Papers
🌟 arXiv Spotlight-
ViewTracing the Representation Geometry of Language Models from Pretraining to Post-training
-
ViewDeceive, Detect, and Disclose Large Language Models Play Mini-Mafia
-
ViewLLM Watermark Evasion via Bias Inversion
-
ViewMoE-PHDS One MoE checkpoint for flexible runtime sparsity
-
ViewCreative Adversarial Testing (CAT) A Novel Framework for Evaluating Goal-Oriented Agentic AI Syste
-
ViewAI Noether -- Bridging the Gap Between Scientific Laws Derived by AI Systems and Canonical Knowled
-
ViewPhysically Plausible Multi-System Trajectory Generation and Symmetry Discovery
-
ViewADAM A Diverse Archive of Mankind for Evaluating and Enhancing LLMs in Biographical Reasoning
-
ViewTowards Strategic Persuasion with Language Models
-
ViewNot only a helper, but also a teacher Interactive LLM Cascade
-
ViewFunctional Critic Modeling for Provably Convergent Off-Policy Actor-Critic
-
ViewTiny-QMoE
-
ViewWhat Matters More For In-Context Learning under Matched Compute Budgets Pretraining on Natural Tex
-
ViewUnsupervised Speech Enhancement using Data-defined Priors
-
ViewCompute-Optimal Quantization-Aware Training
-
ViewMonoCon A general framework for learning ultra-compact high-fidelity representations using monoton
-
ViewLarge language models management of medications three performance analyses
-
ViewSoft-Di[M]O Improving One-Step Discrete Image Generation with Soft Embeddings
-
ViewRethinking Large Language Model Distillation A Constrained Markov Decision Process Perspective
-
ViewTY-RIST Tactical YOLO Tricks for Real-time Infrared Small Target Detection