Papers
🌟 arXiv Spotlight-
ViewVoiceAssistant-Eval Benchmarking AI Assistants across Listening, Speaking, and Viewing
-
ViewToward a Physics of Deep Learning and Brains
-
ViewCapRL Stimulating Dense Image Caption Capabilities via Reinforcement Learning
-
ViewLearning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
-
ViewHierarchical Representation Matching for CLIP-based Class-Incremental Learning
-
ViewWebGen-Agent Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Rei
-
ViewDeath of the Novel(ty) Beyond n-Gram Novelty as a Metric for Textual Creativity
-
ViewLanguage Models Can Learn from Verbal Feedback Without Scalar Rewards
-
ViewVariational Reasoning for Language Models
-
ViewTowards Efficient Online Exploration for Reinforcement Learning with Human Feedback
-
ViewStateX Enhancing RNN Recall via Post-training State Expansion
-
ViewLearning Admissible Heuristics for A Theory and Practice
-
ViewA Theoretical Analysis of Discrete Flow Matching Generative Models
-
ViewIA2 Alignment with ICL Activations Improves Supervised Fine-Tuning
-
ViewVision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting
-
ViewBenefits and Pitfalls of Reinforcement Learning for Language Model Planning A Theoretical Perspect
-
ViewQuantile Advantage Estimation for Entropy-Safe Reasoning
-
ViewLearn the Ropes, Then Trust the Wins Self-imitation with Progressive Exploration for Agentic Reinf
-
ViewDynamic Experts Search Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time
-
ViewUniMIC Token-Based Multimodal Interactive Coding for Human-AI Collaboration