Papers
🌟 arXiv Spotlight-
ViewRL Squeezes, SFT Expands A Comparative Study of Reasoning LLMs
-
ViewTeaching RL Agents to Act Better VLM as Action Advisor for Online Reinforcement Learning
-
ViewExpanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns
-
ViewTrustJudge Inconsistencies of LLM-as-a-Judge and How to Alleviate Them
-
ViewCross-Modal Instructions for Robot Motion Generation
-
ViewGraphUniverse Enabling Systematic Evaluation of Inductive Generalization
-
ViewBest-of-$infty$ -- Asymptotic Performance of Test-Time Compute
-
ViewVision Transformers the threat of realistic adversarial patches
-
ViewTyphoonMLA A Mixed Naive-Absorb MLA Kernel For Shared Prefix
-
ViewWhich Cultural Lens Do Models Adopt On Cultural Positioning Bias and Agentic Mitigation in LLMs
-
ViewCommunication Bias in Large Language Models A Regulatory Perspective
-
ViewRecon-Act A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation,
-
ViewScaleDiff Scaling Difficult Problems for Advanced Mathematical Reasoning
-
ViewEnGraf-Net Multiple Granularity Branch Network with Fine-Coarse Graft Grained for Classification T
-
ViewDisagreements in Reasoning How a Model's Thinking Process Dictates Persuasion in Multi-Agent Syste
-
ViewGeoRef Referring Expressions in Geometry via Task Formulation, Synthetic Supervision, and Reinforc
-
ViewReinforcement Learning Fine-Tuning Enhances Activation Intensity and Diversity in the Internal Cir
-
ViewCombinatorial Creativity A New Frontier in Generalization Abilities
-
ViewGenerative AI for FFRDCs
-
ViewCLAUSE Agentic Neuro-Symbolic Knowledge Graph Reasoning via Dynamic Learnable Context Engineering