Papers
🌟 arXiv Spotlight-
ViewTRACE Learning to Compute on Graphs
-
ViewYou Can't Steal Nothing Mitigating Prompt Leakages in LLMs via System Vectors
-
ViewPosition The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards
-
ViewNo Prompt Left Behind Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-G
-
ViewUnlocking the Essence of Beauty Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimiz
-
ViewEnhancing Low-Rank Adaptation with Structured Nonlinear Transformations
-
ViewReimagining Agent-based Modeling with Large Language Model Agents via Shachi
-
ViewGraph of Agents Principled Long Context Modeling by Emergent Multi-Agent Collaboration
-
ViewBeyond Johnson-Lindenstrauss Uniform Bounds for Sketched Bilinear Forms
-
ViewDeepTravel An End-to-End Agentic Reinforcement Learning Framework for Autonomous Travel Planning A
-
ViewCan Large Language Models Autoformalize Kinematics
-
ViewDiTraj training-free trajectory control for video diffusion transformer
-
ViewAxiomatic Choice and the Decision-Evaluation Paradox
-
ViewDS-STAR Data Science Agent via Iterative Planning and Verification
-
ViewProRe A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
-
ViewChaosNexus A Foundation Model for Universal Chaotic System Forecasting with Multi-scale Representa
-
ViewD-Artemis A Deliberative Cognitive Framework for Mobile GUI Multi-Agents
-
ViewEvaluating and Improving Cultural Awareness of Reward Models for LLM Alignment
-
ViewFastGRPO Accelerating Policy Optimization via Concurrency-aware Speculative Decoding and Online Dr
-
ViewUnbiased Binning Fairness-aware Attribute Representation