Papers
🌟 arXiv Spotlight-
ViewSPEC-RL Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
-
ViewLeave No Observation Behind Real-time Correction for VLA Action Chunks
-
ViewOne-Shot Multi-Label Causal Discovery in High-Dimensional Event Sequences
-
ViewTowards Monotonic Improvement in In-Context Reinforcement Learning
-
ViewPARL-MT Learning to Call Functions in Multi-Turn Conversation with Progress Awareness
-
ViewAutoEP LLMs-Driven Automation of Hyperparameter Evolution for Metaheuristic Algorithms
-
ViewUnderstanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction
-
ViewLimit Analysis for Symbolic Multi-step Reasoning Tasks with Information Propagation Rules Based on
-
ViewWARBERT A Hierarchical BERT-based Model for Web API Recommendation
-
ViewTRAX TRacking Axles for Accurate Axle Count Estimation
-
ViewDense associative memory on the Bures-Wasserstein space
-
ViewDeep Learning-Based Detection of Cognitive Impairment from Passive Smartphone Sensing with Routine
-
ViewAI-Enhanced Distributed Channel Access for Collision Avoidance in Future Wi-Fi 8
-
ViewCoordination Requires Simplification Thermodynamic Bounds on Multi-Objective Compromise in Natural
-
ViewMathBode Frequency-Domain Fingerprints of LLM Mathematical Reasoning
-
ViewTrust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm
-
ViewSysMoBench Evaluating AI on Formally Modeling Complex Real-World Systems
-
ViewC$^2$GSPG Confidence-calibrated Group Sequence Policy Gradient towards Self-aware Reasoning
-
ViewTransferring Vision-Language-Action Models to Industry Applications Architectures, Performance, an
-
ViewRHYTHM Reasoning with Hierarchical Temporal Tokenization for Human Mobility