Papers
🌟 arXiv Spotlight-
ViewToward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
-
ViewContinuous-Time Reinforcement Learning for Asset-Liability Management
-
ViewVid-Freeze Protecting Images from Malicious Image-to-Video Generation via Temporal Freezing
-
ViewSocio-Economic Model of AI Agents
-
ViewLearning Regional Monsoon Patterns with a Multimodal Attention U-Net
-
ViewGUI-PRA Process Reward Agent for GUI Tasks
-
ViewTraining Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning Key I
-
ViewAgentic AI Reasoning for Mobile Edge General Intelligence Fundamentals, Approaches, and Directions
-
ViewAdaptive Token-Weighted Differential Privacy for LLMs Not All Tokens Require Equal Protection
-
ViewOnline Dynamic Goal Recognition in Gym Environments
-
ViewSelf-Consistency as a Free Lunch Reducing Hallucinations in Vision-Language Models via Self-Reflec
-
ViewPatch Rebirth Toward Fast and Transferable Model Inversion of Vision Transformers
-
View$p$-less Sampling A Robust Hyperparameter-Free Approach for LLM Decoding
-
ViewSPEC-RL Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
-
ViewLeave No Observation Behind Real-time Correction for VLA Action Chunks
-
ViewOne-Shot Multi-Label Causal Discovery in High-Dimensional Event Sequences
-
ViewTowards Monotonic Improvement in In-Context Reinforcement Learning
-
ViewPARL-MT Learning to Call Functions in Multi-Turn Conversation with Progress Awareness
-
ViewAutoEP LLMs-Driven Automation of Hyperparameter Evolution for Metaheuristic Algorithms
-
ViewUnderstanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction