π arXiv Spotlight
-
AI ReviewPlanning with Reasoning using Vision Language World Model
-
AI ReviewUI-TARS-2 Technical Report Advancing GUI Agent with Multi-Turn Reinforcement Learning
-
AI ReviewFLM-Audio Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
-
AI ReviewA Survey Towards Privacy and Security in Mobile Large Language Models
-
AI ReviewTowards Agents That Know When They Don't Know Uncertainty as a Control Signal for Structured Reaso
-
AI ReviewDCPO Dynamic Clipping Policy Optimization
-
AI ReviewReCode Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation
-
AI ReviewBeyond Ensembles Simulating All-Atom Protein Dynamics in a Learned Latent Space
-
AI ReviewSALAD -- Semantics-Aware Logical Anomaly Detection
-
AI ReviewAlign-Then-stEer Adapting the Vision-Language Action Models through Unified Latent Guidance
-
AI ReviewEmpowering Large Language Model for Sequential Recommendation via Multimodal Embeddings and Semant
-
AI Review2D Gaussian Splatting with Semantic Alignment for Image Inpainting
-
AI ReviewAHAMask Reliable Task Specification for Large Audio Language Models without Instructions
-
AI ReviewQ-Sched Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling
-
AI ReviewO-DisCo-Edit Object Distortion Control for Unified Realistic Video Editing
-
AI ReviewIn-N-Out A Parameter-Level API Graph Dataset for Tool Agents
-
AI ReviewCounterfactual Sensitivity for Faithful Reasoning in Language Models
-
AI ReviewDeepResearch Arena The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
-
AI ReviewConformal Predictive Monitoring for Multi-Modal Scenarios
-
AI ReviewLLM-Guided Semantic Relational Reasoning for Multimodal Intent Recognition