Papers
🌟 arXiv Spotlight-
ViewInformative Text-Image Alignment for Visual Affordance Learning with Foundation Models
-
ViewIntention-aware Hierarchical Diffusion Model for Long-term Trajectory Anomaly Detection
-
ViewRALLM-POI Retrieval-Augmented LLM for Zero-shot Next POI Recommendation with Geographical Rerankin
-
ViewFrom domain-landmark graph learning to problem-landmark graph generation
-
ViewTactfulToM Do LLMs Have the Theory of Mind Ability to Understand White Lies
-
ViewA Chain-of-thought Reasoning Breast Ultrasound Dataset Covering All Histopathology Categories
-
ViewFrom Easy to Hard The MIR Benchmark for Progressive Interleaved Multi-Image Reasoning
-
ViewKAHAN Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration
-
ViewThe Transfer Neurons Hypothesis An Underlying Mechanism for Language Latent Space Transitions in M
-
ViewWhen Color-Space Decoupling Meets Diffusion for Adverse-Weather Image Restoration
-
ViewAdaptive Overclocking Dynamic Control of Thinking Path Length via Real-Time Reasoning Signals
-
ViewAdvancing Speech Understanding in Speech-Aware Language Models with GRPO
-
ViewPTQTP Post-Training Quantization to Trit-Planes for Large Language Models
-
ViewLeveraging Multiple Speech Enhancers for Non-Intrusive Intelligibility Prediction for Hearing-Impa
-
ViewThe 1st Solution for 7th LSVOS RVOS Track SaSaSa2VA
-
ViewGradient Interference-Aware Graph Coloring for Multitask Learning
-
ViewQuantum Abduction A New Paradigm for Reasoning under Uncertainty
-
ViewAirQA A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
-
ViewEquip Pre-ranking with Target Attention by Residual Quantization
-
ViewCross-Attention with Confidence Weighting for Multi-Channel Audio Alignment