Papers
🌟 arXiv Spotlight-
ViewMaskVCT Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllabili
-
ViewSAEC Scene-Aware Enhanced Edge-Cloud Collaborative Industrial Vision Inspection with Multimodal LL
-
ViewScenGAN Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting
-
ViewMCTS-EP Empowering Embodied Planning with Online Preference Optimization
-
ViewPrompt-with-Me in-IDE Structured Prompt Management for LLM-Driven Software Engineering
-
ViewUltra-short-term solar power forecasting by deep learning and data reconstruction
-
View$texttt{DiffSyn}$ A Generative Diffusion Approach to Materials Synthesis Planning
-
ViewGoverning Automated Strategic Intelligence
-
ViewInformative Text-Image Alignment for Visual Affordance Learning with Foundation Models
-
ViewIntention-aware Hierarchical Diffusion Model for Long-term Trajectory Anomaly Detection
-
ViewRALLM-POI Retrieval-Augmented LLM for Zero-shot Next POI Recommendation with Geographical Rerankin
-
ViewFrom domain-landmark graph learning to problem-landmark graph generation
-
ViewTactfulToM Do LLMs Have the Theory of Mind Ability to Understand White Lies
-
ViewA Chain-of-thought Reasoning Breast Ultrasound Dataset Covering All Histopathology Categories
-
ViewFrom Easy to Hard The MIR Benchmark for Progressive Interleaved Multi-Image Reasoning
-
ViewKAHAN Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration
-
ViewThe Transfer Neurons Hypothesis An Underlying Mechanism for Language Latent Space Transitions in M
-
ViewWhen Color-Space Decoupling Meets Diffusion for Adverse-Weather Image Restoration
-
ViewAdaptive Overclocking Dynamic Control of Thinking Path Length via Real-Time Reasoning Signals
-
ViewAdvancing Speech Understanding in Speech-Aware Language Models with GRPO