Papers
🌟 arXiv Spotlight-
ViewAudio-Guided Dynamic Modality Fusion with Stereo-Aware Attention for Audio-Visual Navigation
-
ViewPGSTalker Real-Time Audio-Driven Talking Head Generation via 3D Gaussian Splatting with Pixel-Awar
-
ViewFedEL Federated Elastic Learning for Heterogeneous Devices
-
ViewME-Mamba Multi-Expert Mamba with Efficient Knowledge Capture and Fusion for Multimodal Survival An
-
ViewLearning from Gene Names, Expression Values and Images Contrastive Masked Text-Image Pretraining f
-
ViewLLMs as Layout Designers A Spatial Reasoning Perspective
-
ViewDynamic Expert Specialization Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation
-
ViewPhysHDR When Lighting Meets Materials and Scene Geometry in HDR Reconstruction
-
ViewseqBench A Tunable Benchmark to Quantify Sequential Reasoning Limits of LLMs
-
ViewLarge Language Models as End-to-end Combinatorial Optimization Solvers
-
ViewAdaptiveGuard Towards Adaptive Runtime Safety for LLM-Powered Software
-
ViewThe Principles of Human-like Conscious Machine
-
ViewShadowServe Interference-Free KV Cache Fetching for Distributed Prefix Caching
-
ViewRoundtable Policy Improving Scientific Reasoning and Narratives through Confidence-Weighted Consen
-
ViewSemantic-Driven Topic Modeling for Analyzing Creativity in Virtual Brainstorming
-
ViewRobot Learning with Sparsity and Scarcity
-
ViewKANO Kolmogorov-Arnold Neural Operator
-
ViewSMART-3D Three-Dimensional Self-Morphing Adaptive Replanning Tree
-
ViewPrompt-Driven Agentic Video Editing System Autonomous Comprehension of Long-Form, Story-Driven Med
-
ViewAutomated Procedural Analysis via Video-Language Models for AI-assisted Nursing Skills Assessment