Papers
🌟 arXiv Spotlight-
ViewVision-Grounded Machine Interpreting Improving the Translation Process through Visual Cues
-
ViewExplore-Execute Chain Towards an Efficient Structured Reasoning Paradigm
-
ViewEasy Turn Integrating Acoustic and Linguistic Modalities for Robust Turn-Taking in Full-Duplex Spo
-
ViewDiffusion Models are Kelly Gamblers
-
ViewHiViS Hiding Visual Tokens from the Drafter for Speculative Decoding in Vision-Language Models
-
ViewTaming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Few
-
ViewGraph Mixing Additive Networks
-
ViewContinual Learning to Generalize Forwarding Strategies for Diverse Mobile Wireless Networks
-
ViewFrom Neural Networks to Logical Theories The Correspondence between Fibring Modal Logics and Fibri
-
ViewEWC-Guided Diffusion Replay for Exemplar-Free Continual Learning in Medical Imaging
-
ViewInterpreting deep learning-based stellar mass estimation via causal analysis and mutual informatio
-
ViewPreserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios
-
ViewDynamic Orthogonal Continual Fine-tuning for Mitigating Catastrophic Forgettings
-
ViewGradient Flow Convergence Guarantee for General Neural Network Architectures
-
ViewTowards Understanding Subliminal Learning When and How Hidden Biases Transfer
-
ViewTunable-Generalization Diffusion Powered by Self-Supervised Contextual Sub-Data for Low-Dose CT Re
-
ViewQuant Fever, Reasoning Blackholes, Schrodinger's Compliance, and More Probing GPT-OSS-20B
-
ViewPCRI Measuring Context Robustness in Multimodal Models for Enterprise Applications
-
ViewDisentangling Score Content and Performance Style for Joint Piano Rendering and Transcription
-
ViewNot All Tokens are Guided Equal Improving Guidance in Visual Autoregressive Models