Papers
🌟 arXiv Spotlight-
ViewEvaluating LLM-Generated Versus Human-Authored Responses in Role-Play Dialogues
-
ViewEngiBench A Benchmark for Evaluating Large Language Models on Engineering Problem Solving
-
ViewTurk-LettuceDetect A Hallucination Detection Models for Turkish RAG Applications
-
ViewMechanistic Interpretability with SAEs Probing Religion, Violence, and Geography in Large Language
-
ViewSD-VLM Spatial Measuring and Understanding with Depth-Encoded Vision-Language Models
-
ViewVideoArtGS Building Digital Twins of Articulated Objects from Monocular Video
-
ViewAuditoryBench++ Can Language Models Understand Auditory Knowledge without Hearing
-
ViewMSCoRe A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents
-
ViewSeqBattNet A Discrete-State Physics-Informed Neural Network with Aging Adaptation for Battery Mode
-
ViewAutiHero Leveraging Generative AI in Social Narratives to Engage Parents in Story-Driven Behaviora
-
ViewTable2LaTeX-RL High-Fidelity LaTeX Code Generation from Table Images via Reinforced Multimodal Lan
-
ViewInterpreting Attention Heads for Image-to-Text Information Flow in Large Vision-Language Models
-
ViewLIMI Less is More for Agency
-
ViewMRN Harnessing 2D Vision Foundation Models for Diagnosing Parkinson's Disease with Limited 3D MR D
-
ViewAn Empirical Study on the Robustness of YOLO Models for Underwater Object Detection
-
ViewMontePrep Monte-Carlo-Driven Automatic Data Preparation without Target Data Instances
-
ViewCan LLMs Reason Over Non-Text Modalities in a Training-Free Manner A Case Study with In-Context Re
-
ViewIs It Certainly a Deepfake Reliability Analysis in Detection & Generation Ecosystem
-
ViewA Multimodal Conversational Assistant for the Characterization of Agricultural Plots from Geospati
-
ViewEvaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microco