Papers
🌟 arXiv Spotlight-
ViewRoMedQA The First Benchmark for Romanian Medical Question Answering
-
ViewGLARE Agentic Reasoning for Legal Judgment Prediction
-
ViewMizanQA Benchmarking Large Language Models on Moroccan Legal Question Answering
-
ViewCausal Beam Selection for Reliable Initial Access in AI-driven Beam Management
-
ViewConfusion is the Final Barrier Rethinking Jailbreak Evaluation and Investigating the Real Misuse T
-
ViewUppaal Coshy Automatic Synthesis of Compact Shields for Hybrid Systems
-
ViewUnsupervised Online Detection of Pipe Blockages and Leakages in Water Distribution Networks
-
ViewVevo2 Bridging Controllable Speech and Singing Voice Generation via Unified Prosody Learning
-
ViewLLMSymGuard A Symbolic Safety Guardrail Framework Leveraging Interpretable Jailbreak Concepts
-
ViewRetrieval Enhanced Feedback via In-context Neural Error-book
-
ViewExploiting Information Redundancy in Attention Maps for Extreme Quantization of Vision Transformer
-
ViewDo What Teaching Vision-Language-Action Models to Reject the Impossible
-
ViewAgentScope 1.0 A Developer-Centric Framework for Building Agentic Applications
-
ViewThe next question after Turing's question Introducing the Grow-AI test
-
ViewRepresentation Learning of Auxiliary Concepts for Improved Student Modeling and Exercise Recommend
-
ViewFrom Confidence to Collapse in LLM Factual Robustness
-
ViewMCPVerse An Expansive, Real-World Benchmark for Agentic Tool Use
-
ViewA Reduction of InputOutput Logics to SAT
-
ViewA XAI-based Framework for Frequency Subband Characterization of Cough Spectrograms in Chronic Resp
-
ViewFlexMUSE Multimodal Unification and Semantics Enhancement Framework with Flexible interaction for