Papers
🌟 arXiv Spotlight-
ViewWhen Judgment Becomes Noise How Design Failures in LLM Judge Benchmarks Silently Undermine Validit
-
ViewPGCLODA Prompt-Guided Graph Contrastive Learning for Oligopeptide-Infectious Disease Association P
-
ViewFeeding Two Birds or Favoring One Adequacy-Fluency Tradeoffs in Evaluation and Meta-Evaluation of
-
ViewInvestigating Security Implications of Automatically Generated Code on the Software Supply Chain
-
ViewScan-do Attitude Towards Autonomous CT Protocol Management using a Large Language Model Agent
-
ViewAnchDrive Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving
-
ViewA HyperGraphMamba-Based Multichannel Adaptive Model for ncRNA Classification
-
ViewImageNet-trained CNNs are not biased towards texture Revisiting feature reliance through controlle
-
ViewBeyond Sharp Minima Robust LLM Unlearning via Feedback-Guided Multi-Point Optimization
-
ViewMultimodal Representation-disentangled Information Bottleneck for Multimodal Recommendation
-
ViewDesign Insights and Comparative Evaluation of a Hardware-Based Cooperative Perception Architecture
-
ViewThe Cream Rises to the Top Efficient Reranking Method for Verilog Code Generation
-
ViewQ-Palette Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
-
ViewLow-Resource English-Tigrinya MT Leveraging Multilingual Models, Custom Tokenizers, and Clean Eval
-
ViewPlay by the Type Rules Inferring Constraints for LLM Functions in Declarative Programs
-
ViewSTAF Leveraging LLMs for Automated Attack Tree-Based Security Test Generation
-
ViewAn Improved Time Series Anomaly Detection by Applying Structural Similarity
-
ViewAutomated Multi-Agent Workflows for RTL Design
-
ViewFederation of Agents A Semantics-Aware Communication Fabric for Large-Scale Agentic AI
-
ViewCyberSOCEval Benchmarking LLMs Capabilities for Malware Analysis and Threat Intelligence Reasoning