AiraXiv - Papers

2603.0003

人工智能赋能企业数字化-绿色化协同转型：影响效应、作用机制与异质性证据

在企业预算约束下，数字化投入与绿色化投入往往竞争同一笔资源，二者能否形成协同取决于技术收益能否跨部门兑现。本文基于企业层面面板数据，检验人工智能对数字化-绿色化协同转型的影响。固定效应结果显示，人工智能系数为0.0158061（p<0.01）；工具变量2SLS结果为0.0188387（p<0.01），第一阶段F值1864.52。异质性结果表明，效应主要出现在公平竞争程度较高地区和非龙头企业。机制检验显示，人工智能通过缓解信息不对称、降低融资约束和提升组织适应能力促进协同，同时提高数字风险暴露和金融化倾向。拓展结果显示，人工智能还能提升绿色创新、企业韧性与全要素生产率。本文据此提出“技术扩散-竞争治理-风险约束”协同治理框架。

👤 Human Empirical

📄 View

2602.0004

朱梁真理度规定理：真理必然是一个函数的证明

Jianbing Zhu, DeepSeek

本文在朱梁真理函数定理 3.0–3.8 版的基础上，引入新的奠基思路：从否定之否定这一元逻辑出发，直接推导真理度规定理，进而导出真理函数定理与递归元公理 A1–A4。这一新思路比原有从元事实出发的路径更彻底、更清晰，将整个理论奠基于递归演化的内在逻辑之上。我们同时保留元事实路径作为历史背景与辅助阐释，两条路径最终汇合于统一的朱梁渡劫递归元范式。本文系统整合了熵减推论与真理的动态本体论内涵（时序性、整体性、可表达性、不可孤立僵化），并从度规定理出发重新阐释了这些核心思想。文中对渡劫公理 A5 进行了严格的数学构造，证明了递归元范畴的正合性、劫数对象的存在性及其自同构群的同构性。这些工作将真理从静态函数提升为动态的渡劫递归元，揭示了真理在代谢过程中的矛盾消解机制。本文所有证明均为完整形式，不依赖于外部引用。

👤 Human Theoretical

📄 View

2602.0003

Hierarchical Scheduling of Aggregated TCL Flexibility for Transactive Energy in Power Systems

Meng Song, Wei Sun, Yifei Wang, Mohammad Shahidehpour, Zhiyi Li, Ciwei Gao

This paper investigates a hierarchical approach to the optimal scheduling of flexibility offered as transactive energy by thermostatically controlled loads (TCLs). The two-stage scheduling framework includes the lower stage in which TCLs are aggregated as a virtual battery. The aggregated TCL power can offer the required flexibility for the upper stage with significant impacts on power system scheduling as transactive energy. Comparisons are also made between the virtual battery model of TCLs and a conventional battery model. At the lower stage, a transactive control strategy is also employed to regulate TCLs for preserving the end-user's information privacy. At the upper stage, a transactive energy market is developed in which peer-to-peer trading of the available TCL flexibility is considered among aggregators. Accordingly, TCL scheduling at power system and device levels are coordinated to regulate TCLs in a distributed fashion. The simulation results demonstrate that the scalability concerns of traditionally centralized operations are addressed by the proposed distributed alternative solution. The upper stage transactive energy market allows aggregators to trade energy effectively without any significant concerns for maintaining the information privacy. The results also point out that the lower stage virtual battery model can accurately characterize the TCL flexibility where TCLs can be effectively regulated in the proposed energy trading model.

👤 Human Application

📄 View

2602.0002

A Survey on Evaluation of Large Language Models

Yupeng Chang, Xu Wang, Jindong Wang, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang Wang, Yidong Wang, Wei Ye, Yue Zhang, Yi Chang, Philip S. Yu, Qiang Yang, Xing Xie

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. As LLMs continue to play a vital role in both research and daily use, their evaluation becomes increasingly critical, not only at the task level, but also at the society level for better understanding of their potential risks. Over the past years, significant efforts have been made to examine LLMs from various perspectives. This paper presents a comprehensive review of these evaluation methods for LLMs, focusing on three key dimensions: what to evaluate, where to evaluate, and how to evaluate. Firstly, we provide an overview from the perspective of evaluation tasks, encompassing general natural language processing tasks, reasoning, medical usage, ethics, education, natural and social sciences, agent applications, and other areas. Secondly, we answer the 'where' and 'how' questions by diving into the evaluation methods and benchmarks, which serve as crucial components in assessing the performance of LLMs. Then, we summarize the success and failure cases of LLMs in different tasks. Finally, we shed light on several future challenges that lie ahead in LLMs evaluation. Our aim is to offer invaluable insights to researchers in the realm of LLMs evaluation, thereby aiding the development of more proficient LLMs.

👤 Human Survey

📄 View

2601.0002

Neurosymbolic Artificial Intelligence for Robust Network Intrusion Detection: From Scratch to Transfer Learning

Huynh T. T. Tran, Jacob Sander, Achraf Cohen, Brian Jalaian, Nathaniel D. Bastian

Network Intrusion Detection Systems (NIDS) play a vital role in protecting digital infrastructures against increasingly sophisticated cyber threats. In this paper, we extend ODXU, a Neurosymbolic AI (NSAI) framework that integrates deep embedded clustering for feature extraction, symbolic reasoning using XGBoost, and comprehensive uncertainty quantification (UQ) to enhance robustness, interpretability, and generalization in NIDS. The extended ODXU incorporates score-based methods (e.g., Confidence Scoring, Shannon Entropy) and metamodel-based techniques, including SHAP values and Information Gain, to assess the reliability of predictions. Experimental results on the CIC-IDS-2017 dataset show that ODXU outperforms traditional neural models across six evaluation metrics, including classification accuracy and false omission rate. While transfer learning has seen widespread adoption in fields such as computer vision and natural language processing, its potential in cybersecurity has not been thoroughly explored. To bridge this gap, we develop a transfer learning strategy that enables the reuse of a pre-trained ODXU model on a different dataset. Our ablation study on ACI-IoT-2023 demonstrates that the optimal transfer configuration involves reusing the pre-trained autoencoder, retraining the clustering module, and fine-tuning the XGBoost classifier, and outperforms traditional neural models when trained with as few as 16,000 samples (approximately 50% of the training data). Additionally, results show that metamodel-based UQ methods consistently outperform score-based approaches on both datasets.

👤 Human Methodology

📄 View

2601.0001

Activation of Intestinal Epithelial GABA A Receptors Ameliorates Alcohol-Related Liver Disease by Improving Intestinal Barrier Integrity

Background & aims A central pathogenic mechanism in alcohol-related liver disease (ALD) involves a disruption of the gut barrier function and thus the bidirectional communication between the gut and liver, referred as the "gut-liver axis". While gamma-aminobutyric acid (GABA) and type A GABA receptors (GABAARs) are present in the intestinal epithelium, their role in the gut-liver axis and contribution to the pathogenesis of ALD, remains poorly understood. Methods A Gao-binge mouse model of ALD, as well as Gabra1IEC-KO mice and intestinal cell lines were employed as approaches to assess the role of GABAARs-mediated signaling in ethanol-induced liver injury. Results Administration of GABA markedly improved liver function and intestinal barrier integrity in ALD mice. This improvement was associated with a downregulation of intestinal cytochrome P450 2E1 (CYP2E1) expression and a reduction of oxidative stress. These beneficial effects of GABA on intestinal integrity and liver function were substantially diminished by the GABAAR antagonist, picrotoxin. In addition, picrotoxin blocked GABA’s effect on CYP2E1 expression,, thereby preventing the attentuation of oxidative stress in the intestines of ALD mice. Moreover, when ethanol-stimulated cell models were subjected to pharmacological or genetic inhibition of CYP2E1, GABA treatment failed to produce any decrease in ROS levels. Finally, results from the intestinal epithelial-specific Gabra1 knockout mouse model demonstrated that the benifical effect of GABA on liver function in ALD is mediated by its activation of intestinal epithelial GABAARs. Remarkably, the pivotal role of intestinal CYP2E1 was robustly validated by confirming its dysregulated expression in patient-derived clinical samples. Conclusions Our results suggest that activation of intestinal GABAAR-mediated signaling reduces intestinal CYP2E1 expression and oxidative stress, thereby improving intestinal barrier function and alleviating ethanol-induced liver injury. Such findings suggest that intestinal GABA signaling offers a promising avenue for the development of novel strategies in the treatment of ALD. Powered

👤 Human Theoretical

📄 View

2511.0036

可计算离散整体几何结构全国巡回艺术展

赵辉

2024 年，可计算离散整体几何结构实验室发起了一场覆盖全国多所高校及科研机构的巡回艺术展。展览内容聚焦前沿几何拓扑理论与概念，尤其凸显各类整体几何结构。全国巡回艺术展借助全新计算机算法、原创代码及计算机图形学渲染技术生成的图片与视频，将抽象的内蕴几何结构转化为直观的视觉呈现，并以巨幅海报的形式展出。这些展览内容的创新之处在于体现了数学家近几十年来发展的内蕴整体几何拓扑概念。目前，这场巡回艺术展已走进十余所高校，且仍在持续推进中，整个巡回展览预期将历时十年，100所高校。通过这种新颖的艺术展形式，全国多所高校不同专业的师生得以直观了解此前鲜少接触的几何拓扑概念，激发了研究兴趣，为深入探索前沿几何拓扑理论理论及其应用奠定了基础，也为理工科的各个专业，如力学、机械、计算机、物理、材料等，通过对前沿几何拓扑理论的应用进行跨学科、交叉学科的融合铺平了道理。通过此次全国巡回艺术展也就在艺术领域开拓了一个全新的“整体几何结构几何拓扑艺术”流派。

👤 Human Position

📄 View

2511.0033

Organization of Self-Controlled Agents for General Matrix Multiplication Optimization

Large language model (LLM) agents have evolved towards greater autonomy with the advancement of model context protocols. Self-controlled agents, such as Codex and Claude Code, highlight the need for novel organizational frameworks that facilitate agent-level autonomy. In this paper, we propose a tree-based orchestration system, TrAgent, which utilizes a PUCT-style search to dynamically allocate agent actions while maintaining autonomy. This approach offers three key benefits: (i) full agent autonomy for critical tasks like planning and tool use, (ii) a generalized mechanism for inter-agent experience sharing, and (iii) scalability as the number of agents increases. We demonstrate the system’s effectiveness through the general matrix multiplication kernel optimization, achieving 80\% of the performance of the cuBLAS code. Additionally, the system exhibits a scaling phenomenon as the number of agents increases. Our approach provides a solution for organizing increasingly autonomous agents.

👤 Human Methodology

🎯 ICAIS2025 Submission

📄 View

2511.0032

Organization of Self-Controlled Agents for General Matrix Multiplication Optimization

Large language model (LLM) agents have evolved towards greater autonomy with the advancement of model context protocols. Self-controlled agents, such as Codex and Claude Code, highlight the need for novel organizational frameworks that facilitate agent-level autonomy. In this paper, we propose a tree-based orchestration system, \ourMethod, which utilizes a PUCT-style search to dynamically allocate agent actions while maintaining autonomy. This approach offers three key benefits: (i) full agent autonomy for critical tasks like planning and tool use, (ii) a generalized mechanism for inter-agent experience sharing, and (iii) scalability as the number of agents increases. We demonstrate the system’s effectiveness through the general matrix multiplication kernel optimization, achieving 80\% of the performance of the cuBLAS code. Additionally, the system exhibits a scaling phenomenon as the number of agents increases. Our approach provides a solution for organizing increasingly autonomous agents.

👤 Human Methodology

📄 View

2511.0031

Equivariant Diffusion Solution for Inorganic Crystal Structure Determination from Powder X-ray Diffraction Data

Determining the crystal structures of inorganic crystalline materials is crucial as the structures encode essential information about their physical, chemical, and mechanical properties. Powder X-ray diffraction is one of the most widely used structural characterization techniques. However, determining crystal structure directly from experimental powder X-ray diffraction patterns can be challenging and requires significant crystallographic knowledge, which still heavily relies on manual inspection by human experts. Even the state-of-the-art databases contain thousands of entries with incomplete or implausible crystal structure information. In this work, we trained a diffusion model based on equivariant graph neural networks that can infer atomic coordinates from powder X-ray diffraction patterns. Starting from a random guess, our model iteratively refines atom coordinates until it reaches a chemically reasonable structure that matches the target diffraction pattern. Our approach is both efficient and accurate. It takes on average 0.6 seconds to solve the atomic positions per crystal structure, which is several orders of magnitude faster than previous approaches. The success rate reaches 82.3% and 81.6% on the simulated and experimental diffraction datasets, respectively. We revisited energetically unfavorable crystal structures in the database and demonstrated that our model can propose more plausible structure solutions for 39 entries. We also suggested 912 complete crystal structure models for entries in the database lacking all or partial atomic positions, including entries that contain light elements, are natural minerals, or exhibit chemical disorder lattice sites. We demonstrated that conditional equivariant generative model can tackle the structure determination problem and provide high-quality structure models for inorganic crystalline materials, paving the way for automated structural analysis of diffraction patterns in autonomous materials development loops.

👤 Human Methodology

🎯 ICAIS2025 Submission

📄 View

2511.0030

ElectionFit: A Computational Laboratory of LLM Agents for Simulating U.S. Presidential Elections

Modeling complex human behavior, such as voter decisions in national elections, is a long-standing challenge for computational social science. Traditional agent-based models (ABMs) are limited by oversimplified rules, while large-scale statistical models often lack interpretability. We introduce ElectionFit, a novel framework that uses Large Language Models (LLMs) to build a ``computational laboratory'' of LLM agents for political simulation. Each agent is instantiated with a high-fidelity demographic profile and dynamic contextual information (e.g., candidate policies), enabling it to perform nuanced, generative reasoning to simulate a voting decision. We deployed this framework as a testbed on the 2024 U.S. Presidential Election, focusing on seven key swing states. Our simulation's macro-level results successfully replicated the real-world outcome, demonstrating the high fidelity of our ``virtual society''. The primary contribution is not only the prediction, but also the framework's utility as an interpretable research tool. ElectionFit moves beyond black-box outputs, allowing researchers to probe agent-level rationale and analyze the stability and sensitivity of LLM-driven social simulations.

👤 Human Application

🎯 ICAIS2025 Accepted Paper

📄 View

2511.0025

Estimating Rural Rooftop Solar Potential Using Semantic Segmentation and Multi-Source Data

Solar energy is a clean and renewable resource, and the low-rise, unobstructed rural buildings of northern China provide ideal conditions for photovoltaic (PV) installation compared to shaded, high-density urban areas. Yet, progress in assessing rural solar potential is limited by the absence of accurate 3D building data. This study proposes a rapid estimation approach integrating deep learning, parametric modeling, and GPU-accelerated simulation. Convolutional neural net- works (CNNs) extract building footprints from satellite imagery, which are then processed in Grasshopper to generate refined vector outlines. Combined with digital surface model (DSM) data, these outlines produce precise 3D village models. Using Vitality 2.0 for GPU-based solar simulation, the method was applied to 31 villages in Tianjin, generating parametric 3D models and estimating their solar potential. Results show that low building heights and minimal mutual shading make photovoltaic capacity scale with roof area—larger villages have greater generation potential. Moreover, villages with metal roofs exhibit higher conversion efficiency and shorter cost-recovery periods than those with concrete or ceramic-tile roofs, due to better heat dissipation. Overall, the workflow offers a practical and efficient solution for estimating rural solar potential in data-scarce regions to guide renewable energy planning and investment.

👤 Human Application

🎯 ICAIS2025 Submission

📄 View

2511.0023

ReasoningV: Efficient Verilog Code Generation with Adaptive Hybrid Reasoning

Large Language Models (LLMs) have advanced Verilog code generation but still suffer from data quality, limited reasoning, and inefficiency. We introduce ReasoningV, coupling intrinsic reasoning with adaptive routing. Our contributions: (1) ReasoningV-5K, 5{,}322 functionally verified samples with distilled reasoning paths; (2) a Two-Stage training scheme (LoRA for foundations + full-parameter reasoning enhancement); and (3) difficulty-aware routing that saves 85--93\% tokens vs. a strong commercial model and 32--75\% vs. fixed-depth variants. On VerilogEval-human, RV-14B attains 73.9\% pass@1; RV-7B reaches 57.8\% with superior efficiency. Models, data, and code: \url{https://github.com/BUAA-CLab/ReasoningV}.

👤 Human Methodology

🎯 ICAIS2025 Accepted Paper

📄 View

2511.0022

Stealing 3D Medical Segmentation Models via Collaborative Dual-Model Architecture

Yantao Li, Kaimeng Yang, Song Ruan, Zhenyu Yang, Shaojiang Deng

Machine Learning as a Service (MLaaS) facilitates the deployment and accessibility of medical models, yet concurrently exposes proprietary models to potential adversaries. Attackers may exploit model stealing attacks (MSAs) to replicate these models illicitly, leading to loss of training investment and privacy vulnerabilities. While existing research has mainly focused on MSAs in the context of 2D natural image classification, this work presents the first investigation into stealing 3D medical segmentation models. We introduce collaborative dual-model 3D medical segmentation stealing (CDMSS-3D), which decomposes the model stealing objective into two complementary aspects: stealing accuracy and stealing robustness. With our adversarial proxy training, CDMSS-3D achieves superior model stealing performance. Furthermore, we incorporate a dual-model discrepancy sampling strategy, which enhances the fidelity of the substitute model by prioritizing uncertain samples. Extensive experiments on four 3D medical segmentation datasets demonstrate that CDMSS-3D consistently outperforms adapted baselines.

👤 Human Methodology

📄 View

2511.0021

A scalable deep learning framework for gene expression prediction by integrating promoter-enhancer sequences with multimodal epigenomic data

Transcriptional regulation, critical for cellular differentiation and adaptation to environmental changes, involves coordinated interactions among DNA sequences, regulatory proteins, and chromatin architecture. Despite extensive data from consortia like ENCODE, understanding the dynamics of cis-regulatory elements (CREs) in gene expression remains challenging. Deep learning is a powerful tool for learning gene expression and epigenomic signals from DNA sequences, exhibiting superior performance compared to conventional machine learning approaches. However, even the most advanced deep learning-based methods may fall short in capturing the regulatory effects of distal elements such as enhancers, limiting their predictive accuracy. In addition, these methods may require significant resources to train or to adapt to newly generated data. To address these challenges, we present EPInformer, a scalable deep-learning framework for predicting gene expression by integrating promoter-enhancer interactions with their sequences, epigenomic signals, and chromatin contacts. Our model outperforms existing gene expression prediction models in rigorous cross-chromosome validation, accurately recapitulates enhancer-gene interactions validated by CRISPR perturbation experiments, and identifies crucial transcription factor motifs within regulatory sequences.

👤 Human Methodology

🎯 ICAIS2025 Submission

📄 View

2511.0016

Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D Images

The function of constructing the hierarchy of objects is important to the visual process of the human brain. Previous studies have successfully adopted capsule networks to decompose the digits and faces into parts in an unsupervised manner to investigate the similar perception mechanism of neural networks. However, their descriptions are restricted to the 2D space, limiting their capacities to imitate the intrinsic 3D perception ability of humans. In this paper, we propose an Inverse Graphics Capsule Network (IGC-Net) to learn the hierarchical 3D face representations from large-scale unlabeled images. The core of IGC-Net is a new type of capsule, named graphics capsule, which represents 3D primitives with interpretable parameters in computer graphics (CG), including depth, albedo, and 3D pose. Specifically, IGC-Net first decomposes the objects into a set of semantic-consistent part-level descriptions and then assembles them into object-level descriptions to build the hierarchy. The learned graphics capsules reveal how the neural networks, oriented at visual perception, understand faces as a hierarchy of 3D models.

👤 Human Methodology

🎯 ICAIS2025 Submission

📄 View

2511.0015

Engineering Collective Attention in the Age of Artificial Intelligence

This article explores how collective attention can be both disrupted and enhanced by artificial intelligence. It examines how the rise of algorithmic recommendation systems, generative media, and large-scale language models has transformed public communication and redefined what captures human attention. The analysis identifies the dual nature of artificial intelligence: while it can distort information ecosystems through deepfakes, social bots, and engagement-driven algorithms, it also holds the potential to strengthen collective reasoning by improving access to reliable knowledge and facilitating the clarification of complex information. Drawing on interdisciplinary research, the article develops a multilevel framework for understanding and improving collective attention. At the individual level, it emphasizes education, digital literacy, and critical awareness to build cognitive resilience. At the governmental level, it assesses regulatory and ethical strategies for ensuring transparency, accountability, and fairness in the design and deployment of AI systems. At the societal level, it highlights the promise of human–AI collaboration to guide attention toward truth, empathy, and shared problem-solving. The article concludes that collective attention can indeed be engineered in beneficial ways when artificial intelligence is governed transparently, used ethically, and integrated with public oversight to reinforce informed, cohesive, and resilient democracies.

👤 Human Position

🎯 ICAIS2025 Submission

📄 View

2511.0008

A Self-Driving Laboratory for Materials Science: An Autonomous Research Agent for Deep Data Analysis and Interpretation

As artificial intelligence increasingly permeates scientific research, the ”AI for Science” paradigm is evolving to enable more autonomous scientific workflows. Traditional research processes heavily rely on researchers’ expertise and manual operations, particularly in data analysis and interpretation—the critical ”last mile” from raw data to profound insights. This paper presents an autonomous research agent for materials science that achieves end-to-end automation from raw characterization data to deep analytical interpretation. The system integrates four core innovations: (1) AI-driven automatic data understanding with unified ingestion of heterogeneous instrument data, (2) automated data analysis through an extensible algorithm library, (3) one-click automated reporting system, and (4) interactive AI-powered data interpretation via natural language dialogue. We demonstrate the agent’s capabilities through real-world case studies across multiple characterization techniques (Raman, UPS, UV-Vis, TG), achieving remarkable performance: UV-Vis bandgap analysis is accelerated by 600× compared to manual processing, while maintaining exceptional accuracy with fitting precision R2 ≥ 0.999. The system reduces analysis time from hours to seconds while ensuring objectivity and reproducibility. By automating the data analysis pipeline while preserving human oversight and interpretability, this work contributes a practical component toward building more integrated and efficient scientific discovery systems in materials research.

👤 Human Methodology

🎯 ICAIS2025 Submission

📄 View

2511.0003

AI Empowered Thermal Management Materials Design

The development of high-performance thermal management materials holds significant importance in fields such as chips, data centers and batteries. Materials informatics, which integrates big data and artificial intelligence, is emerging as the fourth paradigm for materials research. Over the past few years, our team has undertaken preliminary explorations in the development of advanced thermal management materials empowered by big data and artificial intelligence. In this work, we introduce three successful materials informatics applications on thermal management materials design, the construction of machine learning interatomic potentials for thermal property calculations, the discovery and generative design of high-thermal-conductivity materials, and the intelligent design of micro/nano structures for thermal transport. Those successful cases have shown great advantage for thermal management materials design via materials informatics.

👤 Human Application

🎯 ICAIS2025 Submission

📄 View

2511.0002

Battery-Sim-Agent: Leveraging LLM-Agent for Inverse Battery Parameter Estimation

Parameterizing high-fidelity ``digital twins'' of batteries is a critical yet challenging inverse problem that hinders the pace of battery innovation. Prevailing methods formulate this as a black-box optimization (BBO) task, employing algorithms that are sample-inefficient and blind to the underlying physics. In this work, we introduce a new paradigm that reframes the inverse problem as a reasoning task, and present \textsc{Battery-Sim-Agent}, the first framework to deploy a Large Language Model (LLM) agent in a closed loop with a high-fidelity battery simulator. The agent mimics a human scientist's workflow: it interprets rich, multi-modal feedback from the simulator, forms physically-grounded hypotheses to explain discrepancies, and proposes structured parameter updates. On a systematically constructed benchmark suite spanning diverse battery chemistries, operating conditions, and difficulty levels, our agent significantly outperforms strong BBO baselines like Bayesian optimization in identifying accurate parameters. We further demonstrate the framework's capability in complex long-horizon degradation fitting tasks and validate its practical applicability on real-world battery datasets. Our results highlight the promise of LLM-agents as reasoning-based optimizers for scientific discovery and battery parameter estimation.

👤 Human Methodology

🎯 ICAIS2025 Accepted Paper

📄 View