Skip to main content

Showing 101–150 of 2,444 results for author: Zheng, H

.
  1. arXiv:2503.01122  [pdf, other

    cs.CV

    ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization

    Authors: Shizhan Liu, Hao Zheng, Hang Yu, Jianguo Li

    Abstract: Image personalization has garnered attention for its ability to customize Text-to-Image generation using only a few reference images. However, a key challenge in image personalization is the issue of conceptual coupling, where the limited number of reference images leads the model to form unwanted associations between the personalization target and other concepts. Current methods attempt to tackle… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  2. arXiv:2503.00968  [pdf, other

    physics.ins-det hep-ex

    Simulation of the Background from $^{13}$C$(α, n)^{16}$O Reaction in the JUNO Scintillator

    Authors: JUNO Collaboration, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Costas Andreopoulos, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Beretta, Antonio Bergnoli, Nikita Bessonov, Daniel Bick, Lukas Bieger, Svetlana Biktemerova , et al. (608 additional authors not shown)

    Abstract: Large-scale organic liquid scintillator detectors are highly efficient in the detection of MeV-scale electron antineutrinos. These signal events can be detected through inverse beta decay on protons, which produce a positron accompanied by a neutron. A noteworthy background for antineutrinos coming from nuclear power reactors and from the depths of the Earth (geoneutrinos) is generated by ($α, n$)… ▽ More

    Submitted 2 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Comments: 25 pages, 14 figures, 4 tables

  3. arXiv:2502.20821  [pdf, other

    hep-ex

    Improved measurement of absolute branching fraction of the inclusive decay $Λ_{c}^{+} \to K_{S}^{0} X$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (679 additional authors not shown)

    Abstract: By analyzing $4.5$ fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated with the BESIII detector at center-of-mass energies ranging from $4599.53$ MeV to $4698.82$ MeV, we report the measurement of the absolute branching fraction (BF) of the inclusive decay $Λ_{c}^{+} \to K_{S}^{0} X$ using the double-tag technique. The result is $\mathcal{B}(Λ_{c}^{+} \to K_{S}^{0} X)=(10.9\pm0.2\pm0.1)\%$, where… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  4. arXiv:2502.20587  [pdf, other

    cs.LG

    Cache-of-Thought: Master-Apprentice Framework for Cost-Effective Vision Language Model Inference

    Authors: Mingyuan Wu, Jize Jiang, Haozhen Zheng, Meitang Li, Zhaoheng Li, Beitong Tian, Bo Chen, Yongjoo Park, Minjia Zhang, Chengxiang Zhai, Klara Nahrstedt

    Abstract: Vision Language Models (VLMs) have achieved remarkable success in a wide range of vision applications of increasing complexity and scales, yet choosing the right VLM model size involves a trade-off between response quality and cost. While smaller VLMs are cheaper to run, they typically produce responses only marginally better than random guessing on benchmarks such as MMMU. In this paper, we pro… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: Mingyuan, Jize, and Haozhen contributed equally, while Minjia, Chengxiang, and Klara advised equally

  5. arXiv:2502.19850  [pdf, other

    hep-ex

    Precision measurement of the branching fraction for the decay $ψ(2S)\rightarrowτ^{+}τ^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (691 additional authors not shown)

    Abstract: Using $(2259.3 \pm 11.1)\times10^{6}$ $ψ(2S)$ events acquired with the BESIII detector, the branching fraction of $ψ(2S)\rightarrowτ^{+}τ^{-}$ is measured with improved precision to be $\mathcal{B}_{ψ(2S)\rightarrowτ^{+}τ^{-}}=(3.240~\pm~0.023~\pm~0.081)\times 10^{-3}$, where the first and second uncertainties are statistical and systematic, respectively, which is consistent with the world average… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 10 page, 5 figures

  6. arXiv:2502.19247  [pdf, other

    cs.CV

    ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding

    Authors: Qihang Peng, Henry Zheng, Gao Huang

    Abstract: Embodied intelligence requires agents to interact with 3D environments in real time based on language instructions. A foundational task in this domain is ego-centric 3D visual grounding. However, the point clouds rendered from RGB-D images retain a large amount of redundant background data and inherent noise, both of which can interfere with the manifold structure of the target regions. Existing p… ▽ More

    Submitted 27 February, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 12 pages, 3 figures. Accepted by CVPR2025

  7. arXiv:2502.19070  [pdf, other

    cs.LG cs.CR

    A Sample-Level Evaluation and Generative Framework for Model Inversion Attacks

    Authors: Haoyang Li, Li Bai, Qingqing Ye, Haibo Hu, Yaxin Xiao, Huadi Zheng, Jianliang Xu

    Abstract: Model Inversion (MI) attacks, which reconstruct the training dataset of neural networks, pose significant privacy concerns in machine learning. Recent MI attacks have managed to reconstruct realistic label-level private data, such as the general appearance of a target person from all training images labeled on him. Beyond label-level privacy, in this paper we show sample-level privacy, the private… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: Accepted to be appeared in 39th Annual AAAI Conference on Artificial Intelligence (AAAI-25)

  8. arXiv:2502.18731  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    A Challenge of Developing a Classifier for Multi-Band Classification of Variable Stars

    Authors: Xiao-Hui Xu, Qing-Feng Zhu, Xu-Zhi Li, Hang Zheng, Jin-Sheng Qiu

    Abstract: Variable stars play a very important role in our understanding of the Milky Way and the universe. In recent years, many survey projects have generated a large amount of photometric data, necessitating classifiers that can quickly identify various types of variable stars. However, obtaining these classifiers often requires substantial manpower and computational resources. To conserve these resource… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: 19 pages, 3 figures

  9. arXiv:2502.17494  [pdf, other

    cs.IR cs.AI cs.LG

    External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation

    Authors: Mingfu Liang, Xi Liu, Rong Jin, Boyang Liu, Qiuling Suo, Qinghai Zhou, Song Zhou, Laming Chen, Hua Zheng, Zhiyuan Li, Shali Jiang, Jiyan Yang, Xiaozhen Xia, Fan Yang, Yasmine Badr, Ellie Wen, Shuyu Xu, Hansey Chen, Zhengyu Zhang, Jade Nie, Chunzhi Yang, Zhichen Zeng, Weilin Zhang, Xingliang Huang, Qianru Li , et al. (80 additional authors not shown)

    Abstract: Ads recommendation is a prominent service of online advertising systems and has been actively studied. Recent studies indicate that scaling-up and advanced design of the recommendation model can bring significant performance improvement. However, with a larger model scale, such prior studies have a significantly increasing gap from industry as they often neglect two fundamental challenges in indus… ▽ More

    Submitted 23 April, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Comments: Accepted by the ACM Web Conference (WWW) 2025 Industrial Track as Oral Presentation

  10. arXiv:2502.17157  [pdf, other

    cs.CV

    DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

    Authors: Canyu Zhao, Mingyu Liu, Huanyi Zheng, Muzhi Zhu, Zhiyue Zhao, Hao Chen, Tong He, Chunhua Shen

    Abstract: Our primary goal here is to create a good, generalist perception model that can tackle multiple tasks, within limits on computational resources and training data. To achieve this, we resort to text-to-image diffusion models pre-trained on billions of images. Our exhaustive evaluation metrics demonstrate that DICEPTION effectively tackles multiple perception tasks, achieving performance on par with… ▽ More

    Submitted 24 February, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: 29 pages, 20 figures. Homepage: https://aim-uofa.github.io/Diception, Huggingface Demo: https://huggingface.co/spaces/Canyu/Diception-Demo

  11. arXiv:2502.17039  [pdf, other

    cs.CV

    LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR

    Authors: Xinxin Feng, Haoran Sun, Haifeng Zheng

    Abstract: Vehicle-to-Infrastructure (V2I) collaborative perception leverages data collected by infrastructure's sensors to enhance vehicle perceptual capabilities. LiDAR, as a commonly used sensor in cooperative perception, is widely equipped in intelligent vehicles and infrastructure. However, its superior performance comes with a correspondingly high cost. To achieve low-cost V2I, reducing the cost of LiD… ▽ More

    Submitted 4 March, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

  12. arXiv:2502.16982  [pdf, other

    cs.LG cs.AI cs.CL

    Muon is Scalable for LLM Training

    Authors: Jingyuan Liu, Jianlin Su, Xingcheng Yao, Zhejun Jiang, Guokun Lai, Yulun Du, Yidao Qin, Weixin Xu, Enzhe Lu, Junjie Yan, Yanru Chen, Huabin Zheng, Yibo Liu, Shaowei Liu, Bohong Yin, Weiran He, Han Zhu, Yuzhi Wang, Jianzhou Wang, Mengnan Dong, Zheng Zhang, Yongsheng Kang, Hao Zhang, Xinran Xu, Yutao Zhang , et al. (3 additional authors not shown)

    Abstract: Recently, the Muon optimizer based on matrix orthogonalization has demonstrated strong results in training small-scale language models, but the scalability to larger models has not been proven. We identify two crucial techniques for scaling up Muon: (1) adding weight decay and (2) carefully adjusting the per-parameter update scale. These techniques allow Muon to work out-of-the-box on large-scale… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  13. arXiv:2502.16822  [pdf, ps, other

    hep-ph nucl-th

    Deciphering the $G$(3900) quantum numbers $J^{PC}$ in $e^+e^-$ collisions

    Authors: Jian Cao, Wen-Chao Zhang, Zhi-Lei She, An-Ke Lei, Jin-Peng Zhang, Hua Zheng, Dai-Mei Zhou, Yu-Liang Yan, Zhong-Qi Wang, Ben-Hao Sa

    Abstract: Inspired by the BESIII observation of exotic hadron $G$(3900) [PRL 133(2024)081901], we use the parton and hadron cascade model PACIAE to simulate the $G$(3900) production in $e^+e^-$ annihilations at $\sqrt s$=4.95 GeV. The $G$(3900) candidates are recombined by Dynamically Constrained Phase-space Coalescence model using component mesons of $D\bar D$ or $D\bar D^*/\bar DD^*$ in the PACIAE simulat… ▽ More

    Submitted 12 March, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

    Comments: five pages, 1 table, 2 figures, minor revision has been done for the manuscript before submission

  14. arXiv:2502.16484  [pdf

    cs.CL

    A Fine-Tuning Approach for T5 Using Knowledge Graphs to Address Complex Tasks

    Authors: Xiaoxuan Liao, Binrong Zhu, Jacky He, Guiran Liu, Hongye Zheng, Jia Gao

    Abstract: With the development of deep learning technology, large language models have achieved remarkable results in many natural language processing tasks. However, these models still have certain limitations in handling complex reasoning tasks and understanding rich background knowledge. To solve this problem, this study proposed a T5 model fine-tuning method based on knowledge graphs, which enhances the… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  15. arXiv:2502.16084  [pdf, other

    hep-ex

    Single Inclusive $π^\pm$ and $K^\pm$ Production in $e^+e^-$ Annihilation at center-of-mass Energies from 2.000 to 3.671GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (707 additional authors not shown)

    Abstract: Using data samples with a total integrated luminosity of 253 $\rm pb^{-1}$ collected by the BESIII detector operating at the BEPCII collider, the differential cross-sections of inclusive $π^\pm$ and $K^\pm$ production, as a function of momentum and normalized by the total hadronic cross-section, are measured at center-of-mass energies from 2.000 to 3.671 GeV. The measured $π^{\pm}$ cross sections… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  16. arXiv:2502.15997  [pdf, ps, other

    quant-ph

    Atoms, Worldlines, and the Failure of the Scalar Approximation

    Authors: He Zheng, Daniel A. Steck

    Abstract: The worldline path-integral method, developed thus far for scalar fields, offers promising computational efficiency in general geometries, However, it relies so far on the scalar approximation that decomposes electromagnetic waves into two independent polarizations. In this work, we investigate different theoretical frameworks of fluctuation-induced effects and analyze the limitations of the world… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 19 pages

  17. arXiv:2502.15770  [pdf, other

    cs.SE cs.AI

    Performance Review on LLM for solving leetcode problems

    Authors: Lun Wang, Chuanqi Shi, Shaoshui Du, Yiyi Tao, Yixian Shen, Hang Zheng, Yanxin Shen, Xinyu Qiu

    Abstract: This paper presents a comprehensive performance evaluation of Large Language Models (LLMs) in solving programming challenges from Leetcode, a widely used platform for algorithm practice and technical interviews. We began by crawling the Leetcode website to collect a diverse set of problems encompassing various difficulty levels and topics. Using this dataset, we generated solutions with multiple L… ▽ More

    Submitted 2 March, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

  18. arXiv:2502.15261  [pdf, other

    cs.CL cs.AI

    Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction

    Authors: Jingheng Ye, Shang Qin, Yinghui Li, Hai-Tao Zheng, Shen Wang, Qingsong Wen

    Abstract: Grammatical Error Correction (GEC) faces a critical challenge concerning explainability, notably when GEC systems are designed for language learners. Existing research predominantly focuses on explaining grammatical errors extracted in advance, thus neglecting the relationship between explanations and corrections. To address this gap, we introduce EXGEC, a unified explainable GEC framework that in… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 19 pages, 2 figures, and 9 tables

  19. arXiv:2502.13540  [pdf, other

    hep-ex

    Amplitude analysis of $ψ(3686)\to γK_S^0 K_S^0 $

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (704 additional authors not shown)

    Abstract: Using $(2712\pm14)\times10^6$ $ψ(3686)$ events collected with the BESIII detector, we perform the first amplitude analysis of the radiative decay $ψ(3686)\to γK_S^0 K_S^0$ within the mass region $M_{K_S^0 K_S^0 }<2.8$ GeV/$c^2$. Employing a one-channel K-matrix approach for the description of the dynamics of the $K^0_S K^0_S$ system, the data sample is well described with four poles for the $f_0$-… ▽ More

    Submitted 7 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 20 pages, 4 figures, submitted to JHEP

  20. arXiv:2502.13189  [pdf, other

    cs.LG cs.AI cs.CL

    MoBA: Mixture of Block Attention for Long-Context LLMs

    Authors: Enzhe Lu, Zhejun Jiang, Jingyuan Liu, Yulun Du, Tao Jiang, Chao Hong, Shaowei Liu, Weiran He, Enming Yuan, Yuzhi Wang, Zhiqi Huang, Huan Yuan, Suting Xu, Xinran Xu, Guokun Lai, Yanru Chen, Huabin Zheng, Junjie Yan, Jianlin Su, Yuxin Wu, Neo Y. Zhang, Zhilin Yang, Xinyu Zhou, Mingxing Zhang, Jiezhong Qiu

    Abstract: Scaling the effective context length is essential for advancing large language models (LLMs) toward artificial general intelligence (AGI). However, the quadratic increase in computational complexity inherent in traditional attention mechanisms presents a prohibitive overhead. Existing approaches either impose strongly biased structures, such as sink or window attention which are task-specific, or… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 15 pages

  21. arXiv:2502.12751  [pdf, other

    cs.LG

    Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table

    Authors: Haoyuan Wu, Haisheng Zheng, Shoubo Hu, Zhuolun He, Bei Yu

    Abstract: Logic synthesis, a critical stage in electronic design automation (EDA), optimizes gate-level circuits to minimize power consumption and area occupancy in integrated circuits (ICs). Traditional logic synthesis tools rely on human-designed heuristics, often yielding suboptimal results. Although differentiable architecture search (DAS) has shown promise in generating circuits from truth tables, it f… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  22. arXiv:2502.12732  [pdf, other

    cs.LG

    Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment

    Authors: Haoyuan Wu, Haisheng Zheng, Yuan Pu, Bei Yu

    Abstract: Understanding the structure and function of circuits is crucial for electronic design automation (EDA). Circuits can be formulated as And-Inverter graphs (AIGs), enabling efficient implementation of representation learning through graph neural networks (GNNs). Masked modeling paradigms have been proven effective in graph representation learning. However, masking augmentation to original circuits w… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  23. arXiv:2502.12502  [pdf, other

    cs.CL

    Efficient OpAmp Adaptation for Zoom Attention to Golden Contexts

    Authors: Haoyuan Wu, Rui Ming, Haisheng Zheng, Zhuolun He, Bei Yu

    Abstract: Large language models (LLMs) have shown significant promise in question-answering (QA) tasks, particularly in retrieval-augmented generation (RAG) scenarios and long-context applications. However, their performance is hindered by noisy reference documents, which often distract from essential information. Despite fine-tuning efforts, Transformer-based architectures struggle to prioritize relevant c… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  24. arXiv:2502.11890  [pdf, other

    cs.CL

    Revisiting Classification Taxonomy for Grammatical Errors

    Authors: Deqing Zou, Jingheng Ye, Yulu Liu, Yu Wu, Zishan Xu, Yinghui Li, Hai-Tao Zheng, Bingxu An, Zhao Wei, Yong Xu

    Abstract: Grammatical error classification plays a crucial role in language learning systems, but existing classification taxonomies often lack rigorous validation, leading to inconsistencies and unreliable feedback. In this paper, we revisit previous classification taxonomies for grammatical errors by introducing a systematic and qualitative evaluation framework. Our approach examines four aspects of a tax… ▽ More

    Submitted 17 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: 26 pages, 4 figures and 5 tables

  25. arXiv:2502.11811  [pdf, other

    cs.CL

    FineFilter: A Fine-grained Noise Filtering Mechanism for Retrieval-Augmented Large Language Models

    Authors: Qianchi Zhang, Hainan Zhang, Liang Pang, Hongwei Zheng, Yongxin Tong, Zhiming Zheng

    Abstract: Retrieved documents containing noise will hinder Retrieval-Augmented Generation (RAG) from detecting answer clues, necessitating noise filtering mechanisms to enhance accuracy. Existing methods use re-ranking or summarization to identify the most relevant sentences, but directly and accurately locating answer clues from these large-scale and complex documents remains challenging. Unlike these docu… ▽ More

    Submitted 17 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  26. arXiv:2502.11493  [pdf, other

    cs.CL

    DAST: Context-Aware Compression in LLMs via Dynamic Allocation of Soft Tokens

    Authors: Shaoshen Chen, Yangning Li, Zishan Xu, Yinghui Li, Xin Su, Zifei Shan, Hai-tao Zheng

    Abstract: Large Language Models (LLMs) face computational inefficiencies and redundant processing when handling long context inputs, prompting a focus on compression techniques. While existing semantic vector-based compression methods achieve promising performance, these methods fail to account for the intrinsic information density variations between context chunks, instead allocating soft tokens uniformly… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  27. arXiv:2502.11047  [pdf, ps, other

    hep-ex

    Search for the Cabibbo-suppressed decays $Λ_c^{+}\toΣ^0K^{+}π^{0}$ and $Λ_c^{+}\toΣ^0K^{+}π^{+}π^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (687 additional authors not shown)

    Abstract: Utilizing 4.5 $fb^-$ of $e^+e^-$ annihilation data collected at center-of-mass energies ranging from 4599.53 MeV to 4698.82 MeV by the BESIII detector at the BEPCII collider, we search for the singly Cabibbo-suppressed hadronic decays $Λ_{c}^{+}\toΣ^{0} K^{+}π^{0}$ and $Λ_{c}^{+}\toΣ^{0}K^{+}π^+π^-$ with a single-tag method. No significant signals are observed for both decays. The upper limits on… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: 12 pages, 6 figures

  28. arXiv:2502.10891  [pdf, other

    cs.NI

    AquaScope: Reliable Underwater Image Transmission on Mobile Devices

    Authors: Beitong Tian, Lingzhi Zhao, Bo Chen, Mingyuan Wu, Haozhen Zheng, Deepak Vasisht, Francis Y. Yan, Klara Nahrstedt

    Abstract: Underwater communication is essential for both recreational and scientific activities, such as scuba diving. However, existing methods remain highly constrained by environmental challenges and often require specialized hardware, driving research into more accessible underwater communication solutions. While recent acoustic-based communication systems support text messaging on mobile devices, their… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

    Comments: 15 pages, 26 figures

  29. arXiv:2502.10857  [pdf, other

    cs.CL

    Divergent Thoughts toward One Goal: LLM-based Multi-Agent Collaboration System for Electronic Design Automation

    Authors: Haoyuan Wu, Haisheng Zheng, Zhuolun He, Bei Yu

    Abstract: Recently, with the development of tool-calling capabilities in large language models (LLMs), these models have demonstrated significant potential for automating electronic design automation (EDA) flows by interacting with EDA tool APIs via EDA scripts. However, considering the limited understanding of EDA tools, LLMs face challenges in practical scenarios where diverse interfaces of EDA tools exis… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  30. arXiv:2502.10454  [pdf, other

    cs.LG cs.AI cs.CL

    One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

    Authors: Yinghui Li, Jiayi Kuang, Haojing Huang, Zhikun Xu, Xinnian Liang, Yi Yu, Wenlian Lu, Yangning Li, Xiaoyu Tan, Chao Qu, Ying Shen, Hai-Tao Zheng, Philip S. Yu

    Abstract: Leveraging mathematical Large Language Models (LLMs) for proof generation is a fundamental topic in LLMs research. We argue that the ability of current LLMs to prove statements largely depends on whether they have encountered the relevant proof process during training. This reliance limits their deeper understanding of mathematical theorems and related concepts. Inspired by the pedagogical method… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  31. arXiv:2502.09086  [pdf

    cs.CL

    A Hybrid Model for Few-Shot Text Classification Using Transfer and Meta-Learning

    Authors: Jia Gao, Shuangquan Lyu, Guiran Liu, Binrong Zhu, Hongye Zheng, Xiaoxuan Liao

    Abstract: With the continuous development of natural language processing (NLP) technology, text classification tasks have been widely used in multiple application fields. However, obtaining labeled data is often expensive and difficult, especially in few-shot learning scenarios. To solve this problem, this paper proposes a few-shot text classification model based on transfer learning and meta-learning. The… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  32. arXiv:2502.08929  [pdf, ps, other

    hep-ex

    Precise Measurement of the $χ_{c0}$ Resonance Parameters and Branching Fractions of $χ_{c0,c2}\toπ^+π^-/K^+K^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

    Abstract: By analyzing a $ψ(3686)$ data sample containing $(107.7\pm0.6)\times10^{6}$ events taken with the BESIII detector at the BEPCII storage ring in 2009, the $χ_{c0}$ resonance parameters are precisely measured using $χ_{c0,c2} \to π^+π^-/K^+K^-$ events. The mass of $χ_{c0}$ is determined to be $M(χ_{c0})=(3415.67\pm0.07\pm0.06\pm0.07$)~MeV/$c^2$, and its full width is… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 9 pages, 1 figure

  33. MixDec Sampling: A Soft Link-based Sampling Method of Graph Neural Network for Recommendation

    Authors: Xiangjin Xie, Yuxin Chen, Ruipeng Wang, Kai Ouyang, Zihan Zhang, Hai-Tao Zheng, Buyue Qian, Hansen Zheng, Bo Hu, Chengxiang Zhuo, Zang Li

    Abstract: Graph neural networks have been widely used in recent recommender systems, where negative sampling plays an important role. Existing negative sampling methods restrict the relationship between nodes as either hard positive pairs or hard negative pairs. This leads to the loss of structural information, and lacks the mechanism to generate positive pairs for nodes with few neighbors. To overcome limi… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 10 pages, 6 figures

  34. arXiv:2502.07406  [pdf, other

    hep-ex

    Search for $e^+e^-\to K_S^0 K_S^0 h_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (642 additional authors not shown)

    Abstract: Using $e^+e^-$ collision data at 13 center-of-mass energies ranging from 4.600 to 4.950 GeV collected with the BESIII detector, we search for the unmeasured $e^+e^-\to K_S^0 K_S^0 h_c$ process . No significant signal is observed, and the upper limits of the Born cross sections at each center-of-mass energy are presented.

    Submitted 11 February, 2025; originally announced February 2025.

  35. arXiv:2502.07184  [pdf, other

    cs.CL cs.AI

    Refine Knowledge of Large Language Models via Adaptive Contrastive Learning

    Authors: Yinghui Li, Haojing Huang, Jiayi Kuang, Yangning Li, Shu-Yu Guo, Chao Qu, Xiaoyu Tan, Hai-Tao Zheng, Ying Shen, Philip S. Yu

    Abstract: How to alleviate the hallucinations of Large Language Models (LLMs) has always been the fundamental goal pursued by the LLMs research community. Looking through numerous hallucination-related studies, a mainstream category of methods is to reduce hallucinations by optimizing the knowledge representation of LLMs to change their output. Considering that the core focus of these works is the knowledge… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted to ICLR 2025

  36. arXiv:2502.05467  [pdf, other

    cs.CL cs.AI

    Position: LLMs Can be Good Tutors in Foreign Language Education

    Authors: Jingheng Ye, Shen Wang, Deqing Zou, Yibo Yan, Kun Wang, Hai-Tao Zheng, Zenglin Xu, Irwin King, Philip S. Yu, Qingsong Wen

    Abstract: While recent efforts have begun integrating large language models (LLMs) into foreign language education (FLE), they often rely on traditional approaches to learning tasks without fully embracing educational methodologies, thus lacking adaptability to language learning. To address this gap, we argue that LLMs have the potential to serve as effective tutors in FLE. Specifically, LLMs can play three… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 18 pages, 4 figures

  37. arXiv:2502.05330  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge

    Authors: Muhammad Imran, Jonathan R. Krebs, Vishal Balaji Sivaraman, Teng Zhang, Amarjeet Kumar, Walker R. Ueland, Michael J. Fassler, Jinlong Huang, Xiao Sun, Lisheng Wang, Pengcheng Shi, Maximilian Rokuss, Michael Baumgartner, Yannick Kirchhof, Klaus H. Maier-Hein, Fabian Isensee, Shuolin Liu, Bing Han, Bong Thanh Nguyen, Dong-jin Shin, Park Ji-Woo, Mathew Choi, Kwang-Hyun Uhm, Sung-Jea Ko, Chanwoong Lee , et al. (38 additional authors not shown)

    Abstract: Multi-class segmentation of the aorta in computed tomography angiography (CTA) scans is essential for diagnosing and planning complex endovascular treatments for patients with aortic dissections. However, existing methods reduce aortic segmentation to a binary problem, limiting their ability to measure diameters across different branches and zones. Furthermore, no open-source dataset is currently… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  38. Observation of $D\to \bar{K}_{1}(1270)μ^+ν_μ$ and test of lepton flavor universality with $D\to \bar{K}_1(1270) \ell^{+} ν_{\ell}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (646 additional authors not shown)

    Abstract: By analyzing 7.93 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector operated at the BEPCII collider, we report the observation of the semimuonic decays of $D^+\to \bar K_1(1270)^0μ^+ν_μ$ and $D^0\to K_1(1270)^-μ^+ν_μ$ with statistical significances of $12.5σ$ and $6.0σ$, respectively. Their decay branching fractions are determined… ▽ More

    Submitted 18 April, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Rev. D 111, L071101(2025)

  39. arXiv:2502.02326  [pdf, other

    cs.HC

    NoteFlow: Recommending Charts as Sight Glasses for Tracing Data Flow in Computational Notebooks

    Authors: Yuan Tian, Dazhen Deng, Sen Yang, Huawei Zheng, Bowen Shi, Kai Xiong, Xinjing Yi, Yingcai Wu

    Abstract: Exploratory Data Analysis (EDA) is a routine task for data analysts, often conducted using flexible computational notebooks. During EDA, data workers process, visualize, and interpret data tables, making decisions about subsequent analysis. However, the cell-by-cell programming approach, while flexible, can lead to disorganized code, making it difficult to trace the state of data tables across cel… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  40. Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale

    Authors: Elisa Tsai, Neal Mangaokar, Boyuan Zheng, Haizhong Zheng, Atul Prakash

    Abstract: Terms and conditions for online shopping websites often contain terms that can have significant financial consequences for customers. Despite their impact, there is currently no comprehensive understanding of the types and potential risks associated with unfavorable financial terms. Furthermore, there are no publicly available detection systems or datasets to systematically identify or mitigate th… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: This paper has been accepted to The Web Conference 2025 (WWW '25)

    ACM Class: H.3.3; K.4.1; K.4.2; I.2.7

  41. arXiv:2502.00631  [pdf, other

    cs.CV

    MedConv: Convolutions Beat Transformers on Long-Tailed Bone Density Prediction

    Authors: Xuyin Qi, Zeyu Zhang, Huazhan Zheng, Mingxi Chen, Numan Kutaiba, Ruth Lim, Cherie Chiang, Zi En Tham, Xuan Ren, Wenxin Zhang, Lei Zhang, Hao Zhang, Wenbing Lv, Guangzhen Yao, Renda Han, Kangsheng Wang, Mingyuan Li, Hongtao Mao, Yu Li, Zhibin Liao, Yang Zhao, Minh-Son To

    Abstract: Bone density prediction via CT scans to estimate T-scores is crucial, providing a more precise assessment of bone health compared to traditional methods like X-ray bone density tests, which lack spatial resolution and the ability to detect localized changes. However, CT-based prediction faces two major challenges: the high computational complexity of transformer-based architectures, which limits t… ▽ More

    Submitted 3 April, 2025; v1 submitted 1 February, 2025; originally announced February 2025.

    Comments: Accepted to IJCNN 2025

  42. arXiv:2502.00550  [pdf, other

    cs.LG math.NA physics.comp-ph

    Muti-Fidelity Prediction and Uncertainty Quantification with Laplace Neural Operators for Parametric Partial Differential Equations

    Authors: Haoyang Zheng, Guang Lin

    Abstract: Laplace Neural Operators (LNOs) have recently emerged as a promising approach in scientific machine learning due to the ability to learn nonlinear maps between functional spaces. However, this framework often requires substantial amounts of high-fidelity (HF) training data, which is often prohibitively expensive to acquire. To address this, we propose multi-fidelity Laplace Neural Operators (MF-LN… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: 30 pages, 11 figures

  43. arXiv:2501.19069  [pdf, other

    cs.CV cs.AI

    Improving vision-language alignment with graph spiking hybrid Networks

    Authors: Siyu Zhang, Wenzhe Liu, Yeming Chen, Yiming Wu, Heming Zheng, Cheng Cheng

    Abstract: To bridge the semantic gap between vision and language (VL), it is necessary to develop a good alignment strategy, which includes handling semantic diversity, abstract representation of visual information, and generalization ability of models. Recent works use detector-based bounding boxes or patches with regular partitions to represent visual semantics. While current paradigms have made strides,… ▽ More

    Submitted 2 March, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

  44. arXiv:2501.18542  [pdf

    cs.AI

    Semantic Web and Creative AI -- A Technical Report from ISWS 2023

    Authors: Raia Abu Ahmad, Reham Alharbi, Roberto Barile, Martin Böckling, Francisco Bolanos, Sara Bonfitto, Oleksandra Bruns, Irene Celino, Yashrajsinh Chudasama, Martin Critelli, Claudia d'Amato, Giada D'Ippolito, Ioannis Dasoulas, Stefano De Giorgis, Vincenzo De Leo, Chiara Di Bonaventura, Marco Di Panfilo, Daniil Dobriy, John Domingue, Xuemin Duan, Michel Dumontier, Sefika Efeoglu, Ruben Eschauzier, Fakih Ginwa, Nicolas Ferranti , et al. (52 additional authors not shown)

    Abstract: The International Semantic Web Research School (ISWS) is a week-long intensive program designed to immerse participants in the field. This document reports a collaborative effort performed by ten teams of students, each guided by a senior researcher as their mentor, attending ISWS 2023. Each team provided a different perspective to the topic of creative AI, substantiated by a set of research quest… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: Technical Report

  45. arXiv:2501.18419  [pdf, other

    hep-ph hep-th

    Optimizers for Stabilizing Likelihood-free Inference

    Authors: G. Bruno De Luca, Benjamin Nachman, Eva Silverstein, Henry Zheng

    Abstract: A growing number of applications in particle physics and beyond use neural networks as unbinned likelihood ratio estimators applied to real or simulated data. Precision requirements on the inference tasks demand a high-level of stability from these networks, which are affected by the stochastic nature of training. We show how physics concepts can be used to stabilize network training through a phy… ▽ More

    Submitted 1 March, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: 11 pages, 2 figures, v2: fixed references

  46. arXiv:2501.17323  [pdf, other

    cs.LG stat.ML

    Exploring Non-Convex Discrete Energy Landscapes: A Langevin-Like Sampler with Replica Exchange

    Authors: Haoyang Zheng, Ruqi Zhang, Guang Lin

    Abstract: Gradient-based Discrete Samplers (GDSs) are effective for sampling discrete energy landscapes. However, they often stagnate in complex, non-convex settings. To improve exploration, we introduce the Discrete Replica EXchangE Langevin (DREXEL) sampler and its variant with Adjusted Metropolis (DREAM). These samplers use two GDSs at different temperatures and step sizes: one focuses on local exploitat… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

    Comments: 7 figures, 23 pages

  47. arXiv:2501.15447  [pdf, ps, other

    hep-ex

    Observation of $h_{c}$ radiative decays to multiple light hadrons and the tensor state $f_2(1270)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (666 additional authors not shown)

    Abstract: Using $ψ(3686)\rightarrow π^{0} h_{c}$ decays from a data sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider, $h_c$ radiative decays to $γπ^{+}π^{-},~γπ^{+}π^{-}η,~\gamma2(π^{+}π^{-})$, and $γp\bar{p}$ are observed for the first time, each with a significance greater than $5σ$. The corresponding branching fractions are measured. Furtherm… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

  48. arXiv:2501.15085  [pdf, other

    cs.AI cs.LG eess.SY

    Data Center Cooling System Optimization Using Offline Reinforcement Learning

    Authors: Xianyuan Zhan, Xiangyu Zhu, Peng Cheng, Xiao Hu, Ziteng He, Hanfei Geng, Jichao Leng, Huiwen Zheng, Chenhui Liu, Tianshun Hong, Yan Liang, Yunxin Liu, Feng Zhao

    Abstract: The recent advances in information technology and artificial intelligence have fueled a rapid expansion of the data center (DC) industry worldwide, accompanied by an immense appetite for electricity to power the DCs. In a typical DC, around 30~40% of the energy is spent on the cooling system rather than on computer servers, posing a pressing need for developing new energy-saving optimization techn… ▽ More

    Submitted 14 February, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

    Comments: Accepted in ICLR 2025

  49. arXiv:2501.14951  [pdf, other

    cs.LG cs.CL cs.SC

    E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic Expressions

    Authors: Hongbo Zheng, Suyuan Wang, Neeraj Gangwar, Nickvash Kani

    Abstract: Vector representations have been pivotal in advancing natural language processing (NLP), with prior research focusing on embedding techniques for mathematical expressions using mathematically equivalent formulations. While effective, these approaches are constrained by the size and diversity of training data. In this work, we address these limitations by introducing E-Gen, a novel e-graph-based da… ▽ More

    Submitted 9 March, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

  50. arXiv:2501.14859  [pdf

    cs.CL cs.LG

    Dynamic Adaptation of LoRA Fine-Tuning for Efficient and Task-Specific Optimization of Large Language Models

    Authors: Xiaoxuan Liao, Chihang Wang, Shicheng Zhou, Jiacheng Hu, Hongye Zheng, Jia Gao

    Abstract: This paper presents a novel methodology of fine-tuning for large language models-dynamic LoRA. Building from the standard Low-Rank Adaptation framework, this methodology further adds dynamic adaptation mechanisms to improve efficiency and performance. The key contribution of dynamic LoRA lies within its adaptive weight allocation mechanism coupled with an input feature-based adaptive strategy. The… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.