Skip to main content

Showing 1–50 of 898 results for author: Wen, L

.
  1. arXiv:2507.04613  [pdf

    cs.CV cs.AI

    HiLa: Hierarchical Vision-Language Collaboration for Cancer Survival Prediction

    Authors: Jiaqi Cui, Lu Wen, Yuchen Fei, Bo Liu, Luping Zhou, Dinggang Shen, Yan Wang

    Abstract: Survival prediction using whole-slide images (WSIs) is crucial in cancer re-search. Despite notable success, existing approaches are limited by their reliance on sparse slide-level labels, which hinders the learning of discriminative repre-sentations from gigapixel WSIs. Recently, vision language (VL) models, which incorporate additional language supervision, have emerged as a promising solu-tion.… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: Accepted by MICCAI2025

  2. arXiv:2506.22586  [pdf, ps, other

    nucl-ex hep-ex physics.ins-det

    Sensitivity of nEXO to $^{136}$Xe Charged-Current Interactions: Background-free Searches for Solar Neutrinos and Fermionic Dark Matter

    Authors: G. Richardson, B. G. Lenardo, D. Gallacher, R. Saldanha, P. Acharya, S. Al Kharusi, A. Amy, E. Angelico, A. Anker, I. J. Arnquist, A. Atencio, J. Bane, V. Belov, E. P. Bernard, T. Bhatta, A. Bolotnikov, J. Breslin, P. A. Breur, J. P. Brodsky, S. Bron, E. Brown, T. Brunner, B. Burnell, E. Caden, G. F. Cao , et al. (113 additional authors not shown)

    Abstract: We study the sensitivity of nEXO to solar neutrino charged-current interactions, $ν_e + ^{136}$Xe$\rightarrow ^{136}$Cs$^* + e^-$, as well as analogous interactions predicted by models of fermionic dark matter. Due to the recently observed low-lying isomeric states of $^{136}$Cs, these interactions will create a time-delayed coincident signal observable in the scintillation channel. Here we develo… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  3. arXiv:2506.21786  [pdf, ps, other

    stat.ME

    Estimating Average Causal Effects with Incomplete Exposure and Confounders

    Authors: Lan Wen, Glen McGee

    Abstract: Standard methods for estimating average causal effects require complete observations of the exposure and confounders. In observational studies, however, missing data are ubiquitous. Motivated by a study on the effect of prescription opioids on mortality, we propose methods for estimating average causal effects when exposures and potential confounders may be missing. We consider missingness at rand… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  4. arXiv:2506.21250  [pdf, ps, other

    cs.RO

    ACTLLM: Action Consistency Tuned Large Language Model

    Authors: Jing Bi, Lianggong Bruce Wen, Zhang Liu, Chenliang Xu

    Abstract: This paper introduces ACTLLM (Action Consistency Tuned Large Language Model), a novel approach for robot manipulation in dynamic environments. Traditional vision-based systems often struggle to learn visual representations that excel in both task execution and spatial reasoning, thereby limiting their adaptability in dynamic environments. ACTLLM addresses these challenges by harnessing language to… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  5. Evolution of Cluster Alignments as Evidence of Large-scale Structure Formation in the Universe

    Authors: Michael J. West, Roberto De Propris, Maret Einasto, Z. L. Wen, J. L. Han

    Abstract: The universe's large-scale structure forms a vast, interconnected network of filaments, sheets, and voids known as the cosmic web. For decades, astronomers have observed that the orientations of neighboring galaxy clusters within these elongated structures are often aligned over separations of tens of Mpc. Using the largest available catalog of galaxy clusters, we show for the first time that clus… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 13 pages, 7 figures. Accepted for publication in ApJ Letters

  6. arXiv:2506.18800  [pdf, ps, other

    hep-ph hep-ex hep-lat

    Electromagnetic polarizabilities of the spin-$\frac{3}{2}$ baryons in heavy baryon chiral perturbation theory

    Authors: Liang-Zhen Wen, Yan-Ke Chen, Lu Meng, Shi-Lin Zhu

    Abstract: We employ Heavy Baryon Chiral Perturbation Theory (HB$χ$PT), a non-relativistic effective field theory that treats baryons as heavy static sources, to calculate the electromagnetic polarizabilities of spin-3/2 baryons in two sectors: the light-flavor decuplet baryons and singly heavy sextet baryons. We derive the analytical expressions up to $\mathcal{O}\left(p^3\right)$. Our results indicate that… ▽ More

    Submitted 25 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 22 pages, 1 figures, 9 tables. Comments are welcome. arXiv admin note: substantial text overlap with arXiv:2412.02297

  7. arXiv:2506.16749  [pdf, ps, other

    cond-mat.mtrl-sci

    Giant Magneto-Optical Effects in Two-Dimensional Flat-Band Antiferromagnets

    Authors: Ping Yang, Wanxiang Feng, Siyuan Liu, Shan Guan, Liwei Wen, Wei Jiang, Gui-Bin Liu, Yugui Yao

    Abstract: In this work, we reveal giant magneto-optical responses in two-dimensional(2D) antiferromagnets with nearly flat electronic bands, based on first-principles calculations and group-theoretical analysis. We identify a record-large second-order magneto-optical Schafer-Hubert(SH) effect, featuring a polarization rotation angle of 28 degree, in monolayer antiferromagnetic RuOCl2, driven by flatband-enh… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 6 pages, 3 figures

  8. arXiv:2506.07971  [pdf, ps, other

    cs.CV

    CyberV: Cybernetics for Test-time Scaling in Video Understanding

    Authors: Jiahao Meng, Shuyang Sun, Yue Tan, Lu Qi, Yunhai Tong, Xiangtai Li, Longyin Wen

    Abstract: Current Multimodal Large Language Models (MLLMs) may struggle with understanding long or complex videos due to computational demands at test time, lack of robustness, and limited accuracy, primarily stemming from their feed-forward processing nature. These limitations could be more severe for models with fewer parameters. To address these limitations, we propose a novel framework inspired by cyber… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  9. arXiv:2506.00783  [pdf, other

    cs.CL cs.AI

    KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision

    Authors: Rong Wu, Pinlong Cai, Jianbiao Mei, Licheng Wen, Tao Hu, Xuemeng Yang, Daocheng Fu, Botian Shi

    Abstract: Large language models (LLMs) have made remarkable strides in various natural language processing tasks, but their performance on complex reasoning problems remains hindered by a lack of explainability and trustworthiness. This issue, often manifesting as hallucinations or unattributable reasoning processes, limits their applicability in complex reasoning scenarios. To address this, we propose Know… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: 23 pages, 13 figures

  10. arXiv:2505.21027  [pdf, ps, other

    cs.LG cs.AI

    TabAttackBench: A Benchmark for Adversarial Attacks on Tabular Data

    Authors: Zhipeng He, Chun Ouyang, Lijie Wen, Cong Liu, Catarina Moreira

    Abstract: Adversarial attacks pose a significant threat to machine learning models by inducing incorrect predictions through imperceptible perturbations to input data. While these attacks have been extensively studied in unstructured data like images, their application to tabular data presents new challenges. These challenges arise from the inherent heterogeneity and complex feature interdependencies in tab… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 63 pages, 22 figures, 6 tables

  11. arXiv:2505.16582  [pdf, ps, other

    cs.CL cs.AI

    O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

    Authors: Jianbiao Mei, Tao Hu, Daocheng Fu, Licheng Wen, Xuemeng Yang, Rong Wu, Pinlong Cai, Xinyu Cai, Xing Gao, Yu Yang, Chengjun Xie, Botian Shi, Yong Liu, Yu Qiao

    Abstract: Large Language Models (LLMs), despite their advancements, are fundamentally limited by their static parametric knowledge, hindering performance on tasks requiring open-domain up-to-date information. While enabling LLMs to interact with external knowledge environments is a promising solution, current efforts primarily address closed-end problems. Open-ended questions, which characterized by lacking… ▽ More

    Submitted 26 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 25 pages, 9 figures

  12. arXiv:2505.12627  [pdf, ps, other

    cs.NE

    Efficient Heuristics Generation for Solving Combinatorial Optimization Problems Using Large Language Models

    Authors: Xuan Wu, Di Wang, Chunguo Wu, Lijie Wen, Chunyan Miao, Yubin Xiao, You Zhou

    Abstract: Recent studies exploited Large Language Models (LLMs) to autonomously generate heuristics for solving Combinatorial Optimization Problems (COPs), by prompting LLMs to first provide search directions and then derive heuristics accordingly. However, the absence of task-specific knowledge in prompts often leads LLMs to provide unspecific search directions, obstructing the derivation of well-performin… ▽ More

    Submitted 11 June, 2025; v1 submitted 18 May, 2025; originally announced May 2025.

    Comments: Accepted by SIGKDD 2025

  13. arXiv:2505.02500  [pdf, other

    cs.SE

    Automating Automotive Software Development: A Synergy of Generative AI and Formal Methods

    Authors: Fengjunjie Pan, Yinglei Song, Long Wen, Nenad Petrovic, Krzysztof Lebioda, Alois Knoll

    Abstract: As the automotive industry shifts its focus toward software-defined vehicles, the need for faster and reliable software development continues to grow. However, traditional methods show their limitations. The rise of Generative Artificial Intelligence (GenAI), particularly Large Language Models (LLMs), introduces new opportunities to automate automotive software development tasks such as requiremen… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  14. arXiv:2505.02370  [pdf, other

    cs.CV cs.AI cs.LG

    SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing

    Authors: Ming Li, Xin Gu, Fan Chen, Xiaoying Xing, Longyin Wen, Chen Chen, Sijie Zhu

    Abstract: Due to the challenges of manually collecting accurate editing data, existing datasets are typically constructed using various automated methods, leading to noisy supervision signals caused by the mismatch between editing instructions and original-edited image pairs. Recent efforts attempt to improve editing models through generating higher-quality edited images, pre-training on recognition tasks,… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: Code, Data and Models are available at: https://github.com/bytedance/SuperEdit

  15. arXiv:2505.00359  [pdf, other

    cs.LG cs.AI cs.NE

    TNStream: Applying Tightest Neighbors to Micro-Clusters to Define Multi-Density Clusters in Streaming Data

    Authors: Qifen Zeng, Haomin Bao, Yuanzhuo Hu, Zirui Zhang, Yuheng Zheng, Luosheng Wen

    Abstract: In data stream clustering, systematic theory of stream clustering algorithms remains relatively scarce. Recently, density-based methods have gained attention. However, existing algorithms struggle to simultaneously handle arbitrarily shaped, multi-density, high-dimensional data while maintaining strong outlier resistance. Clustering quality significantly deteriorates when data density varies compl… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 21 pages, 9 figures, 8 tables, under review at Expert Systems with Applications (ESWA)

    MSC Class: 68T05; 68W20 ACM Class: H.2.8; I.5.3

  16. arXiv:2505.00063  [pdf, other

    cs.CL cs.CV

    GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling

    Authors: Siqi Li, Yufan Shen, Xiangnan Chen, Jiayi Chen, Hengwei Ju, Haodong Duan, Song Mao, Hongbin Zhou, Bo Zhang, Bin Fu, Pinlong Cai, Licheng Wen, Botian Shi, Yong Liu, Xinyu Cai, Yu Qiao

    Abstract: The rapid advancement of multimodal large language models (MLLMs) has profoundly impacted the document domain, creating a wide array of application scenarios. This progress highlights the need for a comprehensive benchmark to evaluate these models' capabilities across various document-specific tasks. However, existing benchmarks often fail to locate specific model weaknesses or guide systematic im… ▽ More

    Submitted 22 May, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

  17. arXiv:2504.15681  [pdf, other

    cs.CV

    Vidi: Large Multimodal Models for Video Understanding and Editing

    Authors: Vidi Team, Celong Liu, Chia-Wen Kuo, Dawei Du, Fan Chen, Guang Chen, Jiamin Yuan, Lingxi Zhang, Lu Guo, Lusha Li, Longyin Wen, Qingyu Chen, Rachel Deng, Sijie Zhu, Stuart Siew, Tong Jin, Wei Lu, Wen Zhong, Xiaohui Shen, Xin Gu, Xing Mei, Xueqiong Qu

    Abstract: Humans naturally share information with those they are connected to, and video has become one of the dominant mediums for communication and expression on the Internet. To support the creation of high-quality large-scale video content, a modern pipeline requires a comprehensive understanding of both the raw input materials (e.g., the unedited footage captured by cameras) and the editing components… ▽ More

    Submitted 24 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  18. arXiv:2504.15464  [pdf, other

    physics.ins-det hep-ex

    Ultra-sensitive radon assay using an electrostatic chamber in a recirculating system

    Authors: nEXO Collaboration, A. Anker, P. A. Breur, B. Mong, P. Acharya, A. Amy, E. Angelico, I. J. Arnquist, A. Atencio, J. Bane, V. Belov, E. P. Bernard, T. Bhatta, A. Bolotnikov, J. Breslin, J. P. Brodsky, S. Bron, E. Brown, T. Brunner, B. Burnell, E. Caden, L. Q. Cao, G. F. Cao, D. Cesmecioglu, D. Chernyak , et al. (116 additional authors not shown)

    Abstract: Rare event searches such as neutrinoless double beta decay and Weakly Interacting Massive Particle detection require ultra-low background detectors. Radon contamination is a significant challenge for these experiments, which employ highly sensitive radon assay techniques to identify and select low-emission materials. This work presents the development of ultra-sensitive electrostatic chamber (ESC)… ▽ More

    Submitted 24 April, 2025; v1 submitted 21 April, 2025; originally announced April 2025.

    Comments: 14 pages, 9 figures, 1 table

  19. arXiv:2504.09665  [pdf, ps, other

    cs.CL

    CLEAR-KGQA: Clarification-Enhanced Ambiguity Resolution for Knowledge Graph Question Answering

    Authors: Liqiang Wen, Guanming Xiong, Tong Mo, Bing Li, Weiping Li, Wen Zhao

    Abstract: This study addresses the challenge of ambiguity in knowledge graph question answering (KGQA). While recent KGQA systems have made significant progress, particularly with the integration of large language models (LLMs), they typically assume user queries are unambiguous, which is an assumption that rarely holds in real-world applications. To address these limitations, we propose a novel framework t… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: This work has been accepted by the IJCNN 2025 main track

  20. arXiv:2504.07369  [pdf

    cond-mat.str-el

    Ultrahigh room-temperature hole conductivity in a perovskite cuprate with vanishing electron-correlation

    Authors: Meng Wang, Jianbing Zhang, Liang Si, Sijie Wu, Caiyong Li, Wenfeng Wu, Xiaodong Zhang, Cong Li, Lu Wang, Fachao Li, Lingzhi Wen, Yang Liu, Jinling Zhou, Masahiro Sawada, Nianpeng Lu, Qing He, Peng Gao, Tian Liang, Shuyun Zhou, Yeliang Wang, Fumitaka Kagawa, Pu Yu

    Abstract: Electron-correlated two-dimensional (2D) cuprates have been extensively studied since the discovery of high-Tc superconductivity, in contrast, the three-dimensional (3D) counterpart perovskite cuprates remain largely unexplored due to their chemical instability and synthesis challenges. Herein, we develop an efficient two-step approach that combines symmetry-selective growth and topotactic oxidiza… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: 5 figures

  21. arXiv:2504.07089  [pdf, ps, other

    cs.CV cs.CL

    OmniCaptioner: One Captioner to Rule Them All

    Authors: Yiting Lu, Jiakang Yuan, Zhen Li, Shitian Zhao, Qi Qin, Xinyue Li, Le Zhuo, Licheng Wen, Dongyang Liu, Yuewen Cao, Xiangchao Yan, Xin Li, Tianshuo Peng, Shufei Zhang, Botian Shi, Tao Chen, Zhibo Chen, Lei Bai, Peng Gao, Bo Zhang

    Abstract: We propose OmniCaptioner, a versatile visual captioning framework for generating fine-grained textual descriptions across a wide variety of visual domains. Unlike prior methods limited to specific image types (e.g., natural images or geometric visuals), our framework provides a unified solution for captioning natural images, visual text (e.g., posters, UIs, textbooks), and structured visuals (e.g.… ▽ More

    Submitted 2 June, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

    Comments: More visualizations on Homepage: https://alpha-innovator.github.io/OmniCaptioner-project-page and Official code: https://github.com/Alpha-Innovator/OmniCaptioner

  22. arXiv:2504.03294  [pdf, ps, other

    hep-ph nucl-th physics.atom-ph

    Relativistic dynamics of charmonia in strong magnetic fields

    Authors: Liuyuan Wen, Meijian Li, Yiyu Zhou, Yang Li, James P. Vary

    Abstract: We investigate the properties of charmonium systems in strong external magnetic fields using a relativistic light-front Hamiltonian approach within the Basis Light-Front Quantization (BLFQ) framework. By solving the eigenvalue problem for the invariant mass squared operator with confinement potentials and one-gluon-exchange interactions, we obtain the mass spectrum and wave functions under varying… ▽ More

    Submitted 18 June, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

    Comments: 24 pages, 9 figures. Added the derivation of the quantum many-body Hamiltonian from the minimally coupled Lagrangian in Appendix A. To appear in Phys. Rev. D

  23. arXiv:2504.03151  [pdf, other

    cs.CL cs.LG

    Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)

    Authors: Jing Bi, Susan Liang, Xiaofei Zhou, Pinxin Liu, Junjia Guo, Yunlong Tang, Luchuan Song, Chao Huang, Guangyu Sun, Jinxi He, Jiarui Wu, Shu Yang, Daoan Zhang, Chen Chen, Lianggong Bruce Wen, Zhang Liu, Jiebo Luo, Chenliang Xu

    Abstract: Reasoning is central to human intelligence, enabling structured problem-solving across diverse tasks. Recent advances in large language models (LLMs) have greatly enhanced their reasoning abilities in arithmetic, commonsense, and symbolic domains. However, effectively extending these capabilities into multimodal contexts-where models must integrate both visual and textual inputs-continues to be a… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  24. arXiv:2504.00679  [pdf, other

    physics.app-ph

    QUEST: A Quantized Energy-Aware SNN Training Framework for Multi-State Neuromorphic Devices

    Authors: Sai Li, Linliang Chen, Yihao Zhang, Zhongkui Zhang, Ao Du, Biao Pan, Zhaohao Wang, Lianggong Wen, Weisheng Zhao

    Abstract: Neuromorphic devices, leveraging novel physical phenomena, offer a promising path toward energy-efficient hardware beyond CMOS technology by emulating brain-inspired computation. However, their progress is often limited to proof-of-concept studies due to the lack of flexible spiking neural network (SNN) algorithm frameworks tailored to device-specific characteristics, posing a significant challeng… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  25. arXiv:2503.22587  [pdf, other

    cs.SE

    LLM-enabled Instance Model Generation

    Authors: Fengjunjie Pan, Nenad Petrovic, Vahid Zolfaghari, Long Wen, Alois Knoll

    Abstract: In the domain of model-based engineering, models are essential components that enable system design and analysis. Traditionally, the creation of these models has been a manual process requiring not only deep modeling expertise but also substantial domain knowledge of target systems. With the rapid advancement of generative artificial intelligence, large language models (LLMs) show potential for au… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  26. arXiv:2503.21699  [pdf, other

    cs.MM cs.AI cs.CV cs.SD eess.AS

    MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX

    Authors: Liuyue Xie, George Z. Wei, Avik Kuthiala, Ce Zheng, Ananya Bal, Mosam Dabhi, Liting Wen, Taru Rustagi, Ethan Lai, Sushil Khyalia, Rohan Choudhury, Morteza Ziyadi, Xu Zhang, Hao Yang, László A. Jeni

    Abstract: Frontier models have either been language-only or have primarily focused on vision and language modalities. Although recent advancements in models with vision and audio understanding capabilities have shown substantial progress, the field lacks a standardized evaluation framework for thoroughly assessing their cross-modality perception performance. We introduce MAVERIX~(Multimodal Audio-Visual Eva… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  27. arXiv:2503.21353  [pdf, ps, other

    hep-ex

    Neutrino type identification for atmospheric neutrinos in a large homogeneous liquid scintillation detector

    Authors: Jiaxi Liu, Fanrui Zeng, Hongyue Duyang, Wanlei Guo, Xinhai He, Teng Li, Zhen Liu, Wuming Luo, Wing Yan Ma, Xiaohan Tan, Liangjian Wen, Zekun Yang, Yongpeng Zhang

    Abstract: Atmospheric neutrino oscillations are important to the study of neutrino properties, including the neutrino mass ordering problem. A good capability to identify neutrinos' flavor and neutrinos against antineutrinos is crucial in such measurements. In this paper, we present a machine-learning-based approach for identifying atmospheric neutrino events in a large homogeneous liquid scintillator detec… ▽ More

    Submitted 13 June, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

  28. arXiv:2503.13891  [pdf, other

    cs.CV cs.CL

    Where do Large Vision-Language Models Look at when Answering Questions?

    Authors: Xiaoying Xing, Chia-Wen Kuo, Li Fuxin, Yulei Niu, Fan Chen, Ming Li, Ying Wu, Longyin Wen, Sijie Zhu

    Abstract: Large Vision-Language Models (LVLMs) have shown promising performance in vision-language understanding and reasoning tasks. However, their visual understanding behaviors remain underexplored. A fundamental question arises: to what extent do LVLMs rely on visual input, and which image regions contribute to their responses? It is non-trivial to interpret the free-form generation of LVLMs due to thei… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  29. arXiv:2503.11938  [pdf, ps, other

    hep-ph hep-ex hep-lat nucl-ex nucl-th

    The ${φNN,J/ψNN,η_c NN}$ systems based on HAL QCD interactions

    Authors: Liang-Zhen Wen, Yao Ma, Lu Meng, Shi-Lin Zhu

    Abstract: We investigate the existence of bound states and resonances in the ${φNN, J/ψNN, η_c NN}$ systems using HAL QCD interactions for ${φN, J/ψN}$, and ${η_c N}$. We employ the Gaussian expansion method to solve the complex-scaled Schrödinger equation and find no resonances or bound states in the ${J/ψNN}$ and ${η_c NN}$ systems. We estimate the interaction between charmonium and nuclei, concluding tha… ▽ More

    Submitted 2 July, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 16 pages, 9 figures. Comments are welcome

    Journal ref: Phys. Rev. D 111, 114004 (2025)

  30. arXiv:2503.10460  [pdf, other

    cs.CL cs.LG

    Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

    Authors: Liang Wen, Yunke Cai, Fenrui Xiao, Xin He, Qi An, Zhenyu Duan, Yimin Du, Junchen Liu, Lifu Tang, Xiaowei Lv, Haosheng Zou, Yongchao Deng, Shousheng Jia, Xiangzheng Zhang

    Abstract: This paper introduces Light-R1, an open-source suite for training long reasoning models using reproducible and cost-effective methodology. Given the proprietary nature of data used in the DeepSeek-R1 series, we develop an alternative approach leveraging exclusively public data and models. Our curriculum training progressively increases data difficulty, combined with multi-staged post-training. Our… ▽ More

    Submitted 28 May, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: v4: ACL'25 industry track camera ready; v3: minor modifications; v2: better writing & format for later submission; all release at https://github.com/Qihoo360/Light-R1

  31. arXiv:2503.05180  [pdf, other

    cs.RO cs.LG

    Safety-Critical Traffic Simulation with Adversarial Transfer of Driving Intentions

    Authors: Zherui Huang, Xing Gao, Guanjie Zheng, Licheng Wen, Xuemeng Yang, Xiao Sun

    Abstract: Traffic simulation, complementing real-world data with a long-tail distribution, allows for effective evaluation and enhancement of the ability of autonomous vehicles to handle accident-prone scenarios. Simulating such safety-critical scenarios is nontrivial, however, from log data that are typically regular scenarios, especially in consideration of dynamic adversarial interactions between the fut… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: Accepted by ICRA 2025

  32. arXiv:2503.04636  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking

    Authors: Yijie Xu, Aiwei Liu, Xuming Hu, Lijie Wen, Hui Xiong

    Abstract: As open-source large language models (LLMs) like Llama3 become more capable, it is crucial to develop watermarking techniques to detect their potential misuse. Existing watermarking methods either add watermarks during LLM inference, which is unsuitable for open-source LLMs, or primarily target classification LLMs rather than recent generative LLMs. Adapting these watermarks to open-source LLMs fo… ▽ More

    Submitted 15 March, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: Accepted by the ICLR 2025 Workshop on GenAI Watermarking

  33. arXiv:2503.00968  [pdf, other

    physics.ins-det hep-ex

    Simulation of the Background from $^{13}$C$(α, n)^{16}$O Reaction in the JUNO Scintillator

    Authors: JUNO Collaboration, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Costas Andreopoulos, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Beretta, Antonio Bergnoli, Nikita Bessonov, Daniel Bick, Lukas Bieger, Svetlana Biktemerova , et al. (608 additional authors not shown)

    Abstract: Large-scale organic liquid scintillator detectors are highly efficient in the detection of MeV-scale electron antineutrinos. These signal events can be detected through inverse beta decay on protons, which produce a positron accompanied by a neutron. A noteworthy background for antineutrinos coming from nuclear power reactors and from the depths of the Earth (geoneutrinos) is generated by ($α, n$)… ▽ More

    Submitted 2 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Comments: 25 pages, 14 figures, 4 tables

  34. arXiv:2502.17852  [pdf, other

    cs.CV

    Sketch-1-to-3: One Single Sketch to 3D Detailed Face Reconstruction

    Authors: Liting Wen, Zimo Yang, Xianlin Zhang, Chi Ding, Yue Zhang, Mingdao Wang, Xueming Li

    Abstract: 3D face reconstruction from a single sketch is a critical yet underexplored task with significant practical applications. The primary challenges stem from the substantial modality gap between 2D sketches and 3D facial structures, including: (1) accurately extracting facial keypoints from 2D sketches; (2) preserving diverse facial expressions and fine-grained texture details; and (3) training a hig… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  35. arXiv:2502.13367  [pdf, ps, other

    cond-mat.quant-gas quant-ph

    Asymptotic Freedom of Two Heavy Impurities in a Bose-Einstein Condensate

    Authors: Dong-Chen Zheng, Lin Wen, Renyuan Liao

    Abstract: We consider two heavy impurities immersed in a Bose-Einstein condensate, and calculate the self-energy using the Wilsonian renormalization. The polaron energy, quasiparticle residue and damping rate are extracted from the self-energy. We demonstrate that various effective potentials emerge from the polaron energy under the specific conditions. In the limit of large separation between the impuritie… ▽ More

    Submitted 3 March, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: 7 pages, 5 figures

  36. arXiv:2502.11598  [pdf, other

    cs.CL

    Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?

    Authors: Leyi Pan, Aiwei Liu, Shiyu Huang, Yijian Lu, Xuming Hu, Lijie Wen, Irwin King, Philip S. Yu

    Abstract: The radioactive nature of Large Language Model (LLM) watermarking enables the detection of watermarks inherited by student models when trained on the outputs of watermarked teacher models, making it a promising tool for preventing unauthorized knowledge distillation. However, the robustness of watermark radioactivity against adversarial actors remains largely unexplored. In this paper, we investig… ▽ More

    Submitted 24 May, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Accepted by ACL 2025 (Main)

    MSC Class: 68T50 ACM Class: I.2.7

  37. arXiv:2502.09269  [pdf, other

    cs.CV

    Memory-based Ensemble Learning in CMR Semantic Segmentation

    Authors: Yiwei Liu, Ziyi Wu, Liang Zhong, Lingyi Wen, Yuankai Wu

    Abstract: Existing models typically segment either the entire 3D frame or 2D slices independently to derive clinical functional metrics from ventricular segmentation in cardiac cine sequences. While performing well overall, they struggle at the end slices. To address this, we leverage spatial continuity to extract global uncertainty from segmentation variance and use it as memory in our ensemble learning me… ▽ More

    Submitted 17 February, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

  38. arXiv:2502.09170  [pdf, other

    cs.RO

    LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement

    Authors: Daocheng Fu, Naiting Zhong, Xu Han, Pinlong Cai, Licheng Wen, Song Mao, Botian Shi, Yu Qiao

    Abstract: Closed-loop simulation environments play a crucial role in the validation and enhancement of autonomous driving systems (ADS). However, certain challenges warrant significant attention, including balancing simulation accuracy with duration, reconciling functionality with practicality, and establishing comprehensive evaluation mechanisms. This paper addresses these challenges by introducing the Lim… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  39. arXiv:2502.06950  [pdf, other

    astro-ph.IM astro-ph.EP astro-ph.HE astro-ph.SR

    Cryoscope: A Cryogenic Infrared Survey Telescope in Antarctica

    Authors: Mansi M. Kasliwal, Nicholas Earley, Roger Smith, Tristan Guillot, Tony Travouillon, Jason Fucik, Lyu Abe, Timothee Greffe, Abdelkrim Agabi, Michael C. B. Ashley, Amaury H. M. J. Triaud, Samaporn Tinyanont, Sarah Antier, Philippe Bendjoya, Rohan Bhattarai, Rob Bertz, James Brugger, Artem Burdanov, Ilaria Caiazzo, Benoit Carry, Luca Casagrande, Brad Cenko, Jeff Cooke, Kishalay De, Richard Dekany , et al. (36 additional authors not shown)

    Abstract: We present Cryoscope--a new 50 deg$^2$ field-of-view, 1.2 m aperture, $K_{dark}$ survey telescope to be located at Dome C, Antarctica. Cryoscope has an innovative optical-thermal design wherein the entire telescope is cryogenically cooled. Cryoscope also explores new detector technology to cost-effectively tile the full focal plane. Leveraging the dark Antarctic sky and minimizing telescope therma… ▽ More

    Submitted 21 March, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 40 pages, 19 figures, 4 tables; accepted for publication in PASP on 2025-03-21

  40. arXiv:2502.01906  [pdf, other

    cs.CV

    Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models

    Authors: Chia-Wen Kuo, Sijie Zhu, Fan Chen, Xiaohui Shen, Longyin Wen

    Abstract: Large vision-and-language models (LVLMs) typically treat visual and textual embeddings as homogeneous inputs to a large language model (LLM). However, these inputs are inherently different: visual inputs are multi-dimensional and contextually rich, often pre-encoded by models like CLIP, while textual inputs lack this structure. In this paper, we propose Decomposed Attention (D-Attn), a novel metho… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  41. arXiv:2502.01141  [pdf, other

    cs.LG cs.AI

    Beyond Yes or No: Predictive Compliance Monitoring Approaches for Quantifying the Magnitude of Compliance Violations

    Authors: Qian Chen, Stefanie Rinderle-Ma, Lijie Wen

    Abstract: Most existing process compliance monitoring approaches detect compliance violations in an ex post manner. Only predicate prediction focuses on predicting them. However, predicate prediction provides a binary yes/no notion of compliance, lacking the ability to measure to which extent an ongoing process instance deviates from the desired state as specified in constraints. Here, being able to quantif… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  42. arXiv:2501.12583  [pdf, other

    cs.CE

    Chasing price drains liquidity

    Authors: Yizhou Cao, Yepeng Ding, Ruichao Jiang, Long Wen

    Abstract: Assuming that the price in a Uniswap v3 style Automated Market Maker (AMM) follows a Geometric Brownian Motion (GBM), we prove that the strategy that adjusts the position of liquidity to track the current price leads to a deterministic and exponentially fast decay of liquidity. Next, assuming that there is a Centralized Exchange (CEX), in which the price follows a GBM and the AMM price mean revert… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  43. arXiv:2501.08168  [pdf, other

    cs.AI

    LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking

    Authors: Yukai Ma, Tiantian Wei, Naiting Zhong, Jianbiao Mei, Tao Hu, Licheng Wen, Xuemeng Yang, Botian Shi, Yong Liu

    Abstract: While autonomous driving technology has made remarkable strides, data-driven approaches still struggle with complex scenarios due to their limited reasoning capabilities. Meanwhile, knowledge-driven autonomous driving systems have evolved considerably with the popularization of visual language models. In this paper, we propose LeapVAD, a novel method based on cognitive perception and dual-process… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  44. arXiv:2501.06555  [pdf, ps, other

    cond-mat.quant-gas

    Chiral supersolid and dissipative time crystal in Rydberg-dressed Bose-Einstein condensates with Raman-induced spin-orbit coupling

    Authors: Xianghua Su, Xiping Fu, Yang He, Ying Shang, Kaiyuan Ji, Linghua Wen

    Abstract: Spin-orbit coupling (SOC) is one of the key factors that affect the chiral symmetry of matter by causing the spatial symmetry breaking of the system. We find that Raman-induced SOC can induce a chiral supersolid phase with a helical antiskyrmion lattice in balanced Rydberg-dressed two-component Bose-Einstein condensates (BECs) in a harmonic trap by modulating the Raman coupling strength, strong co… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

    Comments: 13 pages,5 figures

  45. arXiv:2501.03580  [pdf

    cs.CV

    BASIC: Semi-supervised Multi-organ Segmentation with Balanced Subclass Regularization and Semantic-conflict Penalty

    Authors: Zhenghao Feng, Lu Wen, Yuanyuan Xu, Binyu Yan, Xi Wu, Jiliu Zhou, Yan Wang

    Abstract: Semi-supervised learning (SSL) has shown notable potential in relieving the heavy demand of dense prediction tasks on large-scale well-annotated datasets, especially for the challenging multi-organ segmentation (MoS). However, the prevailing class-imbalance problem in MoS caused by the substantial variations in organ size exacerbates the learning difficulty of the SSL network. To address this issu… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  46. Search for continuous gravitational waves from known pulsars in the first part of the fourth LIGO-Virgo-KAGRA observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1794 additional authors not shown)

    Abstract: Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent ana… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: main paper: 12 pages, 6 figures, 4 tables

    Report number: LIGO-P2400315

    Journal ref: Astrophys.J. 983 (2025) 2, 99

  47. arXiv:2501.00929  [pdf

    physics.optics physics.app-ph

    Gradient polaritonic surface with space-variant switchable light-matter interactions in 2D moire superlattices

    Authors: Zhen-Bing Dai, Hua Fan, Vyacheslav Semenenko, Xinyu Lv, Lu Wen, Zhen Zhang, Shijie Fang, Vasili Perebeinos, Yue Zhao, Zhiqiang Li

    Abstract: Polaritons in two-dimensional (2D) materials provide unique opportunities for controlling light at nanoscales. Tailoring these polaritons via gradient polaritonic surfaces with space-variant response can enable versatile light-matter interaction platforms with advanced functionalities. However, experimental progress has been hampered by the optical losses and poor light confinement of conventional… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

    Comments: 18 pages, 4 figures

    Journal ref: Science Advance, 10,eadq7445(2024)

  48. arXiv:2501.00871  [pdf, other

    hep-ph hep-ex physics.atom-ph

    Trilepton and tetralepton bound and resonant states: the QED counterpart of multiquark states

    Authors: Yao Ma, Lu Meng, Liang-Zhen Wen, Shi-Lin Zhu

    Abstract: This work presents the first prediction of tetralepton resonant states containing muons, extending beyond the simplest tetralepton system, dipositronium ($\mathrm{Ps}_2$). With the rapid advancements in experimental facilities, the production and study of these intriguing states may be within reach. We perform a comprehensive analysis of S-wave trilepton and tetralepton systems within the framewor… ▽ More

    Submitted 4 April, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

    Comments: 14 pages, 11 figures. Comments are welcome

    Journal ref: Phys. Rev. D 111, 073001 (2025)

  49. Comprehensive Measurement of the Reactor Antineutrino Spectrum and Flux at Daya Bay

    Authors: F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: This Letter reports the precise measurement of reactor antineutrino spectrum and flux based on the full data set of 4.7 million inverse-beta-decay (IBD) candidates collected at Daya Bay near detectors. Expressed in terms of the IBD yield per fission, the antineutrino spectra from all reactor fissile isotopes and the specific $\mathrm{^{235}U}$ and $\mathrm{^{239}Pu}$ isotopes are measured with 1.3… ▽ More

    Submitted 22 May, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

  50. arXiv:2412.18108  [pdf, other

    cs.CV

    Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach

    Authors: Jing Bi, Junjia Guo, Yunlong Tang, Lianggong Bruce Wen, Zhang Liu, Chenliang Xu

    Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) have demonstrated remarkable progress in visual understanding. This impressive leap raises a compelling question: how can language models, initially trained solely on linguistic data, effectively interpret and process visual content? This paper aims to address this question with systematic investigation across 4 model families and 4 m… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Journal ref: CVPR 2025 (IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025)