Skip to main content

Showing 1–50 of 468 results for author: shen, M

.
  1. arXiv:2506.17218  [pdf, ps, other

    cs.CV cs.AI

    Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

    Authors: Zeyuan Yang, Xueyang Yu, Delin Chen, Maohao Shen, Chuang Gan

    Abstract: Vision-language models (VLMs) excel at multimodal understanding, yet their text-only decoding forces them to verbalize visual reasoning, limiting performance on tasks that demand visual imagination. Recent attempts train VLMs to render explicit images, but the heavy image-generation pre-training often hinders the reasoning ability. Inspired by the way humans reason with mental imagery-the internal… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Project page: https://vlm-mirage.github.io/

  2. arXiv:2506.15662  [pdf, ps, other

    cs.CL

    CC-LEARN: Cohort-based Consistency Learning

    Authors: Xiao Ye, Shaswat Shrivastava, Zhaonan Li, Jacob Dineen, Shijie Lu, Avneet Ahuja, Ming Shen, Zhikun Xu, Ben Zhou

    Abstract: Large language models excel at many tasks but still struggle with consistent, robust reasoning. We introduce Cohort-based Consistency Learning (CC-Learn), a reinforcement learning framework that improves the reliability of LLM reasoning by training on cohorts of similar questions derived from shared programmatic abstractions. To enforce cohort-level consistency, we define a composite objective com… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  3. arXiv:2506.14888  [pdf

    cond-mat.str-el

    Time-domain decoding of unconventional charge order mechanisms in nonmagnetic and magnetic kagome metals

    Authors: Seongyong Lee, Byungjune Lee, Hoyoung Jang, Xueliang Wu, Jimin Kim, Gyeongbo Kang, Choongjae Won, Hyeongi Choi, Sang-Youn Park, Kyle M. Shen, Federico Cilento, Aifeng Wang, Jae-Hoon Park, Mingu Kang

    Abstract: In kagome lattice materials, quantum interplay between charge, spin, orbital, and lattice degrees of freedom gives rise to a remarkably rich set of emergent phenomena, ranging from unconventional charge order and superconductivity to topological magnetism. While the exact nature of these exotic orders is often challenging to comprehend in static experiments, time-resolved techniques can offer crit… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 4 figures

  4. arXiv:2506.14808  [pdf, ps, other

    cs.LG

    PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models

    Authors: Jenny Schmalfuss, Nadine Chang, Vibashan VS, Maying Shen, Andres Bruhn, Jose M. Alvarez

    Abstract: Vision language models (VLMs) respond to user-crafted text prompts and visual inputs, and are applied to numerous real-world problems. VLMs integrate visual modalities with large language models (LLMs), which are well known to be prompt-sensitive. Hence, it is crucial to determine whether VLMs inherit this instability to varying prompts. We therefore investigate which prompt variations VLMs are mo… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: Accepted to CVPR 2025

  5. arXiv:2506.13502  [pdf, ps, other

    cs.CL

    BOW: Bottlenecked Next Word Exploration

    Authors: Ming Shen, Zhikun Xu, Xiao Ye, Jacob Dineen, Ben Zhou

    Abstract: Large language models (LLMs) are typically trained via next-word prediction (NWP), which provides strong surface-level fluency but often lacks support for robust reasoning. We propose BOttlenecked next Word exploration (BOW), a novel RL framework that rethinks NWP by introducing a reasoning bottleneck where a policy model first generates a reasoning path rather than predicting the next token direc… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  6. arXiv:2506.12198  [pdf, ps, other

    cs.CV

    ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models

    Authors: Sibo Dong, Ismail Shaheen, Maggie Shen, Rupayan Mallick, Sarah Adel Bargal

    Abstract: Text-to-image diffusion models have achieved remarkable success, yet generating coherent image sequences for visual storytelling remains challenging. A key challenge is effectively leveraging all previous text-image pairs, referred to as history text-image pairs, which provide contextual information for maintaining consistency across frames. Existing auto-regressive methods condition on all past i… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  7. arXiv:2506.08123  [pdf, ps, other

    cs.CL

    QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA

    Authors: Jacob Dineen, Aswin RRV, Qin Liu, Zhikun Xu, Xiao Ye, Ming Shen, Zhaonan Li, Shijie Lu, Chitta Baral, Muhao Chen, Ben Zhou

    Abstract: Alignment of large language models with explicit principles (such as helpfulness, honesty, and harmlessness) is crucial for ensuring safe and reliable AI systems. However, standard reward-based alignment methods typically collapse diverse feedback into a single scalar reward, entangling multiple objectives into one opaque training signal, which hinders interpretability. In this work, we introduce… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  8. arXiv:2506.06664  [pdf, ps, other

    cs.RO cs.CV

    Generalized Trajectory Scoring for End-to-end Multimodal Planning

    Authors: Zhenxin Li, Wenhao Yao, Zi Wang, Xinglong Sun, Joshua Chen, Nadine Chang, Maying Shen, Zuxuan Wu, Shiyi Lan, Jose M. Alvarez

    Abstract: End-to-end multi-modal planning is a promising paradigm in autonomous driving, enabling decision-making with diverse trajectory candidates. A key component is a robust trajectory scorer capable of selecting the optimal trajectory from these candidates. While recent trajectory scorers focus on scoring either large sets of static trajectories or small sets of dynamically generated ones, both approac… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: The 1st place solution of the End-to-end Driving Track at the CVPR 2025 Autonomous Grand Challenge

  9. arXiv:2506.03065  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

    Authors: Pengtao Chen, Xianfang Zeng, Maosen Zhao, Peng Ye, Mingzhu Shen, Wei Cheng, Gang Yu, Tao Chen

    Abstract: While Diffusion Transformers (DiTs) have achieved breakthroughs in video generation, this long sequence generation task remains constrained by the quadratic complexity of attention mechanisms, resulting in significant inference latency. Through detailed analysis of attention maps in Video Diffusion Transformer (vDiT), we identify three recurring sparsity patterns: diagonal, multi-diagonal, and ver… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  10. arXiv:2505.23604  [pdf, ps, other

    cs.CL cs.AI cs.SE

    Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

    Authors: Guangtao Zeng, Maohao Shen, Delin Chen, Zhenting Qi, Subhro Das, Dan Gutfreund, David Cox, Gregory Wornell, Wei Lu, Zhang-Wei Hong, Chuang Gan

    Abstract: Language models (LMs) perform well on standardized coding benchmarks but struggle with real-world software engineering tasks such as resolving GitHub issues in SWE-Bench, especially when model parameters are less than 100B. While smaller models are preferable in practice due to their lower computational cost, improving their performance remains challenging. Existing approaches primarily rely on su… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  11. arXiv:2505.22477  [pdf

    cs.HC cs.AI cs.CY

    Human-Centered Human-AI Collaboration (HCHAC)

    Authors: Qi Gao, Wei Xu, Hanxi Pan, Mowei Shen, Zaifeng Gao

    Abstract: In the intelligent era, the interaction between humans and intelligent systems fundamentally involves collaboration with autonomous intelligent agents. Human-AI Collaboration (HAC) represents a novel type of human-machine relationship facilitated by autonomous intelligent machines equipped with AI technologies. In this paradigm, AI agents serve not only as auxiliary tools but also as active teamma… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: This article is a chapter from the upcoming book Handbook of Human-Centered Artificial Intelligence

  12. arXiv:2505.19482  [pdf, ps, other

    cs.CR

    Language of Network: A Generative Pre-trained Model for Encrypted Traffic Comprehension

    Authors: Di Zhao, Bo Jiang, Song Liu, Susu Cui, Meng Shen, Dongqi Han, Xingmao Guan, Zhigang Lu

    Abstract: The increasing demand for privacy protection and security considerations leads to a significant rise in the proportion of encrypted network traffic. Since traffic content becomes unrecognizable after encryption, accurate analysis is challenging, making it difficult to classify applications and detect attacks. Deep learning is currently the predominant approach for encrypted traffic classification… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  13. arXiv:2505.16086  [pdf, other

    cs.AI cs.CL

    Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development

    Authors: Ming Shen, Raphael Shu, Anurag Pratik, James Gung, Yubin Ge, Monica Sunkara, Yi Zhang

    Abstract: We have seen remarkable progress in large language models (LLMs) empowered multi-agent systems solving complex tasks necessitating cooperation among experts with diverse skills. However, optimizing LLM-based multi-agent systems remains challenging. In this work, we perform an empirical case study on group optimization of role-based multi-agent systems utilizing natural language feedback for challe… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  14. arXiv:2505.15989  [pdf, other

    eess.SP

    AI-Assisted NLOS Sensing for RIS-Based Indoor Localization in Smart Factories

    Authors: Taofeek A. O. Yusuf, Sigurd S. Petersen, Puchu Li, Jian Ren, Placido Mursia, Vincenzo Sciancalepore, Xavier Costa Pérez, Gilberto Berardinelli, Ming Shen

    Abstract: In the era of Industry 4.0, precise indoor localization is vital for automation and efficiency in smart factories. Reconfigurable Intelligent Surfaces (RIS) are emerging as key enablers in 6G networks for joint sensing and communication. However, RIS faces significant challenges in Non-Line-of-Sight (NLOS) and multipath propagation, particularly in localization scenarios, where detecting NLOS cond… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted 7 pages Paper for VTCSpring2025 Conference

  15. arXiv:2505.15403  [pdf, ps, other

    eess.SP

    RIS Beam Calibration for ISAC Systems: Modeling and Performance Analysis

    Authors: Mengting Li, Hui Chen, Sigurd Sandor Petersen, Huiping Huang, Alireza Pourafzal, Yu Ge, Ming Shen, Henk Wymeersch

    Abstract: High-accuracy localization is a key enabler for integrated sensing and communication (ISAC), playing an essential role in various applications such as autonomous driving. Antenna arrays and reconfigurable intelligent surface (RIS) are incorporated into these systems to achieve high angular resolution, assisting in the localization process. However, array and RIS beam patterns in practice often dev… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  16. arXiv:2505.15034  [pdf, ps, other

    cs.LG cs.AI cs.CL

    RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning

    Authors: Kaiwen Zha, Zhengqi Gao, Maohao Shen, Zhang-Wei Hong, Duane S. Boning, Dina Katabi

    Abstract: Reinforcement learning (RL) has recently emerged as a compelling approach for enhancing the reasoning capabilities of large language models (LLMs), where an LLM generator serves as a policy guided by a verifier (reward model). However, current RL post-training methods for LLMs typically use verifiers that are fixed (rule-based or frozen pretrained) or trained discriminatively via supervised fine-t… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Tech report. The first two authors contributed equally

  17. arXiv:2505.09892  [pdf, other

    cs.CR

    Correlating Account on Ethereum Mixing Service via Domain-Invariant feature learning

    Authors: Zheng Che, Taoyu Li, Meng Shen, Hanbiao Du, Liehuang Zhu

    Abstract: The untraceability of transactions facilitated by Ethereum mixing services like Tornado Cash poses significant challenges to blockchain security and financial regulation. Existing methods for correlating mixing accounts suffer from limited labeled data and vulnerability to noisy annotations, which restrict their practical applicability. In this paper, we propose StealthLink, a novel framework that… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: Cryptocurrency, Ethereum, mixing services, GNN

  18. arXiv:2505.07569  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Melting of Charge Density Waves in Low Dimensions

    Authors: Jeremy M. Shen, Alex Stangel, Suk Hyun Sung, Ismail El Baggari, Kai Sun, Robert Hovden

    Abstract: Charge density waves (CDWs) are collective electronic states that can reshape and melt, even while confined within a rigid atomic crystal. In two dimensions, melting is predicted to be distinct, proceeding through partially ordered nematic and hexatic states that are neither liquid nor crystal. Here we measure and explain how continuous, hexatic melting of incommensurate CDWs occurs in low-dimensi… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 18 pages, 7 figures (includes supplemental)

  19. arXiv:2505.03531  [pdf, ps, other

    cs.CL cs.LG

    Faster MoE LLM Inference for Extremely Large Models

    Authors: Haoqi Yang, Luohe Shi, Qiwei Li, Zuchao Li, Ping Wang, Bo Du, Mengjia Shen, Hai Zhao

    Abstract: Sparse Mixture of Experts (MoE) large language models (LLMs) are gradually becoming the mainstream approach for ultra-large-scale models. Existing optimization efforts for MoE models have focused primarily on coarse-grained MoE architectures. With the emergence of DeepSeek Models, fine-grained MoE models are gaining popularity, yet research on them remains limited. Therefore, we want to discuss th… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  20. arXiv:2505.02927  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech physics.comp-ph

    The Physics of Local Optimization in Complex Disordered Systems

    Authors: Mutian Shen, Gerardo Ortiz, Zhiqiao Dong, Martin Weigel, Zohar Nussinov

    Abstract: Limited resources motivate decomposing large-scale problems into smaller, "local" subsystems and stitching together the so-found solutions. We explore the physics underlying this approach and discuss the concept of "local hardness", i.e., complexity from the local solver perspective, in determining the ground states of both P- and NP-hard spin-glasses and related systems. Depending on the model co… ▽ More

    Submitted 2 June, 2025; v1 submitted 5 May, 2025; originally announced May 2025.

    Comments: 8+14 pages, 8+16 figures. Add two figures (S4, S5)

  21. arXiv:2505.02024  [pdf, other

    cs.AI

    From Mind to Machine: The Rise of Manus AI as a Fully Autonomous Digital Agent

    Authors: Minjie Shen, Qikai Yang

    Abstract: Manus AI is a general-purpose AI agent introduced in early 2025, marking a significant advancement in autonomous artificial intelligence. Developed by the Chinese startup Monica.im, Manus is designed to bridge the gap between "mind" and "hand" - combining the reasoning and planning capabilities of large language models with the ability to execute complex, end-to-end tasks that produce tangible out… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  22. arXiv:2504.17457  [pdf, other

    cs.CV

    Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks

    Authors: Zhiying Li, Yeying Jin, Fan Shen, Zhi Liu, Weibin Chen, Pengju Zhang, Xiaomei Zhang, Boyu Chen, Michael Shen, Kejian Wu, Zhaoxin Fan, Jin Dong

    Abstract: Expressive human pose and shape estimation (EHPS) is crucial for digital human generation, especially in applications like live streaming. While existing research primarily focuses on reducing estimation errors, it largely neglects robustness and security aspects, leaving these systems vulnerable to adversarial attacks. To address this significant challenge, we propose the \textbf{Tangible Attack… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 14 pages, 7 figures

  23. arXiv:2504.04471  [pdf, other

    cs.CV

    VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT

    Authors: Zhuo Zhi, Qiangqiang Wu, Minghe shen, Wenbo Li, Yinchuan Li, Kun Shao, Kaiwen Zhou

    Abstract: Long video understanding has emerged as an increasingly important yet challenging task in computer vision. Agent-based approaches are gaining popularity for processing long videos, as they can handle extended sequences and integrate various tools to capture fine-grained information. However, existing methods still face several challenges: (1) they often rely solely on the reasoning ability of larg… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  24. arXiv:2504.04066  [pdf, other

    eess.IV cs.CV

    Performance Analysis of Deep Learning Models for Femur Segmentation in MRI Scan

    Authors: Mengyuan Liu, Yixiao Chen, Anning Tian, Xinmeng Wu, Mozhi Shen, Tianchou Gong, Jeongkyu Lee

    Abstract: Convolutional neural networks like U-Net excel in medical image segmentation, while attention mechanisms and KAN enhance feature extraction. Meta's SAM 2 uses Vision Transformers for prompt-based segmentation without fine-tuning. However, biases in these models impact generalization with limited data. In this study, we systematically evaluate and compare the performance of three CNN-based models,… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

  25. arXiv:2504.03559  [pdf, other

    hep-ph hep-ex physics.ins-det

    Constraints on dark matter boosted by supernova shock within the effective field theory framework from the CDEX-10 experiment

    Authors: J. Z. Wang, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, H. Chen, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, H. X. Huang, T. C. Huang, S. Karmakar, H. B. Li , et al. (62 additional authors not shown)

    Abstract: Supernova shocks can boost dark matter (DM) particles to high, yet nonrelativistic, velocities, providing a suitable mechanism for analysis within the framework of the nonrelativistic effective field theory (NREFT). These accelerated DM sources extend the experimental ability to scan the parameter space of light DM into the sub-GeV region. In this study, we specifically analyze DM accelerated by t… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 9 pages, 5 figures

  26. arXiv:2504.02168  [pdf, other

    cs.CV cs.AI cs.LG

    MDP: Multidimensional Vision Model Pruning with Latency Constraint

    Authors: Xinglong Sun, Barath Lakshmanan, Maying Shen, Shiyi Lan, Jingde Chen, Jose M. Alvarez

    Abstract: Current structural pruning methods face two significant limitations: (i) they often limit pruning to finer-grained levels like channels, making aggressive parameter reduction challenging, and (ii) they focus heavily on parameter and FLOP reduction, with existing latency-aware methods frequently relying on simplistic, suboptimal linear models that fail to generalize well to transformers, where mult… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Accepted at CVPR 2025

  27. arXiv:2503.23778  [pdf, other

    cond-mat.mtrl-sci

    Efficient defect healing of single-walled cabron nanotubes through $ \mathrm{C}_{2}\mathrm{H}_{2} $-assisted multiple-cycle treatment with air exposure

    Authors: Man Shen, Taiki Inoue, Mengyue Wang, Yuanjia Liu, Yoshihiro Kobayashi

    Abstract: Defects in single-walled carbon nanotubes (SWCNTs) degrade their mechanical,electrical, and thermal properties, limiting their potential applications. To realize the diverse applications of SWCNTs, it is essential to enhance their crystallinity through effective defect healing. However, traditional thermal treatments typically require temperatures above 1800°C, which can alter the nanotube structu… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: submitted version

    Journal ref: ACS Appl. Mater. Interfaces 2025

  28. arXiv:2503.17793  [pdf, other

    cs.LG cs.AI cs.CL

    Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM

    Authors: Codefuse, Ling Team, :, Wenting Cai, Yuchen Cao, Chaoyu Chen, Chen Chen, Siba Chen, Qing Cui, Peng Di, Junpeng Fang, Zi Gong, Ting Guo, Zhengyu He, Yang Huang, Cong Li, Jianguo Li, Zheng Li, Shijie Lian, BingChang Liu, Songshan Luo, Shuo Mao, Min Shen, Jian Wu, Jiaolong Yang , et al. (8 additional authors not shown)

    Abstract: Recent advancements in code large language models (LLMs) have demonstrated remarkable capabilities in code generation and understanding. It is still challenging to build a code LLM with comprehensive performance yet ultimate efficiency. Many attempts have been released in the open source community to break the trade-off between performance and efficiency, such as the Qwen Coder series and the Deep… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: 20 pages, 6 figures

    ACM Class: I.2.7

  29. arXiv:2503.17174  [pdf, other

    econ.GN

    How to Promote Autonomous Driving with Evolving Technology: Business Strategy and Pricing Decision

    Authors: Mingliang Li, Yanrong Li, Lai Wei, Wei Jiang, Zuo-Jun Max Shen

    Abstract: Recently, autonomous driving system (ADS) has been widely adopted due to its potential to enhance travel convenience and alleviate traffic congestion, thereby improving the driving experience for consumers and creating lucrative opportunities for manufacturers. With the advancement of data sensing and control technologies, the reliability of ADS and the purchase intentions of consumers are continu… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  30. arXiv:2503.15918  [pdf, other

    cs.LG cs.AI

    Denoising-based Contractive Imitation Learning

    Authors: Macheng Shen, Jishen Peng, Zefang Huang

    Abstract: A fundamental challenge in imitation learning is the \emph{covariate shift} problem. Existing methods to mitigate covariate shift often require additional expert interactions, access to environment dynamics, or complex adversarial training, which may not be practical in real-world applications. In this paper, we propose a simple yet effective method (DeCIL) to mitigate covariate shift by incorpora… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  31. arXiv:2503.06730  [pdf, other

    cs.LG

    Adaptive Test-Time Intervention for Concept Bottleneck Models

    Authors: Matthew Shen, Aliyah Hsu, Abhineet Agarwal, Bin Yu

    Abstract: Concept bottleneck models (CBM) aim to improve model interpretability by predicting human level "concepts" in a bottleneck within a deep learning model architecture. However, how the predicted concepts are used in predicting the target still either remains black-box or is simplified to maintain interpretability at the cost of prediction performance. We propose to use Fast Interpretable Greedy Sum-… ▽ More

    Submitted 14 April, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

  32. arXiv:2502.17806  [pdf, other

    astro-ph.SR physics.space-ph

    Radial dependence of ion fluences in the 2023 July 17 SEP event from Parker Solar Probe to STEREO and ACE

    Authors: G. D. Muro, C. M. S Cohen, Z. Xu, R. A. Leske, E. R. Christian, A. C. Cummings, G. De Nolfo, M. I. Desai, F. Fraschetti, J. Giacalone, A. Labrador, D. J. McComas, J. G. Mitchell, D. G. Mitchell, J. Rankin, N. A. Schwadron, M. Shen, M. E. Wiedenbeck, S. D. Bale, O. Romeo, A. Vourlidas

    Abstract: In the latter moments of 17 July 2023, the solar active region 13363, near the southwestern face of the Sun, was undergoing considerable evolution, which resulted in a significant solar energetic particle (SEP) event measured by Parker Solar Probe's Integrated Science Investigation of the Sun (ISOIS) and near-Earth spacecraft. Remote observations from GOES and CHASE captured two M5.0+ solar flares… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: The Astrophysical Journal: 10 pages, 13 figures

  33. arXiv:2502.16084  [pdf, other

    hep-ex

    Single Inclusive $π^\pm$ and $K^\pm$ Production in $e^+e^-$ Annihilation at center-of-mass Energies from 2.000 to 3.671GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (707 additional authors not shown)

    Abstract: Using data samples with a total integrated luminosity of 253 $\rm pb^{-1}$ collected by the BESIII detector operating at the BEPCII collider, the differential cross-sections of inclusive $π^\pm$ and $K^\pm$ production, as a function of momentum and normalized by the total hadronic cross-section, are measured at center-of-mass energies from 2.000 to 3.671 GeV. The measured $π^{\pm}$ cross sections… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  34. arXiv:2502.15664  [pdf, ps, other

    cond-mat.stat-mech physics.comp-ph

    The Eggbox Ising Model

    Authors: Mutian Shen, Yichen Xu, Zohar Nussinov

    Abstract: We introduce a simple and versatile model that enables controlled design of rugged energy landscapes that realize different types of Parisi overlap distributions. This model captures quintessential aspects of Replica Symmetry Breaking (RSB) theory and may afford additional insights into complex systems and numerical methods for their analysis.

    Submitted 21 February, 2025; originally announced February 2025.

  35. arXiv:2502.09030  [pdf, ps, other

    math.CA

    $L^p\to L^q$ estimates for Stein's spherical maximal operators

    Authors: Naijia Liu, Minxing Shen, Liang Song, Lixin Yan

    Abstract: In this article we consider a modification of the Stein's spherical maximal operator of complex order $α$ on ${\mathbb R^n}$: $$ {\mathfrak M}^α_{[1,2]} f(x) =\sup\limits_{t\in [1,2]} \big| {1\over Γ(α) } \int_{|y|\leq 1} \left(1-|y|^2 \right)^{α-1} f(x-ty) dy\big|. $$ We show that when $n\geq 2$, suppose $\|{\mathfrak M}^α_{[1,2]} f \|_{L^q({\mathbb R^n})} \leq C\|f \|_{L^p({\mathbb R^n})}$ holds… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 14 pages, 1 figures

  36. arXiv:2502.07317  [pdf, other

    physics.ins-det hep-ex

    Position reconstruction and surface background model for the PandaX-4T detector

    Authors: Zhicheng Qian, Linhui Gu, Chen Cheng, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou , et al. (78 additional authors not shown)

    Abstract: We report the position reconstruction methods and surface background model for the PandaX-4T dark matter direct search experiment. This work develops two position reconstruction algorithms: template matching (TM) method and photon acceptance function (PAF) method. Both methods determine the horizontal position of events based on the light pattern of secondary scintillation collected by the light s… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 22 pages, 15 figures, 2 tables

  37. arXiv:2502.07165  [pdf, other

    cs.CL cs.AI

    Don't Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification

    Authors: Peipei Wei, Dimitris Dimitriadis, Yan Xu, Mingwei Shen

    Abstract: We present PRINCIPLE-BASED PROMPTING, a simple but effective multi-agent prompting strategy for text classification. It first asks multiple LLM agents to independently generate candidate principles based on analysis of demonstration samples with or without labels, consolidates them into final principles via a finalizer agent, and then sends them to a classifier agent to perform downstream classifi… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: To be published in AAAI 2025 Workshop on Advancing LLM-Based Multi-Agent Collaboration

  38. arXiv:2502.04923  [pdf, other

    cs.CV cs.AI

    Cached Multi-Lora Composition for Multi-Concept Image Generation

    Authors: Xiandong Zou, Mingzhu Shen, Christos-Savvas Bouganis, Yiren Zhao

    Abstract: Low-Rank Adaptation (LoRA) has emerged as a widely adopted technique in text-to-image models, enabling precise rendering of multiple distinct elements, such as characters and styles, in multi-concept image generation. However, current approaches face significant challenges when composing these LoRAs for multi-concept image generation, resulting in diminished generated image quality. In this paper,… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: The Thirteenth International Conference on Learning Representations (ICLR 2025)

  39. arXiv:2502.03658  [pdf, other

    cs.LG cs.CV

    Advancing Weight and Channel Sparsification with Enhanced Saliency

    Authors: Xinglong Sun, Maying Shen, Hongxu Yin, Lei Mao, Pavlo Molchanov, Jose M. Alvarez

    Abstract: Pruning aims to accelerate and compress models by removing redundant parameters, identified by specifically designed importance scores which are usually imperfect. This removal is irreversible, often leading to subpar performance in pruned models. Dynamic sparse training, while attempting to adjust sparse structures during training for continual reassessment and refinement, has several limitations… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: Accepted at WACV 2025

  40. arXiv:2502.03017  [pdf, other

    nucl-ex

    Search for Double Beta Decay of $^{136}$Xe to the $0^+_1$ Excited State of $^{136}$Ba with PandaX-4T

    Authors: PandaX Collaboration, Lingyin Luo, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingji Fang, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou, Yu Hou , et al. (76 additional authors not shown)

    Abstract: We perform a search of double beta decay of $^{136}$Xe to the excited state, $0^+_1$, of $^{136}$Ba (2$νββ$-0$_1^+$), using the dual-phase xenon detector of PandaX-4T with the first 94.9-day commissioning data. The multi-site events are reconstructed up to the MeV energy scale, which helps to improve the background model significantly. The background contribution from the stainless steel platform… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  41. arXiv:2502.02508  [pdf, ps, other

    cs.CL cs.AI

    Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

    Authors: Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory Wornell, Subhro Das, David Cox, Chuang Gan

    Abstract: Large language models (LLMs) have demonstrated remarkable reasoning capabilities across diverse domains. Recent studies have shown that increasing test-time computation enhances LLMs' reasoning capabilities. This typically involves extensive sampling at inference time guided by an external LLM verifier, resulting in a two-player system. Despite external guidance, the effectiveness of this system d… ▽ More

    Submitted 15 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  42. arXiv:2501.19208  [pdf, other

    stat.ML cs.LG math.OC

    Learning While Repositioning in On-Demand Vehicle Sharing Networks

    Authors: Hansheng Jiang, Chunlin Sun, Zuo-Jun Max Shen, Shunan Jiang

    Abstract: We consider a network inventory problem motivated by one-way, on-demand vehicle sharing services. Due to uncertainties in both demand and returns, as well as a fixed number of rental units across an $n$-location network, the service provider must periodically reposition vehicles to match supply with demand spatially while minimizing costs. The optimal repositioning policy under a general $n$-locat… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

  43. arXiv:2501.18871  [pdf, other

    cs.LG stat.ML

    Neural SDEs as a Unified Approach to Continuous-Domain Sequence Modeling

    Authors: Macheng Shen, Chen Cheng

    Abstract: Inspired by the ubiquitous use of differential equations to model continuous dynamics across diverse scientific and engineering domains, we propose a novel and intuitive approach to continuous sequence modeling. Our method interprets time-series data as \textit{discrete samples from an underlying continuous dynamical system}, and models its time evolution using Neural Stochastic Differential Equat… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  44. arXiv:2501.15942  [pdf, other

    cs.LG

    TimeHF: Billion-Scale Time Series Models Guided by Human Feedback

    Authors: Yongzhi Qi, Hao Hu, Dazhou Lei, Jianshen Zhang, Zhengxin Shi, Yulin Huang, Zhengyu Chen, Xiaoming Lin, Zuo-Jun Max Shen

    Abstract: Time series neural networks perform exceptionally well in real-world applications but encounter challenges such as limited scalability, poor generalization, and suboptimal zero-shot performance. Inspired by large language models, there is interest in developing large time series models (LTM) to address these issues. However, current methods struggle with training complexity, adapting human feedbac… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  45. arXiv:2501.15381  [pdf, other

    physics.optics

    Two-optical-cycle pulses from nanophotonic two-color soliton compression

    Authors: Robert M. Gray, Ryoto Sekine, Maximilian Shen, Thomas Zacharias, James Williams, Selina Zhou, Rahul Chawlani, Luis Ledezma, Nicolas Englebert, Alireza Marandi

    Abstract: Few- and single-cycle optical pulses and their associated ultra-broadband spectra have been crucial in the progress of ultrafast science and technology. Moreover, multi-color waveforms composed of independently manipulable ultrashort pulses in distinct spectral bands offer unique advantages in pulse synthesis and attosecond science. However, the generation and control of ultrashort pulses has requ… ▽ More

    Submitted 18 February, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

    Comments: 24 pages, 5 figures

  46. arXiv:2501.14923  [pdf, other

    astro-ph.SR astro-ph.IM physics.space-ph

    Comparing Methods for Calculating Solar Energetic Particle Intensities: Re-binning versus Spectral Binning

    Authors: M. E. Cuesta, L. Y. Khoo, G. Livadiotis, M. M. Shen, J. R. Szalay, D. J. McComas, J. S. Rankin, R. Bandyopadhyay, H. A. Farooki, J. T. Niehof, C. M. S. Cohen, R. A. Leske, Z. Xu, E. R. Christian, M. I. Desai, M. A. Dayeh

    Abstract: Solar energetic particle (SEP) events have been observed for decades in the interplanetary medium by spacecraft measuring the intensity of energetic ions and electrons. These intensities provide valuable information about particle acceleration, the effects of bulk plasma dynamics on particle transport, and the anisotropy of particle distributions. Since measured intensities are typically reported… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 17 pages, 9 Figures, Accepted for Publication in ApJS

  47. arXiv:2501.07329  [pdf, other

    cs.SD cs.CL eess.AS

    Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding

    Authors: Jiliang Hu, Zuchao Li, Mengjia Shen, Haojun Ai, Sheng Li, Jun Zhang

    Abstract: Spoken language understanding (SLU) is a structure prediction task in the field of speech. Recently, many works on SLU that treat it as a sequence-to-sequence task have achieved great success. However, This method is not suitable for simultaneous speech recognition and understanding. In this paper, we propose a joint speech recognition and structure learning framework (JSRSL), an end-to-end SLU mo… ▽ More

    Submitted 17 January, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: 5 pages, 2 figures, accepted by ICASSP 2025

  48. arXiv:2501.03880  [pdf, other

    eess.IV cs.CV cs.LG

    SELMA3D challenge: Self-supervised learning for 3D light-sheet microscopy image segmentation

    Authors: Ying Chen, Rami Al-Maskari, Izabela Horvath, Mayar Ali, Luciano Hoher, Kaiyuan Yang, Zengming Lin, Zhiwei Zhai, Mengzhe Shen, Dejin Xun, Yi Wang, Tony Xu, Maged Goubran, Yunheng Wu, Kensaku Mori, Johannes C. Paetzold, Ali Erturk

    Abstract: Recent innovations in light sheet microscopy, paired with developments in tissue clearing techniques, enable the 3D imaging of large mammalian tissues with cellular resolution. Combined with the progress in large-scale data analysis, driven by deep learning, these innovations empower researchers to rapidly investigate the morphological and functional properties of diverse biological samples. Segme… ▽ More

    Submitted 12 January, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

    Comments: 2st version

  49. Search for Solar Boosted Dark Matter Particles at the PandaX-4T Experiment

    Authors: Guofang Shen, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (78 additional authors not shown)

    Abstract: We present a novel constraint on light dark matter utilizing $1.54$ tonne$\cdot$year of data acquired from the PandaX-4T dual-phase xenon time projection chamber. This constraint is derived through detecting electronic recoil signals resulting from the interaction with solar-enhanced dark matter flux. Low-mass dark matter particles, lighter than a few MeV/$c^2$, can scatter with the thermal electr… ▽ More

    Submitted 12 May, 2025; v1 submitted 27 December, 2024; originally announced December 2024.

  50. arXiv:2412.18028  [pdf

    astro-ph.EP astro-ph.SR physics.space-ph

    Diverse dust populations in the near-Sun environment characterized by PSP/IS$\odot$IS

    Authors: M. M. Shen, J. R. Szalay, P. Pokorný, J. G. Mitchell, M. E. Hill, D. G. Mitchell, D. J. McComas, E. R. Christian, C. M. S. Cohen, N. A. Schwadron, S. D. Bale, D. M. Malaspina

    Abstract: The Integrated Science Investigation of the Sun (IS$\odot$IS) energetic particle instrument suite on Parker Solar Probe is dedicated to measuring energetic ions and electrons in the near-Sun environment. It includes a half-sky-viewing time-of-flight mass spectrometer (EPI-Lo) and five high-energy silicon solid-state detector-telescopes (EPI-Hi). To August 2024, eight of EPI-Lo's eighty separate te… ▽ More

    Submitted 28 December, 2024; v1 submitted 23 December, 2024; originally announced December 2024.