Skip to main content

Showing 51–100 of 10,762 results for author: Chen, J

.
  1. arXiv:2506.10395  [pdf, ps, other

    cs.CV cs.AI

    Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

    Authors: Zhiyang Xu, Jiuhai Chen, Zhaojiang Lin, Xichen Pan, Lifu Huang, Tianyi Zhou, Madian Khabsa, Qifan Wang, Di Jin, Michihiro Yasunaga, Lili Yu, Xi Victoria Lin, Shaoliang Nie

    Abstract: Recent advances in large language models (LLMs) have enabled multimodal foundation models to tackle both image understanding and generation within a unified framework. Despite these gains, unified models often underperform compared to specialized models in either task. A key challenge in developing unified models lies in the inherent differences between the visual features needed for image underst… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Unified image understanding and generation model

  2. arXiv:2506.10316  [pdf, ps, other

    hep-ex

    Search for sub-GeV invisible particles in inclusive decays of $J/ψ$ to $φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (704 additional authors not shown)

    Abstract: A search for an invisible particle, $X$, with a mass between 0 and 0.96 $\textrm{GeV}/\textit{c}^{2}$, is performed in the process $J/ψ\rightarrowφ+ X$ using $(8774.0\pm39.4)\times10^{6}$ $J/ψ$ events collected with the BESIII detector from 2017 to 2019. The $φ$ meson is fully reconstructed and an efficient veto of photons, neutral and charged hadrons up to twice the $K_L^0$ mass is applied to the… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 10 pages, 3 figures

  3. arXiv:2506.10300  [pdf

    physics.atom-ph

    Revisiting the electron affinity of selenium

    Authors: Rui Zhang, Wenru Jie, Jiayi Chen, Qihan Liu, Chuangang Ning

    Abstract: The electron affinity (EA) of atomic selenium, previously established as 16,297.276(9) cm-1 based on the laser photodetachment microscopy (LPM) measurements in 2012, exhibited a significant deviation from other earlier experimental values, yet it remained the accepted reference standard for over a decade. In this letter, we re-examined the EA of Se using the slow-electron velocity-map imaging meth… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 5 pages, 4 figures

  4. arXiv:2506.10035  [pdf, ps, other

    cs.GR cs.AI

    FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

    Authors: Fuhan Cai, Yong Guo, Jie Li, Wenbo Li, Xiangzhong Fang, Jian Chen

    Abstract: Recent advancements in text-to-image (T2I) generation have led to the emergence of highly expressive models such as diffusion transformers (DiTs), exemplified by FLUX. However, their massive parameter sizes lead to slow inference, high memory usage, and poor deployability. Existing acceleration methods (e.g., single-step distillation and attention pruning) often suffer from significant performance… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 14 pages

  5. arXiv:2506.09924  [pdf, ps, other

    math.OC

    On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing

    Authors: Junlin Chen, Chiwei Yan, Hai Jiang

    Abstract: Important pricing problems in centralized matching markets -- such as carpooling and food delivery platforms -- often exhibit a bi-level structure. At the upper level, the platform sets prices for different types of agents (e.g., riders with various origins and destinations, or delivery orders from diverse restaurants to customer locations). The lower level then matches converted agents to minimiz… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  6. arXiv:2506.09724  [pdf, ps, other

    cs.CV

    The Four Color Theorem for Cell Instance Segmentation

    Authors: Ye Zhang, Yu Zhou, Yifeng Wang, Jun Xiao, Ziyue Wang, Yongbing Zhang, Jianxu Chen

    Abstract: Cell instance segmentation is critical to analyzing biomedical images, yet accurately distinguishing tightly touching cells remains a persistent challenge. Existing instance segmentation frameworks, including detection-based, contour-based, and distance mapping-based approaches, have made significant progress, but balancing model performance with computational efficiency remains an open problem. I… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: Accepted at ICML 2025

  7. arXiv:2506.09677  [pdf, ps, other

    cs.CV cs.AI

    Reasoning Models Are More Easily Gaslighted Than You Think

    Authors: Bin Zhu, Hailong Yin, Jingjing Chen, Yu-Gang Jiang

    Abstract: Recent advances in reasoning-centric models promise improved robustness through mechanisms such as chain-of-thought prompting and test-time scaling. However, their ability to withstand misleading user input remains underexplored. In this paper, we conduct a systematic evaluation of three state-of-the-art reasoning models, i.e., OpenAI's o4-mini, Claude-3.7-Sonnet and Gemini-2.5-Flash, across three… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  8. arXiv:2506.09553  [pdf, ps, other

    cs.CV

    GLD-Road:A global-local decoding road network extraction model for remote sensing images

    Authors: Ligao Deng, Yupeng Deng, Yu Meng, Jingbo Chen, Zhihao Xi, Diyou Liu, Qifeng Chu

    Abstract: Road networks are crucial for mapping, autonomous driving, and disaster response. While manual annotation is costly, deep learning offers efficient extraction. Current methods include postprocessing (prone to errors), global parallel (fast but misses nodes), and local iterative (accurate but slow). We propose GLD-Road, a two-stage model combining global efficiency and local precision. First, it de… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  9. arXiv:2506.09525  [pdf, ps, other

    cs.CR

    Beyond Personalization: Federated Recommendation with Calibration via Low-rank Decomposition

    Authors: Jundong Chen, Honglei Zhang, Haoxuan Li, Chunxu Zhang, Zhiwei Li, Yidong Li

    Abstract: Federated recommendation (FR) is a promising paradigm to protect user privacy in recommender systems. Distinct from general federated scenarios, FR inherently needs to preserve client-specific parameters, i.e., user embeddings, for privacy and personalization. However, we empirically find that globally aggregated item embeddings can induce skew in user embeddings, resulting in suboptimal performan… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  10. arXiv:2506.09518  [pdf, other

    cs.CV

    HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene

    Authors: Jianing Chen, Zehao Li, Yujun Cai, Hao Jiang, Chengxuan Qian, Juyuan Kang, Shuqin Gao, Honglong Zhao, Tianlu Mao, Yucheng Zhang

    Abstract: Reconstructing dynamic 3D scenes from monocular videos remains a fundamental challenge in 3D vision. While 3D Gaussian Splatting (3DGS) achieves real-time rendering in static settings, extending it to dynamic scenes is challenging due to the difficulty of learning structured and temporally consistent motion representations. This challenge often manifests as three limitations in existing methods: r… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  11. arXiv:2506.09454  [pdf, ps, other

    cs.LG

    NDCG-Consistent Softmax Approximation with Accelerated Convergence

    Authors: Yuanhao Pu, Defu Lian, Xiaolong Chen, Xu Huang, Jin Chen, Enhong Chen

    Abstract: Ranking tasks constitute fundamental components of extreme similarity learning frameworks, where extremely large corpora of objects are modeled through relative similarity relationships adhering to predefined ordinal structures. Among various ranking surrogates, Softmax (SM) Loss has been widely adopted due to its natural capability to handle listwise ranking via global negative comparisons, along… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 35 pages

  12. arXiv:2506.09386  [pdf, ps, other

    hep-ex

    Search for the charmonium weak decays $J/ψ\to D_{s}^{-}ρ^{+}+c.c.$ and $J/ψ\to D_{s}^{-}π^{+}+c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (705 additional authors not shown)

    Abstract: Based on $(10087\pm44)\times 10^6$ $J/ψ$ events recorded with the BESIII detector, we search for the rare charmonium weak decays $J/ψ\to D_{s}^{-}ρ^{+}+c.c.$ and $J/ψ\to D_{s}^{-}π^{+}+c.c.$ No signal is observed, and upper limits on the branching fractions at the $90\%$ confidence level are set as $\mathcal{B}(J/ψ\to D_{s}^{-}ρ^{+}+c.c.)<8.0\times10^{-7}$ and… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 18 pages, 3 figures

  13. New Symbiotic Stars from LAMOST DR10 Spectra and Multi-band Photometry

    Authors: Jing Chen, Liang Wang, Yin-Bi Li, Xiao-Xiao Ma, A-Li Luo, Zi-Chong Zhang, Ming-Yi Ding, Kai Zhang

    Abstract: Symbiotic star (SySt) is long-period interacting binary system, typically consisting of a white dwarf and a red giant surrounded by a nebula. These systems are natural astrophysical laboratories for investigating binary star evolution. In this paper, we identified nine SySts from the LAMOST DR10 low-resolution spectra survey, seven of which were previously known, while two are newly identified. In… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 13 pages, 8 figures, Accepted by ApJ

  14. arXiv:2506.09344  [pdf, ps, other

    cs.AI cs.CL cs.CV cs.LG cs.SD eess.AS

    Ming-Omni: A Unified Multimodal Model for Perception and Generation

    Authors: Inclusion AI, Biao Gong, Cheng Zou, Chuanyang Zheng, Chunluan Zhou, Canxiang Yan, Chunxiang Jin, Chunjie Shen, Dandan Zheng, Fudong Wang, Furong Xu, GuangMing Yao, Jun Zhou, Jingdong Chen, Jianxin Sun, Jiajia Liu, Jianjiang Zhu, Jun Peng, Kaixiang Ji, Kaiyou Song, Kaimeng Ren, Libin Wang, Lixiang Ru, Lele Xie, Longhua Tan , et al. (33 additional authors not shown)

    Abstract: We propose Ming-Omni, a unified multimodal model capable of processing images, text, audio, and video, while demonstrating strong proficiency in both speech and image generation. Ming-Omni employs dedicated encoders to extract tokens from different modalities, which are then processed by Ling, an MoE architecture equipped with newly proposed modality-specific routers. This design enables a single… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 18 pages,8 figures

  15. arXiv:2506.09240  [pdf

    cond-mat.mtrl-sci

    Electron mobility in AlN from first principles

    Authors: Amanda Wang, Nick Pant, Woncheol Lee, Jie-Cheng Chen, Feliciano Giustino, Emmanouil Kioupakis

    Abstract: Aluminum nitride is a promising ultra-wide band gap semiconductor for optoelectronics and power electronics. However, its practical applications have been limited by challenges with doping and achieving high electrical conductivity. Recent advances in crystal quality and defect control have led to improvements in experimentally measured mobilities. In this work, we apply first-principles calculati… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 18 pages, 2 figures in main text, 2 in supplementary

  16. arXiv:2506.09175  [pdf, ps, other

    cs.CL cs.AI cs.SD eess.AS

    PHRASED: Phrase Dictionary Biasing for Speech Translation

    Authors: Peidong Wang, Jian Xue, Rui Zhao, Junkun Chen, Aswin Shanmugam Subramanian, Jinyu Li

    Abstract: Phrases are essential to understand the core concepts in conversations. However, due to their rare occurrence in training data, correct translation of phrases is challenging in speech translation tasks. In this paper, we propose a phrase dictionary biasing method to leverage pairs of phrases mapping from the source language to the target language. We apply the phrase dictionary biasing method to t… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  17. arXiv:2506.09114  [pdf, other

    cs.LG

    TRACE: Grounding Time Series in Context for Multimodal Embedding and Retrieval

    Authors: Jialin Chen, Ziyu Zhao, Gaukhar Nurbek, Aosong Feng, Ali Maatouk, Leandros Tassiulas, Yifeng Gao, Rex Ying

    Abstract: The ubiquity of dynamic data in domains such as weather, healthcare, and energy underscores a growing need for effective interpretation and retrieval of time-series data. These data are inherently tied to domain-specific contexts, such as clinical notes or weather narratives, making cross-modal retrieval essential not only for downstream tasks but also for developing robust time-series foundation… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  18. arXiv:2506.09080  [pdf, other

    cs.LG cs.AI q-fin.CP

    FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making

    Authors: Jiaxiang Chen, Mingxi Zou, Zhuo Wang, Qifan Wang, Dongning Sun, Chi Zhang, Zenglin Xu

    Abstract: Financial decision-making presents unique challenges for language models, demanding temporal reasoning, adaptive risk assessment, and responsiveness to dynamic events. While large language models (LLMs) show strong general reasoning capabilities, they often fail to capture behavioral patterns central to human financial decisions-such as expert reliance under information asymmetry, loss-averse sens… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  19. arXiv:2506.09052  [pdf

    cs.LG cs.AI q-bio.QM

    Llama-Affinity: A Predictive Antibody Antigen Binding Model Integrating Antibody Sequences with Llama3 Backbone Architecture

    Authors: Delower Hossain, Ehsan Saghapour, Kevin Song, Jake Y. Chen

    Abstract: Antibody-facilitated immune responses are central to the body's defense against pathogens, viruses, and other foreign invaders. The ability of antibodies to specifically bind and neutralize antigens is vital for maintaining immunity. Over the past few decades, bioengineering advancements have significantly accelerated therapeutic antibody development. These antibody-derived drugs have shown remark… ▽ More

    Submitted 17 May, 2025; originally announced June 2025.

    Comments: 7 Pages

  20. arXiv:2506.09022  [pdf, ps, other

    cs.CV

    Do Multiple Instance Learning Models Transfer?

    Authors: Daniel Shao, Richard J. Chen, Andrew H. Song, Joel Runevic, Ming Y. Lu, Tong Ding, Faisal Mahmood

    Abstract: Multiple Instance Learning (MIL) is a cornerstone approach in computational pathology (CPath) for generating clinically meaningful slide-level embeddings from gigapixel tissue images. However, MIL often struggles with small, weakly supervised clinical datasets. In contrast to fields such as NLP and conventional computer vision, where transfer learning is widely used to address data scarcity, the t… ▽ More

    Submitted 11 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: ICML 2025 (Spotlight). 20 pages, 8 figures

  21. arXiv:2506.08967  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

    Authors: Ailin Huang, Bingxin Li, Bruce Wang, Boyong Wu, Chao Yan, Chengli Feng, Heng Wang, Hongyu Zhou, Hongyuan Wang, Jingbei Li, Jianjian Sun, Joanna Wang, Mingrui Chen, Peng Liu, Ruihang Miao, Shilei Jiang, Tian Fei, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Ge, Zheng Gong, Zhewei Huang , et al. (51 additional authors not shown)

    Abstract: Large Audio-Language Models (LALMs) have significantly advanced intelligent human-computer interaction, yet their reliance on text-based outputs limits their ability to generate natural speech responses directly, hindering seamless audio interactions. To address this, we introduce Step-Audio-AQAA, a fully end-to-end LALM designed for Audio Query-Audio Answer (AQAA) tasks. The model integrates a du… ▽ More

    Submitted 13 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: 12 pages, 3 figures

  22. arXiv:2506.08898  [pdf, ps, other

    cs.AI

    Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation

    Authors: Mingfeng Fan, Jianan Zhou, Yifeng Zhang, Yaoxin Wu, Jinbiao Chen, Guillaume Adrien Sartoretti

    Abstract: Recent deep reinforcement learning methods have achieved remarkable success in solving multi-objective combinatorial optimization problems (MOCOPs) by decomposing them into multiple subproblems, each associated with a specific weight vector. However, these methods typically treat all subproblems equally and solve them using a single model, hindering the effective exploration of the solution space… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 22 pages, 6 figures, under review

  23. arXiv:2506.08806  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Ferromagnetic Two-dimensional Electron Gases with Magnetic Doping and Proximity Effects

    Authors: Zixin Fan, Jiale Chen, Qiangtao Sui, Haoming Ling, Zihao Wang, Lingyuan Kong, Dingyi Li, Fang Yang, Run Zhao, Hanghui Chen, Pan Chen, Yan Liang, Jiandi Zhang

    Abstract: The advent of magnetic two-dimensional electron gases (2DEGs) at oxide interfaces has provided new opportunities in the field of spintronics. The enhancement of magnetism in 2DEGs at oxide interfaces continues to be a significant challenge, as exemplified by the relatively weak magnetism observed in the classical LaAlO3/SrTiO3 interface. Here, we present ferromagnetic (FM) 2DEGs at the interface f… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 21 pages and 5 figures

    Journal ref: Phys. Rev. B 111, 235415 (2025)

  24. arXiv:2506.08624  [pdf, ps, other

    nucl-ex

    Measurement of $ψ(2S)$ to $J/ψ$ cross-section ratio as function of multiplicity in $p$Pb collisions at$\sqrt{s_{NN}} = 8.16$ TeV

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis, L. An , et al. (1137 additional authors not shown)

    Abstract: The production ratio of $ψ(2S)$ to $J/ψ$ charmonium states is presented as a function of multiplicity in proton-lead collisions at a centre-of-mass energy of $\sqrt{s_{NN}}=8.16$ TeV, for both prompt and nonprompt sources. The total luminosity recorded by the LHCb experiment corresponds to 13.6 $pb^{-1}$ for $p$Pb collisions and 20.8 $pb^{-1}$ for Pb$p$ collisions, where the first particle indicat… ▽ More

    Submitted 12 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/4177/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-011, CERN-EP-2025-114

  25. arXiv:2506.08576  [pdf, ps, other

    hep-ex

    Measurement of the $η$ transition form factor through $η' \rightarrow π^+π^-η$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Based on a sample of $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at BESIII, the transition form factor of the $η$ meson is extracted by analyzing $J/ψ\toγη',~η'\toπ^+π^-η,~η\toγl^+l^-$ ($l$=$e$, $μ$) events. The measured slope of the transition form factor is $Λ^{-2}=1.645\pm0.093_{\rm stat.}\pm {0.024_{\rm sys.}}$ (GeV/$c^2$)$^{-2}$ for the di-electron channel and… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  26. arXiv:2506.08534  [pdf, ps, other

    eess.IV cs.AI cs.CV

    DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View

    Authors: Donglian Li, Hui Guo, Minglang Chen, Huizhen Chen, Jialing Chen, Bocheng Liang, Pengchen Liang, Ying Tan

    Abstract: Accurate segmentation of anatomical structures in the apical four-chamber (A4C) view of fetal echocardiography is essential for early diagnosis and prenatal evaluation of congenital heart disease (CHD). However, precise segmentation remains challenging due to ultrasound artifacts, speckle noise, anatomical variability, and boundary ambiguity across different gestational stages. To reduce the workl… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  27. arXiv:2506.08527  [pdf, ps, other

    astro-ph.GA astro-ph.HE

    The Compton-thick AGN Population and the $N_{\rm H}$ Distribution of Low-mass AGN in our Cosmic Backyard

    Authors: A. Annuar, D. M. Alexander, P. Gandhi, G. B. Lansbury, M. N. Rosli, D. Stern, D. Asmus, D. R. Ballantyne, M. Baloković, F. E. Bauer, P. G. Boorman, W. N. Brandt, M. Brightman, C. T. J. Chen, A. Del Moro, D. Farrah, F. A. Harrison, M. J. Koss, L. Lanz, S. Marchesi, P. Mohanadas, E. Nardini, C. Ricci, L. Zappacosta

    Abstract: We present a census of the Compton-thick (CT) active galactic nucleus (AGN) population and the column density ($N_{\rm{H}}$) distribution of AGN in our cosmic backyard using a mid-infrared selected AGN sample within 15 Mpc. The column densities are measured from broadband X-ray spectral analysis, mainly using data from $\textit{Chandra}$ and $\textit{NuSTAR}$. Our sample probes AGN with intrinsic… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 22 pages, 18 figures, 3 tables. Accepted for publication in MNRAS on 5 June 2025

  28. arXiv:2506.08403  [pdf, ps, other

    cs.CL cs.AI

    TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration

    Authors: Weiya Li, Junjie Chen, Bei Li, Boyang Liu, Zichen Wen, Nuanqiao Shan, Xiaoqian Liu, Anping Liu, Huajie Liu, Hu Song, Linfeng Zhang

    Abstract: Machine translation has long been a central task in natural language processing. With the rapid advancement of large language models (LLMs), there has been remarkable progress in translation quality. However, fully realizing the translation potential of LLMs remains an open challenge. Recent studies have explored multi-agent systems to decompose complex translation tasks into collaborative subtask… ▽ More

    Submitted 11 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

    Comments: 20 pages, 4 figures, Under review. Code: https://github.com/weiyali126/TACTIC

  29. arXiv:2506.08383  [pdf

    cs.LG cs.CR

    Network Threat Detection: Addressing Class Imbalanced Data with Deep Forest

    Authors: Jiaqi Chen, Rongbin Ye

    Abstract: With the rapid expansion of Internet of Things (IoT) networks, detecting malicious traffic in real-time has become a critical cybersecurity challenge. This research addresses the detection challenges by presenting a comprehensive empirical analysis of machine learning techniques for malware detection using the IoT-23 dataset provided by the Stratosphere Laboratory. We address the significant class… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  30. arXiv:2506.07999  [pdf, ps, other

    cs.CV cs.LG

    MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

    Authors: Junhao Chen, Yulia Tsvetkov, Xiaochuang Han

    Abstract: Recent progress in multimodal generation has increasingly combined autoregressive (AR) and diffusion-based approaches, leveraging their complementary strengths: AR models capture long-range dependencies and produce fluent, context-aware outputs, while diffusion models operate in continuous latent spaces to refine high-fidelity visual details. However, existing hybrids often lack systematic guidanc… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  31. arXiv:2506.07992  [pdf, ps, other

    cs.CV

    PairEdit: Learning Semantic Variations for Exemplar-based Image Editing

    Authors: Haoguang Lu, Jiacheng Chen, Zhenguo Yang, Aurele Tohokantche Gnanha, Fu Lee Wang, Li Qing, Xudong Mao

    Abstract: Recent advancements in text-guided image editing have achieved notable success by leveraging natural language prompts for fine-grained semantic control. However, certain editing semantics are challenging to specify precisely using textual descriptions alone. A practical alternative involves learning editing semantics from paired source-target examples. Existing exemplar-based editing methods still… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  32. arXiv:2506.07907  [pdf, ps, other

    hep-ex

    A novel measurement of the strong-phase difference between $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decays using $C$-even and $C$-odd quantum-correlated $D\bar{D}$ pairs

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (707 additional authors not shown)

    Abstract: A novel measurement technique of strong-phase differences between the decay amplitudes of $D^0$ and $\bar{D}^0$ mesons is introduced which exploits quantum-correlated $D\bar{D}$ pairs produced by $e^+e^-$ collisions at energies above the $ψ(3770)$ production threshold, where $D\bar{D}$ pairs are produced in both even and odd eigenstates of the charge-conjugation symmetry. Employing this technique,… ▽ More

    Submitted 10 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  33. arXiv:2506.07906  [pdf, ps, other

    hep-ex

    First observation of quantum correlations in $e^+e^-\to XD\bar{D}$ and $C$-even constrained $D\bar{D}$ pairs

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (707 additional authors not shown)

    Abstract: The study of meson pairs produced with quantum correlations gives direct access to parameters that are challenging to measure in other systems. In this Letter, the existence of quantum correlations due to charge-conjugation symmetry $C$ are demonstrated in $D\bar{D}$ pairs produced through the processes $e^+e^-\to D\bar{D}$, $e^+e^- \to D^{*}\bar{D}$, and $e^+e^- \to D^{*} \bar{D}^*$, where the la… ▽ More

    Submitted 10 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  34. arXiv:2506.07820  [pdf, ps, other

    cs.AI

    Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation

    Authors: Jiaxiang Chen, Zhuo Wang, Mingxi Zou, Qifan Wang, Zenglin Xu

    Abstract: Human reasoning is flexible, adaptive, and grounded in prior experience-qualities that large language models (LLMs) still struggle to emulate. Existing methods either explore diverse reasoning paths at inference time or search for optimal workflows through expensive operations, but both fall short in leveraging multiple reusable strategies in a structured, efficient manner. We propose Guideline Fo… ▽ More

    Submitted 9 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  35. arXiv:2506.07652  [pdf, ps, other

    cs.CV cs.AI

    FMaMIL: Frequency-Driven Mamba Multi-Instance Learning for Weakly Supervised Lesion Segmentation in Medical Images

    Authors: Hangbei Cheng, Xiaorong Dong, Xueyu Liu, Jianan Zhang, Xuetao Ma, Mingqiang Wei, Liansheng Wang, Junxin Chen, Yongfei Wu

    Abstract: Accurate lesion segmentation in histopathology images is essential for diagnostic interpretation and quantitative analysis, yet it remains challenging due to the limited availability of costly pixel-level annotations. To address this, we propose FMaMIL, a novel two-stage framework for weakly supervised lesion segmentation based solely on image-level labels. In the first stage, a lightweight Mamba-… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  36. Addressing Correlated Latent Exogenous Variables in Debiased Recommender Systems

    Authors: Shuqiang Zhang, Yuchao Zhang, Jinkun Chen, Haochen Sui

    Abstract: Recommendation systems (RS) aim to provide personalized content, but they face a challenge in unbiased learning due to selection bias, where users only interact with items they prefer. This bias leads to a distorted representation of user preferences, which hinders the accuracy and fairness of recommendations. To address the issue, various methods such as error imputation based, inverse propensity… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2 (KDD '25), August 3--7, 2025, Toronto, ON, Canada

  37. arXiv:2506.07503  [pdf

    cs.SE

    Large Language Models for Multilingual Vulnerability Detection: How Far Are We?

    Authors: Honglin Shu, Michael Fu, Junji Yu, Dong Wang, Chakkrit Tantithamthavorn, Junjie Chen, Yasutaka Kamei

    Abstract: Various deep learning-based approaches utilizing pre-trained language models (PLMs) have been proposed for automated vulnerability detection. With recent advancements in large language models (LLMs), several studies have begun exploring their application to vulnerability detection tasks. However, existing studies primarily focus on specific programming languages (e.g., C/C++) and function-level de… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 33 pages, 9 figures

  38. arXiv:2506.07463  [pdf, ps, other

    cs.CL cs.AI

    CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

    Authors: Guang Liu, Liangdong Wang, Jijie Li, Yang Yu, Yao Xu, Jiabei Chen, Yu Bai, Feng Liao, Yonghua Lin

    Abstract: We introduce CCI4.0, a large-scale bilingual pre-training dataset engineered for superior data quality and diverse human-like reasoning trajectory. CCI4.0 occupies roughly $35$ TB of disk space and comprises two sub-datasets: CCI4.0-M2-Base and CCI4.0-M2-CoT. CCI4.0-M2-Base combines a $5.2$ TB carefully curated Chinese web corpus, a $22.5$ TB English subset from Nemotron-CC, and diverse sources fr… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  39. arXiv:2506.07431  [pdf, ps, other

    cs.CV cs.AI

    FAMSeg: Fetal Femur and Cranial Ultrasound Segmentation Using Feature-Aware Attention and Mamba Enhancement

    Authors: Jie He, Minglang Chen, Minying Lu, Bocheng Liang, Junming Wei, Guiyan Peng, Jiaxi Chen, Ying Tan

    Abstract: Accurate ultrasound image segmentation is a prerequisite for precise biometrics and accurate assessment. Relying on manual delineation introduces significant errors and is time-consuming. However, existing segmentation models are designed based on objects in natural scenes, making them difficult to adapt to ultrasound objects with high noise and high similarity. This is particularly evident in sma… ▽ More

    Submitted 14 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  40. arXiv:2506.07368  [pdf, other

    cs.CV cs.AI

    C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation

    Authors: Jiaying He, Yitong Lin, Jiahe Chen, Honghui Xu, Jianwei Zheng

    Abstract: For the immanent challenge of insufficiently annotated samples in the medical field, semi-supervised medical image segmentation (SSMIS) offers a promising solution. Despite achieving impressive results in delineating primary target areas, most current methodologies struggle to precisely capture the subtle details of boundaries. This deficiency often leads to significant diagnostic inaccuracies. To… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 6 pages, 4 figures, ICME2025

  41. arXiv:2506.07351  [pdf, ps, other

    math.OC cs.LG eess.SY

    Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking

    Authors: Jun Chen, Lina Liu, Tianyi Zhu, Yong Liu, Guang Dai, Yunliang Jiang, Ivor W. Tsang

    Abstract: This paper considers the problem of decentralized optimization on compact submanifolds, where a finite sum of smooth (possibly non-convex) local functions is minimized by $n$ agents forming an undirected and connected graph. However, the efficiency of distributed optimization is often hindered by communication bottlenecks. To mitigate this, we propose the Quantized Riemannian Gradient Tracking (Q-… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  42. arXiv:2506.07346  [pdf, ps, other

    math.AP

    Another look at quasilinear Schrödinger equations with prescribed mass via dual method

    Authors: Jianhua Chen, Vicentiu D. Radulescu, Jijiang Sun, Jian Zhang

    Abstract: In this paper, we aim to study the existence of ground state normalized solutions for the following quasilinear Schrödinger equation $-Δu-Δ(u^2)u=h(u)+λu,\,\, x\in\R^N$, under the mass constraint $\int_{\R^N}|u|^2\text{d}x=a,$ where $N\geq2$, $a>0$ is a given mass, $λ$ is a Lagrange multiplier and $h$ is a nonlinear reaction term with some suitable conditions. By employing a suitable transformatio… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  43. arXiv:2506.07165  [pdf, ps, other

    cs.LG cs.AI

    AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models

    Authors: Qi Liu, Jingqing Ruan, Hao Li, Haodong Zhao, Desheng Wang, Jiansong Chen, Wan Guanglu, Xunliang Cai, Zhi Zheng, Tong Xu

    Abstract: Existing multi-objective preference alignment methods for large language models (LLMs) face limitations: (1) the inability to effectively balance various preference dimensions, and (2) reliance on auxiliary reward/reference models introduces computational complexity. To address these challenges, we propose Adaptive Multi-objective Preference Optimization (AMoPO), a novel framework that achieves dy… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: Accepted by ACL 2025

  44. arXiv:2506.07084  [pdf, ps, other

    math.NA

    The PML method for calculating the propagating modes of electromagnetic wave in periodic structures

    Authors: Lide Cai, Junqing Chen, Yanpeng Gao

    Abstract: When the electromagnetic wave is incident on the periodic structures, in addition to the scattering field, some propagating modes that are traveling in the periodic medium could be generated. In the present paper, we study the calculation of propagating modes. We formulate the problem as a nonlinear eigenvalue problem in an unbounded periodic domain. Then we use perfectly matched layers to truncat… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  45. arXiv:2506.06952  [pdf, ps, other

    cs.CV

    LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

    Authors: Ying Shen, Zhiyang Xu, Jiuhai Chen, Shizhe Diao, Jiaxin Zhang, Yuguang Yao, Joy Rimchala, Ismini Lourentzou, Lifu Huang

    Abstract: Recent advances in multimodal foundation models unifying image understanding and generation have opened exciting avenues for tackling a wide range of vision-language tasks within a single framework. Despite progress, existing unified models typically require extensive pretraining and struggle to achieve the same level of performance compared to models dedicated to each task. Additionally, many of… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: Unified multimodal model, Flow-matching

  46. arXiv:2506.06664  [pdf, ps, other

    cs.RO cs.CV

    Generalized Trajectory Scoring for End-to-end Multimodal Planning

    Authors: Zhenxin Li, Wenhao Yao, Zi Wang, Xinglong Sun, Joshua Chen, Nadine Chang, Maying Shen, Zuxuan Wu, Shiyi Lan, Jose M. Alvarez

    Abstract: End-to-end multi-modal planning is a promising paradigm in autonomous driving, enabling decision-making with diverse trajectory candidates. A key component is a robust trajectory scorer capable of selecting the optimal trajectory from these candidates. While recent trajectory scorers focus on scoring either large sets of static trajectories or small sets of dynamically generated ones, both approac… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: The 1st place solution of the End-to-end Driving Track at the CVPR 2025 Autonomous Grand Challenge

  47. arXiv:2506.06541  [pdf, ps, other

    cs.DB cs.AI cs.MA

    KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes

    Authors: Eugenie Lai, Gerardo Vitagliano, Ziyu Zhang, Sivaprasad Sudhir, Om Chabra, Anna Zeng, Anton A. Zabreyko, Chenning Li, Ferdi Kossmann, Jialin Ding, Jun Chen, Markos Markakis, Matthew Russo, Weiyang Wang, Ziniu Wu, Michael J. Cafarella, Lei Cao, Samuel Madden, Tim Kraska

    Abstract: Constructing real-world data-to-insight pipelines often involves data extraction from data lakes, data integration across heterogeneous data sources, and diverse operations from data cleaning to analysis. The design and implementation of data science pipelines require domain knowledge, technical expertise, and even project-specific insights. AI systems have shown remarkable reasoning, coding, and… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  48. arXiv:2506.06362  [pdf, ps, other

    cs.NE cs.AI cs.LG

    CR-BLEA: Contrastive Ranking for Adaptive Resource Allocation in Bilevel Evolutionary Algorithms

    Authors: Dejun Xu, Jijia Chen, Gary G. Yen, Min Jiang

    Abstract: Bilevel optimization poses a significant computational challenge due to its nested structure, where each upper-level candidate solution requires solving a corresponding lower-level problem. While evolutionary algorithms (EAs) are effective at navigating such complex landscapes, their high resource demands remain a key bottleneck -- particularly the redundant evaluation of numerous unpromising lowe… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  49. arXiv:2506.06295  [pdf, ps, other

    cs.LG cs.AI cs.CL

    dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching

    Authors: Zhiyuan Liu, Yicun Yang, Yaojie Zhang, Junjie Chen, Chang Zou, Qingyuan Wei, Shaobo Wang, Linfeng Zhang

    Abstract: Autoregressive Models (ARMs) have long dominated the landscape of Large Language Models. Recently, a new paradigm has emerged in the form of diffusion-based Large Language Models (dLLMs), which generate text by iteratively denoising masked segments. This approach has shown significant advantages and potential. However, dLLMs suffer from high inference latency. Traditional ARM acceleration techniqu… ▽ More

    Submitted 17 May, 2025; originally announced June 2025.

  50. arXiv:2506.06283  [pdf, other

    cs.CV cs.AI

    Facial Foundational Model Advances Early Warning of Coronary Artery Disease from Live Videos with DigitalShadow

    Authors: Juexiao Zhou, Zhongyi Han, Mankun Xin, Xingwei He, Guotao Wang, Jiaoyan Song, Gongning Luo, Wenjia He, Xintong Li, Yuetan Chu, Juanwen Chen, Bo Wang, Xia Wu, Wenwen Duan, Zhixia Guo, Liyan Bai, Yilin Pan, Xuefei Bi, Lu Liu, Long Feng, Xiaonan He, Xin Gao

    Abstract: Global population aging presents increasing challenges to healthcare systems, with coronary artery disease (CAD) responsible for approximately 17.8 million deaths annually, making it a leading cause of global mortality. As CAD is largely preventable, early detection and proactive management are essential. In this work, we introduce DigitalShadow, an advanced early warning system for CAD, powered b… ▽ More

    Submitted 23 April, 2025; originally announced June 2025.