Skip to main content

Showing 1–50 of 455 results for author: Ni, Y

.
  1. arXiv:2507.05249  [pdf, ps, other

    cs.CV cond-mat.str-el cs.LG physics.data-an

    Physics-Guided Dual Implicit Neural Representations for Source Separation

    Authors: Yuan Ni, Zhantao Chen, Alexander N. Petsch, Edmund Xu, Cheng Peng, Alexander I. Kolesnikov, Sugata Chowdhury, Arun Bansil, Jana B. Thayer, Joshua J. Turner

    Abstract: Significant challenges exist in efficient data analysis of most advanced experimental and observational techniques because the collected signals often include unwanted contributions--such as background and signal distortions--that can obscure the physically relevant information of interest. To address this, we have developed a self-supervised machine-learning approach for source separation using a… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2506.18737  [pdf, ps, other

    cs.CV cs.RO

    USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways

    Authors: Shanliang Yao, Runwei Guan, Yi Ni, Sen Xu, Yong Yue, Xiaohui Zhu, Ryan Wen Liu

    Abstract: Object tracking in inland waterways plays a crucial role in safe and cost-effective applications, including waterborne transportation, sightseeing tours, environmental monitoring and surface rescue. Our Unmanned Surface Vehicle (USV), equipped with a 4D radar, a monocular camera, a GPS, and an IMU, delivers robust tracking capabilities in complex waterborne environments. By leveraging these sensor… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Accepted by IROS

  3. arXiv:2506.17340  [pdf, ps, other

    physics.chem-ph

    Revisiting Sampling Strategies for Molecular Generation

    Authors: Yuyan Ni, Shikun Feng, Wei-Ying Ma, Zhi-Ming Ma, Yanyan Lan

    Abstract: Sampling strategies in diffusion models are critical to molecular generation yet remain relatively underexplored. In this work, we investigate a broad spectrum of sampling methods beyond conventional defaults and reveal that sampling choice substantially affects molecular generation performance. In particular, we identify a maximally stochastic sampling (StoMax), a simple yet underexplored strateg… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  4. arXiv:2506.13127  [pdf, ps, other

    cs.SD eess.AS

    I$^2$S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement

    Authors: Jiaming Cheng, Ruiyu Liang, Chao Xu, Ye Ni, Wei Zhou, Björn W. Schuller, Xiaoshuai Hao

    Abstract: In recent years, complexity compression of neural network (NN)-based speech enhancement (SE) models has gradually attracted the attention of researchers, especially in scenarios with limited hardware resources or strict latency requirements. The main difficulties and challenges lie in achieving a balance between complexity and performance according to the characteristics of the task. In this paper… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: submitted to IEEE Transactions on Neural Networks and Learning Systems

  5. arXiv:2506.12682  [pdf, ps, other

    eess.SP

    Conditional Diffusion Model-Driven Generative Channels for Double RIS-Aided Wireless Systems

    Authors: Yiyang Ni, Qi Zhang, Guangji Chen, Yan Cai, Jun Li, Shi Jin

    Abstract: With the development of the upcoming sixth-generation networks (6G), reconfigurable intelligent surfaces (RISs) have gained significant attention due to its ability of reconfiguring wireless channels via smart reflections. However, traditional channel state information (CSI) acquisition techniques for double-RIS systems face challenges (e.g., high pilot overhead or multipath interference). This pa… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: 5 pages, 4 figures

  6. arXiv:2506.12130  [pdf, ps, other

    astro-ph.GA

    Biases in stellar masses of JWST high-z quasar host galaxies caused by quasar subtraction

    Authors: Sabrina Berger, Madeline A. Marshall, J. Stuart B. Wyithe, Tiziana di Matteo, Yueying Ni, Stephen M. Wilkins, Minghao Yue

    Abstract: JWST has enabled a new era of understanding high-z galaxy and black hole evolution with more than 30 high-z quasar host galaxy detections. Many of these observations imply galaxies with black holes that are overmassive compared to their low-z counterparts. However, the bright quasar point source removal may cause significant biases in these stellar mass measurements. We develop a simulation-based… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 30 pages, 23 figures (including appendices). Submitted to MNRAS. Comments welcome!

  7. arXiv:2506.11307  [pdf, ps, other

    physics.geo-ph

    A Review of Cloud Computing in Seismology

    Authors: Yiyu Ni, Marine A. Denolle, Jannes Munchmeyer, Yinzhi Wang, Kuan-Fu Feng, Carlos Garcia Jurado Suarez, Amanda M. Thomas, Chad Trabant, Alex Hamilton, David Mencin

    Abstract: Seismology has entered the petabyte era, driven by decades of continuous recordings of broadband networks, the increase in nodal seismic experiments, and the recent emergence of Distributed Acoustic Sensing (DAS). This review explains how commercial clouds - AWS, Google Cloud, and Azure - by providing object storage, elastic compute, and managed databases, enable researchers to "bring the code to… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  8. arXiv:2506.06483  [pdf, ps, other

    cs.GR cs.AI cs.CV cs.LG eess.IV

    Noise Consistency Regularization for Improved Subject-Driven Image Synthesis

    Authors: Yao Ni, Song Wen, Piotr Koniusz, Anoop Cherian

    Abstract: Fine-tuning Stable Diffusion enables subject-driven image synthesis by adapting the model to generate images containing specific subjects. However, existing fine-tuning methods suffer from two key issues: underfitting, where the model fails to reliably capture subject identity, and overfitting, where it memorizes the subject image and reduces background diversity. To address these challenges, we p… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  9. arXiv:2506.03930  [pdf, ps, other

    cs.SE cs.AI cs.CL

    VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

    Authors: Yuansheng Ni, Ping Nie, Kai Zou, Xiang Yue, Wenhu Chen

    Abstract: Large language models (LLMs) often struggle with visualization tasks like plotting diagrams, charts, where success depends on both code correctness and visual semantics. Existing instruction-tuning datasets lack execution-grounded supervision and offer limited support for iterative code correction, resulting in fragile and unreliable plot generation. We present VisCode-200K, a large-scale instruct… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  10. arXiv:2505.24586  [pdf, ps, other

    astro-ph.HE

    All-sky search for individual Primordial Black Hole bursts with LHAASO

    Authors: Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, G. H. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen, S. H. Chen , et al. (293 additional authors not shown)

    Abstract: Primordial Black Holes~(PBHs) are hypothetical black holes with a wide range of masses that formed in the early universe. As a result, they may play an important cosmological role and provide a unique probe of the early universe. A PBH with an initial mass of approximately $10^{15}$~g is expected to explode today in a final burst of Hawking radiation. In this work, we conduct an all-sky search for… ▽ More

    Submitted 2 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

    Comments: 8 pages, 2 figures

  11. arXiv:2505.22170  [pdf, ps, other

    eess.SP cs.IT

    Attention-Enhanced Prompt Decision Transformers for UAV-Assisted Communications with AoI

    Authors: Chi Lu, Yiyang Ni, Zhe Wang, Xiaoli Shi, Jun Li, Shi Jin

    Abstract: Decision Transformer (DT) has recently demonstrated strong generalizability in dynamic resource allocation within unmanned aerial vehicle (UAV) networks, compared to conventional deep reinforcement learning (DRL). However, its performance is hindered due to zero-padding for varying state dimensions, inability to manage long-term energy constraint, and challenges in acquiring expert samples for few… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  12. arXiv:2505.20900  [pdf, ps, other

    cs.IR

    Embed Progressive Implicit Preference in Unified Space for Deep Collaborative Filtering

    Authors: Zhongjin Zhang, Yu Liang, Cong Fu, Yuxuan Zhu, Kun Wang, Yabo Ni, Anxiang Zeng, Jiazhi Xia

    Abstract: Embedding-based collaborative filtering, often coupled with nearest neighbor search, is widely deployed in large-scale recommender systems for personalized content selection. Modern systems leverage multiple implicit feedback signals (e.g., clicks, add to cart, purchases) to model user preferences comprehensively. However, prevailing approaches adopt a feedback-wise modeling paradigm, which (1) fa… ▽ More

    Submitted 28 May, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  13. arXiv:2505.20439  [pdf, ps, other

    astro-ph.CO astro-ph.GA

    The Properties of Little Red Dot Galaxies in the ASTRID Simulation

    Authors: Patrick LaChance, Rupert A. C. Croft, Tiziana Di Matteo, Yihao Zhou, Fabio Pacucci, Yueying Ni, Nianyi Chen, Simeon Bird

    Abstract: We present simulated counterparts of the ``Little Red Dot'' (LRD) galaxies observed with JWST, using the large cosmological hydrodynamic simulation, ASTRID. We create mock observations of the galaxies ($5 \leq z \leq 8$) in ASTRID, and find seventeen which fit the color and size criteria of LRDs. These LRDs are galaxies with high stellar masses ($\rm log(M_*/M_{\odot}) \geq 9.7$), and massive blac… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 15 pages, 12 figures

  14. arXiv:2505.18874  [pdf, ps, other

    physics.geo-ph

    A Global-scale Database of Seismic Phases from Cloud-based Picking at Petabyte Scale

    Authors: Yiyu Ni, Marine A. Denolle, Amanda M. Thomas, Alex Hamilton, Jannes Münchmeyer, Yinzhi Wang, Loïc Bachelot, Chad Trabant, David Mencin

    Abstract: We present the first global-scale database of 4.3 billion P- and S-wave picks extracted from 1.3 PB continuous seismic data via a cloud-native workflow. Using cloud computing services on Amazon Web Services, we launched ~145,000 containerized jobs on continuous records from 47,354 stations spanning 2002-2025, completing in under three days. Phase arrivals were identified with a deep learning model… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  15. arXiv:2505.15929  [pdf, ps, other

    cs.AI

    PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

    Authors: Hui Shen, Taiqiang Wu, Qi Han, Yunta Hsieh, Jizhou Wang, Yuyue Zhang, Yuxin Cheng, Zijian Hao, Yuansheng Ni, Xin Wang, Zhongwei Wan, Kai Zhang, Wendong Xu, Jing Xiong, Ping Luo, Wenhu Chen, Chaofan Tao, Zhuoqing Mao, Ngai Wong

    Abstract: Existing benchmarks fail to capture a crucial aspect of intelligence: physical reasoning, the integrated ability to combine domain knowledge, symbolic reasoning, and understanding of real-world constraints. To address this gap, we introduce PhyX: the first large-scale benchmark designed to assess models capacity for physics-grounded reasoning in visual scenarios. PhyX includes 3K meticulously cura… ▽ More

    Submitted 29 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  16. arXiv:2505.14447  [pdf, ps, other

    astro-ph.HE hep-ex

    First Identification and Precise Spectral Measurement of the Proton Component in the Cosmic-Ray `Knee'

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, G. H. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (292 additional authors not shown)

    Abstract: We report the first high-purity identification of cosmic-ray (CR) protons and a precise measurement of their energy spectrum from 0.15 to 12 PeV using the Large High Altitude Air Shower Observatory (LHAASO). Abundant event statistics, combined with the simultaneous detection of electrons/photons, muons, and Cherenkov light in air showers, enable spectroscopic measurements with statistical and syst… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  17. arXiv:2505.13142  [pdf, other

    cs.LG stat.ML

    Parallel Layer Normalization for Universal Approximation

    Authors: Yunhao Ni, Yuhe Liu, Wenxin Sun, Yitong Tang, Yuxin Guo, Peilin Feng, Wenjun Wu, Lei Huang

    Abstract: Universal approximation theorem (UAT) is a fundamental theory for deep neural networks (DNNs), demonstrating their powerful representation capacity to represent and approximate any function. The analyses and proofs of UAT are based on traditional network with only linear and nonlinear activation functions, but omitting normalization layers, which are commonly employed to enhance the training of mo… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 30 pages

  18. arXiv:2505.12483  [pdf, ps, other

    stat.ME

    Truncated Gaussian copula principal component analysis with application to pediatric acute lymphoblastic leukemia patients' gut microbiome

    Authors: Lei Wang, Yang Ni, Irina Gaynanova

    Abstract: Increasing epidemiologic evidence suggests that the diversity and composition of the gut microbiome can predict infection risk in cancer patients. Infections remain a major cause of morbidity and mortality during chemotherapy. Analyzing microbiome data to identify associations with infection pathogenesis for proactive treatment has become a critical research focus. However, the high-dimensional na… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  19. arXiv:2505.12259  [pdf, ps, other

    cs.CL

    Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches

    Authors: Yuhang Zhou, Xutian Chen, Yixin Cao, Yuchen Ni, Yu He, Siyu Tian, Xiang Liu, Jian Zhang, Chuanjun Ji, Guangnan Ye, Xipeng Qiu

    Abstract: Recent progress in large language models (LLMs) has outpaced the development of effective evaluation methods. Traditional benchmarks rely on task-specific metrics and static datasets, which often suffer from fairness issues, limited scalability, and contamination risks. In this paper, we introduce Teach2Eval, an indirect evaluation framework inspired by the Feynman Technique. Instead of directly t… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  20. arXiv:2505.11970  [pdf, ps, other

    cs.DC cs.AR

    A Survey of Real-time Scheduling on Accelerator-based Heterogeneous Architecture for Time Critical Applications

    Authors: An Zou, Yuankai Xu, Yinchen Ni, Jintao Chen, Yehan Ma, Jing Li, Christopher Gill, Xuan Zhang, Yier Jin

    Abstract: Accelerator-based heterogeneous architectures, such as CPU-GPU, CPU-TPU, and CPU-FPGA systems, are widely adopted to support the popular artificial intelligence (AI) algorithms that demand intensive computation. When deployed in real-time applications, such as robotics and autonomous vehicles, these architectures must meet stringent timing constraints. To summarize these achievements, this article… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  21. Dependence of the intensity of the nonwave component of EUV waves on coronal magnetic field configuration

    Authors: Yuwei Li, J. H. Guo, Y. W. Ni, Z. Y. Zhang, P. F. Chen

    Abstract: Context. Mounting evidence has shown that EUV waves consist of a fast-mode magnetohydrodynamic (MHD) wave (or shock wave) followed by a slower nonwave component, as predicted by the magnetic fieldline stretching model. However, not all observed events display both wavefronts, particularly the slower nonwave component. Even in case that the slower nonwave component is present, the intensity distrib… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 8 pages, 6 figures, accepted for publication in A&A

    Journal ref: A&A 698, A316 (2025)

  22. arXiv:2505.09702  [pdf, ps, other

    cs.LG

    Enabling Group Fairness in Graph Unlearning via Bi-level Debiasing

    Authors: Yezi Liu, Prathyush Poduval, Wenjun Huang, Yang Ni, Hanning Chen, Mohsen Imani

    Abstract: Graph unlearning is a crucial approach for protecting user privacy by erasing the influence of user data on trained graph models. Recent developments in graph unlearning methods have primarily focused on maintaining model prediction performance while removing user information. However, we have observed that when user information is deleted from the model, the prediction distribution across differe… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  23. arXiv:2505.05554  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    The THESAN-ZOOM project: Star formation efficiency from giant molecular clouds to galactic scale in high-redshift starbursts

    Authors: Zihao Wang, Xuejian Shen, Mark Vogelsberger, Hui Li, Rahul Kannan, Ewald Puchwein, Aaron Smith, Josh Borrow, Enrico Garaldi, Laura Keating, Oliver Zier, William McClymont, Sandro Tacchella, Yang Ni, Lars Hernquist

    Abstract: Star formation in galaxies is inherently complex, involving the interplay of physical processes over a hierarchy of spatial scales. In this work, we investigate the connection between global (galaxy-scale) and local (cloud-scale) star formation efficiencies (SFEs) at high redshifts ($z\gtrsim 3$), using the state-of-the-art cosmological zoom-in simulation suite THESAN-ZOOM. We find that the galaxy… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 17 pages, 16 figures. To be submitted to MNRAS. Comments are welcome!

  24. arXiv:2505.04519  [pdf, other

    cs.CL

    Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

    Authors: Yehui Tang, Yichun Yin, Yaoyuan Wang, Hang Zhou, Yu Pan, Wei Guo, Ziyang Zhang, Miao Rang, Fangcheng Liu, Naifu Zhang, Binghan Li, Yonghan Dong, Xiaojun Meng, Yasheng Wang, Dong Li, Yin Li, Dandan Tu, Can Chen, Youliang Yan, Fisher Yu, Ruiming Tang, Yunhe Wang, Botian Huang, Bo Wang, Boxiao Liu , et al. (49 additional authors not shown)

    Abstract: Sparse large language models (LLMs) with Mixture of Experts (MoE) and close to a trillion parameters are dominating the realm of most capable language models. However, the massive model scale poses significant challenges for the underlying software and hardware systems. In this paper, we aim to uncover a recipe to harness such scale on Ascend NPUs. The key goals are better usage of the computing r… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  25. arXiv:2505.03626  [pdf, other

    astro-ph.CO

    High-redshift Millennium and Astrid galaxies in effective field theory at the field level

    Authors: James M. Sullivan, Carolina Cuesta-Lazaro, Mikhail M. Ivanov, Yueying Ni, Sownak Bose, Boryana Hadzhiyska, César Hernández-Aguayo, Lars Hernquist, Rahul Kannan

    Abstract: Effective Field Theory (EFT) modeling is expected to be a useful tool in the era of future higher-redshift galaxy surveys such as DESI-II and Spec-S5 due to its robust description of various large-scale structure tracers. However, large values of EFT bias parameters of higher-redshift galaxies could jeopardize the convergence of the perturbative expansion. In this paper we measure the bias paramet… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 26 pages, 10 figures, 3 tables

    Report number: MIT-CTP/5867

  26. arXiv:2505.01073  [pdf, other

    cs.AI

    Retrieval Augmented Learning: A Retrial-based Large Language Model Self-Supervised Learning and Autonomous Knowledge Generation

    Authors: Zongyuan Li, Pengfei Li, Runnan Qi, Yanan Ni, Lumin Jiang, Hui Wu, Xuebo Zhang, Kuihua Huang, Xian Guo

    Abstract: The lack of domain-specific data in the pre-training of Large Language Models (LLMs) severely limits LLM-based decision systems in specialized applications, while post-training a model in the scenarios requires significant computational resources. In this paper, we present Retrial-Augmented Learning (RAL), a reward-free self-supervised learning framework for LLMs that operates without model traini… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  27. arXiv:2504.21275  [pdf, other

    stat.ME stat.AP

    Hurdle Network Model With Latent Dynamic Shrinkage For Enhanced Edge Prediction in Zero-Inflated Directed Network Time Series

    Authors: Sandipan Pramanik, Raymond Robertson, Yang Ni

    Abstract: This article aims to model international trade relationships among 29 countries in the apparel industry between 1994 and 2013. Bilateral trade flows can be represented as a directed network, where nodes correspond to countries and directed edges indicate trade flows (i.e., whether one country exported to another in a given year). Additionally, node (e.g., GDP) and edge-specific (e.g., labor provis… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  28. arXiv:2504.14861  [pdf, other

    cs.DB cs.IR

    Stitching Inner Product and Euclidean Metrics for Topology-aware Maximum Inner Product Search

    Authors: Tingyang Chen, Cong Fu, Xiangyu Ke, Yunjun Gao, Yabo Ni, Anxiang Zeng

    Abstract: Maximum Inner Product Search (MIPS) is a fundamental challenge in machine learning and information retrieval, particularly in high-dimensional data applications. Existing approaches to MIPS either rely solely on Inner Product (IP) similarity, which faces issues with local optima and redundant computations, or reduce the MIPS problem to the Nearest Neighbor Search under the Euclidean metric via spa… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted by SIGIR 2025

  29. arXiv:2504.14214  [pdf, other

    cs.IR

    Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration

    Authors: Hongji Li, Hanwen Du, Youhua Li, Junchen Fu, Chunxiao Li, Ziyi Zhuang, Jiakang Li, Yongxin Ni

    Abstract: The surge in multimedia content has led to the development of Multi-Modal Recommender Systems (MMRecs), which use diverse modalities such as text, images, videos, and audio for more personalized recommendations. However, MMRecs struggle with noisy data caused by misalignment among modal content and the gap between modal semantics and recommendation semantics. Traditional denoising methods are inad… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

    Comments: Accepted to ACM Web Search and Data Mining (WSDM) 2025

  30. arXiv:2504.12617  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    Bayesian Density-Density Regression with Application to Cell-Cell Communications

    Authors: Khai Nguyen, Yang Ni, Peter Mueller

    Abstract: We introduce a scalable framework for regressing multivariate distributions onto multivariate distributions, motivated by the application of inferring cell-cell communication from population-scale single-cell data. The observed data consist of pairs of multivariate distributions for ligands from one cell type and corresponding receptors from another. For each ordered pair $e=(l,r)$ of cell types… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 42 pages, 24 figures, 1 table

  31. arXiv:2504.10854  [pdf, other

    cs.CV

    LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation

    Authors: Hanning Chen, Yang Ni, Wenjun Huang, Hyunwoo Oh, Yezi Liu, Tamoghno Das, Mohsen Imani

    Abstract: Large Vision Language Models (LVLMs) have been widely adopted to guide vision foundation models in performing reasoning segmentation tasks, achieving impressive performance. However, the substantial computational overhead associated with LVLMs presents a new challenge. The primary source of this computational cost arises from processing hundreds of image tokens. Therefore, an effective strategy to… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  32. arXiv:2504.10307  [pdf, other

    cs.IR

    CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation

    Authors: Junchen Fu, Yongxin Ni, Joemon M. Jose, Ioannis Arapakis, Kaiwen Zheng, Youhua Li, Xuri Ge

    Abstract: Multimodal Foundation Models (MFMs) excel at representing diverse raw modalities (e.g., text, images, audio, videos, etc.). As recommender systems increasingly incorporate these modalities, leveraging MFMs to generate better representations has great potential. However, their application in sequential recommendation remains largely unexplored. This is primarily because mainstream adaptation method… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  33. arXiv:2504.08100  [pdf, other

    cs.CV

    ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting

    Authors: Junbang Liu, Enpei Huang, Dongxing Mao, Hui Zhang, Xinyuan Song, Yongxin Ni

    Abstract: Creating 3D content from single-view images is a challenging problem that has attracted considerable attention in recent years. Current approaches typically utilize score distillation sampling (SDS) from pre-trained 2D diffusion models to generate multi-view 3D representations. Although some methods have made notable progress by balancing generation speed and model quality, their performance is of… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Code will be available at https://github.com/YaNLlan-ljb/ContrastiveGaussian

  34. arXiv:2504.07866  [pdf, ps, other

    cs.CL cs.AI

    Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

    Authors: Yichun Yin, Wenyong Huang, Kaikai Song, Yehui Tang, Xueyu Wu, Wei Guo, Peng Guo, Yaoyuan Wang, Xiaojun Meng, Yasheng Wang, Dong Li, Can Chen, Dandan Tu, Yin Li, Fisher Yu, Ruiming Tang, Yunhe Wang, Baojun Wang, Bin Wang, Bo Wang, Boxiao Liu, Changzheng Zhang, Duyu Tang, Fei Mi, Hui Jin , et al. (27 additional authors not shown)

    Abstract: We present Pangu Ultra, a Large Language Model (LLM) with 135 billion parameters and dense Transformer modules trained on Ascend Neural Processing Units (NPUs). Although the field of LLM has been witnessing unprecedented advances in pushing the scale and capability of LLM in recent years, training such a large-scale model still involves significant optimization and system challenges. To stabilize… ▽ More

    Submitted 11 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

    Comments: fix conflicts of latex pacakges

  35. arXiv:2504.04907  [pdf, other

    cs.CV cs.AI

    Video-Bench: Human-Aligned Video Generation Benchmark

    Authors: Hui Han, Siyuan Li, Jiaqi Chen, Yiwen Yuan, Yuling Wu, Chak Tou Leong, Hanwen Du, Junchen Fu, Youhua Li, Jie Zhang, Chi Zhang, Li-jia Li, Yongxin Ni

    Abstract: Video generation assessment is essential for ensuring that generative models produce visually realistic, high-quality videos while aligning with human expectations. Current video generation benchmarks fall into two main categories: traditional benchmarks, which use metrics and embeddings to evaluate generated video quality across multiple dimensions but often lack alignment with human judgments; a… ▽ More

    Submitted 29 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

    Comments: Accepted by CVPR'25

  36. arXiv:2504.03848  [pdf, other

    astro-ph.CO

    Large-scale surveys of the quasar proximity effect

    Authors: Rupert A. C. Croft, Patrick Shaw, Ann-Marsha Alexis, Nianyi Chen, Yihao Zhou, Tiziana Di Matteo, Simeon Bird, Patrick Lachance, Yueying Ni

    Abstract: The UV radiation from high redshift quasars causes a local deficit in the neutral hydrogen absorption (Lyman-alpha forest) in their spectra, known as the proximity effect. Measurements from small samples of tens to hundreds of quasars have been used to constrain the global intensity of the UV background radiation, but so far the power of large-scale surveys such as the Sloan Digital Sky Survey and… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 17 pages, 15 figures, to be submitted to OJA

  37. arXiv:2503.24304  [pdf, other

    astro-ph.GA

    Gravitational Waves from Massive Black Hole Mergers in ASTRID: Predictions for LISA

    Authors: Bonny Y. Wang, Yihao Zhou, William Chen, Nianyi Chen, Tiziana Di Matteo, Rupert Croft, Simeon Bird, Yueying Ni

    Abstract: We use the ASTRID cosmological simulation to forecast massive black hole (MBH) mergers detectable by LISA down to $z=0$. ASTRID directly models MBH dynamical friction, allowing a realistic tracking of their trajectory. It also incorporates relatively low-mass MBH seeds down to $5\times 10^{4}\mathrm{M}_{\odot}$, providing a more complete picture of LISA MBH mergers. We find that LISA MBH mergers i… ▽ More

    Submitted 26 April, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: 19 Pages, 12 Figures; Submitted to ApJ

  38. arXiv:2503.23985  [pdf, other

    cs.PL

    An Empirical Study of Rust-Specific Bugs in the rustc Compiler

    Authors: Zixi Liu, Yang Feng, Yunbo Ni, Shaohua Li, Xizhe Yin, Qingkai Shi, Baowen Xu, Zhendong Su

    Abstract: Rust is gaining popularity for its well-known memory safety guarantees and high performance, distinguishing it from C/C++ and JVM-based languages. Its compiler, rustc, enforces these guarantees through specialized mechanisms such as trait solving, borrow checking, and specific optimizations. However, Rust's unique language mechanisms introduce complexity to its compiler, leading to Rust-specific c… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

  39. arXiv:2503.23074  [pdf, other

    astro-ph.HE astro-ph.IM

    Infant Core-collapse Supernovae with Circumstellar Interactions from KMTNet I: Luminous Transitional Case of KSP-SN-2022c

    Authors: Nan Jiang, Dae-Sik Moon, Yuan Qi Ni, Maria R. Drout, Hong Soo Park, Santiago González-Gaitán, Sang Chul Kim, Youngdae Lee, Ernest Chang

    Abstract: We present $BVi$ multi-band high-cadence observations of a Type II supernova (SN) KSP-SN-2022c from a star-forming galaxy at $z$ $\simeq$ 0.041 from its infant to nebular phase. Early light curve fitting with a single power-law is consistent with the first detection of roughly 15 minutes after shock breakout. The SN light curves feature a rapid rise and decline across its luminous ($V$ $\simeq$ -1… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

    Comments: Submitted to ApJ

  40. arXiv:2503.15558  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

    Authors: NVIDIA, :, Alisson Azzolini, Junjie Bai, Hannah Brandon, Jiaxin Cao, Prithvijit Chattopadhyay, Huayu Chen, Jinju Chu, Yin Cui, Jenna Diamond, Yifan Ding, Liang Feng, Francesco Ferroni, Rama Govindaraju, Jinwei Gu, Siddharth Gururani, Imad El Hanafi, Zekun Hao, Jacob Huffman, Jingyi Jin, Brendan Johnson, Rizwan Khan, George Kurian, Elena Lantz , et al. (29 additional authors not shown)

    Abstract: Physical AI systems need to perceive, understand, and perform complex actions in the physical world. In this paper, we present the Cosmos-Reason1 models that can understand the physical world and generate appropriate embodied decisions (e.g., next step action) in natural language through long chain-of-thought reasoning processes. We begin by defining key capabilities for Physical AI reasoning, wit… ▽ More

    Submitted 19 May, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  41. arXiv:2503.11072  [pdf, ps, other

    cs.RO math.OC

    A High-Speed Time-Optimal Trajectory Generation Strategy via a Two-layer Planning Model

    Authors: Haotian Tan, Yuan-Hua Ni

    Abstract: Motion planning and trajectory generation are crucial technologies in various domains including the control of Unmanned Aerial Vehicles, manipulators, and rockets. However, optimization-based real-time motion planning becomes increasingly challenging due to the problem's probable non-convexity and the inherent limitations of non-linear programming algorithms. Highly nonlinear dynamics, obstacle av… ▽ More

    Submitted 6 April, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  42. arXiv:2503.10164   

    math.OC

    Safety Control of Impulsive Systems with Control Barrier Functions and Adaptive Gains

    Authors: Zihan Liu, Yuan-Hua Ni

    Abstract: This paper addresses the safety challenges in impulsive systems, where abrupt state jumps introduce significant complexities into system dynamics. A unified framework is proposed by integrating Quadratic Programming (QP), Control Barrier Functions (CBFs), and adaptive gain mechanisms to ensure system safety during impulsive events. The CBFs are constructed to enforce safety constraints by capturin… ▽ More

    Submitted 9 April, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: The authors have identified certain technical inaccuracies in the current version of the manuscript the require substantial revision. To ensure correctness and clarity, we have decided to withdraw the submission. A thoroughly revised version will be resubmitted in the future

  43. arXiv:2503.09127  [pdf, other

    cs.HC cs.GR

    Spiritus: An AI-Assisted Tool for Creating 2D Characters and Animations

    Authors: Qirui Sun, Yunyi Ni, Teli Yuan, Jingjing Zhang, Fan Yang, Zhihao Yao, Haipeng Mi

    Abstract: This research presents Spiritus, an AI-assisted creation tool designed to streamline 2D character animation creation while enhancing creative flexibility. By integrating natural language processing and diffusion models, users can efficiently transform natural language descriptions into personalized 2D characters and animations. The system employs automated segmentation, layered costume techniques,… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  44. arXiv:2503.06882  [pdf, other

    cs.DB

    Maximum Inner Product is Query-Scaled Nearest Neighbor

    Authors: Tingyang Chen, Cong Fu, Kun Wang, Xiangyu Ke, Yunjun Gao, Wenchao Zhou, Yabo Ni, Anxiang Zeng

    Abstract: Maximum Inner Product Search (MIPS) for high-dimensional vectors is pivotal across databases, information retrieval, and artificial intelligence. Existing methods either reduce MIPS to Nearest Neighbor Search (NNS) while suffering from harmful vector space transformations, or attempt to tackle MIPS directly but struggle to mitigate redundant computations due to the absence of the triangle inequali… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: Accepted by VLDB 2025

  45. arXiv:2503.06689  [pdf, other

    cs.SE cs.CL

    DependEval: Benchmarking LLMs for Repository Dependency Understanding

    Authors: Junjia Du, Yadi Liu, Hongcheng Guo, Jiawei Wang, Haojian Huang, Yunyi Ni, Zhoujun Li

    Abstract: While large language models (LLMs) have shown considerable promise in code generation, real-world software development demands advanced repository-level reasoning. This includes understanding dependencies, project structures, and managing multi-file changes. However, the ability of LLMs to effectively comprehend and handle complex code repositories has yet to be fully explored. To address challeng… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  46. arXiv:2503.03747  [pdf, other

    cs.CR cs.LG

    PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning

    Authors: Ryozo Masukawa, Sanggeon Yun, Sungheon Jeong, Wenjun Huang, Yang Ni, Ian Bryant, Nathaniel D. Bastian, Mohsen Imani

    Abstract: Traffic classification is vital for cybersecurity, yet encrypted traffic poses significant challenges. We present PacketCLIP, a multi-modal framework combining packet data with natural language semantics through contrastive pretraining and hierarchical Graph Neural Network (GNN) reasoning. PacketCLIP integrates semantic reasoning with efficient classification, enabling robust detection of anomalie… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 7 pages, 7 figures

  47. arXiv:2503.02918  [pdf, ps, other

    cs.LG cs.AI

    Straight-Line Diffusion Model for Efficient 3D Molecular Generation

    Authors: Yuyan Ni, Shikun Feng, Haohan Chi, Bowen Zheng, Huan-ang Gao, Wei-Ying Ma, Zhi-Ming Ma, Yanyan Lan

    Abstract: Diffusion-based models have shown great promise in molecular generation but often require a large number of sampling steps to generate valid samples. In this paper, we introduce a novel Straight-Line Diffusion Model (SLDM) to tackle this problem, by formulating the diffusion process to follow a linear trajectory. The proposed process aligns well with the noise sensitivity characteristic of molecul… ▽ More

    Submitted 9 June, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  48. arXiv:2502.18367  [pdf, other

    astro-ph.SR

    The Birth of a Major Coronal Mass Ejection with Intricate Magnetic Structure from Multiple Active Regions

    Authors: Jinhan Guo, Y. W. Ni, B. Schmieder, Y. Guo, C. Xia, P. Devi, R. Chandra, S. Poedts, R. Joshi, Y. H. Zhou, H. T. Li, P. F. Chen

    Abstract: Coronal mass ejections (CMEs) are the eruptions of magnetised plasma from the Sun and are considered the main driver of adverse space weather events. Hence, undrstanding its formation process, particularly the magnetic topology, is critical for accurate space weather prediction. Here, based on imaging observations and three-dimensional (3D) data-constrained thermodynamic magnetohydrodynamical (MHD… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: 19 pages, 8 figures, accepted for publication in ApJ

  49. arXiv:2502.15447  [pdf, other

    astro-ph.HE hep-ph

    Ultra-high-energy $γ$-ray emission associated with the tail of a bow-shock pulsar wind nebula

    Authors: Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen, S. H. Chen, S. Z. Chen , et al. (274 additional authors not shown)

    Abstract: In this study, we present a comprehensive analysis of an unidentified point-like ultra-high-energy (UHE) $γ$-ray source, designated as 1LHAASO J1740+0948u, situated in the vicinity of the middle-aged pulsar PSR J1740+1000. The detection significance reached 17.1$σ$ (9.4$σ$) above 25$\,$TeV (100$\,$TeV). The source energy spectrum extended up to 300$\,$TeV, which was well fitted by a log-parabola f… ▽ More

    Submitted 24 February, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

    Comments: Corrected spelling errors in several author names

    Journal ref: The Innovation (2025), 100802

  50. arXiv:2502.14739  [pdf, other

    cs.CL

    SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

    Authors: M-A-P Team, Xinrun Du, Yifan Yao, Kaijing Ma, Bingli Wang, Tianyu Zheng, King Zhu, Minghao Liu, Yiming Liang, Xiaolong Jin, Zhenlin Wei, Chujie Zheng, Kaixin Deng, Shawn Gavin, Shian Jia, Sichao Jiang, Yiyan Liao, Rui Li, Qinrui Li, Sirun Li, Yizhi Li, Yunwen Li, David Ma, Yuansheng Ni, Haoran Que , et al. (72 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated remarkable proficiency in mainstream academic disciplines such as mathematics, physics, and computer science. However, human knowledge encompasses over 200 specialized disciplines, far exceeding the scope of existing benchmarks. The capabilities of LLMs in many of these specialized fields-particularly in light industry, agriculture, and service-orient… ▽ More

    Submitted 28 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.