Skip to main content

Showing 1–50 of 265 results for author: Pu, S

.
  1. arXiv:2505.10631  [pdf, other

    math.OC

    Decentralized Min-Max Optimization with Gradient Tracking

    Authors: Runze You, Kun Huang, Shi Pu

    Abstract: This paper presents a novel distributed formulation of the min-max optimization problem. Such a formulation enables enhanced flexibility among agents when optimizing their maximization variables. To address the problem, we propose two distributed gradient methods over networks, termed Distributed Gradient Tracking Ascent (DGTA) and Distributed Stochastic Gradient Tracking Ascent (DSGTA). We demons… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2505.10322  [pdf, other

    cs.LG math.OC

    Asynchronous Decentralized SGD under Non-Convexity: A Block-Coordinate Descent Framework

    Authors: Yijie Zhou, Shi Pu

    Abstract: Decentralized optimization has become vital for leveraging distributed data without central control, enhancing scalability and privacy. However, practical deployments face fundamental challenges due to heterogeneous computation speeds and unpredictable communication delays. This paper introduces a refined model of Asynchronous Decentralized Stochastic Gradient Descent (ADSGD) under practical assum… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  3. arXiv:2503.17489  [pdf, other

    cs.CL cs.CV

    Judge Anything: MLLM as a Judge Across Any Modality

    Authors: Shu Pu, Yaochen Wang, Dongping Chen, Yuhang Chen, Guohao Wang, Qi Qin, Zhongyi Zhang, Zhiyuan Zhang, Zetong Zhou, Shuang Gong, Yi Gui, Yao Wan, Philip S. Yu

    Abstract: Evaluating generative foundation models on open-ended multimodal understanding (MMU) and generation (MMG) tasks across diverse modalities (e.g., images, audio, video) poses significant challenges due to the complexity of cross-modal interactions. To this end, the idea of utilizing Multimodal LLMs (MLLMs) as automated judges has emerged, with encouraging results in assessing vision-language underst… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  4. arXiv:2503.16123  [pdf, other

    math.OC cs.LG

    Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time

    Authors: Runze You, Shi Pu

    Abstract: We study a distributed learning problem in which $n$ agents, each with potentially heterogeneous local data, collaboratively minimize the sum of their local cost functions via peer-to-peer communication. We propose a novel algorithm, Spanning Tree Push-Pull (STPP), which employs two spanning trees extracted from a general communication graph to distribute both model parameters and stochastic gradi… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  5. arXiv:2503.13320  [pdf, other

    hep-ph hep-th nucl-th

    Radiative corrections on vortical spin polarization in hot QCD matter

    Authors: Shuo Fang, Shi Pu, Di-Lun Yang

    Abstract: We investigate the radiative corrections on spin polarization of relativistic fermions induced by vortical fields in thermal-equilibrium QCD matter at weak coupling. Such corrections stem from the self-energy gradients in quantum kinetic theory, which are further obtained by a more systematic and general approach through the Keldysh equation. By applying the hard-thermal-loop approximation, we obt… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 14 pages

  6. arXiv:2501.07424  [pdf

    physics.optics

    Photonic antiferromagnetic topological insulator with a single surface Dirac cone

    Authors: Fujia Chen, Ning Han, Songyang Pu, Rui Zhao, Li Zhang, Qiaolu Chen, Yuze Hu, Mingyu Tong, Wenhao Li, Junyao Wu, Yudong Ren Xinrui Li, Wenyan Yin, Hongsheng Chen, Rui-Xing Zhang, Yihao Yang

    Abstract: Antiferromagnetism, characterized by magnetic moments aligned in alternating directions with a vanished ensemble average, has garnered renewed interest for its potential applications in spintronics and axion dynamics. The synergy between antiferromagnetism and topology can lead to the emergence of an exotic topological phase unique to certain magnetic order, termed antiferromagnetic topological in… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 13 pages, 4 figures

  7. arXiv:2501.03390  [pdf, ps, other

    math.OC

    State-of-the-art Methods for Pseudo-Boolean Solving with SCIP

    Authors: Gioni Mexi, Dominik Kamp, Yuji Shinano, Shanwen Pu, Alexander Hoen, Ksenia Bestuzheva, Christopher Hojny, Matthias Walter, Marc E. Pfetsch, Sebastian Pokutta, Thorsten Koch

    Abstract: The Pseudo-Boolean problem deals with linear or polynomial constraints with integer coefficients over Boolean variables. The objective lies in optimizing a linear objective function, or finding a feasible solution, or finding a solution that satisfies as many constraints as possible. In the 2024 Pseudo-Boolean competition, solvers incorporating the SCIP framework won five out of six categories it… ▽ More

    Submitted 8 January, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

  8. arXiv:2412.19400  [pdf, ps, other

    hep-ph nucl-th

    Spin alignment of vector mesons in local equilibrium by Zubarev's approach

    Authors: Shi-Zheng Yang, Xin-Qing Xie, Shi Pu, Jian-Hua Gao, Qun Wang

    Abstract: We compute the $00$ element of the spin density matrix, denoted as $ρ_{00}$ and called the spin alignment, up to the second order of the gradient expansion in local equilibrium by Zubarev's approach. In the first order, we obtain $ρ_{00}=1/3$, meaning that the contributions from thermal vorticity and shear stress tensor are vanishing. The non-vanishing contributions to $ρ_{00}-1/3$ appear in the s… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

    Comments: 20 pages

  9. arXiv:2412.13054  [pdf, other

    math.OC

    Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks

    Authors: Kun Huang, Shi Pu, Angelia Nedić

    Abstract: Consider $n$ agents connected over a network collaborate to minimize the average of their local cost functions combined with a common nonsmooth function. This paper introduces a unified algorithmic framework for solving such a problem through distributed stochastic proximal gradient methods, leveraging the normal map update scheme. Within this framework, we propose two new algorithms, termed Norma… ▽ More

    Submitted 26 December, 2024; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: 34 pages, 5 figures

  10. arXiv:2412.02320  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Simulating Composite Fermion Excitons by Density Functional Theory and Monte Carlo on a Disk

    Authors: Yi Yang, Songyang Pu, Yayun Hu, Zi-Xiang Hu

    Abstract: The Kohn-Sham density functional method for the fractional quantum Hall (FQH) effect has recently been developed by mapping the strongly interacting electrons into an auxiliary system of weakly interacting composite fermions (CFs) that experience a density-dependent effective magnetic field. This approach has been successfully applied to explore the edge rescontruction, fractional charge and fract… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 11 pages, 6 figures

  11. arXiv:2412.00659  [pdf, other

    math.OC

    Linear Convergence Analysis of Single-loop Algorithm for Bilevel Optimization via Small-gain Theorem

    Authors: Jianhui Li, Shi Pu, Jianqi Chen, Junfeng Wu

    Abstract: Bilevel optimization has gained considerable attention due to its broad applicability across various fields. While several studies have investigated the convergence rates in the strongly-convex-strongly-convex (SC-SC) setting, no prior work has proven that a single-loop algorithm can achieve linear convergence. This paper employs a small-gain theorem in {robust control theory} to demonstrate that… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

  12. arXiv:2411.17285  [pdf, other

    nucl-th

    A solvable model for spin polarizations with flow-momentum correspondence

    Authors: Anum Arslan, Wen-Bo Dong, Guo-Liang Ma, Shi Pu, Qun Wang

    Abstract: We present an analytically solvable model based on the blast-wave picture of heavy-ion collisions with flow-momentum correspondence. It can describe the key features of spin polarizations in heavy-ion collisions. With the analytical solution, we can clearly show that the spin polarization with respect to the reaction plane is governed by the directed flow, while the spin polarization along the bea… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: RevTex 4, 12 pages, 8 figures, 2 tables

  13. arXiv:2411.17188  [pdf, other

    cs.CV cs.CL

    Interleaved Scene Graphs for Interleaved Text-and-Image Generation Assessment

    Authors: Dongping Chen, Ruoxi Chen, Shu Pu, Zhaoyi Liu, Yanru Wu, Caixi Chen, Benlin Liu, Yue Huang, Yao Wan, Pan Zhou, Ranjay Krishna

    Abstract: Many real-world user queries (e.g. "How do to make egg fried rice?") could benefit from systems capable of generating responses with both textual steps with accompanying images, similar to a cookbook. Models designed to generate interleaved text and images face challenges in ensuring consistency within and across these modalities. To address these challenges, we present ISG, a comprehensive evalua… ▽ More

    Submitted 24 March, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: Accepted by ICLR 2025 as Spotlight. Project homepage: https://interleave-eval.github.io/

  14. arXiv:2411.12591  [pdf, other

    cs.CV cs.AI

    Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination

    Authors: Haojie Zheng, Tianyang Xu, Hanchi Sun, Shu Pu, Ruoxi Chen, Lichao Sun

    Abstract: Multimodal large language models (MLLMs) have advanced the integration of visual and linguistic modalities, establishing themselves as the dominant paradigm for visual-language tasks. Current approaches like chain of thought (CoT) reasoning have augmented the cognitive capabilities of large language models (LLMs), yet their adaptation to MLLMs is hindered by heightened risks of hallucination in cr… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  15. arXiv:2410.13491  [pdf, other

    astro-ph.GA

    Progenitor diversity in the accreted stellar halos of Milky Way-like galaxies

    Authors: Sy-Yun Pu, Andrew P. Cooper, Robert J. J. Grand, Facundo A. Gómez, Antonela Monachesi

    Abstract: Ongoing large stellar spectroscopic surveys of the Milky Way seek to reconstruct the major events in the assembly history of the Galaxy. Chemical and kinematic observations can be used to separate the contributions of different progenitor galaxies to the present-day stellar halo. Here we compute the number of progenitors that contribute to the accreted stellar halos of simulated Milky Way-like gal… ▽ More

    Submitted 25 February, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 20 pages, 10 figures, published in ApJ. This paper describes the first public release of the particle-tagging stellar halo data from Cooper et al. (2010), see https://github.com/nthu-ga/aquarius-halos

  16. arXiv:2410.02524  [pdf, other

    astro-ph.CO astro-ph.GA

    Constraining cosmology with N-body simulations for future spectroscopic galaxy surveys at $2\leq z\leq 3$

    Authors: Sy-Yun Pu, Teppei Okumura, Chian-Chou Chen, Takahiro Nishimichi, Kazuyuki Akitsu

    Abstract: Determining the spatial curvature ($Ω_k$) independent of cosmic microwave background observations plays a key role in revealing the physics of the early universe. The Hubble tension is one of the most serious issues in modern cosmology. We investigate halo catalogs identified from $N$-body simulations at $z=2$ and 3, mimicking high-redshift galaxy surveys. We measure redshift-space correlation fun… ▽ More

    Submitted 28 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: 8 pages, 4 figures, 3 tables; references and results updated; typos corrected

  17. arXiv:2409.18971  [pdf, other

    cs.MM cs.AI cs.SD eess.AS

    Early Joint Learning of Emotion Information Makes MultiModal Model Understand You Better

    Authors: Mengying Ge, Mingyang Li, Dongkai Tang, Pengbo Li, Kuo Liu, Shuhao Deng, Songbai Pu, Long Liu, Yang Song, Tao Zhang

    Abstract: In this paper, we present our solutions for emotion recognition in the sub-challenges of Multimodal Emotion Recognition Challenge (MER2024). To mitigate the modal competition issue between audio and text, we adopt an early fusion strategy based on a large language model, where joint training of audio and text is conducted initially. And the joint Audio-Text modal feature will be late-fused with ot… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  18. arXiv:2409.00456  [pdf, ps, other

    hep-ph hep-th nucl-th

    Corrections from space-time dependent electromagnetic fields to Wigner functions and spin polarization

    Authors: Shi-Zheng Yang, Jian-Hua Gao, Shi Pu

    Abstract: We have derived the Wigner equations at global equilibrium with constant vorticity but space-time dependent electromagnetic fields up to second order in semiclassical expansion. We obtain the new second-order contributions to the charge currents and energy-momentum tensor from the varying electromagnetic fields. We also compute the new corrections to the spin polarization pesudo-vector from both c… ▽ More

    Submitted 23 September, 2024; v1 submitted 31 August, 2024; originally announced September 2024.

    Comments: 20 pages, typos corrected, Sec.VI reorganized

  19. arXiv:2408.09877  [pdf, other

    hep-ph hep-th nucl-th

    Collisional corrections to spin polarization from quantum kinetic theory using Chapman-Enskog expansion

    Authors: Shuo Fang, Shi Pu

    Abstract: We have investigated the collisional corrections to the spin polarization pseudo-vector, $δ\mathcal{P}^μ$, using quantum kinetic theory in Chapman-Enskog expansion. We derive the spin Boltzmann equation incorporating Møller scattering process. We further consider two distinct scenarios using hard thermal loop approximations for simplification. In scenario (I), the vector charge distribution functi… ▽ More

    Submitted 10 March, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: 32 pages, 1 figure; version accepted for publication of PRD

    Journal ref: Phys.Rev.D 111 (2025) 3, 034015

  20. arXiv:2408.04296  [pdf, other

    hep-ph nucl-th

    Spin polarization of $Λ$ hyperons along beam direction in p+Pb collisions at $\sqrt{s_{NN}}=8.16$ TeV using hydrodynamic approaches

    Authors: Cong Yi, Xiang-Yu Wu, Jie Zhu, Shi Pu, Guang-You Qin

    Abstract: We have implemented the 3+1 dimensional CLVisc hydrodynamics model with TRENTO-3D initial conditions to investigate the spin polarization of $Λ$ hyperons along the beam direction in p+Pb collisions at $\sqrt{s_{NN}} = 8.16$ TeV. Following our previous theoretical framework based on quantum kinetic theory, we consider three different scenarios: $Λ$ equilibrium, $s$ quark equilibrium, and iso-therma… ▽ More

    Submitted 30 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

    Comments: 8 pages, 5 figures and 1 table. A new figure for local polarization as a function of pseudo-rapidity is added. submitted to PRC

  21. arXiv:2408.03781  [pdf, other

    hep-ph hep-th nucl-th

    Late-time asymptotic solutions, attractor, and focusing behavior of spin hydrodynamics

    Authors: Dong-Lin Wang, Li Yan, Shi Pu

    Abstract: We have investigated the late-time asymptotic solutions, attractor, and focusing behavior of minimal causal spin hydrodynamics in Bjorken expansion. Using the method of dominant balance, we derive the late-time asymptotic solutions of the evolution equation for spin density and identify the specific conditions necessary for the spin density to exhibit a power-law decay. We then analyze both the la… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 34 pages, 6 figures

  22. arXiv:2408.01727  [pdf, other

    math.OC

    A Robust Compressed Push-Pull Method for Decentralized Nonconvex Optimization

    Authors: Yiwei Liao, Zhuorui Li, Shi Pu, Tsung-Hui Chang

    Abstract: In the modern paradigm of multi-agent networks, communication has become one of the main bottlenecks for decentralized optimization, where a large number of agents are involved in minimizing the average of the local cost functions. In this paper, we propose a robust compressed push-pull algorithm (RCPP) that combines gradient tracking with communication compression. In particular, RCPP is robust u… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.07091

  23. Understanding the Ising zigzag antiferromagnetism of FePS3 and FePSe3 monolayers

    Authors: Ke Yang, Yueyue Ning, Yuxuan Zhou, Di Lu, Yaozhenghang Ma, Lu Liu, Shengli Pu, Hua Wu

    Abstract: This study investigates the spin-orbital states of FePS3 and FePSe3 monolayers and the origin of their Ising zigzag AFM, using DFT, crystal field level diagrams, superexchange analyses, and parallel tempering MC simulations. Our calculations show that under the trigonal elongation of the FeS6 (FeSe6) octahedra, the $e_g^π$ doublet of the Fe 3d crystal field levels lies lower than the $a_{1g}$ sing… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures, 3 tables

    Journal ref: Phys. Rev. B 110, 024427 (2024)

  24. arXiv:2407.11119  [pdf, other

    cond-mat.str-el cond-mat.mes-hall quant-ph

    Entanglement scaling and charge fluctuations in a Fermi liquid of composite fermions

    Authors: Cristian Voinea, Songyang Pu, Ajit C. Balram, Zlatko Papić

    Abstract: The composite fermion Fermi liquid (CFL) state at $ν=1/2$ filling of a Landau level is a paradigmatic non-Fermi liquid borne out purely by Coulomb interactions. But in what ways is this exotic state of matter different from a Fermi liquid? The CFL entanglement entropy was indeed found to exhibit a significant enhancement compared to free electrons [Shao et al., Phys. Rev. Lett. 114, 206402 (2015)]… ▽ More

    Submitted 28 March, 2025; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 13 pages, 9 figures; changed format, added data

    Journal ref: Phys. Rev. B 111, 115119 (2025)

  25. arXiv:2407.06091  [pdf, other

    hep-ph nucl-th

    Light nuclei photoproduction in relativistic heavy ion ultraperipheral collisions

    Authors: Jin-Yu Hu, Shuo Lin, Shi Pu, Qun Wang

    Abstract: We have investigated light nuclei pair photoproduction in relativistic heavy ion ultraperipheral collisions. As a first attempt, we employ our previously developed quantum electrodynamics model, which incorporates a wave-packet description of initial nuclei, to compute the cross section for proton-antiproton pair photoproduction. The effective vertex for the photon and proton interaction is chosen… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures

  26. arXiv:2406.19605  [pdf, other

    math.OC

    A Customized Augmented Lagrangian Method for Block-Structured Integer Programming

    Authors: Rui Wang, Chuwen Zhang, Shanwen Pu, Jianjun Gao, Zaiwen Wen

    Abstract: Integer programming with block structures has received considerable attention recently and is widely used in many practical applications such as train timetabling and vehicle routing problems. It is known to be NP-hard due to the presence of integer variables. We define a novel augmented Lagrangian function by directly penalizing the inequality constraints and establish the strong duality between… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  27. arXiv:2406.11410  [pdf, other

    cs.CL cs.AI

    HARE: HumAn pRiors, a key to small language model Efficiency

    Authors: Lingyun Zhang, Bin jin, Gaojian Ge, Lunhui Liu, Xuewen Shen, Mingyong Wu, Houqian Zhang, Yongneng Jiang, Shiqi Chen, Shi Pu

    Abstract: Human priors play a crucial role in efficiently utilizing data in deep learning. However, with the development of large language models (LLMs), there is an increasing emphasis on scaling both model size and data volume, which often diminishes the importance of human priors in data construction. Influenced by these trends, existing Small Language Models (SLMs) mainly rely on web-scraped large-scale… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  28. Nuclear deformation effects in photoproduction of $ρ$ mesons in ultraperipheral isobaric collisions

    Authors: Shuo Lin, Jin-Yu Hu, Hao-Jie Xu, Shi Pu, Qun Wang

    Abstract: We have investigated the $ρ^{0}$ meson photoproduction in ultraperipheral isobaric collisions between $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at $\sqrt{s_{NN}}=200$ GeV, employing the dipole model with the equivalent photon approximation. By implementing the Woods-Saxon distribution to represent the nuclear mass density, which is derived from… ▽ More

    Submitted 19 April, 2025; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

  29. arXiv:2405.03105  [pdf, ps, other

    nucl-th hep-ph hep-th

    Thermodynamic stability in relativistic viscous and spin hydrodynamics

    Authors: Xiang Ren, Chen Yang, Dong-Lin Wang, Shi Pu

    Abstract: We have applied thermodynamic stability analysis to derive the stability and causality conditions for conventional relativistic viscous hydrodynamics and spin hydrodynamics. We obtain the thermodynamic stability conditions for second-order relativistic hydrodynamics with shear and bulk viscous tensors, finding them identical to those derived from linear mode analysis. We then derive the thermodyna… ▽ More

    Submitted 11 August, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 30 pages; published version

    Journal ref: Phys.Rev.D 110 (2024) 3, 034010

  30. arXiv:2404.05454  [pdf, other

    math.OC

    B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous Data

    Authors: Runze You, Shi Pu

    Abstract: This paper considers the distributed learning problem where a group of agents cooperatively minimizes the summation of their local cost functions based on peer-to-peer communication. Particularly, we propose a highly efficient algorithm, termed ``B-ary Tree Push-Pull'' (BTPP), that employs two B-ary spanning trees for distributing the information related to the parameters and stochastic gradients… ▽ More

    Submitted 21 November, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  31. arXiv:2403.05172  [pdf, other

    cs.CV

    Learning Expressive And Generalizable Motion Features For Face Forgery Detection

    Authors: Jingyi Zhang, Peng Zhang, Jingjing Wang, Di Xie, Shiliang Pu

    Abstract: Previous face forgery detection methods mainly focus on appearance features, which may be easily attacked by sophisticated manipulation. Considering the majority of current face manipulation methods generate fake faces based on a single frame, which do not take frame consistency and coordination into consideration, artifacts on frame sequences are more effective for face forgery detection. However… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to ICASSP 2023

  32. arXiv:2403.05117  [pdf, other

    cs.CV

    Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning

    Authors: Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, Shiliang Pu

    Abstract: Recently, arbitrary-scale point cloud upsampling mechanism became increasingly popular due to its efficiency and convenience for practical applications. To achieve this, most previous approaches formulate it as a problem of surface approximation and employ point-based networks to learn surface representations. However, learning surfaces from sparse point clouds is more challenging, and thus they o… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to AAAI 2024. The source code is available at https://github.com/hikvision-research/3DVision

  33. arXiv:2403.02806  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Atacama Large Aperture Submillimeter Telescope (AtLAST) Science: Surveying the distant Universe

    Authors: Eelco van Kampen, Tom Bakx, Carlos De Breuck, Chian-Chou Chen, Helmut Dannerbauer, Benjamin Magnelli, Francisco Miguel Montenegro-Montes, Teppei Okumura, Sy-Yun Pu, Matus Rybak, Amelie Saintonge, Claudia Cicone, Evanthia Hatziminaoglou, Juliette Hilhorst, Pamela Klaassen, Minju Lee, Christopher C. Lovell, Andreas Lundgren, Luca Di Mascolo, Tony Mroczkowski, Laura Sommovigo, Mark Booth, Martin A. Cordiner, Rob Ivison, Doug Johnstone , et al. (5 additional authors not shown)

    Abstract: During the most active period of star formation in galaxies, which occurs in the redshift range 1<z<3, strong bursts of star formation result in significant quantities of dust, which obscures new stars being formed as their UV/optical light is absorbed and then re-emitted in the infrared, which redshifts into the mm/sub-mm bands for these early times. To get a complete picture of the high-z galaxy… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 17 pages, 10 figures, submitted to Open Research Europe as part of the AtLAST collection

  34. arXiv:2403.00258  [pdf, ps, other

    stat.ML cs.LG

    "Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

    Authors: Lingyu Gu, Yongqi Du, Yuan Zhang, Di Xie, Shiliang Pu, Robert C. Qiu, Zhenyu Liao

    Abstract: Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to address this key limitation, efforts have been devoted to the compression (e.g., sparsification and/or quantization) of these large-scale machine learning models, s… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 32 pages, 4 figures, and 2 tables. Fixing typos in Theorems 1 and 2 from NeurIPS 2022 proceeding (https://proceedings.neurips.cc/paper_files/paper/2022/hash/185087ea328b4f03ea8fd0c8aa96f747-Abstract-Conference.html)

  35. arXiv:2402.18627  [pdf, other

    cond-mat.supr-con cond-mat.mes-hall cond-mat.str-el

    Topologically protected emergent Fermi surface in an Abrikosov vortex lattice

    Authors: Songyang Pu, Jay D. Sau, Rui-Xing Zhang

    Abstract: We show that a three-dimensional (3D) fully gapped type-II superconductor can feature emergent in-gap Fermi surfaces of Caroli-de Gennes Matricon (CdGM) quasiparticles in the presence of an Abrikosov vortex lattice. In particular, these CdGM Fermi surfaces manifest in the emergent 3D band structure enabled by the intervortex tunneling physics, and their stability is guaranteed by a $\mathbb{Z}_2$… ▽ More

    Submitted 17 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 6 + 9 pages, 3 + 6 figures

  36. arXiv:2402.17294  [pdf, ps, other

    math.ST

    Advancing Continuous Distribution Generation: An Exponentiated Odds Ratio Generator Approach

    Authors: Xinyu Chen, Yuanqi Xie, Achraf Cohen, Shusen Pu

    Abstract: This paper presents a new methodology for generating continuous statistical distributions, integrating the exponentiated odds ratio within the framework of survival analysis. This new method enhances the flexibility and adaptability of distribution models to effectively address the complexities inherent in contemporary datasets. The core of this advancement is illustrated by introducing a particul… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    MSC Class: 62E99; 60E05

  37. arXiv:2402.09714  [pdf, other

    math.OC cs.DC cs.MA

    An Accelerated Distributed Stochastic Gradient Method with Momentum

    Authors: Kun Huang, Shi Pu, Angelia Nedić

    Abstract: In this paper, we introduce an accelerated distributed stochastic gradient method with momentum for solving the distributed optimization problem, where a group of $n$ agents collaboratively minimize the average of the local objective functions over a connected network. The method, termed ``Distributed Stochastic Momentum Tracking (DSMT)'', is a single-loop algorithm that utilizes the momentum trac… ▽ More

    Submitted 26 March, 2025; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 45 pages, 5 figures

  38. arXiv:2402.04540  [pdf, other

    nucl-th

    Spin polarization in relativistic heavy-ion collisions

    Authors: Francesco Becattini, Matteo Buzzegoli, Takafumi Niida, Shi Pu, Ai-Hong Tang, Qun Wang

    Abstract: Polarization has opened a new physics chapter in relativistic heavy-ion collisions. Since the first prediction and experimental observation of global spin polarization, a lot of progress has been made in understanding its features, both at experimental and theoretical level. In this paper, we give an overview on the recent advances in this field. The covered topics include a review of measurements… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: RevTeX 4, 41 pages, 12 figures, review article as a book chapter for QGP6

  39. arXiv:2402.03672  [pdf, other

    nucl-th

    The spin alignment of rho mesons in a pion gas

    Authors: Yi-Liang Yin, Wen-Bo Dong, Jin-Yi Pang, Shi Pu, Qun Wang

    Abstract: We study the spin alignment of neutral rho mesons in a pion gas using spin kinetic or Boltzmann equations. The $ρππ$ coupling is given by the chiral effective theory. The collision terms at the leading and next-to-leading order in spin Boltzmann equations are derived. The evolution of the spin density matrix of the neutral rho meson is simulated with different initial conditions. The numerical res… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: RevTex 4, 17 pages, 12 figures

  40. arXiv:2401.17352  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Microscopic Model for Fractional Quantum Hall Nematics

    Authors: Songyang Pu, Ajit C. Balram, Joseph Taylor, Eduardo Fradkin, Zlatko Papić

    Abstract: Geometric fluctuations of the density mode in a fractional quantum Hall (FQH) state can give rise to a nematic FQH phase, a topological state with a spontaneously broken rotational symmetry. While experiments on FQH states in the second Landau level have reported signatures of putative FQH nematics in anisotropic transport, a realistic model for this state has been lacking. We show that the standa… ▽ More

    Submitted 9 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Four figures in main text with supplementary information

    Journal ref: Phys. Rev. Lett. 132, 236503 (2024)

  41. arXiv:2401.09703  [pdf, other

    math.NA

    Fast Updating Truncated SVD for Representation Learning with Sparse Matrices

    Authors: Haoran Deng, Yang Yang, Jiahe Li, Cheng Chen, Weihao Jiang, Shiliang Pu

    Abstract: Updating a truncated Singular Value Decomposition (SVD) is crucial in representation learning, especially when dealing with large-scale data matrices that continuously evolve in practical scenarios. Aligning SVD-based models with fast-paced updates becomes increasingly important. Existing methods for updating truncated SVDs employ Rayleigh-Ritz projection procedures, where projection matrices are… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  42. arXiv:2312.09979  [pdf, other

    cs.CL

    LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

    Authors: Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks. Increasing instruction data substantially is a direct solution to align the model with a broader range of downstream tasks or notably improve its performance on a specific task. However, we find that large-scale increase… ▽ More

    Submitted 8 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures

  43. arXiv:2312.09068  [pdf, other

    nucl-th hep-ex hep-ph nucl-ex

    Global and local polarization of $Λ$ hyperons across RHIC-BES energies

    Authors: Xiang-Yu Wu, Cong Yi, Guang-You Qin, Shi Pu

    Abstract: We report our recent study on the global and local polarization of $Λ$ hyperons in Au+Au collisions at RHIC-BES energies within the (3+1)-dimensional CLVisc hydrodynamics framework. We present our numerical results for the global polarization as the function of collision energies and the local polarization along the beam direction as functions of azimuthal angle in $20-50$% centrality at… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 4 pages, 3 figures. Contribution to the proceedings of Quark Matter 2023 (Houston, TX, 3-9 Sep. 2023)

  44. arXiv:2312.06779  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Fingerprints of Composite Fermion Lambda Levels in Scanning Tunneling Microscopy

    Authors: Songyang Pu, Ajit C. Balram, Yuwen Hu, Yen-Chen Tsui, Minhao He, Nicolas Regnault, Michael P. Zaletel, Ali Yazdani, Zlatko Papić

    Abstract: Composite fermion (CF) is a topological quasiparticle that emerges from a non-perturbative attachment of vortices to electrons in strongly correlated two-dimensional materials. Similar to non-interacting fermions that form Landau levels in a magnetic field, CFs can fill analogous ``Lambda'' levels, giving rise to the fractional quantum Hall (FQH) effect of electrons. Here, we show that Lambda leve… ▽ More

    Submitted 15 August, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Seven figures including supplementary materials

    Journal ref: Phys. Rev. B 110, L081107 (2024)

  45. arXiv:2311.15197  [pdf, other

    hep-ph hep-th nucl-th

    Spin polarization and spin alignment from quantum kinetic theory with self-energy corrections

    Authors: Shuo Fang, Shi Pu, Di-Lun Yang

    Abstract: We derive the quantum kinetic theory for massive fermions with collision terms and self-energy corrections based on quantum field theory. We adopt an effective power counting scheme with $\hbar$ expansion to obtain the leading-order perturbative solutions of the vector and axial Wigner functions and the corresponding kinetic equations. We observe that both the onshell relation and the structure of… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 52 pages, 1 table

    Journal ref: Phys.Rev.D 109 (2024) 3, 034034

  46. arXiv:2310.08298  [pdf, other

    cs.CL

    MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition

    Authors: Shuhui Wu, Yongliang Shen, Zeqi Tan, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu

    Abstract: Distantly supervised named entity recognition (DS-NER) aims to locate entity mentions and classify their types with only knowledge bases or gazetteers and unlabeled corpus. However, distant annotations are noisy and degrade the performance of NER models. In this paper, we propose a noise-robust prototype network named MProto for the DS-NER task. Different from previous prototype-based NER methods,… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP-2023, camera ready version

  47. arXiv:2309.11708  [pdf, other

    hep-th hep-ph nucl-th

    Stability and causality criteria in linear mode analysis: stability means causality

    Authors: Dong-Lin Wang, Shi Pu

    Abstract: Causality and stability are fundamental requirements for the differential equations describing predictable relativistic many-body systems. In this work, we investigate the stability and causality criteria in linear mode analysis. We discuss the updated stability criterion in 3+1 dimensional systems and introduce the improved sufficient criterion for causality. Our findings clearly demonstrate that… ▽ More

    Submitted 20 February, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 6+8 pages, 1 figure; references added, typos corrected

    Journal ref: Phys. Rev. D 109, L031504 (2024)

  48. arXiv:2309.04527  [pdf, other

    cond-mat.str-el cond-mat.mes-hall quant-ph

    Deformed Fredkin model for the $ν{=}5/2$ Moore-Read state on thin cylinders

    Authors: Cristian Voinea, Songyang Pu, Ammar Kirmani, Pouyan Ghaemi, Armin Rahmani, Zlatko Papić

    Abstract: We propose a frustration-free model for the Moore-Read quantum Hall state on sufficiently thin cylinders with circumferences $\lesssim 7$ magnetic lengths. While the Moore-Read Hamiltonian involves complicated long-range interactions between triplets of electrons in a Landau level, our effective model is a simpler one-dimensional chain of qubits with deformed Fredkin gates. We show that the ground… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 18 pages, 15 figures

    Journal ref: Phys. Rev. Research 6, 013105 (2024)

  49. arXiv:2308.14038  [pdf, other

    nucl-th hep-ph

    Momentum dependence of $φ$ meson's spin alignment

    Authors: Xin-Li Sheng, Shi Pu, Qun Wang

    Abstract: We study the rapidity and azimuthal angle dependences of the global spin alignment $ρ_{00}$ for $φ$ mesons with respect to the reaction plane in Au+Au collisions at RHIC by the relativistic coalescence model in the spin transport theory. The global spin alignment of $φ$ mesons arises from local fluctuations of strong force fields whose values are extracted from the STAR's data. The calculated resu… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: RevTex 4, 5 pages, 4 figures

  50. Learning to Pivot as a Smart Expert

    Authors: Tianhao Liu, Shanwen Pu, Dongdong Ge, Yinyu Ye

    Abstract: Linear programming has been practically solved mainly by simplex and interior point methods. Compared with the weakly polynomial complexity obtained by the interior point methods, the existence of strongly polynomial bounds for the length of the pivot path generated by the simplex methods remains a mystery. In this paper, we propose two novel pivot experts that leverage both global and local infor… ▽ More

    Submitted 31 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.