Search | arXiv e-print repository

Quantum state preparation with optimal T-count

Authors: David Gosset, Robin Kothari, Kewen Wu

Abstract: How many T gates are needed to approximate an arbitrary $n$-qubit quantum state to within error $\varepsilon$? Improving prior work of Low, Kliuchnikov, and Schaeffer, we show that the optimal asymptotic scaling is $Θ\left(\sqrt{2^n\log(1/\varepsilon)}+\log(1/\varepsilon)\right)$ if we allow ancilla qubits. We also show that this is the optimal T-count for implementing an arbitrary diagonal $n$-qu… ▽ More How many T gates are needed to approximate an arbitrary $n$-qubit quantum state to within error $\varepsilon$? Improving prior work of Low, Kliuchnikov, and Schaeffer, we show that the optimal asymptotic scaling is $Θ\left(\sqrt{2^n\log(1/\varepsilon)}+\log(1/\varepsilon)\right)$ if we allow ancilla qubits. We also show that this is the optimal T-count for implementing an arbitrary diagonal $n$-qubit unitary to within error $\varepsilon$. We describe applications in which a tensor product of many single-qubit unitaries can be synthesized in parallel for the price of one. △ Less

Submitted 7 November, 2024; originally announced November 2024.

arXiv:2411.04643 [pdf, other]

A Micro-Macro Decomposition-Based Asymptotic-Preserving Random Feature Method for Multiscale Radiative Transfer Equations

Authors: Jingrun Chen, Zheng Ma, Keke Wu

Abstract: This paper introduces the Asymptotic-Preserving Random Feature Method (APRFM) for the efficient resolution of multiscale radiative transfer equations. The APRFM effectively addresses the challenges posed by stiffness and multiscale characteristics inherent in radiative transfer equations through the application of a micro-macro decomposition strategy. This approach decomposes the distribution func… ▽ More This paper introduces the Asymptotic-Preserving Random Feature Method (APRFM) for the efficient resolution of multiscale radiative transfer equations. The APRFM effectively addresses the challenges posed by stiffness and multiscale characteristics inherent in radiative transfer equations through the application of a micro-macro decomposition strategy. This approach decomposes the distribution function into equilibrium and non-equilibrium components, allowing for the approximation of both parts through the random feature method (RFM) within a least squares minimization framework. The proposed method exhibits remarkable robustness across different scales and achieves high accuracy with fewer degrees of freedom and collocation points than the vanilla RFM. Additionally, compared to the deep neural network-based method, our approach offers significant advantages in terms of parameter efficiency and computational speed. These benefits have been substantiated through numerous numerical experiments conducted on both one- and two-dimensional problems. △ Less

Submitted 18 May, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

arXiv:2411.04065 [pdf, other]

Imaging heat transport in suspended diamond nanostructures with integrated spin defect thermometers

Authors: Valentin Goblot, Kexin Wu, Enrico Di Lucente, Yuchun Zhu, Elena Losero, Quentin Jobert, Claudio Jaramillo Concha, Nicola Marzari, Michele Simoncelli, Christophe Galland

Abstract: Among all materials, mono-crystalline diamond has one of the highest measured thermal conductivities, with values above 2000 W/m/K at room temperature. This stems from momentum-conserving `normal' phonon-phonon scattering processes dominating over momentum-dissipating `Umklapp' processes, a feature that also suggests diamond as an ideal platform to experimentally investigate phonon heat transport… ▽ More Among all materials, mono-crystalline diamond has one of the highest measured thermal conductivities, with values above 2000 W/m/K at room temperature. This stems from momentum-conserving `normal' phonon-phonon scattering processes dominating over momentum-dissipating `Umklapp' processes, a feature that also suggests diamond as an ideal platform to experimentally investigate phonon heat transport phenomena that violate Fourier's law. Here, we introduce dilute nitrogen-vacancy color centers as in-situ, highly precise spin defect thermometers to image temperature inhomogeneities in single-crystal diamond microstructures heated from ambient conditions. We analyze cantilevers with cross-sections in the range from about 0.2 to 2.6 $\mathrm{μm}^2$, observing a relation between cross-section and heat flux that departs from Fourier's law predictions. We rationalize such behavior relying on first-principles simulations based on the linearized phonon Boltzmann transport equation, also discussing how fabrication-induced impurities influence conduction. Our temperature-imaging method can be applied to diamond devices of arbitrary geometry, paving the way for the exploration of unconventional, non-diffusive heat transport phenomena. △ Less

Submitted 26 November, 2024; v1 submitted 6 November, 2024; originally announced November 2024.

Comments: 8 pages, 4 figures + supplementary materials (8 pages, 7 figures)

arXiv:2411.03787 [pdf, other]

doi 10.1051/0004-6361/202452962

Unveiling the Binary Nature of NGC 2323

Authors: Songmei Qin, Jing Zhong, Tong Tang, Yueyue Jiang, Long Wang, Kai Wu, Friedrich Anders, Lola Balaguer-Núñez, Guimei Liu, Chunyan Li, Jinliang Hou, Li Chen

Abstract: As a well-known open cluster, NGC 2323 (also called M50) has been widely investigated for over a hundred years and has always been considered a classical single cluster. In this work, with the help of Gaia DR3, we study the binary structure nature of this cluster. Although indistinguishable in the spatial space, the small but undeniable difference in the proper motion indicates that they may be tw… ▽ More As a well-known open cluster, NGC 2323 (also called M50) has been widely investigated for over a hundred years and has always been considered a classical single cluster. In this work, with the help of Gaia DR3, we study the binary structure nature of this cluster. Although indistinguishable in the spatial space, the small but undeniable difference in the proper motion indicates that they may be two individual clusters. After investigating the properties of the two clusters, it is found that they have very close positions (three-dimensional $Δ$pos = 12.3 pc, $σ_{Δ\mathrm{pos}} = 3.4$ pc) and similar tangential velocities (two-dimensional $Δ$V = 2.2 km s$^{-1}$, $σ_{Δ\mathrm{V}} = 0.02$ km s$^{-1}$), indicating the existence of their physical association. Moreover, the best isochrone fitting ages of the two clusters are the same (158 Myr), further proving their possibly common origin. To comprehensively understand the formation and evolution of this binary cluster, we employ the PETAR $N$-body code to trace back their birthplace and deduce their dynamical evolutionary fate. With observational mean cluster properties, the simulations suggest that they may form together, and then orbit each other as a binary cluster for over 200 Myr. After that, because of their gradual mass loss, the two clusters will eventually separate and evolve into two independent clusters. Meanwhile, the numerical $N$-body simulation suggests that the less massive cluster is unlikely to be the cluster tidal tails created by the differential rotation of the Milky Way. △ Less

Submitted 6 November, 2024; originally announced November 2024.

Comments: 14 pages, 8 figures

Journal ref: A&A 693, A317 (2025)

arXiv:2411.03374 [pdf, other]

doi 10.33232/001c.137526

Detection of Thermal Emission at Millimeter Wavelengths from Low-Earth Orbit Satellites

Authors: A. Foster, A. Chokshi, A. J. Anderson, B. Ansarinejad, M. Archipley, L. Balkenhol, K. Benabed, A. N. Bender, D. R. Barron, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, C. L. Chang, P. Chaubal, P. M. Chichura, T. -L. Chou, A. Coerver, T. M. Crawford, C. Daley, T. de Haan, K. R. Dibert , et al. (66 additional authors not shown)

Abstract: The detection of artificial satellite thermal emission at millimeter wavelengths is presented using data from the 3rd-Generation receiver on the South Pole Telescope (SPT-3G). This represents the first reported detection of thermal emission from artificial satellites at millimeter wavelengths. Satellite thermal emission is shown to be detectable at high signal-to-noise ratios on timescales as shor… ▽ More The detection of artificial satellite thermal emission at millimeter wavelengths is presented using data from the 3rd-Generation receiver on the South Pole Telescope (SPT-3G). This represents the first reported detection of thermal emission from artificial satellites at millimeter wavelengths. Satellite thermal emission is shown to be detectable at high signal-to-noise ratios on timescales as short as a few tens of milliseconds. An algorithm for downloading orbital information and tracking known satellites given observer constraints and time-ordered observatory pointing is described. Consequences for cosmological surveys and short-duration transient searches are discussed, revealing that the integrated thermal emission from all large satellites does not contribute significantly to the SPT-3G survey intensity map. Measured satellite positions are found to be discrepant from their two-line element (TLE) derived ephemerides up to several arcminutes which may present a difficulty in cross-checking or masking satellites from short-duration transient searches. △ Less

Submitted 29 April, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

arXiv:2411.02265 [pdf, other]

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Authors: Xingwu Sun, Yanfeng Chen, Yiqing Huang, Ruobing Xie, Jiaqi Zhu, Kai Zhang, Shuaipeng Li, Zhen Yang, Jonny Han, Xiaobo Shu, Jiahao Bu, Zhongzhi Chen, Xuemeng Huang, Fengzong Lian, Saiyong Yang, Jianfeng Yan, Yuyuan Zeng, Xiaoqin Ren, Chao Yu, Lulu Wu, Yue Mao, Jun Xia, Tao Yang, Suncong Zheng, Kan Wu , et al. (83 additional authors not shown)

Abstract: In this paper, we introduce Hunyuan-Large, which is currently the largest open-source Transformer-based mixture of experts model, with a total of 389 billion parameters and 52 billion activation parameters, capable of handling up to 256K tokens. We conduct a thorough evaluation of Hunyuan-Large's superior performance across various benchmarks including language understanding and generation, logica… ▽ More In this paper, we introduce Hunyuan-Large, which is currently the largest open-source Transformer-based mixture of experts model, with a total of 389 billion parameters and 52 billion activation parameters, capable of handling up to 256K tokens. We conduct a thorough evaluation of Hunyuan-Large's superior performance across various benchmarks including language understanding and generation, logical reasoning, mathematical problem-solving, coding, long-context, and aggregated tasks, where it outperforms LLama3.1-70B and exhibits comparable performance when compared to the significantly larger LLama3.1-405B model. Key practice of Hunyuan-Large include large-scale synthetic data that is orders larger than in previous literature, a mixed expert routing strategy, a key-value cache compression technique, and an expert-specific learning rate strategy. Additionally, we also investigate the scaling laws and learning rate schedule of mixture of experts models, providing valuable insights and guidances for future model development and optimization. The code and checkpoints of Hunyuan-Large are released to facilitate future innovations and applications. Codes: https://github.com/Tencent/Hunyuan-Large Models: https://huggingface.co/tencent/Tencent-Hunyuan-Large △ Less

Submitted 6 November, 2024; v1 submitted 4 November, 2024; originally announced November 2024.

Comments: 17 pages, 4 Figures

arXiv:2411.01230 [pdf, other]

Strengthening DeFi Security: A Static Analysis Approach to Flash Loan Vulnerabilities

Authors: Ka Wai Wu

Abstract: The rise of Decentralized Finance (DeFi) has brought novel financial opportunities but also exposed serious security vulnerabilities, with flash loans frequently exploited for price manipulation attacks. These attacks, leveraging the atomic nature of flash loans, allow malicious actors to manipulate DeFi protocol oracles and pricing mechanisms within a single transaction, causing substantial finan… ▽ More The rise of Decentralized Finance (DeFi) has brought novel financial opportunities but also exposed serious security vulnerabilities, with flash loans frequently exploited for price manipulation attacks. These attacks, leveraging the atomic nature of flash loans, allow malicious actors to manipulate DeFi protocol oracles and pricing mechanisms within a single transaction, causing substantial financial losses. Traditional smart contract analysis tools address some security risks but often struggle to detect the complex, inter-contract dependencies that make flash loan attacks challenging to identify. In response, we introduce FlashDeFier, an advanced detection framework that enhances static taint analysis to target price manipulation vulnerabilities arising from flash loans. FlashDeFier expands the scope of taint sources and sinks, enabling comprehensive analysis of data flows across DeFi protocols. The framework constructs detailed inter-contract call graphs to capture sophisticated data flow patterns, significantly improving detection accuracy. Tested against a dataset of high-profile DeFi incidents, FlashDeFier identifies 76.4% of price manipulation vulnerabilities, marking a 30% improvement over DeFiTainter. These results highlight the importance of adaptive detection frameworks that evolve alongside DeFi threats, underscoring the need for hybrid approaches combining static, dynamic, and symbolic analysis methods for resilient DeFi security. △ Less

Submitted 23 February, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

arXiv:2411.01036 [pdf, ps, other]

Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference

Authors: Jonathan Wenger, Kaiwen Wu, Philipp Hennig, Jacob R. Gardner, Geoff Pleiss, John P. Cunningham

Abstract: Model selection in Gaussian processes scales prohibitively with the size of the training dataset, both in time and memory. While many approximations exist, all incur inevitable approximation error. Recent work accounts for this error in the form of computational uncertainty, which enables -- at the cost of quadratic complexity -- an explicit tradeoff between computation and precision. Here we exte… ▽ More Model selection in Gaussian processes scales prohibitively with the size of the training dataset, both in time and memory. While many approximations exist, all incur inevitable approximation error. Recent work accounts for this error in the form of computational uncertainty, which enables -- at the cost of quadratic complexity -- an explicit tradeoff between computation and precision. Here we extend this development to model selection, which requires significant enhancements to the existing approach, including linear-time scaling in the size of the dataset. We propose a novel training loss for hyperparameter optimization and demonstrate empirically that the resulting method can outperform SGPR, CGGP and SVGP, state-of-the-art methods for GP model selection, on medium to large-scale datasets. Our experiments show that model selection for computation-aware GPs trained on 1.8 million data points can be done within a few hours on a single GPU. As a result of this work, Gaussian processes can be trained on large-scale datasets without significantly compromising their ability to quantify uncertainty -- a fundamental prerequisite for optimal decision-making. △ Less

Submitted 7 July, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

Comments: Advances in Neural Information Processing Systems (NeurIPS 2024)

arXiv:2411.00419 [pdf, other]

Argus: Multi-View Egocentric Human Mesh Reconstruction Based on Stripped-Down Wearable mmWave Add-on

Authors: Di Duan, Shengzhe Lyu, Mu Yuan, Hongfei Xue, Tianxing Li, Weitao Xu, Kaishun Wu, Guoliang Xing

Abstract: In this paper, we propose Argus, a wearable add-on system based on stripped-down (i.e., compact, lightweight, low-power, limited-capability) mmWave radars. It is the first to achieve egocentric human mesh reconstruction in a multi-view manner. Compared with conventional frontal-view mmWave sensing solutions, it addresses several pain points, such as restricted sensing range, occlusion, and the mul… ▽ More In this paper, we propose Argus, a wearable add-on system based on stripped-down (i.e., compact, lightweight, low-power, limited-capability) mmWave radars. It is the first to achieve egocentric human mesh reconstruction in a multi-view manner. Compared with conventional frontal-view mmWave sensing solutions, it addresses several pain points, such as restricted sensing range, occlusion, and the multipath effect caused by surroundings. To overcome the limited capabilities of the stripped-down mmWave radars (with only one transmit antenna and three receive antennas), we tackle three main challenges and propose a holistic solution, including tailored hardware design, sophisticated signal processing, and a deep neural network optimized for high-dimensional complex point clouds. Extensive evaluation shows that Argus achieves performance comparable to traditional solutions based on high-capability mmWave radars, with an average vertex error of 6.5 cm, solely using stripped-down radars deployed in a multi-view configuration. It presents robustness and practicality across conditions, such as with unseen users and different host devices. △ Less

Submitted 1 November, 2024; originally announced November 2024.

Comments: 15 pages, 25 figures

ACM Class: C.3

arXiv:2410.21086 [pdf, ps, other]

Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving

Authors: Jiyao Wang, Xiao Yang, Zhenyu Wang, Ximeng Wei, Ange Wang, Dengbo He, Kaishun Wu

Abstract: Road safety remains a critical challenge worldwide, with approximately 1.35 million fatalities annually attributed to traffic accidents, often due to human errors. As we advance towards higher levels of vehicle automation, challenges still exist, as driving with automation can cognitively over-demand drivers if they engage in non-driving-related tasks (NDRTs), or lead to drowsiness if driving was… ▽ More Road safety remains a critical challenge worldwide, with approximately 1.35 million fatalities annually attributed to traffic accidents, often due to human errors. As we advance towards higher levels of vehicle automation, challenges still exist, as driving with automation can cognitively over-demand drivers if they engage in non-driving-related tasks (NDRTs), or lead to drowsiness if driving was the sole task. This calls for the urgent need for an effective Driver Monitoring System (DMS) that can evaluate cognitive load and drowsiness in SAE Level-2/3 autonomous driving contexts. In this study, we propose a novel multi-task DMS, termed VDMoE, which leverages RGB video input to monitor driver states non-invasively. By utilizing key facial features to minimize computational load and integrating remote Photoplethysmography (rPPG) for physiological insights, our approach enhances detection accuracy while maintaining efficiency. Additionally, we optimize the Mixture-of-Experts (MoE) framework to accommodate multi-modal inputs and improve performance across different tasks. A novel prior-inclusive regularization method is introduced to align model outputs with statistical priors, thus accelerating convergence and mitigating overfitting risks. We validate our method with the creation of a new dataset (MCDD), which comprises RGB video and physiological indicators from 42 participants, and two public datasets. Our findings demonstrate the effectiveness of VDMoE in monitoring driver states, contributing to safer autonomous driving systems. The code and data will be released. △ Less

Submitted 19 June, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

arXiv:2410.20855 [pdf, other]

ByteNet: Rethinking Multimedia File Fragment Classification through Visual Perspectives

Authors: Wenyang Liu, Kejun Wu, Tianyi Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

Abstract: Multimedia file fragment classification (MFFC) aims to identify file fragment types, e.g., image/video, audio, and text without system metadata. It is of vital importance in multimedia storage and communication. Existing MFFC methods typically treat fragments as 1D byte sequences and emphasize the relations between separate bytes (interbytes) for classification. However, the more informative relat… ▽ More Multimedia file fragment classification (MFFC) aims to identify file fragment types, e.g., image/video, audio, and text without system metadata. It is of vital importance in multimedia storage and communication. Existing MFFC methods typically treat fragments as 1D byte sequences and emphasize the relations between separate bytes (interbytes) for classification. However, the more informative relations inside bytes (intrabytes) are overlooked and seldom investigated. By looking inside bytes, the bit-level details of file fragments can be accessed, enabling a more accurate classification. Motivated by this, we first propose Byte2Image, a novel visual representation model that incorporates previously overlooked intrabyte information into file fragments and reinterprets these fragments as 2D grayscale images. This model involves a sliding byte window to reveal the intrabyte information and a rowwise stacking of intrabyte ngrams for embedding fragments into a 2D space. Thus, complex interbyte and intrabyte correlations can be mined simultaneously using powerful vision networks. Additionally, we propose an end-to-end dual-branch network ByteNet to enhance robust correlation mining and feature representation. ByteNet makes full use of the raw 1D byte sequence and the converted 2D image through a shallow byte branch feature extraction (BBFE) and a deep image branch feature extraction (IBFE) network. In particular, the BBFE, composed of a single fully-connected layer, adaptively recognizes the co-occurrence of several some specific bytes within the raw byte sequence, while the IBFE, built on a vision Transformer, effectively mines the complex interbyte and intrabyte correlations from the converted image. Experiments on the two representative benchmarks, including 14 cases, validate that our proposed method outperforms state-of-the-art approaches on different cases by up to 12.2%. △ Less

Submitted 28 October, 2024; originally announced October 2024.

Comments: Accepted in TMM

arXiv:2410.20582 [pdf, other]

Evidence for a shock-compressed magnetic field in the northwestern rim of Vela Jr. from X-ray polarimetry

Authors: Dmitry A. Prokhorov, Yi-Jung Yang, Riccardo Ferrazzoli, Jacco Vink, Patrick Slane, Enrico Costa, Stefano Silvestri, Ping Zhou, Niccolò Bucciantini, Alessandro Di Marco, Martin C. Weisskopf, Luca Baldini, Victor Doroshenko, Steven R. Ehlert, Jeremy Heyl, Philip Kaaret, Dawoon E. Kim, Frédéric Marin, Tsunefumi Mizuno, Chi-Yung Ng, Melissa Pesce-Rollins, Carmelo Sgrò, Paolo Soffitta, Douglas A. Swartz, Toru Tamagawa , et al. (75 additional authors not shown)

Abstract: Synchrotron X-ray emission has been detected from nearly a dozen young supernova remnants (SNRs). X-rays of synchrotron origin exhibit linear polarization in a regular, non-randomly oriented magnetic field. The significant polarized X-ray emission from four such SNRs has already been reported on the basis of observations with the Imaging X-ray Polarimetry Explorer (IXPE). The magnetic-field struct… ▽ More Synchrotron X-ray emission has been detected from nearly a dozen young supernova remnants (SNRs). X-rays of synchrotron origin exhibit linear polarization in a regular, non-randomly oriented magnetic field. The significant polarized X-ray emission from four such SNRs has already been reported on the basis of observations with the Imaging X-ray Polarimetry Explorer (IXPE). The magnetic-field structure as derived from IXPE observations is radial for Cassiopeia A, Tycho's SNR, and SN 1006, and tangential for RX J1713.7-3946. The latter together with the recent detection of a tangential magnetic field in SNR 1E 0102.2-7219 by the Australia Telescope Compact Array in the radio band shows that tangential magnetic fields can also be present in young SNRs. Thus, the dichotomy in polarization between young and middle-aged SNRs (radial magnetic fields in young SNRs, but tangential magnetic fields in middle-aged SNRs), previously noticed in the radio band, deserves additional attention. The present analysis of IXPE observations determines, for the first time, a magnetic-field structure in the northwestern rim of Vela Jr, also known as RX J0852.0-4622, and provides a new example of a young SNR with a tangential magnetic field. △ Less

Submitted 27 October, 2024; originally announced October 2024.

Comments: Accepted for publication in Astronomy and Astrophysics

arXiv:2410.20121 [pdf, other]

Postprocessing of tilt-to-length noise with coefficient drifts in TianQin using a null time-delay interferometry channel

Authors: Zhizhao Wang, Shuju Yang, Kaihang Wu, Xiaojie Wang, Huizong Duan, Yurong Liang, Xuefeng Zhang, Hsien-Chi Yeh

Abstract: Tilt-to-length (TTL) coupling is expected to be one of the major noise sources in the interferometric phase readouts in TianQin mission. Arising from the angular motion of spacecraft (SC) and the onboard movable optical subassemblies (MOSAs), TTL noise needs to be removed in postprocessing after suppressing the laser phase noise with time-delay interferometry (TDI) technique. In this article, we s… ▽ More Tilt-to-length (TTL) coupling is expected to be one of the major noise sources in the interferometric phase readouts in TianQin mission. Arising from the angular motion of spacecraft (SC) and the onboard movable optical subassemblies (MOSAs), TTL noise needs to be removed in postprocessing after suppressing the laser phase noise with time-delay interferometry (TDI) technique. In this article, we show that we can estimate the TTL coupling coefficients using the null TDI channel ζ and remove the TTL noise in the commonly used Michelson variables with the estimated coefficients. We introduce the theoretical model of TTL noise in TDI and consider linear drifts in the linear TTL coefficients for noise estimation and subtraction. The TTL coefficients with drifts are estimated successfully with an accuracy of 10 μm/rad in our numerical simulation. We discuss the impact of point-ahead angle compensation error and wavefront error, and find it necessary to estimate linear drift coefficients and quadratic TTL coefficients to keep TTL noise residuals below the 0.3 pm noise reference curve. However, the estimation accuracy suffers greatly from the correlation between yaw jitter measurements that contain the same SC jitter. Assuming all angular jitters induced by MOSAs are independent, choosing a frequency range with relatively higher MOSA yaw jitter noise levels is beneficial to the TTL coefficient estimation. △ Less

Submitted 26 October, 2024; originally announced October 2024.

arXiv:2410.19983 [pdf, other]

A Two-Week $IXPE$ Monitoring Campaign on Mrk 421

Authors: W. Peter Maksym, Ioannis Liodakis, M. Lynne Saade, Dawoon E. Kim, Riccardo Middei, Laura Di Gesu, Sebastian Kiehlmann, Gabriele Matzeu, Iván Agudo, Alan P. Marscher, Steven R. Ehlert, Svetlana G. Jorstad, Philip Kaaret, Herman L. Marshall, Luigi Pacciani, Matteo Perri, Simonetta Puccetti, Pouya M. Kouch, Elina Lindfors, Francisco José Aceituno, Giacomo Bonnoli, Víctor Casanova, Juan Escudero, Beatriz Agís-González, César Husillos , et al. (131 additional authors not shown)

Abstract: X-ray polarization is a unique new probe of the particle acceleration in astrophysical jets made possible through the Imaging X-ray Polarimetry Explorer. Here we report on the first dense X-ray polarization monitoring campaign on the blazar Mrk 421. Our observations were accompanied by an even denser radio and optical polarization campaign. We find significant short-timescale variability in both X… ▽ More X-ray polarization is a unique new probe of the particle acceleration in astrophysical jets made possible through the Imaging X-ray Polarimetry Explorer. Here we report on the first dense X-ray polarization monitoring campaign on the blazar Mrk 421. Our observations were accompanied by an even denser radio and optical polarization campaign. We find significant short-timescale variability in both X-ray polarization degree and angle, including a $\sim90^\circ$ angle rotation about the jet axis. We attribute this to random variations of the magnetic field, consistent with the presence of turbulence but also unlikely to be explained by turbulence alone. At the same time, the degree of lower-energy polarization is significantly lower and shows no more than mild variability. Our campaign provides further evidence for a scenario in which energy-stratified shock-acceleration of relativistic electrons, combined with a turbulent magnetic field, is responsible for optical to X-ray synchrotron emission in blazar jets. △ Less

Submitted 25 October, 2024; originally announced October 2024.

Comments: 23 pages, including 8 pages of appendices. 12 figures, 3 tables. Submitted to ApJ

arXiv:2410.15252 [pdf, other]

Lossless KV Cache Compression to 2%

Authors: Zhen Yang, J. N. Han, Kan Wu, Ruobing Xie, An Wang, Xingwu Sun, Zhanhui Kang

Abstract: Large language models have revolutionized data processing in numerous domains, with their ability to handle extended context reasoning receiving notable recognition. To speed up inference, maintaining a key-value (KV) cache memory is essential. Nonetheless, the growing demands for KV cache memory create significant hurdles for efficient implementation. This work introduces a novel architecture, Cr… ▽ More Large language models have revolutionized data processing in numerous domains, with their ability to handle extended context reasoning receiving notable recognition. To speed up inference, maintaining a key-value (KV) cache memory is essential. Nonetheless, the growing demands for KV cache memory create significant hurdles for efficient implementation. This work introduces a novel architecture, Cross-Layer Latent Attention (CLLA), aimed at compressing the KV cache to less than 2% of its original size while maintaining comparable performance levels. CLLA integrates multiple aspects of KV cache compression, including attention head/dimension reduction, layer sharing, and quantization techniques, into a cohesive framework. Our extensive experiments demonstrate that CLLA achieves lossless performance on most tasks while utilizing minimal KV cache, marking a significant advancement in practical KV cache compression. △ Less

Submitted 19 October, 2024; originally announced October 2024.

arXiv:2410.14292 [pdf, ps, other]

Bound preserving Point-Average-Moment PolynomiAl-interpreted (PAMPA) scheme: one-dimensional case

Authors: Rémi Abgrall, Miaosen Jiao, Yongle Liu, Kailiang Wu

Abstract: We propose a bound-preserving (BP) Point-Average-Moment PolynomiAl-interpreted (PAMPA) scheme by blending third-order and first-order constructions. The originality of the present construction is that it does not need any explicit reconstruction within each element, and therefore the construction is very flexible. The scheme employs a classical blending approach between a first-order BP scheme and… ▽ More We propose a bound-preserving (BP) Point-Average-Moment PolynomiAl-interpreted (PAMPA) scheme by blending third-order and first-order constructions. The originality of the present construction is that it does not need any explicit reconstruction within each element, and therefore the construction is very flexible. The scheme employs a classical blending approach between a first-order BP scheme and a high-order scheme that does not inherently preserve bounds. The proposed BP PAMPA scheme demonstrates effectiveness across a range of problems, from scalar cases to systems such as the Euler equations of gas dynamics. We derive optimal blending parameters for both scalar and system cases, with the latter based on the recent geometric quasi-linearization (GQL) framework of [Wu \& Shu, {\em SIAM Review}, 65 (2023), pp. 1031--1073]. This yields explicit, optimal blending coefficients that ensure positivity and control spurious oscillations in both point values and cell averages. This framework incorporates a convex blending of fluxes and residuals from both high-order and first-order updates, facilitating a rigorous BP property analysis. Sufficient conditions for the BP property are established, ensuring robustness while preserving high-order accuracy. Numerical tests confirm the effectiveness of the BP PAMPA scheme on several challenging problems. △ Less

Submitted 16 June, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

arXiv:2410.13229 [pdf, other]

Quamba: A Post-Training Quantization Recipe for Selective State Space Models

Authors: Hung-Yueh Chiang, Chi-Chih Chang, Natalia Frumkin, Kai-Chiang Wu, Diana Marculescu

Abstract: State Space Models (SSMs) have emerged as an appealing alternative to Transformers for large language models, achieving state-of-the-art accuracy with constant memory complexity which allows for holding longer context lengths than attention-based networks. The superior computational efficiency of SSMs in long sequence modeling positions them favorably over Transformers in many scenarios. However,… ▽ More State Space Models (SSMs) have emerged as an appealing alternative to Transformers for large language models, achieving state-of-the-art accuracy with constant memory complexity which allows for holding longer context lengths than attention-based networks. The superior computational efficiency of SSMs in long sequence modeling positions them favorably over Transformers in many scenarios. However, improving the efficiency of SSMs on request-intensive cloud-serving and resource-limited edge applications is still a formidable task. SSM quantization is a possible solution to this problem, making SSMs more suitable for wide deployment, while still maintaining their accuracy. Quantization is a common technique to reduce the model size and to utilize the low bit-width acceleration features on modern computing units, yet existing quantization techniques are poorly suited for SSMs. Most notably, SSMs have highly sensitive feature maps within the selective scan mechanism (i.e., linear recurrence) and massive outliers in the output activations which are not present in the output of token-mixing in the self-attention modules. To address this issue, we propose a static 8-bit per-tensor SSM quantization method which suppresses the maximum values of the input activations to the selective SSM for finer quantization precision and quantizes the output activations in an outlier-free space with Hadamard transform. Our 8-bit weight-activation quantized Mamba 2.8B SSM benefits from hardware acceleration and achieves a 1.72x lower generation latency on an Nvidia Orin Nano 8G, with only a 0.9% drop in average accuracy on zero-shot tasks. The experiments demonstrate the effectiveness and practical applicability of our approach for deploying SSM-based models of all sizes on both cloud and edge platforms. △ Less

Submitted 7 December, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

arXiv:2410.12089 [pdf, other]

BICEP/Keck XVIII: Measurement of BICEP3 polarization angles and consequences for constraining cosmic birefringence and inflation

Authors: BICEP/Keck Collaboration, :, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, H. Boenish, V. Buza, J. R. Cheshire IV, J. Connors, J. Cornelison, M. Crumrine, A. J. Cukierman, E. Denison, L. Duband, M. Eiben, B. D. Elwood, S. Fatigoni, J. P. Filippini, A. Fortes, M. Gao , et al. (62 additional authors not shown)

Abstract: We use a custom-made calibrator to measure individual detectors' polarization angles of BICEP3, a small aperture telescope observing the cosmic microwave background (CMB) at 95GHz from the South Pole. We describe our calibration strategy and the statistical and systematic uncertainties associated with the measurement. We reach an unprecedented precision for such measurement on a CMB experiment, wi… ▽ More We use a custom-made calibrator to measure individual detectors' polarization angles of BICEP3, a small aperture telescope observing the cosmic microwave background (CMB) at 95GHz from the South Pole. We describe our calibration strategy and the statistical and systematic uncertainties associated with the measurement. We reach an unprecedented precision for such measurement on a CMB experiment, with a repeatability for each detector pair of $0.02°$. We show that the relative angles measured using this method are in excellent agreement with those extracted from CMB data. Because the absolute measurement is currently limited by a systematic uncertainty, we do not derive cosmic birefringence constraints from BICEP3 data in this work. Rather, we forecast the sensitivity of BICEP3 sky maps for such analysis. We investigate the relative contributions of instrument noise, lensing, and dust, as well as astrophysical and instrumental systematics. We also explore the constraining power of different angle estimators, depending on analysis choices. We establish that the BICEP3 2-year dataset (2017--2018) has an on-sky sensitivity to the cosmic birefringence angle of $σ= 0.078°$, which could be improved to $σ= 0.055°$ by adding all of the existing BICEP3 data (through 2023). Furthermore, we emphasize the possibility of using the BICEP3 sky patch as a polarization calibration source for CMB experiments, which with the present data could reach a precision of $0.035°$. Finally, in the context of inflation searches, we investigate the impact of detector-to-detector variations in polarization angles as they may bias the tensor-to-scalar ratio r. We show that while the effect is expected to remain subdominant to other sources of systematic uncertainty, it can be reliably calibrated using polarization angle measurements such as the ones we present in this paper. △ Less

Submitted 17 February, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

Comments: 29 Pages, 17 Figures, 6 Tables, as submitted to PRD. Visit bicepkeck.org for figure pdfs/pngs

arXiv:2410.07903 [pdf, other]

doi 10.1051/0004-6361/202451390

Forgotten treasures in the HST/FOC UV imaging polarimetric archives of active galactic nuclei III. Five years monitoring of M87

Authors: F. Marin, T. Barnouin, K. Wu, E. Lopez-Rodriguez

Abstract: The active galactic nucleus (AGN) within M87, a giant elliptical galaxy, is responsible for one of the closest kiloparsec-scale relativistic jet to Earth. We unearthed unpublished M87 polarization maps taken with the Faint Object Camera (FOC) aboard the Hubble Space Telescope (HST), obtained between 1995 and 1999. At a rate of one observation per year, we can follow the evolution of the polarized… ▽ More The active galactic nucleus (AGN) within M87, a giant elliptical galaxy, is responsible for one of the closest kiloparsec-scale relativistic jet to Earth. We unearthed unpublished M87 polarization maps taken with the Faint Object Camera (FOC) aboard the Hubble Space Telescope (HST), obtained between 1995 and 1999. At a rate of one observation per year, we can follow the evolution of the polarized flux knots in the jet. We can thus constrain the time scale of variation of the magnetic field up to a spatial resolution of one tenth of an arcsecond (11.5 pc). After coherently reducing the five observations using the same methodology presented in the first paper of this series, the analysis of polarized maps from POS 1 (base of the jet) and POS 3 (end of the jet) reveals significant temporal and spatial dynamics in the jet's magnetic fields morphology. Despite minimal changes in overall intensity structure, notable fluctuations in polarization degrees and angles are detected across various knots and inter-knot regions. In addition, the emission and polarization characteristics of M87's jet differ significantly between POS1 and POS3. POS1 shows a more collimated jet with strong variability in polarization, while POS3 reveals a thicker structure, a quasi-absence of variability and complex magnetic field interactions. This suggests that the jet may have co-axial structures with distinct kinetic properties. Theoretical models like the jet-in-jet scenario, featuring double helical magnetic flux ropes, help explain these observations, indicating a strong density contrast and higher speeds in the inner jet. △ Less

Submitted 10 October, 2024; originally announced October 2024.

Comments: 26 pages, 16 figures, 5 tables, accepted for publication in A&A

MSC Class: 85-06 ACM Class: J.2.3; J.2.9

arXiv:2410.06446

Machine Unlearning in Forgettability Sequence

Authors: Junjie Chen, Qian Chen, Jian Lou, Xiaoyu Zhang, Kai Wu, Zilong Wang

Abstract: Machine unlearning (MU) is becoming a promising paradigm to achieve the "right to be forgotten", where the training trace of any chosen data points could be eliminated, while maintaining the model utility on general testing samples after unlearning. With the advancement of forgetting research, many fundamental open questions remain unanswered: do different samples exhibit varying levels of difficu… ▽ More Machine unlearning (MU) is becoming a promising paradigm to achieve the "right to be forgotten", where the training trace of any chosen data points could be eliminated, while maintaining the model utility on general testing samples after unlearning. With the advancement of forgetting research, many fundamental open questions remain unanswered: do different samples exhibit varying levels of difficulty in being forgotten? Further, does the sequence in which samples are forgotten, determined by their respective difficulty levels, influence the performance of forgetting algorithms? In this paper, we identify key factor affecting unlearning difficulty and the performance of unlearning algorithms. We find that samples with higher privacy risks are more likely to be unlearning, indicating that the unlearning difficulty varies among different samples which motives a more precise unlearning mode. Built upon this insight, we propose a general unlearning framework, dubbed RSU, which consists of Ranking module and SeqUnlearn module. △ Less

Submitted 21 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

Comments: The senior authors of the draft are not fully convinced that the novelty is significant enough for this submission compared to the latest research progress in this area. Additionally, the senior authors have identified writing issues. Based on these two reasons, we have decided to withdraw the draft from arXiv

arXiv:2410.05173 [pdf, other]

Provably Positivity-Preserving Constrained Transport (PPCT) Second-Order Scheme for Ideal Magnetohydrodynamics

Authors: Dongwen Pang, Kailiang Wu

Abstract: This paper proposes and analyzes a robust and efficient second-order positivity-preserving constrained transport (PPCT) scheme for ideal magnetohydrodynamics (MHD) on non-staggered Cartesian meshes. The PPCT scheme ensures two critical physical constraints: a globally discrete divergence-free (DDF) condition on the magnetic field and the positivity of density and pressure. The method is inspired b… ▽ More This paper proposes and analyzes a robust and efficient second-order positivity-preserving constrained transport (PPCT) scheme for ideal magnetohydrodynamics (MHD) on non-staggered Cartesian meshes. The PPCT scheme ensures two critical physical constraints: a globally discrete divergence-free (DDF) condition on the magnetic field and the positivity of density and pressure. The method is inspired by a novel splitting technique from [T.A. Dao, M. Nazarov and I. Tomas, J. Comput. Phys., 508:113009, 2024], which divides the MHD system into an Euler subsystem with steady magnetic fields and a magnetic subsystem with steady density and internal energy. To achieve these structure-preserving properties, the PPCT scheme combines a positivity-preserving (PP) finite volume method for the Euler subsystem with a finite difference constrained transport (CT) method for the magnetic subsystem via Strang splitting. The finite volume method employs a new PP limiter that retains second-order accuracy and enforces the positivity of density and pressure, with rigorous proof provided using the geometric quasilinearization (GQL) approach [K. Wu and C.-W. Shu, SIAM Review, 65:1031-1073, 2023]. For the magnetic subsystem, we develop an implicit finite difference CT method that conserves energy and maintains a globally DDF constraint. This nonlinear system is efficiently solved to machine precision using an iterative algorithm. Since the CT method is unconditionally energy-stable and conserves steady density and internal energy, the PPCT scheme requires only a mild CFL condition for the finite volume method to ensure stability and the PP property. While the focus is on 2D cases for clarity, the extension to 3D is discussed. Several challenging numerical experiments, including highly magnetized MHD jets with high Mach numbers, validate the PPCT scheme's accuracy, robustness, and high resolution. △ Less

Submitted 7 October, 2024; originally announced October 2024.

Comments: 47 pages

arXiv:2410.05000 [pdf, other]

Robust Discontinuous Galerkin Methods Maintaining Physical Constraints for General Relativistic Hydrodynamics

Authors: Huihui Cao, Manting Peng, Kailiang Wu

Abstract: Simulating general relativistic hydrodynamics (GRHD) presents challenges such as handling curved spacetime, achieving high-order shock-capturing accuracy, and preserving key physical constraints (positive density, pressure, and subluminal velocity) under nonlinear coupling. This paper introduces high-order, physical-constraint-preserving, oscillation-eliminating discontinuous Galerkin (PCP-OEDG) s… ▽ More Simulating general relativistic hydrodynamics (GRHD) presents challenges such as handling curved spacetime, achieving high-order shock-capturing accuracy, and preserving key physical constraints (positive density, pressure, and subluminal velocity) under nonlinear coupling. This paper introduces high-order, physical-constraint-preserving, oscillation-eliminating discontinuous Galerkin (PCP-OEDG) schemes with Harten-Lax-van Leer flux for GRHD. To suppress spurious oscillations near discontinuities, we incorporate a computationally efficient oscillation-eliminating (OE) procedure based on a linear damping equation, maintaining accuracy and avoiding complex characteristic decomposition. To enhance stability and robustness, we construct PCP schemes using the W-form of GRHD equations with Cholesky decomposition of the spatial metric, addressing the non-equivalence of admissible state sets in curved spacetime. We rigorously prove the PCP property of cell averages via technical estimates and the Geometric Quasi-Linearization (GQL) approach, which transforms nonlinear constraints into linear forms. Additionally, we present provably convergent PCP iterative algorithms for robust recovery of primitive variables, ensuring physical constraints are satisfied throughout. The PCP-OEDG method is validated through extensive tests, demonstrating its robustness, accuracy, and capability to handle extreme GRHD scenarios involving strong shocks, high Lorentz factors, and intense gravitational fields. △ Less

Submitted 7 October, 2024; originally announced October 2024.

Comments: 54 pages, 18 figures

arXiv:2410.01285 [pdf, other]

Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration

Authors: Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng

Abstract: The black-box nature of large language models (LLMs) poses challenges in interpreting results, impacting issues such as data intellectual property protection and hallucination tracing. Training data attribution (TDA) methods are considered effective solutions to address these challenges. Most recent TDA methods rely on influence functions, assuming the model achieves minimized empirical risk. Howe… ▽ More The black-box nature of large language models (LLMs) poses challenges in interpreting results, impacting issues such as data intellectual property protection and hallucination tracing. Training data attribution (TDA) methods are considered effective solutions to address these challenges. Most recent TDA methods rely on influence functions, assuming the model achieves minimized empirical risk. However, achieving this criterion is difficult, and sourcing accuracy can be compromised by fitting errors during model training. In this paper, we introduce a novel TDA method called Debias and Denoise Attribution (DDA), which enhances influence functions by addressing fitting errors. Specifically, the debias strategy seeks to improve the performance of influence functions by eliminating the knowledge bias present in the base model before fine-tuning, while the denoise strategy aims to reduce discrepancies in influence scores arising from varying degrees of fitting during the training process through smoothing techniques. Experimental results demonstrate that our method significantly outperforms existing approaches, achieving an averaged AUC of 91.64%. Moreover, DDA exhibits strong generality and scalability across various sources and different-scale models like LLaMA2, QWEN2, and Mistral. △ Less

Submitted 19 November, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

Comments: Accepted to the EMNLP 2024 main

arXiv:2410.00710 [pdf, ps, other]

A potential theory for the Wess--Zumino--Witten equation in the space of Kähler potentials

Authors: Kuang-Ru Wu

Abstract: We develop a potential theory for the Wess--Zumino--Witten (WZW) equation in the space of Kähler potentials which is parallel to the potential theory for the Hermitian--Yang--Mills equation. A concept called $ω$-harmonicity on graphs is introduced which characterizes the WZW equation. We also show that, with respect to a Banach--Mazur type distance function, the distance between two solutions of t… ▽ More We develop a potential theory for the Wess--Zumino--Witten (WZW) equation in the space of Kähler potentials which is parallel to the potential theory for the Hermitian--Yang--Mills equation. A concept called $ω$-harmonicity on graphs is introduced which characterizes the WZW equation. We also show that, with respect to a Banach--Mazur type distance function, the distance between two solutions of the WZW equation is subharmonic. The harmonic map into the space of Kähler potentials, as a special case of the WZW equation, is also investigated. In particular, we show the solvability of the Dirichlet problem for the harmonic map, and the approximation/quantization by its finite dimensional counterparts. △ Less

Submitted 1 October, 2024; originally announced October 2024.

Comments: 20 pages

MSC Class: 32Q15; 32U05

arXiv:2410.00356 [pdf, other]

A Digital Twin Framework for Physical-Virtual Integration in V2X-Enabled Connected Vehicle Corridors

Authors: Keshu Wu, Pei Li, Yang Cheng, Steven T. Parker, Bin Ran, David A. Noyce, Xinyue Ye

Abstract: Transportation Cyber-Physical Systems (T-CPS) enhance safety and mobility by integrating cyber and physical transportation systems. A key component of T-CPS is the Digital Twin (DT), a virtual representation that enables simulation, analysis, and optimization through real-time data exchange and communication. Although existing studies have explored DTs for vehicles, communications, pedestrians, an… ▽ More Transportation Cyber-Physical Systems (T-CPS) enhance safety and mobility by integrating cyber and physical transportation systems. A key component of T-CPS is the Digital Twin (DT), a virtual representation that enables simulation, analysis, and optimization through real-time data exchange and communication. Although existing studies have explored DTs for vehicles, communications, pedestrians, and traffic, real-world validations and implementations of DTs that encompass infrastructure, vehicles, signals, communications, and more remain limited due to several challenges. These include accessing real-world connected infrastructure, integrating heterogeneous, multi-sourced data, ensuring real-time data processing, and synchronizing the digital and physical systems. To address these challenges, this study develops a traffic DT based on a real-world connected vehicle corridor. Leveraging the Cellular Vehicle-to-Everything (C-V2X) infrastructure in the corridor, along with communication, computing, and simulation technologies, the proposed DT accurately replicates physical vehicle behaviors, signal timing, communications, and traffic patterns within the virtual environment. Building upon the previous data pipeline, the digital system ensures robust synchronization with the physical environment. Moreover, the DT's scalable and redundant architecture enhances data integrity, making it capable of supporting future large-scale C-V2X deployments. Furthermore, its ability to provide feedback to the physical system is demonstrated through applications such as signal timing adjustments, vehicle advisory messages, and incident notifications. The proposed DT is a vital tool in T-CPS, enabling real-time traffic monitoring, prediction, and optimization to enhance the reliability and safety of transportation systems. △ Less

Submitted 26 February, 2025; v1 submitted 30 September, 2024; originally announced October 2024.

arXiv:2409.20414 [pdf]

KANDU-Net:A Dual-Channel U-Net with KAN for Medical Image Segmentation

Authors: Chenglin Fang, Kaigui Wu

Abstract: The U-Net model has consistently demonstrated strong performance in the field of medical image segmentation, with various improvements and enhancements made since its introduction. This paper presents a novel architecture that integrates KAN networks with U-Net, leveraging the powerful nonlinear representation capabilities of KAN networks alongside the established strengths of U-Net. We introduce… ▽ More The U-Net model has consistently demonstrated strong performance in the field of medical image segmentation, with various improvements and enhancements made since its introduction. This paper presents a novel architecture that integrates KAN networks with U-Net, leveraging the powerful nonlinear representation capabilities of KAN networks alongside the established strengths of U-Net. We introduce a KAN-convolution dual-channel structure that enables the model to more effectively capture both local and global features. We explore effective methods for fusing features extracted by KAN with those obtained through convolutional layers, utilizing an auxiliary network to facilitate this integration process. Experiments conducted across multiple datasets show that our model performs well in terms of accuracy, indicating that the KAN-convolution dual-channel approach has significant potential in medical image segmentation tasks. △ Less

Submitted 30 September, 2024; originally announced September 2024.

arXiv:2409.19220 [pdf]

Extending Depth of Field for Varifocal Multiview Images

Authors: Zhilong Li, Kejun Wu, Qiong Liu, You Yang

Abstract: Optical imaging systems are generally limited by the depth of field because of the nature of the optics. Therefore, extending depth of field (EDoF) is a fundamental task for meeting the requirements of emerging visual applications. To solve this task, the common practice is using multi-focus images from a single viewpoint. This method can obtain acceptable quality of EDoF under the condition of fi… ▽ More Optical imaging systems are generally limited by the depth of field because of the nature of the optics. Therefore, extending depth of field (EDoF) is a fundamental task for meeting the requirements of emerging visual applications. To solve this task, the common practice is using multi-focus images from a single viewpoint. This method can obtain acceptable quality of EDoF under the condition of fixed field of view, but it is only applicable to static scenes and the field of view is limited and fixed. An emerging data type, varifocal multiview images have the potential to become a new paradigm for solving the EDoF, because the data contains more field of view information than multi-focus images. To realize EDoF of varifocal multiview images, we propose an end-to-end method for the EDoF, including image alignment, image optimization and image fusion. Experimental results demonstrate the efficiency of the proposed method. △ Less

Submitted 27 September, 2024; originally announced September 2024.

arXiv:2409.18707 [pdf, other]

Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation

Authors: Kun Wu, Yichen Zhu, Jinming Li, Junjie Wen, Ning Liu, Zhiyuan Xu, Jian Tang

Abstract: Learning visuomotor policy for multi-task robotic manipulation has been a long-standing challenge for the robotics community. The difficulty lies in the diversity of action space: typically, a goal can be accomplished in multiple ways, resulting in a multimodal action distribution for a single task. The complexity of action distribution escalates as the number of tasks increases. In this work, we… ▽ More Learning visuomotor policy for multi-task robotic manipulation has been a long-standing challenge for the robotics community. The difficulty lies in the diversity of action space: typically, a goal can be accomplished in multiple ways, resulting in a multimodal action distribution for a single task. The complexity of action distribution escalates as the number of tasks increases. In this work, we propose \textbf{Discrete Policy}, a robot learning method for training universal agents capable of multi-task manipulation skills. Discrete Policy employs vector quantization to map action sequences into a discrete latent space, facilitating the learning of task-specific codes. These codes are then reconstructed into the action space conditioned on observations and language instruction. We evaluate our method on both simulation and multiple real-world embodiments, including both single-arm and bimanual robot settings. We demonstrate that our proposed Discrete Policy outperforms a well-established Diffusion Policy baseline and many state-of-the-art approaches, including ACT, Octo, and OpenVLA. For example, in a real-world multi-task training setting with five tasks, Discrete Policy achieves an average success rate that is 26\% higher than Diffusion Policy and 15\% higher than OpenVLA. As the number of tasks increases to 12, the performance gap between Discrete Policy and Diffusion Policy widens to 32.5\%, further showcasing the advantages of our approach. Our work empirically demonstrates that learning multi-task policies within the latent space is a vital step toward achieving general-purpose agents. △ Less

Submitted 21 March, 2025; v1 submitted 27 September, 2024; originally announced September 2024.

Comments: Accept to ICRA 2025

arXiv:2409.17624 [pdf, other]

HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting

Authors: Zijun Xu, Rui Jin, Ke Wu, Yi Zhao, Zhiwei Zhang, Jieru Zhao, Fei Gao, Zhongxue Gan, Wenchao Ding

Abstract: In complex missions such as search and rescue,robots must make intelligent decisions in unknown environments, relying on their ability to perceive and understand their surroundings. High-quality and real-time reconstruction enhances situational awareness and is crucial for intelligent robotics. Traditional methods often struggle with poor scene representation or are too slow for real-time use. Ins… ▽ More In complex missions such as search and rescue,robots must make intelligent decisions in unknown environments, relying on their ability to perceive and understand their surroundings. High-quality and real-time reconstruction enhances situational awareness and is crucial for intelligent robotics. Traditional methods often struggle with poor scene representation or are too slow for real-time use. Inspired by the efficacy of 3D Gaussian Splatting (3DGS), we propose a hierarchical planning framework for fast and high-fidelity active reconstruction. Our method evaluates completion and quality gain to adaptively guide reconstruction, integrating global and local planning for efficiency. Experiments in simulated and real-world environments show our approach outperforms existing real-time methods. △ Less

Submitted 9 October, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

arXiv:2409.17429 [pdf, other]

Real-World Data Inspired Interactive Connected Traffic Scenario Generation

Authors: Junwei You, Pei Li, Yang Cheng, Keshu Wu, Rui Gan, Steven T. Parker, Bin Ran

Abstract: Simulation is a crucial step in ensuring accurate, efficient, and realistic Connected and Autonomous Vehicles (CAVs) testing and validation. As the adoption of CAV accelerates, the integration of real-world data into simulation environments becomes increasingly critical. Among various technologies utilized by CAVs, Vehicle-to-Everything (V2X) communication plays a crucial role in ensuring a seamle… ▽ More Simulation is a crucial step in ensuring accurate, efficient, and realistic Connected and Autonomous Vehicles (CAVs) testing and validation. As the adoption of CAV accelerates, the integration of real-world data into simulation environments becomes increasingly critical. Among various technologies utilized by CAVs, Vehicle-to-Everything (V2X) communication plays a crucial role in ensuring a seamless transmission of information between CAVs, infrastructure, and other road users. However, most existing studies have focused on developing and testing communication protocols, resource allocation strategies, and data dissemination techniques in V2X. There is a gap where real-world V2X data is integrated into simulations to generate diverse and high-fidelity traffic scenarios. To fulfill this research gap, we leverage real-world Signal Phase and Timing (SPaT) data from Roadside Units (RSUs) to enhance the fidelity of CAV simulations. Moreover, we developed an algorithm that enables Autonomous Vehicles (AVs) to respond dynamically to real-time traffic signal data, simulating realistic V2X communication scenarios. Such high-fidelity simulation environments can generate multimodal data, including trajectory, semantic camera, depth camera, and bird's eye view data for various traffic scenarios. The generated scenarios and data provide invaluable insights into AVs' interactions with traffic infrastructure and other road users. This work aims to bridge the gap between theoretical research and practical deployment of CAVs, facilitating the development of smarter and safer transportation systems. △ Less

Submitted 25 September, 2024; originally announced September 2024.

arXiv:2409.16440 [pdf, other]

Calibration Measurements of the BICEP3 and BICEP Array CMB Polarimeters from 2017 to 2024

Authors: Christos Giannakopoulos, Clara Vergès, P. A. R. Ade, Zeeshan Ahmed, Mandana Amiri, Denis Barkats, Ritoban Basu Thakur, Colin A. Bischoff, Dominic Beck, James J. Bock, Hans Boenish, Victor Buza, James R. Cheshire IV, Jake Connors, James Cornelison, Michael Crumrine, Ari Jozef Cukierman, Edward Denison, Marion Dierickx, Lionel Duband, Miranda Eiben, Brodi D. Elwood, Sofia Fatigoni, Jeff P. Filippini, Antonio Fortes , et al. (61 additional authors not shown)

Abstract: The BICEP3 and BICEP Array polarimeters are small-aperture refracting telescopes located at the South Pole designed to measure primordial gravitational wave signatures in the Cosmic Microwave Background (CMB) polarization, predicted by inflation. Constraining the inflationary signal requires not only excellent sensitivity, but also careful control of instrumental systematics. Both instruments use… ▽ More The BICEP3 and BICEP Array polarimeters are small-aperture refracting telescopes located at the South Pole designed to measure primordial gravitational wave signatures in the Cosmic Microwave Background (CMB) polarization, predicted by inflation. Constraining the inflationary signal requires not only excellent sensitivity, but also careful control of instrumental systematics. Both instruments use antenna-coupled orthogonally polarized detector pairs, and the polarized sky signal is reconstructed by taking the difference in each detector pair. As a result, the differential response between detectors within a pair becomes an important systematic effect we must control. Additionally, mapping the intensity and polarization response in regions away from the main beam can inform how sidelobe levels affect CMB measurements. Extensive calibration measurements are taken in situ every austral summer for control of instrumental systematics and instrument characterisation. In this work, we detail the set of beam calibration measurements that we conduct on the BICEP receivers, from deep measurements of main beam response to polarized beam response and sidelobe mapping. We discuss the impact of these measurements for instrumental systematics studies and design choices for future CMB receivers. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Comments: 13 pages, 7 figures, 1 table, Proceedings paper SPIE 2024

arXiv:2409.16269 [pdf, other]

Bound-preserving OEDG schemes for Aw-Rascle-Zhang traffic models on networks

Authors: Wei Chen, Shumo Cui, Kailiang Wu, Tao Xiong

Abstract: Physical solutions to the widely used Aw-Rascle-Zhang (ARZ) traffic model and the adapted pressure (AP) ARZ model should satisfy the positivity of density, the minimum and maximum principles with respect to the velocity $v$ and other Riemann invariants. Many numerical schemes suffer from instabilities caused by violating these bounds, and the only existing bound-preserving (BP) numerical scheme (f… ▽ More Physical solutions to the widely used Aw-Rascle-Zhang (ARZ) traffic model and the adapted pressure (AP) ARZ model should satisfy the positivity of density, the minimum and maximum principles with respect to the velocity $v$ and other Riemann invariants. Many numerical schemes suffer from instabilities caused by violating these bounds, and the only existing bound-preserving (BP) numerical scheme (for ARZ model) is random, only first-order accurate, and not strictly conservative. This paper introduces arbitrarily high-order provably BP DG schemes for these two models, preserving all the aforementioned bounds except the maximum principle of $v$, which has been rigorously proven to conflict with the consistency and conservation of numerical schemes. Although the maximum principle of $v$ is not directly enforced, we find that the strictly preserved maximum principle of another Riemann invariant $w$ actually enforces an alternative upper bound on $v$. At the core of this work, analyzing and rigorously proving the BP property is a particularly nontrivial task: the Lax-Friedrichs (LF) splitting property, usually expected for hyperbolic conservation laws and employed to construct BP schemes, does not hold for these two models. To overcome this challenge, we formulate a generalized version of the LF splitting property, and prove it via the geometric quasilinearization (GQL) approach [Kailiang Wu and Chi-Wang Shu, SIAM Review, 65: 1031-1073, 2023]. To suppress spurious oscillations in the DG solutions, we employ the oscillation-eliminating (OE) technique, recently proposed in [Manting Peng, Zheng Sun, and Kailiang Wu, Mathematics of Computation, in press], which is based on the solution operator of a novel damping equation. Several numerical examples are included to demonstrate the effectiveness, accuracy, and BP properties of our schemes, with applications to traffic simulations on road networks. △ Less

Submitted 24 September, 2024; originally announced September 2024.

arXiv:2409.15182 [pdf, other]

Goal-based Neural Physics Vehicle Trajectory Prediction Model

Authors: Rui Gan, Haotian Shi, Pei Li, Keshu Wu, Bocheng An, Linheng Li, Junyi Ma, Chengyuan Ma, Bin Ran

Abstract: Vehicle trajectory prediction plays a vital role in intelligent transportation systems and autonomous driving, as it significantly affects vehicle behavior planning and control, thereby influencing traffic safety and efficiency. Numerous studies have been conducted to predict short-term vehicle trajectories in the immediate future. However, long-term trajectory prediction remains a major challenge… ▽ More Vehicle trajectory prediction plays a vital role in intelligent transportation systems and autonomous driving, as it significantly affects vehicle behavior planning and control, thereby influencing traffic safety and efficiency. Numerous studies have been conducted to predict short-term vehicle trajectories in the immediate future. However, long-term trajectory prediction remains a major challenge due to accumulated errors and uncertainties. Additionally, balancing accuracy with interpretability in the prediction is another challenging issue in predicting vehicle trajectory. To address these challenges, this paper proposes a Goal-based Neural Physics Vehicle Trajectory Prediction Model (GNP). The GNP model simplifies vehicle trajectory prediction into a two-stage process: determining the vehicle's goal and then choosing the appropriate trajectory to reach this goal. The GNP model contains two sub-modules to achieve this process. The first sub-module employs a multi-head attention mechanism to accurately predict goals. The second sub-module integrates a deep learning model with a physics-based social force model to progressively predict the complete trajectory using the generated goals. The GNP demonstrates state-of-the-art long-term prediction accuracy compared to four baseline models. We provide interpretable visualization results to highlight the multi-modality and inherent nature of our neural physics framework. Additionally, ablation studies are performed to validate the effectiveness of our key designs. △ Less

Submitted 25 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

arXiv:2409.13896 [pdf, ps, other]

doi 10.1145/3689788

Semantic-Type-Guided Bug Finding

Authors: Kelvin Qian, Scott Smith, Brandon Stride, Shiwei Weng, Ke Wu

Abstract: In recent years, there has been an increased interest in tools that establish \emph{incorrectness} rather than correctness of program properties. In this work we build on this approach by developing a novel methodology to prove incorrectness of \emph{semantic typing} properties of functional programs, extending the incorrectness approach to the model theory of functional program typing. We define… ▽ More In recent years, there has been an increased interest in tools that establish \emph{incorrectness} rather than correctness of program properties. In this work we build on this approach by developing a novel methodology to prove incorrectness of \emph{semantic typing} properties of functional programs, extending the incorrectness approach to the model theory of functional program typing. We define a semantic type refuter which refutes semantic typings for a simple functional language. We prove our refuter is co-recursively enumerable, and that it is sound and complete with respect to a semantic typing notion. An initial implementation is described which uses symbolic evaluation to efficiently find type errors over a functional language with a rich type system. △ Less

Submitted 20 September, 2024; originally announced September 2024.

arXiv:2409.13638 [pdf, other]

On-chip pulse shaping of entangled photons

Authors: Kaiyi Wu, Lucas M. Cohen, Karthik V. Myilswamy, Navin B. Lingaraju, Hsuan-Hao Lu, Joseph M. Lukens, Andrew M. Weiner

Abstract: We demonstrate spectral shaping of entangled photons with a six-channel microring-resonator-based silicon photonic pulse shaper. Through precise calibration of thermal phase shifters in a microresonator-based pulse shaper, we demonstrate line-by-line phase control on a 3~GHz grid for two frequency-bin-entangled qudits, corresponding to Hilbert spaces of up to $6\times 6$ ($3\times 3$) dimensions f… ▽ More We demonstrate spectral shaping of entangled photons with a six-channel microring-resonator-based silicon photonic pulse shaper. Through precise calibration of thermal phase shifters in a microresonator-based pulse shaper, we demonstrate line-by-line phase control on a 3~GHz grid for two frequency-bin-entangled qudits, corresponding to Hilbert spaces of up to $6\times 6$ ($3\times 3$) dimensions for shared (independent) signal-idler filters. The pulse shaper's fine spectral resolution enables control of nanosecond-scale temporal features, which are observed by direct coincidence detection of biphoton correlation functions that show excellent agreement with theory. This work marks, to our knowledge, the first demonstration of biphoton pulse shaping using an integrated spectral shaper and holds significant promise for applications in quantum information processing. △ Less

Submitted 20 September, 2024; originally announced September 2024.

arXiv:2409.12514 [pdf, other]

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

Authors: Junjie Wen, Yichen Zhu, Jinming Li, Minjie Zhu, Kun Wu, Zhiyuan Xu, Ning Liu, Ran Cheng, Chaomin Shen, Yaxin Peng, Feifei Feng, Jian Tang

Abstract: Vision-Language-Action (VLA) models have shown remarkable potential in visuomotor control and instruction comprehension through end-to-end learning processes. However, current VLA models face significant challenges: they are slow during inference and require extensive pre-training on large amounts of robotic data, making real-world deployment difficult. In this paper, we introduce a new family of… ▽ More Vision-Language-Action (VLA) models have shown remarkable potential in visuomotor control and instruction comprehension through end-to-end learning processes. However, current VLA models face significant challenges: they are slow during inference and require extensive pre-training on large amounts of robotic data, making real-world deployment difficult. In this paper, we introduce a new family of compact vision-language-action models, called TinyVLA, which offers two key advantages over existing VLA models: (1) faster inference speeds, and (2) improved data efficiency, eliminating the need for pre-training stage. Our framework incorporates two essential components to build TinyVLA: (1) initializing the policy backbone with robust, high-speed multimodal models, and (2) integrating a diffusion policy decoder during fine-tuning to enable precise robot actions. We conducted extensive evaluations of TinyVLA in both simulation and on real robots, demonstrating that our approach significantly outperforms the state-of-the-art VLA model, OpenVLA, in terms of speed and data efficiency, while delivering comparable or superior performance. Additionally, TinyVLA exhibits strong generalization capabilities across various dimensions, including language instructions, novel objects, unseen positions, changes in object appearance, background variations, and environmental shifts, often matching or exceeding the performance of OpenVLA. We believe that \methodname offers an interesting perspective on utilizing pre-trained multimodal models for policy learning. Our project is at https://tiny-vla.github.io. △ Less

Submitted 13 May, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

Comments: add more citations

arXiv:2409.12513 [pdf, ps, other]

Hölder regularity of solutions of the steady Boltzmann equation with soft potentials

Authors: Kung-Chien Wu, Kuan-Hsiang Wang

Abstract: We consider the Hölder regularity of solutions to the steady Boltzmann equation with in-flow boundary condition in bounded and strictly convex domains $Ω\subset\mathbb{R}^{3}$ for gases with cutoff soft potential $(-3<γ<0)$. We prove that there is a unique solution with a bounded $L^{\infty}$ norm in space and velocity. This solution is Hölder continuous, and it's order depends not only on the reg… ▽ More We consider the Hölder regularity of solutions to the steady Boltzmann equation with in-flow boundary condition in bounded and strictly convex domains $Ω\subset\mathbb{R}^{3}$ for gases with cutoff soft potential $(-3<γ<0)$. We prove that there is a unique solution with a bounded $L^{\infty}$ norm in space and velocity. This solution is Hölder continuous, and it's order depends not only on the regularity of the incoming boundary data, but also on the potential power $γ$. The result for modulated soft potential case $-2<γ<0$ is similar to hard potential case $(0\leqγ<1)$ since we have $C^{1}$ velocity regularity from collision part. However, we observe that for very soft potential case $(-3<γ\leq -2)$, the regularity in velocity obtained by the collision part is lower (Hölder only), but the boundary regularity still can transfer to solution (in both space and velocity) by transport and collision part under the restriction of $γ$. △ Less

Submitted 26 September, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

arXiv:2409.11676 [pdf, other]

Hypergraph-based Motion Generation with Multi-modal Interaction Relational Reasoning

Authors: Keshu Wu, Yang Zhou, Haotian Shi, Dominique Lord, Bin Ran, Xinyue Ye

Abstract: The intricate nature of real-world driving environments, characterized by dynamic and diverse interactions among multiple vehicles and their possible future states, presents considerable challenges in accurately predicting the motion states of vehicles and handling the uncertainty inherent in the predictions. Addressing these challenges requires comprehensive modeling and reasoning to capture the… ▽ More The intricate nature of real-world driving environments, characterized by dynamic and diverse interactions among multiple vehicles and their possible future states, presents considerable challenges in accurately predicting the motion states of vehicles and handling the uncertainty inherent in the predictions. Addressing these challenges requires comprehensive modeling and reasoning to capture the implicit relations among vehicles and the corresponding diverse behaviors. This research introduces an integrated framework for autonomous vehicles (AVs) motion prediction to address these complexities, utilizing a novel Relational Hypergraph Interaction-informed Neural mOtion generator (RHINO). RHINO leverages hypergraph-based relational reasoning by integrating a multi-scale hypergraph neural network to model group-wise interactions among multiple vehicles and their multi-modal driving behaviors, thereby enhancing motion prediction accuracy and reliability. Experimental validation using real-world datasets demonstrates the superior performance of this framework in improving predictive accuracy and fostering socially aware automated driving in dynamic traffic scenarios. △ Less

Submitted 17 September, 2024; originally announced September 2024.

arXiv:2409.10872 [pdf, other]

High-order Accurate Entropy Stable Schemes for Relativistic Hydrodynamics with General Synge-type Equation of State

Authors: Linfeng Xu, Shengrong Ding, Kailiang Wu

Abstract: All the existing entropy stable (ES) schemes for relativistic hydrodynamics (RHD) in the literature were restricted to the ideal equation of state (EOS), which however is often a poor approximation for most relativistic flows due to its inconsistency with the relativistic kinetic theory. This paper develops high-order ES finite difference schemes for RHD with general Synge-type EOS, which encompas… ▽ More All the existing entropy stable (ES) schemes for relativistic hydrodynamics (RHD) in the literature were restricted to the ideal equation of state (EOS), which however is often a poor approximation for most relativistic flows due to its inconsistency with the relativistic kinetic theory. This paper develops high-order ES finite difference schemes for RHD with general Synge-type EOS, which encompasses a range of special EOSs. We first establish an entropy pair for the RHD equations with general Synge-type EOS in any space dimensions. We rigorously prove that the found entropy function is strictly convex and derive the associated entropy variables, laying the foundation for designing entropy conservative (EC) and ES schemes. Due to relativistic effects, one cannot explicitly express primitive variables, fluxes, and entropy variables in terms of conservative variables. Consequently, this highly complicates the analysis of the entropy structure of the RHD equations, the investigation of entropy convexity, and the construction of EC numerical fluxes. By using a suitable set of parameter variables, we construct novel two-point EC fluxes in a unified form for general Synge-type EOS. We obtain high-order EC schemes through linear combinations of the two-point EC fluxes. Arbitrarily high-order accurate ES schemes are achieved by incorporating dissipation terms into the EC schemes, based on (weighted) essentially non-oscillatory reconstructions. Additionally, we derive the general dissipation matrix for general Synge-type EOS based on the scaled eigenvectors of the RHD system. We also define a suitable average of the dissipation matrix at the cell interfaces to ensure that the resulting ES schemes can resolve stationary contact discontinuities accurately. Several numerical examples are provided to validate the accuracy and effectiveness of our schemes for RHD with four special EOSs. △ Less

Submitted 16 September, 2024; originally announced September 2024.

Comments: 49 pages, 20 figures

arXiv:2409.10871 [pdf, other]

Spectral Volume from a DG perspective: Oscillation Elimination, Stability, and Optimal Error Estimates

Authors: Zhuoyun Li, Kailiang Wu

Abstract: The discontinuous Galerkin (DG) method and the spectral volume (SV) method are two widely-used numerical methodologies for solving hyperbolic conservation laws. In this paper, we demonstrate that under specific subdivision assumptions, the SV method can be represented in a DG form with a different inner product. Building on this insight, we extend the oscillation-eliminating (OE) technique, recent… ▽ More The discontinuous Galerkin (DG) method and the spectral volume (SV) method are two widely-used numerical methodologies for solving hyperbolic conservation laws. In this paper, we demonstrate that under specific subdivision assumptions, the SV method can be represented in a DG form with a different inner product. Building on this insight, we extend the oscillation-eliminating (OE) technique, recently proposed in [M. Peng, Z. Sun, and K. Wu, {\it Mathematics of Computation}, https://doi.org/10.1090/mcom/3998], to develop a new fully-discrete OESV method. The OE technique is non-intrusive, efficient, and straightforward to implement, acting as a simple post-processing filter to effectively suppress spurious oscillations. From a DG perspective, we present a comprehensive framework to theoretically analyze the stability and accuracy of both general Runge-Kutta SV (RKSV) schemes and the novel OESV method. For the linear advection equation, we conduct an energy analysis of the fully-discrete RKSV method, identifying an upwind condition crucial for stability. Furthermore, we establish optimal error estimates for the OESV schemes, overcoming nonlinear challenges through error decomposition and treating the OE procedure as additional source terms in the RKSV schemes. Extensive numerical experiments validate our theoretical findings and demonstrate the effectiveness and robustness of the proposed OESV method. This work enhances the theoretical understanding and practical application of SV schemes for hyperbolic conservation laws, making the OESV method a promising approach for high-resolution simulations. △ Less

Submitted 16 September, 2024; originally announced September 2024.

Comments: 32 pages, 8 figures

arXiv:2409.09708 [pdf, other]

ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration

Authors: Ning-Chi Huang, Chi-Chih Chang, Wei-Cheng Lin, Endri Taka, Diana Marculescu, Kai-Chiang Wu

Abstract: $N{:}M$ sparsity is an emerging model compression method supported by more and more accelerators to speed up sparse matrix multiplication in deep neural networks. Most existing $N{:}M… ▽ More $N{:}M$ sparsity is an emerging model compression method supported by more and more accelerators to speed up sparse matrix multiplication in deep neural networks. Most existing $N{:}M$ sparsity methods compress neural networks with a uniform setting for all layers in a network or heuristically determine the layer-wise configuration by considering the number of parameters in each layer. However, very few methods have been designed for obtaining a layer-wise customized $N{:}M$ sparse configuration for vision transformers (ViTs), which usually consist of transformer blocks involving the same number of parameters. In this work, to address the challenge of selecting suitable sparse configuration for ViTs on $N{:}M$ sparsity-supporting accelerators, we propose ELSA, Exploiting Layer-wise $N{:}M$ Sparsity for ViTs. Considering not only all $N{:}M$ sparsity levels supported by a given accelerator but also the expected throughput improvement, our methodology can reap the benefits of accelerators supporting mixed sparsity by trading off negligible accuracy loss with both memory usage and inference time reduction for ViT models. For instance, our approach achieves a noteworthy 2.9$\times$ reduction in FLOPs for both Swin-B and DeiT-B with only a marginal degradation of accuracy on ImageNet. Our code will be released upon paper acceptance. △ Less

Submitted 15 September, 2024; originally announced September 2024.

arXiv:2409.09632 [pdf, other]

High-Order Oscillation-Eliminating Hermite WENO Method for Hyperbolic Conservation Laws

Authors: Chuan Fan, Kailiang Wu

Abstract: This paper proposes high-order accurate, oscillation-eliminating Hermite weighted essentially non-oscillatory (OE-HWENO) finite volume schemes for hyperbolic conservation laws. The OE-HWENO schemes apply an OE procedure after each Runge--Kutta stage, dampening the first-order moments of the HWENO solution to suppress spurious oscillations without any problem-dependent parameters. This OE procedure… ▽ More This paper proposes high-order accurate, oscillation-eliminating Hermite weighted essentially non-oscillatory (OE-HWENO) finite volume schemes for hyperbolic conservation laws. The OE-HWENO schemes apply an OE procedure after each Runge--Kutta stage, dampening the first-order moments of the HWENO solution to suppress spurious oscillations without any problem-dependent parameters. This OE procedure acts as a filter, derived from the solution operator of a novel damping equation, solved exactly without discretization. As a result, the OE-HWENO method remains stable with a normal CFL number, even for strong shocks producing highly stiff damping terms. To ensure the method's non-oscillatory property across varying scales and wave speeds, we design a scale- and evolution-invariant damping equation and propose a dimensionless transformation for HWENO reconstruction. The OE-HWENO method offers several advantages over existing HWENO methods: the OE procedure is efficient and easy to implement, requiring only simple multiplication of first-order moments; it preserves high-order accuracy, local compactness, and spectral properties. The non-intrusive OE procedure can be integrated seamlessly into existing HWENO codes. Finally, we analyze the bound-preserving (BP) property using optimal cell average decomposition, relaxing the BP time step-size constraint and reducing decomposition points, improving efficiency. Extensive benchmarks validate the method's accuracy, efficiency, resolution, and robustness. △ Less

Submitted 15 September, 2024; originally announced September 2024.

Comments: 54 pages, 13 figures

arXiv:2409.09620 [pdf, other]

Robust DG Schemes on Unstructured Triangular Meshes: Oscillation Elimination and Bound Preservation via Optimal Convex Decomposition

Authors: Shengrong Ding, Shumo Cui, Kailiang Wu

Abstract: Discontinuous Galerkin (DG) schemes on unstructured meshes offer the advantages of compactness and the ability to handle complex computational domains. However, their robustness and reliability in solving hyperbolic conservation laws depend on two critical abilities: suppressing spurious oscillations and preserving intrinsic bounds or constraints. This paper introduces two significant advancements… ▽ More Discontinuous Galerkin (DG) schemes on unstructured meshes offer the advantages of compactness and the ability to handle complex computational domains. However, their robustness and reliability in solving hyperbolic conservation laws depend on two critical abilities: suppressing spurious oscillations and preserving intrinsic bounds or constraints. This paper introduces two significant advancements in enhancing the robustness and efficiency of DG methods on unstructured meshes for general hyperbolic conservation laws, while maintaining their accuracy and compactness. First, we investigate the oscillation-eliminating (OE) DG methods on unstructured meshes. These methods not only maintain key features such as conservation, scale invariance, and evolution invariance but also achieve rotation invariance through a novel rotation-invariant OE (RIOE) procedure. Second, we propose, for the first time, the optimal convex decomposition for designing efficient bound-preserving (BP) DG schemes on unstructured meshes. Finding the optimal convex decomposition that maximizes the BP CFL number is an important yet challenging problem.While this challenge was addressed for rectangular meshes, it remains an open problem for triangular meshes. This paper successfully constructs the optimal convex decomposition for the widely used $P^1$ and $P^2$ spaces on triangular cells, significantly improving the efficiency of BP DG methods.The maximum BP CFL numbers are increased by 100%--200% for $P^1$ and 280.38%--350% for $P^2$, compared to classic decomposition. Furthermore, our RIOE procedure and optimal decomposition technique can be integrated into existing DG codes with little and localized modifications. These techniques require only edge-neighboring cell information, thereby retaining the compactness and high parallel efficiency of original DG methods. △ Less

Submitted 15 September, 2024; originally announced September 2024.

Comments: 48 pages, 22 figures

arXiv:2409.09600 [pdf, other]

High-order accurate structure-preserving finite volume schemes on adaptive moving meshes for shallow water equations: Well-balancedness and positivity

Authors: Zhihao Zhang, Huazhong Tang, Kailiang Wu

Abstract: This paper develops high-order accurate, well-balanced (WB), and positivity-preserving (PP) finite volume schemes for shallow water equations on adaptive moving structured meshes. The mesh movement poses new challenges in maintaining the WB property, which not only depends on the balance between flux gradients and source terms but is also affected by the mesh movement. To address these complexitie… ▽ More This paper develops high-order accurate, well-balanced (WB), and positivity-preserving (PP) finite volume schemes for shallow water equations on adaptive moving structured meshes. The mesh movement poses new challenges in maintaining the WB property, which not only depends on the balance between flux gradients and source terms but is also affected by the mesh movement. To address these complexities, the WB property in curvilinear coordinates is decomposed into flux source balance and mesh movement balance. The flux source balance is achieved by suitable decomposition of the source terms, the numerical fluxes based on hydrostatic reconstruction, and appropriate discretization of the geometric conservation laws (GCLs). Concurrently, the mesh movement balance is maintained by integrating additional schemes to update the bottom topography during mesh adjustments. The proposed schemes are rigorously proven to maintain the WB property by using the discrete GCLs and these two balances. We provide rigorous analyses of the PP property under a sufficient condition enforced by a PP limiter. Due to the involvement of mesh metrics and movement, the analyses are nontrivial, while some standard techniques, such as splitting high-order schemes into convex combinations of formally first-order PP schemes, are not directly applicable. Various numerical examples validate the high-order accuracy, high efficiency, WB, and PP properties of the proposed schemes. △ Less

Submitted 14 September, 2024; originally announced September 2024.

Comments: 50 pages, 13 figures, 3 tables

arXiv:2409.09572 [pdf, other]

A Novel Aerial-Aquatic Locomotion Robot with Variable Stiffness Propulsion Module

Authors: Junzhe Hu, Pengyu Chen, Tianxiang Feng, Yuxuan Wen, Ke Wu, Janet Dong

Abstract: In recent years, the development of robots capable of operating in both aerial and aquatic environments has gained significant attention. This study presents the design and fabrication of a novel aerial-aquatic locomotion robot (AALR). Inspired by the diving beetle, the AALR incorporates a biomimetic propulsion mechanism with power and recovery strokes. The variable stiffness propulsion module (VS… ▽ More In recent years, the development of robots capable of operating in both aerial and aquatic environments has gained significant attention. This study presents the design and fabrication of a novel aerial-aquatic locomotion robot (AALR). Inspired by the diving beetle, the AALR incorporates a biomimetic propulsion mechanism with power and recovery strokes. The variable stiffness propulsion module (VSPM) uses low melting point alloy (LMPA) and variable stiffness joints (VSJ) to achieve efficient aquatic locomotion while reduce harm to marine life. The AALR's innovative design integrates the VSPM into the arms of a traditional quadrotor, allowing for effective aerial-aquatic locomotion. The VSPM adjusts joint stiffness through temperature control, meeting locomotion requirements in both aerial and aquatic modes. A dynamic model for the VSPM was developed, with optimized dimensional parameters to increase propulsion force. Experiments focused on aquatic mode analysis and demonstrated the AALR's swimming capability, achieving a maximum swimming speed of 77 mm/s underwater. The results confirm the AALR's effective performance in water environment, highlighting its potential for versatile, eco-friendly operations. △ Less

Submitted 14 September, 2024; originally announced September 2024.

Comments: 8 pages, 10 figures, ICRA

arXiv:2409.07163 [pdf, ps, other]

Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models

Authors: Jiahang Cao, Qiang Zhang, Jingkai Sun, Jiaxu Wang, Hao Cheng, Yulin Li, Jun Ma, Kun Wu, Zhiyuan Xu, Yecheng Shao, Wen Zhao, Gang Han, Yijie Guo, Renjing Xu

Abstract: Diffusion models have been widely employed in the field of 3D manipulation due to their efficient capability to learn distributions, allowing for precise prediction of action trajectories. However, diffusion models typically rely on large parameter UNet backbones as policy networks, which can be challenging to deploy on resource-constrained devices. Recently, the Mamba model has emerged as a promi… ▽ More Diffusion models have been widely employed in the field of 3D manipulation due to their efficient capability to learn distributions, allowing for precise prediction of action trajectories. However, diffusion models typically rely on large parameter UNet backbones as policy networks, which can be challenging to deploy on resource-constrained devices. Recently, the Mamba model has emerged as a promising solution for efficient modeling, offering low computational complexity and strong performance in sequence modeling. In this work, we propose the Mamba Policy, a lighter but stronger policy that reduces the parameter count by over 80% compared to the original policy network while achieving superior performance. Specifically, we introduce the XMamba Block, which effectively integrates input information with conditional features and leverages a combination of Mamba and Attention mechanisms for deep feature extraction. Extensive experiments demonstrate that the Mamba Policy excels on the Adroit, Dexart, and MetaWorld datasets, requiring significantly fewer computational resources. Additionally, we highlight the Mamba Policy's enhanced robustness in long-horizon scenarios compared to baseline methods and explore the performance of various Mamba variants within the Mamba Policy framework. Real-world experiments are also conducted to further validate its effectiveness. Our open-source project page can be found at https://andycao1125.github.io/mamba_policy/. △ Less

Submitted 25 June, 2025; v1 submitted 11 September, 2024; originally announced September 2024.

Comments: Accepted to IROS 2025

arXiv:2409.02704 [pdf, other]

doi 10.1140/epja/s10050-025-01487-8

Calculation of The Abundance of $^{187}$Re-$^{187}$Os Nuclear Clock Nuclides in S-process and Sensitivity Analysis of Maxwellian-Averaged Neutron Capture Cross Sections

Authors: Xinyu Dong, Yixuan Qiu, Kaisu Wu

Abstract: In this paper, the network equations calculation of $^{187}$Re-$^{187}$Os clock-related nuclide abundance in s-process is studied, and the sensitivities of Maxwellian-Averaged neutron capture cross sections for each nuclide are analyzed in detail. Firstly, basing nuclear physical parameters, we give the branching s-process reaction network from $^{184}$W to $^{190}$Os, and establish the correspond… ▽ More In this paper, the network equations calculation of $^{187}$Re-$^{187}$Os clock-related nuclide abundance in s-process is studied, and the sensitivities of Maxwellian-Averaged neutron capture cross sections for each nuclide are analyzed in detail. Firstly, basing nuclear physical parameters, we give the branching s-process reaction network from $^{184}$W to $^{190}$Os, and establish the corresponding network equations. Using a single path s-process approximation, we obtain an analytical expression of the seed nuclide $^{183}$W abundance of our branching network. Because of the stiffness of the system of network equations, we use the semi-implicit Runge-Kutta method to give the numerical solution of the network equations, and thus obtain the abundance of each nuclide related to the $^{187}$Re-$^{187}$Os nuclear clock in the s-process. Finally, with the numerical solution, the sensitivity analysis of the Maxwellian-Averaged neutron capture cross sections of the nuclear reaction involved in the $^{187}$Re-$^{187}$Os nuclear clock network equations is carried out. Therefore, we find that in s-process, the neutron capture reaction $^{184}$W + n $\to$ $^{185}$W has the greatest influence on the $^{187}$Re-$^{187}$Os nuclear clock reaction network, and the neutron capture reaction $^{186}$W + n $\to$ $^{187}$W has the greatest effect on the particular nuclides $^{187}$Re and $^{187}$Os. So the measurements of these two Maxwellian-Averaged neutron capture cross sections deserve the attention of experimental nuclear physicists. △ Less

Submitted 4 September, 2024; originally announced September 2024.

MSC Class: 02.30.Hq; 02.60.Cb; 25.40.Lw; 25.40.Hs; 29.85.Fj

arXiv:2409.02430 [pdf, other]

Transfer-based Adversarial Poisoning Attacks for Online (MIMO-)Deep Receviers

Authors: Kunze Wu, Weiheng Jiang, Dusit Niyato, Yinghuan Li, Chuang Luo

Abstract: Recently, the design of wireless receivers using deep neural networks (DNNs), known as deep receivers, has attracted extensive attention for ensuring reliable communication in complex channel environments. To adapt quickly to dynamic channels, online learning has been adopted to update the weights of deep receivers with over-the-air data (e.g., pilots). However, the fragility of neural models and… ▽ More Recently, the design of wireless receivers using deep neural networks (DNNs), known as deep receivers, has attracted extensive attention for ensuring reliable communication in complex channel environments. To adapt quickly to dynamic channels, online learning has been adopted to update the weights of deep receivers with over-the-air data (e.g., pilots). However, the fragility of neural models and the openness of wireless channels expose these systems to malicious attacks. To this end, understanding these attack methods is essential for robust receiver design. In this paper, we propose a transfer-based adversarial poisoning attack method for online receivers. Without knowledge of the attack target, adversarial perturbations are injected to the pilots, poisoning the online deep receiver and impairing its ability to adapt to dynamic channels and nonlinear effects. In particular, our attack method targets Deep Soft Interference Cancellation (DeepSIC)[1] using online meta-learning. As a classical model-driven deep receiver, DeepSIC incorporates wireless domain knowledge into its architecture. This integration allows it to adapt efficiently to time-varying channels with only a small number of pilots, achieving optimal performance in a multi-input and multi-output (MIMO) scenario. The deep receiver in this scenario has a number of applications in the field of wireless communication, which motivates our study of the attack methods targeting it. Specifically, we demonstrate the effectiveness of our attack in simulations on synthetic linear, synthetic nonlinear, static, and COST 2100 channels. Simulation results indicate that the proposed poisoning attack significantly reduces the performance of online receivers in rapidly changing scenarios. △ Less

Submitted 23 September, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

Comments: 15 pages, 14 figures

arXiv:2409.02296 [pdf, other]

Development of the 220/270 GHz Receiver of BICEP Array

Authors: The BICEP/Keck Collaboration, :, Y. Nakato, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, V. Buza, B. Cantrall, J. R. Cheshire IV, J. Cornelison, M. Crumrine, A. J. Cukierman, E. Denison, M. Dierickx, L. Duband, M. Eiben, B. D. Elwood, S. Fatigoni, J. P. Filippini, A. Fortes , et al. (61 additional authors not shown)

Abstract: Measurements of B-mode polarization in the CMB sourced from primordial gravitational waves would provide information on the energy scale of inflation and its potential form. To achieve these goals, one must carefully characterize the Galactic foregrounds, which can be distinguished from the CMB by conducting measurements at multiple frequencies. BICEP Array is the latest-generation multi-frequency… ▽ More Measurements of B-mode polarization in the CMB sourced from primordial gravitational waves would provide information on the energy scale of inflation and its potential form. To achieve these goals, one must carefully characterize the Galactic foregrounds, which can be distinguished from the CMB by conducting measurements at multiple frequencies. BICEP Array is the latest-generation multi-frequency instrument of the BICEP/Keck program, which specifically targets degree-scale primordial B-modes in the CMB. In its final configuration, this telescope will consist of four small-aperture receivers, spanning frequency bands from 30 to 270 GHz. The 220/270 GHz receiver designed to characterize Galactic dust is currently undergoing commissioning at Stanford University and is scheduled to deploy to the South Pole during the 2024--2025 austral summer. Here, we will provide an overview of this high-frequency receiver and discuss the integration status and test results as it is being commissioned. △ Less

Submitted 3 September, 2024; originally announced September 2024.

arXiv:2409.00996 [pdf, other]

Influence of planets on debris discs in star clusters -- II. The impact of stellar density

Authors: Kai Wu, M. B. N. Kouwenhoven, Francesco Flammini Dotti, Rainer Spurzem

Abstract: We present numerical simulations of planetary systems in star clusters with different initial stellar densities, to investigate the impact of the density on debris disc dynamics. We use LPS+ to combine N-body codes NBODY6++GPU and REBOUND for simulations. We simulate debris discs with and without a Jupiter-mass planet at 50 au, in star clusters with N = 1k - 64k stars. The spatial range of the rem… ▽ More We present numerical simulations of planetary systems in star clusters with different initial stellar densities, to investigate the impact of the density on debris disc dynamics. We use LPS+ to combine N-body codes NBODY6++GPU and REBOUND for simulations. We simulate debris discs with and without a Jupiter-mass planet at 50 au, in star clusters with N = 1k - 64k stars. The spatial range of the remaining planetary systems decreases with increasing N. As cluster density increases, the planet's influence range first increases and then decreases. For debris particles escaping from planetary systems, the probability of their direct ejection from the star cluster decreases as their initial semi-major axis (a0) or the cluster density increases. The eccentricity and inclination of surviving particles increase as cluster density increases. The presence of a planet leads to lower eccentricities and inclinations of surviving particles. The radial density distribution of the remaining discs decays exponentially in sparse clusters. We derive a general expression of the gravitational encounter rate. Our results are unable to directly explain the scarcity of debris discs in star clusters. Nevertheless, given that many planetary systems have multiple planets, the mechanism of the planet-cluster combined gravitational influence on the disc remains appealing as a potential explanation. △ Less

Submitted 2 September, 2024; originally announced September 2024.

Comments: 15 pages, 15 figures, and 4 tables. Accepted for publication in MNRAS

Showing 201–250 of 1,591 results for author: Wu, K