-
The Massive and Quiescent Elliptical Host Galaxy of the Repeating Fast Radio Burst FRB20240209A
Authors:
T. Eftekhari,
Y. Dong,
W. Fong,
V. Shah,
S. Simha,
B. C. Andersen,
S. Andrew,
M. Bhardwaj,
T. Cassanelli,
S. Chatterjee,
D. A. Coulter,
E. Fonseca,
B. M. Gaensler,
A. C. Gordon,
J. W. T. Hessels,
A. L. Ibik,
R. C. Joseph,
L. A. Kahinga,
V. Kaspi,
B. Kharel,
C. D. Kilpatrick,
A. E. Lanman,
M. Lazda,
C. Leung,
C. Liu
, et al. (17 additional authors not shown)
Abstract:
The discovery and localization of FRB20240209A by the Canadian Hydrogen Intensity Mapping Fast Radio Burst (CHIME/FRB) experiment marks the first repeating FRB localized with the CHIME/FRB Outriggers and adds to the small sample of repeating FRBs with associated host galaxies. Here we present Keck and Gemini observations of the host that reveal a redshift $z=0.1384\pm0.0004$. We perform stellar po…
▽ More
The discovery and localization of FRB20240209A by the Canadian Hydrogen Intensity Mapping Fast Radio Burst (CHIME/FRB) experiment marks the first repeating FRB localized with the CHIME/FRB Outriggers and adds to the small sample of repeating FRBs with associated host galaxies. Here we present Keck and Gemini observations of the host that reveal a redshift $z=0.1384\pm0.0004$. We perform stellar population modeling to jointly fit the optical through mid-infrared data of the host and infer a median stellar mass log$(M_*/{\rm M_{\odot}})=11.34\pm0.01$ and a mass-weighted stellar population age $\sim11$Gyr, corresponding to the most massive and oldest FRB host discovered to date. Coupled with a star formation rate $<0.36\,{\rm M_{\odot}\ yr^{-1}}$, the specific star formation rate $<10^{-11.8}\rm\ yr^{-1}$ classifies the host as quiescent. Through surface brightness profile modeling, we determine an elliptical galaxy morphology, marking the host as the first confirmed elliptical FRB host. The discovery of a quiescent early-type host galaxy within a transient class predominantly characterized by late-type star-forming hosts is reminiscent of short-duration gamma-ray bursts, Type Ia supernovae, and ultraluminous X-ray sources. Based on these shared host demographics, coupled with a large offset as demonstrated in our companion paper, we conclude that preferred progenitors for FRB20240209A include magnetars formed through merging binary neutron stars/white dwarfs or the accretion-induced collapse of a white dwarf, or a luminous X-ray binary. Together with FRB20200120E localized to a globular cluster in M81, our findings provide strong evidence that some fraction of FRBs may arise from a process distinct from the core collapse of massive stars.
△ Less
Submitted 30 October, 2024;
originally announced October 2024.
-
SoK: Prompt Hacking of Large Language Models
Authors:
Baha Rababah,
Shang,
Wu,
Matthew Kwiatkowski,
Carson Leung,
Cuneyt Gurcan Akcora
Abstract:
The safety and robustness of large language models (LLMs) based applications remain critical challenges in artificial intelligence. Among the key threats to these applications are prompt hacking attacks, which can significantly undermine the security and reliability of LLM-based systems. In this work, we offer a comprehensive and systematic overview of three distinct types of prompt hacking: jailb…
▽ More
The safety and robustness of large language models (LLMs) based applications remain critical challenges in artificial intelligence. Among the key threats to these applications are prompt hacking attacks, which can significantly undermine the security and reliability of LLM-based systems. In this work, we offer a comprehensive and systematic overview of three distinct types of prompt hacking: jailbreaking, leaking, and injection, addressing the nuances that differentiate them despite their overlapping characteristics. To enhance the evaluation of LLM-based applications, we propose a novel framework that categorizes LLM responses into five distinct classes, moving beyond the traditional binary classification. This approach provides more granular insights into the AI's behavior, improving diagnostic precision and enabling more targeted enhancements to the system's safety and robustness.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Symmetry in Deformation quantization and Geometric quantization
Authors:
Naichung Conan Leung,
Qin Li,
Ziming Nikolas Ma
Abstract:
In this paper, we explore the quantization of Kähler manifolds, focusing on the relationship between deformation quantization and geometric quantization. We provide a classification of degree 1 formal quantizable functions in the Berezin-Toeplitz deformation quantization, establishing that these formal functions are of the form $f = f_0 - \frac{\hbar}{4π}(Δf_0 + c)$ for a certain smooth (non-forma…
▽ More
In this paper, we explore the quantization of Kähler manifolds, focusing on the relationship between deformation quantization and geometric quantization. We provide a classification of degree 1 formal quantizable functions in the Berezin-Toeplitz deformation quantization, establishing that these formal functions are of the form $f = f_0 - \frac{\hbar}{4π}(Δf_0 + c)$ for a certain smooth (non-formal) function $f_0$. If $f_0$ is real-valued then $f_0$ corresponds to a Hamiltonian Killing vector field. In the presence of Hamiltonian $G$-symmetry, we address the compatibility between the infinitesimal symmetry for deformation quantization via quantum moment map and infinitesimal symmetry on geometric quantization acting on Hilbert spaces of holomorphic sections via Berezin-Toeplitz quantization.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Tuning the mechanical behaviour of additively manufactured metamaterials with twinning and meta-harmonics
Authors:
David McArthur,
PJ Tan,
Chu Lun Alex Leung
Abstract:
Body-Centred Cubic (BCC) lattices with twinned meta-crystal architecture inspired by the strengthening of bulk metals have significantly improved mechanical performance; however, their deformation behaviour and underlying strengthening mechanisms remain unclear. Here, we reveal that twinning causes a transition from bending to stretch-dominated behaviour in BCC lattices, eliciting vast improvement…
▽ More
Body-Centred Cubic (BCC) lattices with twinned meta-crystal architecture inspired by the strengthening of bulk metals have significantly improved mechanical performance; however, their deformation behaviour and underlying strengthening mechanisms remain unclear. Here, we reveal that twinning causes a transition from bending to stretch-dominated behaviour in BCC lattices, eliciting vast improvements in stiffness (+162%) and strength (+95%) without changing nodal connectivity. We designed meta-harmonic lattices by controlling a heterogenous distribution of twinned grain boundaries, inspired by bimodal harmonic microstructure, and we amplified the axial strain energy at location specific sites to further enhance the stiffness of BCC lattices by 206%. Our lattice design philosophy unleashes the potential of cellular materials for high-performance engineering applications.
△ Less
Submitted 5 March, 2025; v1 submitted 10 October, 2024;
originally announced October 2024.
-
Investigating the sightline of a highly scattered FRB through a filamentary structure in the local Universe
Authors:
Kaitlyn Shin,
Calvin Leung,
Sunil Simha,
Bridget C. Andersen,
Emmanuel Fonseca,
Kenzie Nimmo,
Mohit Bhardwaj,
Charanjot Brar,
Shami Chatterjee,
Amanda M. Cook,
B. M. Gaensler,
Ronniy C. Joseph,
Dylan Jow,
Jane Kaczmarek,
Lordrick Kahinga,
Victoria M. Kaspi,
Bikash Kharel,
Adam E. Lanman,
Mattias Lazda,
Robert A. Main,
Lluis Mas-Ribas,
Kiyoshi W. Masui,
Juan Mena-Parra,
Daniele Michilli,
Ayush Pandhi
, et al. (9 additional authors not shown)
Abstract:
Fast radio bursts (FRBs) are unique probes of extragalactic ionized baryonic structure as each signal, through its burst properties, holds information about the ionized matter it encounters along its sightline. FRB 20200723B is a burst with a scattering timescale of $τ_\mathrm{400\,MHz} >$1 second at 400 MHz and a dispersion measure of DM $\sim$ 244 pc cm$^{-3}$. Observed across the entire CHIME/F…
▽ More
Fast radio bursts (FRBs) are unique probes of extragalactic ionized baryonic structure as each signal, through its burst properties, holds information about the ionized matter it encounters along its sightline. FRB 20200723B is a burst with a scattering timescale of $τ_\mathrm{400\,MHz} >$1 second at 400 MHz and a dispersion measure of DM $\sim$ 244 pc cm$^{-3}$. Observed across the entire CHIME/FRB frequency band, it is the single-component burst with the largest scattering timescale yet observed by CHIME/FRB. The combination of its high scattering timescale and relatively low dispersion measure present an uncommon opportunity to use FRB 20200723B to explore the properties of the cosmic web it traversed. With an $\sim$arcminute-scale localization region, we find the most likely host galaxy is NGC 4602 (with PATH probability $P(O|x)=0.985$), which resides $\sim$30 Mpc away within a sheet filamentary structure on the outskirts of the Virgo Cluster. We place an upper limit on the average free electron density of this filamentary structure of $\langle n_e \rangle < 4.6^{+9.6}_{-2.0} \times 10^{-5}$ cm$^{-3}$, broadly consistent with expectations from cosmological simulations. We investigate whether the source of scattering lies within the same galaxy as the FRB, or at a farther distance from an intervening structure along the line of sight. Comparing with Milky Way pulsar observations, we suggest the scattering may originate from within the host galaxy of FRB 20200723B.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Collaborative Safety-Critical Formation Control with Obstacle Avoidance
Authors:
Brooks A. Butler,
Chi Ho Leung,
Philip E. Paré
Abstract:
This work explores a collaborative method for ensuring safety in multi-agent formation control problems. We formulate a control barrier function (CBF) based safety filter control law for a generic distributed formation controller and extend our previously developed collaborative safety framework to an obstacle avoidance problem for agents with acceleration control inputs. We then incorporate multi…
▽ More
This work explores a collaborative method for ensuring safety in multi-agent formation control problems. We formulate a control barrier function (CBF) based safety filter control law for a generic distributed formation controller and extend our previously developed collaborative safety framework to an obstacle avoidance problem for agents with acceleration control inputs. We then incorporate multi-obstacle collision avoidance into the collaborative safety framework. This framework includes a method for computing the maximum capability of agents to satisfy their individual safety requirements. We analyze the convergence rate of our collaborative safety algorithm, and prove the linear-time convergence of cooperating agents to a jointly feasible safe action for all agents under the special case of a tree-structured communication network with a single obstacle for each agent. We illustrate the analytical results via simulation on a mass-spring kinematics-based formation controller and demonstrate the finite-time convergence of the collaborative safety algorithm in the simple proven case, the more general case of a fully-connected system with multiple static obstacles, and with dynamic obstacles.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
3d Mirror Symmetry is Mirror Symmetry
Authors:
Ki Fung Chan,
Naichung Conan Leung
Abstract:
3d mirror symmetry is a mysterious duality for certian pairs of hyperkähler manifolds, or more generally complex symplectic manifolds/stacks. In this paper, we will describe its relationships with 2d mirror symmetry. This could be regarded as a 3d analog of the paper "Mirror Symmetry is T-Duality" by Strominger, Yau and Zaslow which described 2d mirror symmetry via 1d dualities.
3d mirror symmetry is a mysterious duality for certian pairs of hyperkähler manifolds, or more generally complex symplectic manifolds/stacks. In this paper, we will describe its relationships with 2d mirror symmetry. This could be regarded as a 3d analog of the paper "Mirror Symmetry is T-Duality" by Strominger, Yau and Zaslow which described 2d mirror symmetry via 1d dualities.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
Coastal Underwater Evidence Search System with Surface-Underwater Collaboration
Authors:
Hin Wang Lin,
Pengyu Wang,
Zhaohua Yang,
Ka Chun Leung,
Fangming Bao,
Ka Yu Kui,
Jian Xiang Erik Xu,
Ling Shi
Abstract:
The Coastal underwater evidence search system with surface-underwater collaboration is designed to revolutionize the search for artificial objects in coastal underwater environments, overcoming limitations associated with traditional methods such as divers and tethered remotely operated vehicles. Our innovative multi-robot collaborative system consists of three parts, an autonomous surface vehicle…
▽ More
The Coastal underwater evidence search system with surface-underwater collaboration is designed to revolutionize the search for artificial objects in coastal underwater environments, overcoming limitations associated with traditional methods such as divers and tethered remotely operated vehicles. Our innovative multi-robot collaborative system consists of three parts, an autonomous surface vehicle as a mission control center, a towed underwater vehicle for wide-area search, and a biomimetic underwater robot inspired by marine organisms for detailed inspections of identified areas. We conduct extensive simulations and real-world experiments in pond environments and coastal fields to demonstrate the system potential to surpass the limitations of conventional underwater search methods, offering a robust and efficient solution for law enforcement and recovery operations in marine settings.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
A Database Engineered System for Big Data Analytics on Tornado Climatology
Authors:
Fengfan Bian,
Carson K. Leung,
Piers Grenier,
Harry Pu,
Samuel Ning,
Alfredo Cuzzocrea
Abstract:
Recognizing the challenges with current tornado warning systems, we investigate alternative approaches. In particular, we present a database engi-neered system that integrates information from heterogeneous rich data sources, including climatology data for tornadoes and data just before a tornado warning. The system aids in predicting tornado occurrences by identifying the data points that form th…
▽ More
Recognizing the challenges with current tornado warning systems, we investigate alternative approaches. In particular, we present a database engi-neered system that integrates information from heterogeneous rich data sources, including climatology data for tornadoes and data just before a tornado warning. The system aids in predicting tornado occurrences by identifying the data points that form the basis of a tornado warning. Evaluation on US data highlights the advantages of using a classification forecasting recurrent neural network (RNN) model. The results highlight the effectiveness of our database engineered system for big data analytics on tornado climatology-especially, in accurately predict-ing tornado lead-time, magnitude, and location, contributing to the development of sustainable cities.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
A VLBI Calibrator Grid at 600MHz for Fast Radio Transient Localizations with CHIME/FRB Outriggers
Authors:
Shion Andrew,
Calvin Leung,
Alexander Li,
Kiyoshi W. Masui,
Bridget C. Andersen,
Kevin Bandura,
Alice P. Curtin,
Jane Kaczmarek,
Adam E. Lanman,
Mattias Lazda,
Juan Mena-Parra,
Daniele Michilli,
Kenzie Nimmo,
Aaron B. Pearlman,
Mubdi Rahman,
Vishwangi Shah,
Kaitlyn Shin,
Haochen Wang
Abstract:
The Canadian Hydrogen Intensity Mapping Experiment Fast Radio Burst (CHIME/FRB) Project has a new VLBI Outrigger at the Green Bank Observatory (GBO), which forms a 3300km baseline with CHIME operating at 400-800MHz. Using 100ms long full-array baseband "snapshots" collected commensally during FRB and pulsar triggers, we perform a shallow, wide-area VLBI survey covering a significant fraction of th…
▽ More
The Canadian Hydrogen Intensity Mapping Experiment Fast Radio Burst (CHIME/FRB) Project has a new VLBI Outrigger at the Green Bank Observatory (GBO), which forms a 3300km baseline with CHIME operating at 400-800MHz. Using 100ms long full-array baseband "snapshots" collected commensally during FRB and pulsar triggers, we perform a shallow, wide-area VLBI survey covering a significant fraction of the Northern sky targeted at the positions of compact sources from the Radio Fundamental Catalog. In addition, our survey contains calibrators detected from two 1s long trial baseband snapshots for a deeper survey with CHIME and GBO. In this paper, we present the largest catalog of compact calibrators suitable for 30-milliarcsecond-scale VLBI observations at sub-GHz frequencies to date. Our catalog consists of 200 total calibrators in the Northern Hemisphere that are compact on 30-milliarcsecond scales with fluxes above 100mJy. This calibrator grid will enable the precise localization of hundreds of FRBs a year with CHIME/FRB-Outriggers.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Broad-Line AGN at 3.5<z<6: The Black Hole Mass Function and a Connection with Little Red Dots
Authors:
Anthony J. Taylor,
Steven L. Finkelstein,
Dale D. Kocevski,
Junehyoung Jeon,
Volker Bromm,
Ricardo O. Amorin,
Pablo Arrabal Haro,
Bren E. Backhaus,
Micaela B. Bagley,
Eduardo Bañados,
Rachana Bhatawdekar,
Madisyn Brooks,
Antonello Calabro,
Oscar A. Chavez Ortiz,
Yingjie Cheng,
Nikko J. Cleri,
Justin W. Cole,
Kelcey Davis,
Mark Dickinson,
Callum Donnan,
James S. Dunlop,
Richard S. Ellis,
Vital Fernandez,
Adriano Fontana,
Seiji Fujimoto
, et al. (26 additional authors not shown)
Abstract:
We present a sample of 50 H-alpha detected broad-line active galactic nuclei (BLAGN) at redshifts 3.5<z<6.8 using data from the CEERS and RUBIES surveys. We select these sources directly from JWST/NIRSpec G395M/F290LP spectra. We use a multi-step pre-selection and a Bayesian fitting procedure to ensure a high-quality sample of sources with broad Balmer lines and narrow forbidden lines. We compute…
▽ More
We present a sample of 50 H-alpha detected broad-line active galactic nuclei (BLAGN) at redshifts 3.5<z<6.8 using data from the CEERS and RUBIES surveys. We select these sources directly from JWST/NIRSpec G395M/F290LP spectra. We use a multi-step pre-selection and a Bayesian fitting procedure to ensure a high-quality sample of sources with broad Balmer lines and narrow forbidden lines. We compute rest-frame ultraviolet and optical spectral slopes for these objects, and determine that 10 BLAGN in our sample are also little red dots (LRDs). These LRD BLAGN, when examined in aggregate, show broader H-alpha line profiles and a higher fraction of broad-to-narrow component H-alpha emission than non-LRD BLAGN. Moreover, we find that ~66% of these objects are intrinsically reddened (beta (optical)>0), independent of the contributions of emission lines to the broadband photometry. We construct the black hole (BH) mass function at 3.5<z<6 after computing robust observational and line detection completeness corrections. This BH mass function shows broad agreement with both recent JWST/NIRSpec and JWST/NIRCam WFSS based BH mass functions, though we extend these earlier results to log(M(BH)/M(sun)) < 7. The derived BH mass function is consistent with a variety of theoretical models, indicating that the observed abundance of black holes in the early universe is not discrepant with physically-motivated predictions. The BH mass function shape resembles a largely featureless power-law, suggesting that any signature from black-hole seeding has been lost by redshift z~5-6. Finally, we compute the BLAGN UV luminosity function and find good agreement with JWST-detected BLAGN samples from recent works, finding that BLAGN hosts constitute <10% of the total observed UV luminosity at all but the brightest luminosities.
△ Less
Submitted 14 May, 2025; v1 submitted 10 September, 2024;
originally announced September 2024.
-
Morphology of 137 Fast Radio Bursts down to Microseconds Timescales from The First CHIME/FRB Baseband Catalog
Authors:
Ketan R. Sand,
Alice P. Curtin,
Daniele Michilli,
Victoria M. Kaspi,
Emmanuel Fonseca,
Kenzie Nimmo,
Ziggy Pleunis,
Kaitlyn Shin,
Mohit Bhardwaj,
Charanjot Brar,
Matt Dobbs,
Gwendolyn Eadie,
B. M. Gaensler,
Ronniy C. Joseph,
Calvin Leung,
Robert Main,
Kiyoshi W. Masui,
Ryan Mckinven,
Ayush Pandhi,
Aaron B. Pearlman,
Masoud Rafiei-Ravandi,
Mawson W. Sammons,
Kendrick Smith,
Ingrid H. Stairs
Abstract:
We present a spectro-temporal analysis of 137 fast radio bursts (FRBs) from the first CHIME/FRB baseband catalog, including 125 one-off bursts and 12 repeat bursts, down to microsecond resolution using the least-squares optimization fitting routine: fitburst. Our measured values are compared with those in the first CHIME/FRB intensity catalog, revealing that nearly one-third of our sample exhibits…
▽ More
We present a spectro-temporal analysis of 137 fast radio bursts (FRBs) from the first CHIME/FRB baseband catalog, including 125 one-off bursts and 12 repeat bursts, down to microsecond resolution using the least-squares optimization fitting routine: fitburst. Our measured values are compared with those in the first CHIME/FRB intensity catalog, revealing that nearly one-third of our sample exhibits additional burst components at higher time resolutions. We measure sub-burst components within burst envelopes as narrow as $\sim$23 $μ$s (FWHM), with 20% of the sample displaying sub-structures narrower than 100 $μ$s, offering constraints on emission mechanisms. Scattering timescales in the sample range from 30 $μ$s to 13 ms at 600 MHz. We observe no correlations between scattering time and dispersion measure, rotation measure, or linear polarization fraction, with the latter suggesting that depolarization due to multipath propagation is negligible in our sample. Bursts with narrower envelopes ($\leq$ 1 ms) in our sample exhibit higher flux densities, indicating the potential presence of sub-ms FRBs that are being missed by our real-time system below a brightness threshold. Most multicomponent bursts in our sample exhibit sub-burst separations of $\leq$ 1 ms, with no bursts showing separations $<$41 $μ$s, even at a time resolution of 2.56 $μ$s, but both scattering and low signal-to-noise ratio can hinder detection of additional components. Lastly, given the morphological diversity of our sample, we suggest that one-off and repeating FRBs can come from different classes but have overlapping property distributions.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches
Authors:
Yanjie Dong,
Haijun Zhang,
Chengming Li,
Song Guo,
Victor C. M. Leung,
Xiping Hu
Abstract:
Since the invention of GPT2--1.5B in 2019, large language models (LLMs) have transitioned from specialized models to versatile foundation models. The LLMs exhibit impressive zero-shot ability, however, require fine-tuning on local datasets and significant resources for deployment. Traditional fine-tuning techniques with the first-order optimizers require substantial GPU memory that exceeds mainstr…
▽ More
Since the invention of GPT2--1.5B in 2019, large language models (LLMs) have transitioned from specialized models to versatile foundation models. The LLMs exhibit impressive zero-shot ability, however, require fine-tuning on local datasets and significant resources for deployment. Traditional fine-tuning techniques with the first-order optimizers require substantial GPU memory that exceeds mainstream hardware capability. Therefore, memory-efficient methods are motivated to be investigated. Model compression techniques can reduce energy consumption, operational costs, and environmental impact so that to support sustainable artificial intelligence advancements. Additionally, large-scale foundation models have expanded to create images, audio, videos, and multi-modal contents, further emphasizing the need for efficient deployment. Therefore, we are motivated to present a comprehensive overview of the prevalent memory-efficient fine-tuning methods over the network edge. We also review the state-of-the-art literatures on model compression to provide a vision on deploying LLMs over the network edge.
△ Less
Submitted 1 October, 2024; v1 submitted 20 August, 2024;
originally announced August 2024.
-
SYZ Mirrors in non-Abelian 3d Mirror Symmetry
Authors:
Ki Fung Chan,
Naichung Conan Leung
Abstract:
In the SYZ program, the mirror of $Y$ is the moduli space of Lagrangian branes in $Y$. When $Y$ is equipped with a Hamiltonian $G$-action, we prove that its mirror determines a canonical complex Lagrangian subvariety in the Coulomb branch of the 3d $\mathcal{N}=4$ pure $G$-gauge theory.
In the SYZ program, the mirror of $Y$ is the moduli space of Lagrangian branes in $Y$. When $Y$ is equipped with a Hamiltonian $G$-action, we prove that its mirror determines a canonical complex Lagrangian subvariety in the Coulomb branch of the 3d $\mathcal{N}=4$ pure $G$-gauge theory.
△ Less
Submitted 6 October, 2024; v1 submitted 18 August, 2024;
originally announced August 2024.
-
Theoretical framework for enhancing or enabling cooling of a mechanical resonator via the anti-Stokes or Stokes interaction and zero-photon detection
Authors:
Jack Clarke,
Evan A. Cryer-Jenkins,
Arjun Gupta,
Kyle D. Major,
Jinglei Zhang,
Georg Enzian,
Magdalena Szczykulska,
Anthony C. Leung,
Harsh Rathee,
Andreas Ø. Svela,
Anthony K. C. Tan,
Almut Beige,
Klaus Mølmer,
Michael R. Vanner
Abstract:
We develop a theoretical framework to describe how zero-photon detection may be utilized to enhance laser cooling via the anti-Stokes interaction and, somewhat surprisingly, enable cooling via the Stokes interaction commonly associated with heating. Our description includes both pulsed and continuous measurements as well as optical detection efficiency and open-system dynamics. For both cases, we…
▽ More
We develop a theoretical framework to describe how zero-photon detection may be utilized to enhance laser cooling via the anti-Stokes interaction and, somewhat surprisingly, enable cooling via the Stokes interaction commonly associated with heating. Our description includes both pulsed and continuous measurements as well as optical detection efficiency and open-system dynamics. For both cases, we discuss how the cooling depends on the system parameters such as detection efficiency and optomechanical cooperativity, and we study the continuous-measurement-induced dynamics, contrasting to single-photon detection events. For the Stokes case, we explore the interplay between cooling and heating via optomechanical parametric amplification, and we find the efficiency required to cool a mechanical oscillator via zero-photon detection. This work serves as a companion article to the recent experiment [E. A. Cryer-Jenkins, K. D. Major, et al., arXiv:2408.01734 (2024)], which demonstrated enhanced laser cooling of a mechanical oscillator via zero-photon detection on the anti-Stokes signal. The framework developed here provides new approaches for cooling mechanical resonators that can be applied to a wide range of areas including nonclassical state preparation, quantum thermodynamics, and avoiding the often unwanted heating effects of parametric amplification.
△ Less
Submitted 6 May, 2025; v1 submitted 3 August, 2024;
originally announced August 2024.
-
Enhanced Laser Cooling of a Mechanical Resonator via Zero-Photon Detection
Authors:
Evan A. Cryer-Jenkins,
Kyle D. Major,
Jack Clarke,
Georg Enzian,
Magdalena Szczykulska,
Jinglei Zhang,
Arjun Gupta,
Anthony C. Leung,
Harsh Rathee,
Andreas Ø. Svela,
Anthony K. C. Tan,
Almut Beige,
Klaus Mølmer,
Michael R. Vanner
Abstract:
Throughout quantum science and technology, measurement is used as a powerful resource for nonlinear operations and quantum state engineering. In particular, single-photon detection is commonly employed for quantum-information applications and tests of fundamental physics. By contrast, and perhaps counter-intuitively, measurement of the absence of photons also provides useful information, and offer…
▽ More
Throughout quantum science and technology, measurement is used as a powerful resource for nonlinear operations and quantum state engineering. In particular, single-photon detection is commonly employed for quantum-information applications and tests of fundamental physics. By contrast, and perhaps counter-intuitively, measurement of the absence of photons also provides useful information, and offers significant potential for a wide range of new experimental directions. Here, we propose and experimentally demonstrate cooling of a mechanical resonator below its laser-cooled mechanical occupation via zero-photon detection on the anti-Stokes scattered optical field and verify this cooling through heterodyne measurements. Our measurements are well captured by a stochastic master equation and the techniques introduced here open new avenues for cooling, quantum thermodynamics, quantum state engineering, and quantum measurement and control.
△ Less
Submitted 6 May, 2025; v1 submitted 3 August, 2024;
originally announced August 2024.
-
Off-axis Hartmann wavefront sensing for the GMT-Consortium Large Earth Finder (G-CLEF) red camera optics
Authors:
Matthew C. H. Leung,
Colby A. Jurgenson,
Andrew Szentgyorgyi,
Brian McLeod,
Cem Onyuksel,
Joseph Zajac,
David Charbonneau,
William Podgorski,
Abigail Unger,
Mark Mueller,
Matthew Smith,
Daniel Baldwin,
V. Ashley Villar
Abstract:
The Hartmann test is a method used to measure the wavefront error in a focal optical system, wherein a mask with a pattern of small holes is placed at the system's aperture stop. By taking an image at a defocused plane, the differences between the ideal and real positions of the reimaged holes (called the transverse ray aberrations) can be measured, which can then be used to estimate the wavefront…
▽ More
The Hartmann test is a method used to measure the wavefront error in a focal optical system, wherein a mask with a pattern of small holes is placed at the system's aperture stop. By taking an image at a defocused plane, the differences between the ideal and real positions of the reimaged holes (called the transverse ray aberrations) can be measured, which can then be used to estimate the wavefront error. However, the Hartmann test is usually used with an on-axis field. In this paper, we present a wavefront sensing method which generalizes the classical Hartmann test for off-axis field angles and arbitrary reference wavefronts. Our method involves taking images at two defocused planes, and then using the real reimaged hole positions on both planes to estimate the trajectories of rays from the system's exit pupil, at which the reference wavefront is situated. We then propagate the rays forward from the reference wavefront to one of the two defocused planes, in order to find the ideal reimaged hole positions, from which we can compute transverse ray aberrations. We derive and solve a pair of nonlinear partial differential equations relating transverse ray aberrations to wavefront error, using Zernike decomposition and nonlinear least squares. Our method has been verified on simulated data from the 7-lens f/2.25 red camera system of the GMT-Consortium Large Earth Finder (G-CLEF), a high resolution optical echelle spectrograph which will be a first light instrument for the Giant Magellan Telescope (GMT).
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
SMiCRM: A Benchmark Dataset of Mechanistic Molecular Images
Authors:
Ching Ting Leung,
Yufan Chen,
Hanyu Gao
Abstract:
Optical chemical structure recognition (OCSR) systems aim to extract the molecular structure information, usually in the form of molecular graph or SMILES, from images of chemical molecules. While many tools have been developed for this purpose, challenges still exist due to different types of noises that might exist in the images. Specifically, we focus on the 'arrow-pushing' diagrams, a typical…
▽ More
Optical chemical structure recognition (OCSR) systems aim to extract the molecular structure information, usually in the form of molecular graph or SMILES, from images of chemical molecules. While many tools have been developed for this purpose, challenges still exist due to different types of noises that might exist in the images. Specifically, we focus on the 'arrow-pushing' diagrams, a typical type of chemical images to demonstrate electron flow in mechanistic steps. We present Structural molecular identifier of Molecular images in Chemical Reaction Mechanisms (SMiCRM), a dataset designed to benchmark machine recognition capabilities of chemical molecules with arrow-pushing annotations. Comprising 453 images, it spans a broad array of organic chemical reactions, each illustrated with molecular structures and mechanistic arrows. SMiCRM offers a rich collection of annotated molecule images for enhancing the benchmarking process for OCSR methods. This dataset includes a machine-readable molecular identity for each image as well as mechanistic arrows showing electron flow during chemical reactions. It presents a more authentic and challenging task for testing molecular recognition technologies, and achieving this task can greatly enrich the mechanisitic information in computer-extracted chemical reaction data.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
The BoRG-JWST Survey: Program Overview and First Confirmations of Luminous Reionization-Era Galaxies from Pure-Parallel Observations
Authors:
Guido Roberts-Borsani,
Micaela Bagley,
Sofía Rojas-Ruiz,
Tommaso Treu,
Takahiro Morishita,
Steven L. Finkelstein,
Michele Trenti,
Pablo Arrabal Haro,
Eduardo Bañados,
Óscar A. Chávez Ortiz,
Katherine Chworowsky,
Taylor A. Hutchison,
Rebecca L. Larson,
Nicha Leethochawalit,
Gene C. K. Leung,
Charlotte Mason,
Rachel S. Somerville,
Massimo Stiavelli,
L. Y. Aaron Yung,
Susan A. Kassin,
Christian Soto
Abstract:
We present the BoRG-JWST survey, a combination of two JWST Cycle 1 programs aimed at obtaining NIRSpec spectroscopy of representative, UV-bright $7<z<10$ galaxy candidates across 22 independent sight lines selected from Hubble/WFC3 pure-parallel observations. We confirm the high-$z$ nature of 10 out of 19 observed primary targets through low-resolution prism observations, with the rest revealing t…
▽ More
We present the BoRG-JWST survey, a combination of two JWST Cycle 1 programs aimed at obtaining NIRSpec spectroscopy of representative, UV-bright $7<z<10$ galaxy candidates across 22 independent sight lines selected from Hubble/WFC3 pure-parallel observations. We confirm the high-$z$ nature of 10 out of 19 observed primary targets through low-resolution prism observations, with the rest revealing themselves unsurprisingly to be $z\sim1-3$ interlopers, brown dwarfs, or yielding inconclusive results. From the MSA observations, we confirm an additional 9 filler sources at $z>5$, highlighting the large abundance of high-redshift galaxies even in individual WFC3 pointings. The primary sample span an absolute magnitude range $-20.4<M_{\rm UV}<-22.4$ mag and harbour UV continuum slopes of $β\simeq-2.5$ to $-2.0$, representing some of the most luminous $z>7$ sources currently known and comparable to the brightest sources at $z>10$. Prominent [O III]+H$β$ lines are found across the full sample, while a stack of sources reveals a plethora of other rest-optical lines and additional rest-UV C III]1909 Å emission. Despite their luminosities, none of the low-resolution spectra display evidence for Type 1 AGN activity based on a search for broad-line emission. Lastly, we present a spectroscopic data release of 188 confirmed $0.5\lesssim z\lesssim5.0$ sources from filler MSA observations, highlighting the legacy value of the survey and a representative benchmark for comparisons to deep field observations.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
Non-abelian arboreal Galois groups associated to PCF rational maps
Authors:
Chifan Leung,
Clayton Petsche
Abstract:
We prove that arboreal Galois extensions of number fields are never abelian for post-critically finite rational maps and non-preperiodic base points. For polynomials, this establishes a new class of known cases of a conjecture of Andrews-Petsche. Together with a result of Ferraguti-Ostafe-Zannier, this result implies that counterexamples to the conjecture, if they exist, are sparse. We also prove…
▽ More
We prove that arboreal Galois extensions of number fields are never abelian for post-critically finite rational maps and non-preperiodic base points. For polynomials, this establishes a new class of known cases of a conjecture of Andrews-Petsche. Together with a result of Ferraguti-Ostafe-Zannier, this result implies that counterexamples to the conjecture, if they exist, are sparse. We also prove an auxiliary result on places of periodic reduction for rational maps, which may be of independent interest.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
Entanglement and operator correlation signatures of many-body quantum Zeno phases in inefficiently monitored noisy systems
Authors:
Chun Y. Leung,
Alessandro Romito
Abstract:
The interplay between information-scrambling Hamiltonians and local continuous measurements hosts platforms for exotic measurement-induced phase transition in out-of-equilibrium steady states. Here, we consider such transitions under the addition of local random white noise and measurement inefficiency in a XX spin chain. We identify a non-monotonic dependence on the local noise strength in both t…
▽ More
The interplay between information-scrambling Hamiltonians and local continuous measurements hosts platforms for exotic measurement-induced phase transition in out-of-equilibrium steady states. Here, we consider such transitions under the addition of local random white noise and measurement inefficiency in a XX spin chain. We identify a non-monotonic dependence on the local noise strength in both the averaged entanglement and operator correlations, specifically the subsystem parity variance. While the non-monotonicity persists at any finite efficiency for the operator correlations, it disappears at finite inefficiency for the entanglement. The analysis of scaling with the system size in a finite length chain indicates that, at finite efficiency, this effect leads to distinct MiPTs for operator correlations and entanglement. Our result hints at a difference between area-law entanglement scaling and Zeno-localized phases for inefficient monitoring.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Simulating FRB Morphologies and Coherent Phase Correlation Signatures from Multi-Plane Astrophysical Lensing
Authors:
Zarif Kader,
Matt Dobbs,
Calvin Leung,
Kiyoshi W. Masui,
Mawson W. Sammons
Abstract:
Fast Radio Bursts (FRBs), like pulsars, display radio emission from compact regions such that they can be treated as point sources. As this radiation propagates through space, they encounter sources of lensing such as a gravitational field of massive objects or inhomogeneous changes in the electron density of cold plasma. We have developed a simulation tool to generate these lensing morphologies t…
▽ More
Fast Radio Bursts (FRBs), like pulsars, display radio emission from compact regions such that they can be treated as point sources. As this radiation propagates through space, they encounter sources of lensing such as a gravitational field of massive objects or inhomogeneous changes in the electron density of cold plasma. We have developed a simulation tool to generate these lensing morphologies through coherent propagation transfer functions generated by phase coherent geometric optics on a spatial grid. In the limit an FRB can be treated as a point source, the ray paths from the FRB to the observer are phase coherent. Each image will have a time delay and magnification that will alter the emitted frequency-temporal morphology of the FRB to that which is observed. The interference of these images could also decohere the observed phase properties of the images, affecting any phase related searches such as searching for the auto-correlation of the observed FRB voltage with other images in time. We present analytic test cases to demonstrate that the simulation can model qualitative properties. We provide example multi-plane lensing systems to show the capabilities of the simulation in modeling the lensed morphology of an FRB and observed phase coherence.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
QoE Maximization for Multiple-UAV-Assisted Multi-Access Edge Computing: An Online Joint Optimization Approach
Authors:
Long He,
Geng Sun,
Zemin Sun,
Qingqing Wu,
Jiawen Kang,
Dusit Niyato,
Zhu Han,
Victor C. M. Leung
Abstract:
In disaster scenarios, conventional terrestrial multi-access edge computing (MEC) paradigms, which rely on fixed infrastructure, may become unavailable due to infrastructure damage. With high-probability line-of-sight (LoS) communication, flexible mobility, and low cost, unmanned aerial vehicle (UAV)-assisted MEC is emerging as a new promising paradigm to provide edge computing services for ground…
▽ More
In disaster scenarios, conventional terrestrial multi-access edge computing (MEC) paradigms, which rely on fixed infrastructure, may become unavailable due to infrastructure damage. With high-probability line-of-sight (LoS) communication, flexible mobility, and low cost, unmanned aerial vehicle (UAV)-assisted MEC is emerging as a new promising paradigm to provide edge computing services for ground user devices (UDs) in disaster-stricken areas. However, the limited battery capacity, computing resources, and spectrum resources also pose serious challenges for UAV-assisted MEC, which can potentially shorten the service time of UAVs and degrade the quality of experience (QoE) of UDs without an effective control approach. To this end, in this work, we first present a hierarchical architecture of multiple-UAV-assisted MEC networks that enables the coordinated provision of edge computing services by multiple UAVs. Then, we formulate a joint task offloading, resource allocation, and UAV trajectory planning optimization problem (JTRTOP) to maximize the QoE of UDs while considering the energy consumption constraints of UAVs. Since the problem is proven to be a future-dependent and NP-hard problem, we propose a novel online joint task offloading, resource allocation, and UAV trajectory planning approach (OJTRTA) to solve the problem. Specifically, the JTRTOP is first transformed into a per-slot real-time optimization problem (PROP) using the Lyapunov optimization framework. Then, a two-stage optimization method based on game theory and convex optimization is proposed to solve the PROP. Simulation results provide empirical evidence supporting the superior system performance of the proposed OJTRTA in comparison to alternative approaches.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Measurement of $J/ψ$ and $ψ\left(2S\right)$ production in $p+p$ and $p+d$ interactions at 120 GeV
Authors:
C. H. Leung,
K. Nagai,
K. Nakano,
D. Nawarathne,
J. Dove,
S. Prasad,
N. Wuerfel,
C. A. Aidala,
J. Arrington,
C. Ayuso,
C. L. Barker,
C. N. Brown,
W. C. Chang,
A. Chen,
D. C. Christian,
B. P. Dannowitz,
M. Daugherity,
L. El Fassi,
D. F. Geesaman,
R. Gilman,
Y. Goto,
R. Guo,
T. J. Hague,
R. J. Holt,
M. F. Hossain
, et al. (36 additional authors not shown)
Abstract:
We report the $p+p$ and $p+d$ differential cross sections measured in the SeaQuest experiment for $J/ψ$ and $ψ\left(2S\right)$ production at 120 GeV beam energy covering the forward $x$-Feynman ($x_F$) range of $0.5 < x_F <0.9$. The measured cross sections are in good agreement with theoretical calculations based on the nonrelativistic QCD (NRQCD) using the long-distance matrix elements deduced fr…
▽ More
We report the $p+p$ and $p+d$ differential cross sections measured in the SeaQuest experiment for $J/ψ$ and $ψ\left(2S\right)$ production at 120 GeV beam energy covering the forward $x$-Feynman ($x_F$) range of $0.5 < x_F <0.9$. The measured cross sections are in good agreement with theoretical calculations based on the nonrelativistic QCD (NRQCD) using the long-distance matrix elements deduced from a recent global analysis of proton- and pion-induced charmonium production data. The $σ_{ψ\left(2S\right)} / σ_{J/ψ}$ cross section ratios are found to increase as $x_F$ increases, indicating that the $q \bar{q}$ annihilation process has larger contributions in the $ψ\left(2S\right)$ production than the $J/ψ$ production. The $σ_{pd}/2σ_{pp}$ cross section ratios are observed to be significantly different for the Drell-Yan process and $J/ψ$ production, reflecting their different production mechanisms. We find that the $σ_{pd}/2σ_{pp}$ ratios for $J/ψ$ production at the forward $x_F$ region are sensitive to the $\bar{d}/ \bar{u}$ flavor asymmetry of the proton sea, analogous to the Drell-Yan process. The transverse momentum ($p_T$) distributions for $J/ψ$ and $ψ\left(2S\right)$ production are also presented and compared with data collected at higher center-of-mass energies.
△ Less
Submitted 22 September, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Magnetospheric origin of a fast radio burst constrained using scintillation
Authors:
Kenzie Nimmo,
Ziggy Pleunis,
Paz Beniamini,
Pawan Kumar,
Adam E. Lanman,
D. Z. Li,
Robert Main,
Mawson W. Sammons,
Shion Andrew,
Mohit Bhardwaj,
Shami Chatterjee,
Alice P. Curtin,
Emmanuel Fonseca,
B. M. Gaensler,
Ronniy C. Joseph,
Zarif Kader,
Victoria M. Kaspi,
Mattias Lazda,
Calvin Leung,
Kiyoshi W. Masui,
Ryan Mckinven,
Daniele Michilli,
Ayush Pandhi,
Aaron B. Pearlman,
Masoud Rafiei-Ravandi
, et al. (4 additional authors not shown)
Abstract:
Fast radio bursts (FRBs) are micro-to-millisecond duration radio transients that originate mostly from extragalactic distances. The emission mechanism responsible for these high luminosity, short duration transients remains debated. The models are broadly grouped into two classes: physical processes that occur within close proximity to a central engine; and central engines that release energy whic…
▽ More
Fast radio bursts (FRBs) are micro-to-millisecond duration radio transients that originate mostly from extragalactic distances. The emission mechanism responsible for these high luminosity, short duration transients remains debated. The models are broadly grouped into two classes: physical processes that occur within close proximity to a central engine; and central engines that release energy which moves to large radial distances and subsequently interacts with surrounding media producing radio waves. The expected emission region sizes are notably different between these two types of models. FRB emission size constraints can therefore be used to distinguish between these competing models and inform on the physics responsible. Here we present the measurement of two mutually coherent scintillation scales in the frequency spectrum of FRB 20221022A: one originating from a scattering screen located within the Milky Way, and the second originating from a scattering screen located within its host galaxy or local environment. We use the scattering media as an astrophysical lens to constrain the size of the lateral emission region, $R_{\star\mathrm{obs}} \lesssim 3\times10^{4}$ km. We find that this is inconsistent with the expected emission sizes for the large radial distance models, and is more naturally explained with an emission process that operates within or just beyond the magnetosphere of a central compact object. Recently, FRB 20221022A was found to exhibit an S-shaped polarisation angle swing, supporting a magnetospheric emission process. The scintillation results presented in this work independently support this conclusion, while highlighting scintillation as a useful tool in our understanding of FRB emission physics and progenitors.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Online Identification of Time-Varying Systems Using Excitation Sets and Change Point Detection
Authors:
Chi Ho Leung,
Ashish R. Hota,
Philip E. Paré
Abstract:
In this work, we first show that the problem of parameter identification is often ill-conditioned and lacks the persistence of excitation required for the convergence of online learning schemes. To tackle these challenges, we introduce the notion of optimal and greedy excitation sets which contain data points with sufficient richness to aid in the identification task. We then present the greedy ex…
▽ More
In this work, we first show that the problem of parameter identification is often ill-conditioned and lacks the persistence of excitation required for the convergence of online learning schemes. To tackle these challenges, we introduce the notion of optimal and greedy excitation sets which contain data points with sufficient richness to aid in the identification task. We then present the greedy excitation set-based recursive least squares algorithm to alleviate the problem of the lack of persistent excitation, and prove that the iterates generated by the proposed algorithm minimize an auxiliary weighted least squares cost function. When data points are generated from time-varying parameters, online estimators tend to underfit the true parameter trajectory, and their predictability deteriorates. To tackle this problem, we propose a memory resetting scheme leveraging change point detection techniques. Finally, we illustrate the performance of the proposed algorithms via several numerical case studies to learn the (time-varying) parameters of networked epidemic dynamics, and compare it with results obtained using conventional approaches.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey
Authors:
Feng Liang,
Zhen Zhang,
Haifeng Lu,
Chengming Li,
Victor C. M. Leung,
Yanyi Guo,
Xiping Hu
Abstract:
With rapidly increasing distributed deep learning workloads in large-scale data centers, efficient distributed deep learning framework strategies for resource allocation and workload scheduling have become the key to high-performance deep learning. The large-scale environment with large volumes of datasets, models, and computational and communication resources raises various unique challenges for…
▽ More
With rapidly increasing distributed deep learning workloads in large-scale data centers, efficient distributed deep learning framework strategies for resource allocation and workload scheduling have become the key to high-performance deep learning. The large-scale environment with large volumes of datasets, models, and computational and communication resources raises various unique challenges for resource allocation and workload scheduling in distributed deep learning, such as scheduling complexity, resource and workload heterogeneity, and fault tolerance. To uncover these challenges and corresponding solutions, this survey reviews the literature, mainly from 2019 to 2024, on efficient resource allocation and workload scheduling strategies for large-scale distributed DL. We explore these strategies by focusing on various resource types, scheduling granularity levels, and performance goals during distributed training and inference processes. We highlight critical challenges for each topic and discuss key insights of existing technologies. To illustrate practical large-scale resource allocation and workload scheduling in real distributed deep learning scenarios, we use a case study of training large language models. This survey aims to encourage computer science, artificial intelligence, and communications researchers to understand recent advances and explore future research directions for efficient framework strategies for large-scale distributed deep learning.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay
Authors:
Daya Bay collaboration,
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
J. Cheng,
Y. -C. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng
, et al. (177 additional authors not shown)
Abstract:
This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive…
▽ More
This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive region, the relative $\overlineν_{e}$ rates and energy spectra variation among the near and far detectors gives $\mathrm{sin}^22θ_{13} = 0.0759_{-0.0049}^{+0.0050}$ and $Δm^2_{32} = (2.72^{+0.14}_{-0.15})\times10^{-3}$ eV$^2$ assuming the normal neutrino mass ordering, and $Δm^2_{32} = (-2.83^{+0.15}_{-0.14})\times10^{-3}$ eV$^2$ for the inverted neutrino mass ordering. This estimate of $\sin^2 2θ_{13}$ is consistent with and essentially independent from the one obtained using the capture-on-gadolinium sample at Daya Bay. The combination of these two results yields $\mathrm{sin}^22θ_{13}= 0.0833\pm0.0022$, which represents an 8% relative improvement in precision regarding the Daya Bay full 3158-day capture-on-gadolinium result.
△ Less
Submitted 10 October, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Risk-Neutral Generative Networks
Authors:
Zhonghao Xian,
Xing Yan,
Cheuk Hang Leung,
Qi Wu
Abstract:
We present a functional generative approach to extract risk-neutral densities from market prices of options. Specifically, we model the log-returns on the time-to-maturity continuum as a stochastic curve driven by standard normal. We then use neural nets to represent the term structures of the location, the scale, and the higher-order moments, and impose stringent conditions on the learning proces…
▽ More
We present a functional generative approach to extract risk-neutral densities from market prices of options. Specifically, we model the log-returns on the time-to-maturity continuum as a stochastic curve driven by standard normal. We then use neural nets to represent the term structures of the location, the scale, and the higher-order moments, and impose stringent conditions on the learning process to ensure the neural net-based curve representation is free of static arbitrage. This specification is structurally clear in that it separates the modeling of randomness from the modeling of the term structures of the parameters. It is data adaptive in that we use neural nets to represent the shape of the stochastic curve. It is also generative in that the functional form of the stochastic curve, although parameterized by neural nets, is an explicit and deterministic function of the standard normal. This explicitness allows for the efficient generation of samples to price options across strikes and maturities, without compromising data adaptability. We have validated the effectiveness of this approach by benchmarking it against a comprehensive set of baseline models. Experiments show that the extracted risk-neutral densities accommodate a diverse range of shapes. Its accuracy significantly outperforms the extensive set of baseline models--including three parametric models and nine stochastic process models--in terms of accuracy and stability. The success of this approach is attributed to its capacity to offer flexible term structures for risk-neutral skewness and kurtosis.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Networking Systems for Video Anomaly Detection: A Tutorial and Survey
Authors:
Jing Liu,
Yang Liu,
Jieyu Lin,
Jielin Li,
Liang Cao,
Peng Sun,
Bo Hu,
Liang Song,
Azzedine Boukerche,
Victor C. M. Leung
Abstract:
The increasing utilization of surveillance cameras in smart cities, coupled with the surge of online video applications, has heightened concerns regarding public security and privacy protection, which propelled automated Video Anomaly Detection (VAD) into a fundamental research task within the Artificial Intelligence (AI) community. With the advancements in deep learning and edge computing, VAD ha…
▽ More
The increasing utilization of surveillance cameras in smart cities, coupled with the surge of online video applications, has heightened concerns regarding public security and privacy protection, which propelled automated Video Anomaly Detection (VAD) into a fundamental research task within the Artificial Intelligence (AI) community. With the advancements in deep learning and edge computing, VAD has made significant progress and advances synergized with emerging applications in smart cities and video internet, which has moved beyond the conventional research scope of algorithm engineering to deployable Networking Systems for VAD (NSVAD), a practical hotspot for intersection exploration in the AI, IoVT, and computing fields. In this article, we delineate the foundational assumptions, learning frameworks, and applicable scenarios of various deep learning-driven VAD routes, offering an exhaustive tutorial for novices in NSVAD. In addition, this article elucidates core concepts by reviewing recent advances and typical solutions and aggregating available research resources accessible at https://github.com/fdjingliu/NSVAD. Lastly, this article projects future development trends and discusses how the integration of AI and computing technologies can address existing research challenges and promote open opportunities, serving as an insightful guide for prospective researchers and engineers.
△ Less
Submitted 3 April, 2025; v1 submitted 15 May, 2024;
originally announced May 2024.
-
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
Authors:
Raghu Prabhakar,
Ram Sivaramakrishnan,
Darshan Gandhi,
Yun Du,
Mingran Wang,
Xiangyu Song,
Kejie Zhang,
Tianren Gao,
Angela Wang,
Karen Li,
Yongning Sheng,
Joshua Brot,
Denis Sokolov,
Apurv Vivek,
Calvin Leung,
Arjun Sabnis,
Jiayu Bai,
Tuowen Zhao,
Mark Gottscho,
David Jackson,
Mark Luttrell,
Manish K. Shah,
Edison Chen,
Kaizhao Liang,
Swayambhoo Jain
, et al. (5 additional authors not shown)
Abstract:
Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert…
▽ More
Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Experts (CoE) is an alternative modular approach that lowers the cost and complexity of training and serving. However, this approach presents two key challenges when using conventional hardware: (1) without fused operations, smaller models have lower operational intensity, which makes high utilization more challenging to achieve; and (2) hosting a large number of models can be either prohibitively expensive or slow when dynamically switching between them.
In this paper, we describe how combining CoE, streaming dataflow, and a three-tier memory system scales the AI memory wall. We describe Samba-CoE, a CoE system with 150 experts and a trillion total parameters. We deploy Samba-CoE on the SambaNova SN40L Reconfigurable Dataflow Unit (RDU) - a commercial dataflow accelerator architecture that has been co-designed for enterprise inference and training applications. The chip introduces a new three-tier memory system with on-chip distributed SRAM, on-package HBM, and off-package DDR DRAM. A dedicated inter-RDU network enables scaling up and out over multiple sockets. We demonstrate speedups ranging from 2$\times$ to 13$\times$ on various benchmarks running on eight RDU sockets compared with an unfused baseline. We show that for CoE inference deployments, the 8-socket RDU Node reduces machine footprint by up to 19$\times$, speeds up model switching time by 15$\times$ to 31$\times$, and achieves an overall speedup of 3.7$\times$ over a DGX H100 and 6.6$\times$ over a DGX A100.
△ Less
Submitted 4 November, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
Update Rate, Accuracy, and Age of Information in a Wireless Sensor Network
Authors:
Xinlu Dai,
Cyril Leung
Abstract:
Age of Information (AoI), namely the time that has elapsed since the most recently delivered packet was generated, is receiving increasing attention with the emergence of many real-time applications that rely on the exchange of time-sensitive information. AoI captures the freshness of the information from the perspective of the destination. The term "accuracy of information" is used to assess how…
▽ More
Age of Information (AoI), namely the time that has elapsed since the most recently delivered packet was generated, is receiving increasing attention with the emergence of many real-time applications that rely on the exchange of time-sensitive information. AoI captures the freshness of the information from the perspective of the destination. The term "accuracy of information" is used to assess how close the estimate at the destination is to the parameter value measured by the sensor. In this paper, the mean square error (MSE) is used to evaluate the accuracy of information. We focus on a single sensor that monitors a time-sensitive physical process, which is modelled as a random walk. Whenever the state of the random walk changes by more than a specified threshold, the sensor generates a status update packet and transmits it to the destination. When no update packet is received, the destination assumes that the state of the process has not changed. We study the problem of finding the minimum update rate under AoI and accuracy of information constraints. More specifically, we derive analytical expressions for the update rate, the AoI, and the MSE.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Multi-objective Optimization for Multi-UAV-assisted Mobile Edge Computing
Authors:
Geng Sun,
Yixian Wang,
Zemin Sun,
Qingqing Wu,
Jiawen Kang,
Dusit Niyato,
Victor C. M. Leung
Abstract:
Recent developments in unmanned aerial vehicles (UAVs) and mobile edge computing (MEC) have provided users with flexible and resilient computing services. However, meeting the computing-intensive and latency-sensitive demands of users poses a significant challenge due to the limited resources of UAVs. To address this challenge, we present a multi-objective optimization approach for multi-UAV-assis…
▽ More
Recent developments in unmanned aerial vehicles (UAVs) and mobile edge computing (MEC) have provided users with flexible and resilient computing services. However, meeting the computing-intensive and latency-sensitive demands of users poses a significant challenge due to the limited resources of UAVs. To address this challenge, we present a multi-objective optimization approach for multi-UAV-assisted MEC systems. First, we formulate a multi-objective optimization problem \textcolor{b2}{aiming} at minimizing the total task completion delay, reducing the total UAV energy consumption, and maximizing the total amount of offloaded tasks by jointly optimizing task offloading, computation resource allocation, and UAV trajectory control. Since the problem is a mixed-integer non-linear programming (MINLP) and NP-hard problem which is challenging, we propose a joint task offloading, computation resource allocation, and UAV trajectory control (JTORATC) approach to solve the problem. \textcolor{b3}{However, since the decision variables of task offloading, computation resource allocation, and UAV trajectory control are coupled with each other, the original problem is split into three sub-problems, i.e., task offloading, computation resource allocation, and UAV trajectory control, which are solved individually to obtain the corresponding decisions.} \textcolor{b2}{Moreover, the sub-problem of task offloading is solved by using distributed splitting and threshold rounding methods, the sub-problem of computation resource allocation is solved by adopting the Karush-Kuhn-Tucker (KKT) method, and the sub-problem of UAV trajectory control is solved by employing the successive convex approximation (SCA) method.} Simulation results show that the proposed JTORATC has superior performance compared to the other benchmark methods.
△ Less
Submitted 23 March, 2024;
originally announced April 2024.
-
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Authors:
Xiaofei Wang,
Yunfeng Zhao,
Chao Qiu,
Qinghua Hu,
Victor C. M. Leung
Abstract:
Amidst the robust impetus from artificial intelligence (AI) and big data, edge intelligence (EI) has emerged as a nascent computing paradigm, synthesizing AI with edge computing (EC) to become an exemplary solution for unleashing the full potential of AI services. Nonetheless, challenges in communication costs, resource allocation, privacy, and security continue to constrain its proficiency in sup…
▽ More
Amidst the robust impetus from artificial intelligence (AI) and big data, edge intelligence (EI) has emerged as a nascent computing paradigm, synthesizing AI with edge computing (EC) to become an exemplary solution for unleashing the full potential of AI services. Nonetheless, challenges in communication costs, resource allocation, privacy, and security continue to constrain its proficiency in supporting services with diverse requirements. In response to these issues, this paper introduces socialized learning (SL) as a promising solution, further propelling the advancement of EI. SL is a learning paradigm predicated on social principles and behaviors, aimed at amplifying the collaborative capacity and collective intelligence of agents within the EI system. SL not only enhances the system's adaptability but also optimizes communication, and networking processes, essential for distributed intelligence across diverse devices and platforms. Therefore, a combination of SL and EI may greatly facilitate the development of collaborative intelligence in the future network. This paper presents the findings of a literature review on the integration of EI and SL, summarizing the latest achievements in existing research on EI and SL. Subsequently, we delve comprehensively into the limitations of EI and how it could benefit from SL. Special emphasis is placed on the communication challenges and networking strategies and other aspects within these systems, underlining the role of optimized network solutions in improving system efficiency. Based on these discussions, we elaborate in detail on three integrated components: socialized architecture, socialized training, and socialized inference, analyzing their strengths and weaknesses. Finally, we identify some possible future applications of combining SL and EI, discuss open problems and suggest some future research.
△ Less
Submitted 3 November, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
Collaborative Ground-Space Communications via Evolutionary Multi-objective Deep Reinforcement Learning
Authors:
Jiahui Li,
Geng Sun,
Qingqing Wu,
Dusit Niyato,
Jiawen Kang,
Abbas Jamalipour,
Victor C. M. Leung
Abstract:
In this paper, we propose a distributed collaborative beamforming (DCB)-based uplink communication paradigm for enabling ground-space direct communications. Specifically, DCB treats the terminals that are unable to establish efficient direct connections with the low Earth orbit (LEO) satellites as distributed antennas, forming a virtual antenna array to enhance the terminal-to-satellite uplink ach…
▽ More
In this paper, we propose a distributed collaborative beamforming (DCB)-based uplink communication paradigm for enabling ground-space direct communications. Specifically, DCB treats the terminals that are unable to establish efficient direct connections with the low Earth orbit (LEO) satellites as distributed antennas, forming a virtual antenna array to enhance the terminal-to-satellite uplink achievable rates and durations. However, such systems need multiple trade-off policies that variously balance the terminal-satellite uplink achievable rate, energy consumption of terminals, and satellite switching frequency to satisfy the scenario requirement changes. Thus, we perform a multi-objective optimization analysis and formulate a long-term optimization problem. To address availability in different terminal cluster scales, we reformulate this problem into an action space-reduced and universal multi-objective Markov decision process. Then, we propose an evolutionary multi-objective deep reinforcement learning algorithm to obtain the desirable policies, in which the low-value actions are masked to speed up the training process. As such, the applicability of a one-time trained model can cover more changing terminal-satellite uplink scenarios. Simulation results show that the proposed algorithm outmatches various baselines, and draw some useful insights. Specifically, it is found that DCB enables terminals that cannot reach the uplink achievable threshold to achieve efficient direct uplink transmission, which thus reveals that DCB is an effective solution for enabling direct ground-space communications. Moreover, it reveals that the proposed algorithm achieves multiple policies favoring different objectives and achieving near-optimal uplink achievable rates with low switching frequency.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey
Authors:
Feng Liang,
Zhen Zhang,
Haifeng Lu,
Victor C. M. Leung,
Yanyi Guo,
Xiping Hu
Abstract:
With the rapid growth in the volume of data sets, models, and devices in the domain of deep learning, there is increasing attention on large-scale distributed deep learning. In contrast to traditional distributed deep learning, the large-scale scenario poses new challenges that include fault tolerance, scalability of algorithms and infrastructures, and heterogeneity in data sets, models, and resou…
▽ More
With the rapid growth in the volume of data sets, models, and devices in the domain of deep learning, there is increasing attention on large-scale distributed deep learning. In contrast to traditional distributed deep learning, the large-scale scenario poses new challenges that include fault tolerance, scalability of algorithms and infrastructures, and heterogeneity in data sets, models, and resources. Due to intensive synchronization of models and sharing of data across GPUs and computing nodes during distributed training and inference processes, communication efficiency becomes the bottleneck for achieving high performance at a large scale. This article surveys the literature over the period of 2018-2023 on algorithms and technologies aimed at achieving efficient communication in large-scale distributed deep learning at various levels, including algorithms, frameworks, and infrastructures. Specifically, we first introduce efficient algorithms for model synchronization and communication data compression in the context of large-scale distributed training. Next, we introduce efficient strategies related to resource allocation and task scheduling for use in distributed training and inference. After that, we present the latest technologies pertaining to modern communication infrastructures used in distributed deep learning with a focus on examining the impact of the communication overhead in a large-scale and heterogeneous setting. Finally, we conduct a case study on the distributed training of large language models at a large scale to illustrate how to apply these technologies in real cases. This article aims to offer researchers a comprehensive understanding of the current landscape of large-scale distributed deep learning and to reveal promising future research directions toward communication-efficient solutions in this scope.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
The Rise of Faint, Red AGN at $z>4$: A Sample of Little Red Dots in the JWST Extragalactic Legacy Fields
Authors:
Dale D. Kocevski,
Steven L. Finkelstein,
Guillermo Barro,
Anthony J. Taylor,
Antonello Calabrò,
Brivael Laloux,
Johannes Buchner,
Jonathan R. Trump,
Gene C. K. Leung,
Guang Yang,
Mark Dickinson,
Pablo G. Pérez-González,
Fabio Pacucci,
Kohei Inayoshi,
Rachel S. Somerville,
Elizabeth J. McGrath,
Hollis B. Akins,
Micaela B. Bagley,
Laura Bisigello,
Rebecca A. A. Bowler,
Adam Carnall,
Caitlin M. Casey,
Yingjie Cheng,
Nikko J. Cleri,
Luca Costantin
, et al. (32 additional authors not shown)
Abstract:
We present a sample of 341 "little red dots" (LRDs) spanning the redshift range $z\sim2-11$ using data from the CEERS, PRIMER, JADES, UNCOVER and NGDEEP surveys. Unlike past use of color indices to identify LRDs, we employ continuum slope fitting using shifting bandpasses to sample the same rest-frame emission blueward and redward of the Balmer break. This enables the detection of LRDs over a wide…
▽ More
We present a sample of 341 "little red dots" (LRDs) spanning the redshift range $z\sim2-11$ using data from the CEERS, PRIMER, JADES, UNCOVER and NGDEEP surveys. Unlike past use of color indices to identify LRDs, we employ continuum slope fitting using shifting bandpasses to sample the same rest-frame emission blueward and redward of the Balmer break. This enables the detection of LRDs over a wider redshift range and with less contamination from galaxies with strong breaks that otherwise lack a rising red continuum. The redshift distribution of our sample increases at $z<8$ and then undergoes a rapid decline at $z\sim4.5$, which may tie the emergence of these sources to the inside-out growth that galaxies experience during this epoch. We find that LRDs are $\sim1$ dex more numerous than X-ray and UV selected AGN at z~5-7. Within our sample, we have identified the first two X-ray detected LRDs. An X-ray spectral analysis confirms that these AGN are moderately obscured with $\log\,(N_{\rm H}/{\rm cm}^{2}$) of $23.3^{+0.4}_{-1.3}$ and $22.72^{+0.13}_{-0.16}$. Our analysis reveals that reddened AGN emission dominates their rest-optical light, while the rest-UV originates from their host galaxies. We also present NIRSpec observations from the RUBIES survey of 17 LRDs that show broad emission lines consistent with AGN activity. The confirmed AGN fraction of our sample is 71\% for sources with F444W<26.5. In addition, we find three LRDs with blue-shifted Balmer absorption features in their spectra, suggesting an outflow of high-density, low-ionization gas from near the central engine of these faint, red AGN.
△ Less
Submitted 20 January, 2025; v1 submitted 4 April, 2024;
originally announced April 2024.
-
Search for a sub-eV sterile neutrino using Daya Bay's full dataset
Authors:
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
Y. C. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
X. Y. Ding,
Y. Y. Ding
, et al. (176 additional authors not shown)
Abstract:
This Letter presents results of a search for the mixing of a sub-eV sterile neutrino with three active neutrinos based on the full data sample of the Daya Bay Reactor Neutrino Experiment, collected during 3158 days of detector operation, which contains $5.55 \times 10^{6}$ reactor \anue candidates identified as inverse beta-decay interactions followed by neutron-capture on gadolinium. The analysis…
▽ More
This Letter presents results of a search for the mixing of a sub-eV sterile neutrino with three active neutrinos based on the full data sample of the Daya Bay Reactor Neutrino Experiment, collected during 3158 days of detector operation, which contains $5.55 \times 10^{6}$ reactor \anue candidates identified as inverse beta-decay interactions followed by neutron-capture on gadolinium. The analysis benefits from a doubling of the statistics of our previous result and from improvements of several important systematic uncertainties.
No significant oscillation due to mixing of a sub-eV sterile neutrino with active neutrinos was found. Exclusion limits are set by both Feldman-Cousins and CLs methods.
Light sterile neutrino mixing with $\sin^2 2θ_{14} \gtrsim 0.01$ can be excluded at 95\% confidence level in the region of $0.01$ eV$^2 \lesssim |Δm^{2}_{41}| \lesssim 0.1 $ eV$^2$. This result represents the world-leading constraints in the region of $2 \times 10^{-4}$ eV$^2 \lesssim |Δm^{2}_{41}| \lesssim 0.2 $ eV$^2$.
△ Less
Submitted 20 August, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Mining Sequential Patterns in Uncertain Databases Using Hierarchical Index Structure
Authors:
Kashob Kumar Roy,
Md Hasibul Haque Moon,
Md Mahmudur Rahman,
Chowdhury Farhan Ahmed,
Carson K. Leung
Abstract:
In this uncertain world, data uncertainty is inherent in many applications and its importance is growing drastically due to the rapid development of modern technologies. Nowadays, researchers have paid more attention to mine patterns in uncertain databases. A few recent works attempt to mine frequent uncertain sequential patterns. Despite their success, they are incompetent to reduce the number of…
▽ More
In this uncertain world, data uncertainty is inherent in many applications and its importance is growing drastically due to the rapid development of modern technologies. Nowadays, researchers have paid more attention to mine patterns in uncertain databases. A few recent works attempt to mine frequent uncertain sequential patterns. Despite their success, they are incompetent to reduce the number of false-positive pattern generation in their mining process and maintain the patterns efficiently. In this paper, we propose multiple theoretically tightened pruning upper bounds that remarkably reduce the mining space. A novel hierarchical structure is introduced to maintain the patterns in a space-efficient way. Afterward, we develop a versatile framework for mining uncertain sequential patterns that can effectively handle weight constraints as well. Besides, with the advent of incremental uncertain databases, existing works are not scalable. There exist several incremental sequential pattern mining algorithms, but they are limited to mine in precise databases. Therefore, we propose a new technique to adapt our framework to mine patterns when the database is incremental. Finally, we conduct extensive experiments on several real-life datasets and show the efficacy of our framework in different applications.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Mining Weighted Sequential Patterns in Incremental Uncertain Databases
Authors:
Kashob Kumar Roy,
Md Hasibul Haque Moon,
Md Mahmudur Rahman,
Chowdhury Farhan Ahmed,
Carson Kai-Sang Leung
Abstract:
Due to the rapid development of science and technology, the importance of imprecise, noisy, and uncertain data is increasing at an exponential rate. Thus, mining patterns in uncertain databases have drawn the attention of researchers. Moreover, frequent sequences of items from these databases need to be discovered for meaningful knowledge with great impact. In many real cases, weights of items and…
▽ More
Due to the rapid development of science and technology, the importance of imprecise, noisy, and uncertain data is increasing at an exponential rate. Thus, mining patterns in uncertain databases have drawn the attention of researchers. Moreover, frequent sequences of items from these databases need to be discovered for meaningful knowledge with great impact. In many real cases, weights of items and patterns are introduced to find interesting sequences as a measure of importance. Hence, a constraint of weight needs to be handled while mining sequential patterns. Besides, due to the dynamic nature of databases, mining important information has become more challenging. Instead of mining patterns from scratch after each increment, incremental mining algorithms utilize previously mined information to update the result immediately. Several algorithms exist to mine frequent patterns and weighted sequences from incremental databases. However, these algorithms are confined to mine the precise ones. Therefore, we have developed an algorithm to mine frequent sequences in an uncertain database in this work. Furthermore, we have proposed two new techniques for mining when the database is incremental. Extensive experiments have been conducted for performance evaluation. The analysis showed the efficiency of our proposed framework.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Authors:
Qingping Sun,
Yanjun Wang,
Ailing Zeng,
Wanqi Yin,
Chen Wei,
Wenjia Wang,
Haiyi Mei,
Chi Sing Leung,
Ziwei Liu,
Lei Yang,
Zhongang Cai
Abstract:
Expressive human pose and shape estimation (a.k.a. 3D whole-body mesh recovery) involves the human body, hand, and expression estimation. Most existing methods have tackled this task in a two-stage manner, first detecting the human body part with an off-the-shelf detection model and inferring the different human body parts individually. Despite the impressive results achieved, these methods suffer…
▽ More
Expressive human pose and shape estimation (a.k.a. 3D whole-body mesh recovery) involves the human body, hand, and expression estimation. Most existing methods have tackled this task in a two-stage manner, first detecting the human body part with an off-the-shelf detection model and inferring the different human body parts individually. Despite the impressive results achieved, these methods suffer from 1) loss of valuable contextual information via cropping, 2) introducing distractions, and 3) lacking inter-association among different persons and body parts, inevitably causing performance degradation, especially for crowded scenes. To address these issues, we introduce a novel all-in-one-stage framework, AiOS, for multiple expressive human pose and shape recovery without an additional human detection step. Specifically, our method is built upon DETR, which treats multi-person whole-body mesh recovery task as a progressive set prediction problem with various sequential detection. We devise the decoder tokens and extend them to our task. Specifically, we first employ a human token to probe a human location in the image and encode global features for each instance, which provides a coarse location for the later transformer block. Then, we introduce a joint-related token to probe the human joint in the image and encoder a fine-grained local feature, which collaborates with the global feature to regress the whole-body mesh. This straightforward but effective model outperforms previous state-of-the-art methods by a 9% reduction in NMVE on AGORA, a 30% reduction in PVE on EHF, a 10% reduction in PVE on ARCTIC, and a 3% reduction in PVE on EgoBody.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
TJCCT: A Two-timescale Approach for UAV-assisted Mobile Edge Computing
Authors:
Zemin Sun,
Geng Sun,
Qingqing Wu,
Long He,
Shuang Liang,
Hongyang Pan,
Dusit Niyato,
Chau Yuen,
Victor C. M. Leung
Abstract:
Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) is emerging as a promising paradigm to provide aerial-terrestrial computing services in close proximity to mobile devices (MDs). However, meeting the demands of computation-intensive and delay-sensitive tasks for MDs poses several challenges, including the demand-supply contradiction between MDs and MEC servers, the demand-supply h…
▽ More
Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) is emerging as a promising paradigm to provide aerial-terrestrial computing services in close proximity to mobile devices (MDs). However, meeting the demands of computation-intensive and delay-sensitive tasks for MDs poses several challenges, including the demand-supply contradiction between MDs and MEC servers, the demand-supply heterogeneity between MDs and MEC servers, the trajectory control requirements on energy efficiency and timeliness, and the different time-scale dynamics of the network. To address these issues, we first present a hierarchical architecture by incorporating terrestrial-aerial computing capabilities and leveraging UAV flexibility. Furthermore, we formulate a joint computing resource allocation, computation offloading, and trajectory control problem to maximize the system utility. Since the problem is a non-convex and NP-hard mixed integer nonlinear programming (MINLP), we propose a two-timescale joint computing resource allocation, computation offloading, and trajectory control (TJCCT) approach for solving the problem. In the short timescale, we propose a price-incentive model for on-demand computing resource allocation and a matching mechanism-based method for computation offloading. In the long timescale, we propose a convex optimization-based method for UAV trajectory control. Besides, we theoretically prove the stability, optimality, and polynomial complexity of TJCCT. Extended simulation results demonstrate that the proposed TJCCT outperforms the comparative algorithms in terms of the system utility, average processing rate, average completion delay, and average completion ratio.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
A VLBI Software Correlator for Fast Radio Transients
Authors:
Calvin Leung,
Shion Andrew,
Kiyoshi W. Masui,
Charanjot Brar,
Tomas Cassanelli,
Shami Chatterjee,
Victoria Kaspi,
Kholoud Khairy,
Adam E. Lanman,
Mattias Lazda,
Juan Mena-Parra,
Gavin Noble,
Aaron B. Pearlman,
Mubdi Rahman,
Pranav Sanghavi,
Vishwangi Shah
Abstract:
One major goal in fast radio burst science is to detect fast radio bursts (FRBs) over a wide field of view without sacrificing the angular resolution required to pinpoint them to their host galaxies. Wide-field detection and localization capabilities have already been demonstrated using connected-element interferometry; the CHIME/FRB Outriggers project will push this further using widefield cylind…
▽ More
One major goal in fast radio burst science is to detect fast radio bursts (FRBs) over a wide field of view without sacrificing the angular resolution required to pinpoint them to their host galaxies. Wide-field detection and localization capabilities have already been demonstrated using connected-element interferometry; the CHIME/FRB Outriggers project will push this further using widefield cylindrical telescopes as widefield outriggers for very long baseline interferometry (VLBI). This paper describes an offline VLBI software correlator written in Python for the CHIME/FRB Outriggers project. It includes features well-suited to modern widefield instruments like multibeaming/multiple phase center correlation, pulse gating including coherent dedispersion, and a novel correlation algorithm based on the quadratic estimator formalism. This algorithm mitigates sensitivity loss which arises in instruments where the windowing and channelization is done outside the VLBI correlator at each station, which accounts for a 30 percent sensitivity drop away from the phase center. Our correlation algorithm recovers this sensitivity on both simulated and real data. As an end to end check of our software, we have written a preliminary pipeline for VLBI calibration and single-pulse localization, which we use in Lanman et al. (2024) to verify the astrometric accuracy of the CHIME/FRB Outriggers array.
△ Less
Submitted 26 March, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition
Authors:
Yufan Chen,
Ching Ting Leung,
Yong Huang,
Jianwei Sun,
Hao Chen,
Hanyu Gao
Abstract:
In the field of chemical structure recognition, the task of converting molecular images into machine-readable data formats such as SMILES string stands as a significant challenge, primarily due to the varied drawing styles and conventions prevalent in chemical literature. To bridge this gap, we proposed MolNexTR, a novel image-to-graph deep learning model that collaborates to fuse the strengths of…
▽ More
In the field of chemical structure recognition, the task of converting molecular images into machine-readable data formats such as SMILES string stands as a significant challenge, primarily due to the varied drawing styles and conventions prevalent in chemical literature. To bridge this gap, we proposed MolNexTR, a novel image-to-graph deep learning model that collaborates to fuse the strengths of ConvNext, a powerful Convolutional Neural Network variant, and Vision-TRansformer. This integration facilitates a more detailed extraction of both local and global features from molecular images. MolNexTR can predict atoms and bonds simultaneously and understand their layout rules. It also excels at flexibly integrating symbolic chemistry principles to discern chirality and decipher abbreviated structures. We further incorporate a series of advanced algorithms, including an improved data augmentation module, an image contamination module, and a post-processing module for getting the final SMILES output. These modules cooperate to enhance the model's robustness to diverse styles of molecular images found in real literature. In our test sets, MolNexTR has demonstrated superior performance, achieving an accuracy rate of 81-97%, marking a significant advancement in the domain of molecular structure recognition.
△ Less
Submitted 27 August, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
Computer-Controlled 3D Freeform Surface Weaving
Authors:
Xiangjia Chen,
Lip M. Lai,
Zishun Liu,
Chengkai Dai,
Isaac C. W. Leung,
Charlie C. L. Wang,
Yeung Yam
Abstract:
In this paper, we present a new computer-controlled weaving technology that enables the fabrication of woven structures in the shape of given 3D surfaces by using threads in non-traditional materials with high bending-stiffness, allowing for multiple applications with the resultant woven fabrics. A new weaving machine and a new manufacturing process are developed to realize the function of 3D surf…
▽ More
In this paper, we present a new computer-controlled weaving technology that enables the fabrication of woven structures in the shape of given 3D surfaces by using threads in non-traditional materials with high bending-stiffness, allowing for multiple applications with the resultant woven fabrics. A new weaving machine and a new manufacturing process are developed to realize the function of 3D surface weaving by the principle of short-row shaping. A computational solution is investigated to convert input 3D freeform surfaces into the corresponding weaving operations (indicated as W-code) to guide the operation of this system. A variety of examples using cotton threads, conductive threads and optical fibres are fabricated by our prototype system to demonstrate its functionality.
△ Less
Submitted 8 May, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Questioning whether seasonal advance of intense tropical cyclones since the 1980s truly exists
Authors:
Jimin Liu,
Jeremy Cheuk-Hin Leung,
Wenshou Tian,
Hong Huang,
Daosheng Xu,
Weijing Li,
Weihong Qian,
Banglin Zhang
Abstract:
Shan et al. recently reported significant seasonal advances of intense tropical cyclones (TCs) in both the Northern Hemisphere (NH) and Southern Hemisphere (SH) since the 1980s and emphasized the data insensitivity of this conclusion, based on the advanced Dvorak Technique-Hurricane Satellite (ADT-HURSAT) and the International Best Track Archive for Climate Stewardship (IBTrACS) datasets. However,…
▽ More
Shan et al. recently reported significant seasonal advances of intense tropical cyclones (TCs) in both the Northern Hemisphere (NH) and Southern Hemisphere (SH) since the 1980s and emphasized the data insensitivity of this conclusion, based on the advanced Dvorak Technique-Hurricane Satellite (ADT-HURSAT) and the International Best Track Archive for Climate Stewardship (IBTrACS) datasets. However, this conclusion contradicts with previous research and our recent findings. Our analyses reveal that both the magnitudes and statistical significance of seasonal advancing trends of intense TCs are sensitive to the choice of datasets. These inconsistencies primarily arise from the differences in intense TC cases identified from the two datasets, which is attributed to the uncertainties of ADT-HURSAT in estimating TCs' lifetime maximum intensities (LMIs). According to the IBTrACS records, we find that no significant seasonal advancing trends of intense TCs were observed in both hemispheres since the 1980s. These findings raise doubts about the validity of Shan et al.'s conclusions regarding the seasonal advance of intense TCs. We argue that the reported seasonal advance of intense TCs since the 1980s is inconclusive and further investigations with alternative datasets and approaches are needed.
△ Less
Submitted 7 January, 2025; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Unveiling the Potential of Robustness in Selecting Conditional Average Treatment Effect Estimators
Authors:
Yiyan Huang,
Cheuk Hang Leung,
Siyi Wang,
Yijun Li,
Qi Wu
Abstract:
The growing demand for personalized decision-making has led to a surge of interest in estimating the Conditional Average Treatment Effect (CATE). Various types of CATE estimators have been developed with advancements in machine learning and causal inference. However, selecting the desirable CATE estimator through a conventional model validation procedure remains impractical due to the absence of c…
▽ More
The growing demand for personalized decision-making has led to a surge of interest in estimating the Conditional Average Treatment Effect (CATE). Various types of CATE estimators have been developed with advancements in machine learning and causal inference. However, selecting the desirable CATE estimator through a conventional model validation procedure remains impractical due to the absence of counterfactual outcomes in observational data. Existing approaches for CATE estimator selection, such as plug-in and pseudo-outcome metrics, face two challenges. First, they must determine the metric form and the underlying machine learning models for fitting nuisance parameters (e.g., outcome function, propensity function, and plug-in learner). Second, they lack a specific focus on selecting a robust CATE estimator. To address these challenges, this paper introduces a Distributionally Robust Metric (DRM) for CATE estimator selection. The proposed DRM is nuisance-free, eliminating the need to fit models for nuisance parameters, and it effectively prioritizes the selection of a distributionally robust CATE estimator. The experimental results validate the effectiveness of the DRM method in selecting CATE estimators that are robust to the distribution shift incurred by covariate shift and hidden confounders.
△ Less
Submitted 31 October, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
A pulsar-like swing in the polarisation position angle of a nearby fast radio burst
Authors:
Ryan Mckinven,
Mohit Bhardwaj,
Tarraneh Eftekhari,
Charles D. Kilpatrick,
Aida Kirichenko,
Arpan Pal,
Amanda M. Cook,
B. M. Gaensler,
Utkarsh Giri,
Victoria M. Kaspi,
Daniele Michilli,
Kenzie Nimmo,
Aaron B. Pearlman,
Ziggy Pleunis,
Ketan R. Sand,
Ingrid Stairs,
Bridget C. Andersen,
Shion Andrew,
Kevin Bandura,
Charanjot Brar,
Tomas Cassanelli,
Shami Chatterjee,
Alice P. Curtin,
Fengqiu Adam Dong,
Gwendolyn Eadie
, et al. (19 additional authors not shown)
Abstract:
Fast radio bursts (FRBs) last for milliseconds and arrive at Earth from cosmological distances. While their origin(s) and emission mechanism(s) are presently unknown, their signals bear similarities with the much less luminous radio emission generated by pulsars within our Galaxy and several lines of evidence point toward neutron star origins. For pulsars, the linear polarisation position angle (P…
▽ More
Fast radio bursts (FRBs) last for milliseconds and arrive at Earth from cosmological distances. While their origin(s) and emission mechanism(s) are presently unknown, their signals bear similarities with the much less luminous radio emission generated by pulsars within our Galaxy and several lines of evidence point toward neutron star origins. For pulsars, the linear polarisation position angle (PA) often exhibits evolution over the pulse phase that is interpreted within a geometric framework known as the rotating vector model (RVM). Here, we report on a fast radio burst, FRB 20221022A, detected by the Canadian Hydrogen Intensity Mapping Experiment (CHIME) and localized to a nearby host galaxy ($\sim 65\; \rm{Mpc}$), MCG+14-02-011. This one-off FRB displays a $\sim 130$ degree rotation of its PA over its $\sim 2.5\; \rm{ms}$ burst duration, closely resembling the "S"-shaped PA evolution commonly seen from pulsars and some radio magnetars. The PA evolution disfavours emission models involving shocks far from the source and instead suggests magnetospheric origins for this source which places the emission region close to the FRB central engine, echoing similar conclusions drawn from tempo-polarimetric studies of some repeating sources. This FRB's PA evolution is remarkably well-described by the RVM and, although we cannot determine the inclination and magnetic obliquity due to the unknown period/duty cycle of the source, we can dismiss extremely short-period pulsars (e.g., recycled millisecond pulsars) as potential progenitors. RVM-fitting appears to favour a source occupying a unique position in the period/duty cycle phase space that implies tight opening angles for the beamed emission, significantly reducing burst energy requirements of the source.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
CHIME/FRB Outriggers: KKO Station System and Commissioning Results
Authors:
Adam E. Lanman,
Shion Andrew,
Mattias Lazda,
Vishwangi Shah,
Mandana Amiri,
Arvind Balasubramanian,
Kevin Bandura,
P. J. Boyle,
Charanjot Brar,
Mark Carlson,
Jean-François Cliche,
Nina Gusinskaia,
Ian T. Hendricksen,
J. F. Kaczmarek,
Tom Landecker,
Calvin Leung,
Ryan Mckinven,
Juan Mena-Parra,
Nikola Milutinovic,
Kenzie Nimmo,
Aaron B. Pearlman,
Andre Renard,
Mubdi Rahman,
J. Richard Shaw,
Seth R. Siegel
, et al. (21 additional authors not shown)
Abstract:
Localizing fast radio bursts (FRBs) to their host galaxies is an essential step to better understanding their origins and using them as cosmic probes. The CHIME/FRB Outrigger program aims to add VLBI-localization capabilities to CHIME, such that FRBs may be localized to tens of milliarcsecond precision at the time of their discovery, more than sufficient for host galaxy identification. The first-b…
▽ More
Localizing fast radio bursts (FRBs) to their host galaxies is an essential step to better understanding their origins and using them as cosmic probes. The CHIME/FRB Outrigger program aims to add VLBI-localization capabilities to CHIME, such that FRBs may be localized to tens of milliarcsecond precision at the time of their discovery, more than sufficient for host galaxy identification. The first-built outrigger telescope is KKO, located 66 kilometers west of CHIME. Cross-correlating KKO with CHIME can achieve arcsecond-scale localization in right ascension while avoiding the worst effects of the ionosphere. This paper presents measurements of KKO's performance throughout its commissioning phase, as well as a summary of its design and function. We demonstrate KKO's capabilities as a standalone instrument by producing full-sky images, mapping the angular and frequency structure of the primary beam, and measuring feed positions. To demonstrate the localization capabilities of the CHIME -- KKO baseline, we collected five separate observations each for a set of twenty bright pulsars, and aimed to measure their positions to within 5~arcseconds. All of these pulses were successfully localized to within this specification. The next two outriggers are expected to be commissioned in 2024, and will enable subarcsecond localizations for approximately hundreds of FRBs each year.
△ Less
Submitted 29 May, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation Learning
Authors:
Gangda Deng,
Hongkuan Zhou,
Hanqing Zeng,
Yinglong Xia,
Christopher Leung,
Jianbo Li,
Rajgopal Kannan,
Viktor Prasanna
Abstract:
Recently, Temporal Graph Neural Networks (TGNNs) have demonstrated state-of-the-art performance in various high-impact applications, including fraud detection and content recommendation. Despite the success of TGNNs, they are prone to the prevalent noise found in real-world dynamic graphs like time-deprecated links and skewed interaction distribution. The noise causes two critical issues that sign…
▽ More
Recently, Temporal Graph Neural Networks (TGNNs) have demonstrated state-of-the-art performance in various high-impact applications, including fraud detection and content recommendation. Despite the success of TGNNs, they are prone to the prevalent noise found in real-world dynamic graphs like time-deprecated links and skewed interaction distribution. The noise causes two critical issues that significantly compromise the accuracy of TGNNs: (1) models are supervised by inferior interactions, and (2) noisy input induces high variance in the aggregated messages. However, current TGNN denoising techniques do not consider the diverse and dynamic noise pattern of each node. In addition, they also suffer from the excessive mini-batch generation overheads caused by traversing more neighbors. We believe the remedy for fast and accurate TGNNs lies in temporal adaptive sampling. In this work, we propose TASER, the first adaptive sampling method for TGNNs optimized for accuracy, efficiency, and scalability. TASER adapts its mini-batch selection based on training dynamics and temporal neighbor selection based on the contextual, structural, and temporal properties of past interactions. To alleviate the bottleneck in mini-batch generation, TASER implements a pure GPU-based temporal neighbor finder and a dedicated GPU feature cache. We evaluate the performance of TASER using two state-of-the-art backbone TGNNs. On five popular datasets, TASER outperforms the corresponding baselines by an average of 2.3% in Mean Reciprocal Rank (MRR) while achieving an average of 5.1x speedup in training time.
△ Less
Submitted 23 November, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.