-
The Simons Observatory: Science Goals and Forecasts for the Enhanced Large Aperture Telescope
Authors:
The Simons Observatory Collaboration,
M. Abitbol,
I. Abril-Cabezas,
S. Adachi,
P. Ade,
A. E. Adler,
P. Agrawal,
J. Aguirre,
Z. Ahmed,
S. Aiola,
T. Alford,
A. Ali,
D. Alonso,
M. A. Alvarez,
R. An,
K. Arnold,
P. Ashton,
Z. Atkins,
J. Austermann,
S. Azzoni,
C. Baccigalupi,
A. Baleato Lizancos,
D. Barron,
P. Barry,
J. Bartlett
, et al. (397 additional authors not shown)
Abstract:
We describe updated scientific goals for the wide-field, millimeter-wave survey that will be produced by the Simons Observatory (SO). Significant upgrades to the 6-meter SO Large Aperture Telescope (LAT) are expected to be complete by 2028, and will include a doubled mapping speed with 30,000 new detectors and an automated data reduction pipeline. In addition, a new photovoltaic array will supply…
▽ More
We describe updated scientific goals for the wide-field, millimeter-wave survey that will be produced by the Simons Observatory (SO). Significant upgrades to the 6-meter SO Large Aperture Telescope (LAT) are expected to be complete by 2028, and will include a doubled mapping speed with 30,000 new detectors and an automated data reduction pipeline. In addition, a new photovoltaic array will supply most of the observatory's power. The LAT survey will cover about 60% of the sky at a regular observing cadence, with five times the angular resolution and ten times the map depth of Planck. The science goals are to: (1) determine the physical conditions in the early universe and constrain the existence of new light particles; (2) measure the integrated distribution of mass, electron pressure, and electron momentum in the late-time universe, and, in combination with optical surveys, determine the neutrino mass and the effects of dark energy via tomographic measurements of the growth of structure at $z < 3$; (3) measure the distribution of electron density and pressure around galaxy groups and clusters, and calibrate the effects of energy input from galaxy formation on the surrounding environment; (4) produce a sample of more than 30,000 galaxy clusters, and more than 100,000 extragalactic millimeter sources, including regularly sampled AGN light-curves, to study these sources and their emission physics; (5) measure the polarized emission from magnetically aligned dust grains in our Galaxy, to study the properties of dust and the role of magnetic fields in star formation; (6) constrain asteroid regoliths, search for Trans-Neptunian Objects, and either detect or eliminate large portions of the phase space in the search for Planet 9; and (7) provide a powerful new window into the transient universe on time scales of minutes to years, concurrent with observations from Rubin of overlapping sky.
△ Less
Submitted 15 March, 2025; v1 submitted 1 March, 2025;
originally announced March 2025.
-
A Novel Bispectrum Estimator of the Kinematic Sunyaev-Zel'dovich Effect using Projected Fields
Authors:
Raagini Patki,
Nicholas Battaglia,
J. Colin Hill
Abstract:
With the advent of current and future high-resolution CMB experiments, the kinematic Sunyaev-Zel'dovich (kSZ) effect has become a unique observational probe of the distribution of baryons and velocities in the Universe. In this work, we propose a novel binned bispectrum of the form temperature-temperature-density to extract the late-time kSZ effect from cleaned CMB maps. Unlike 'kSZ tomography' me…
▽ More
With the advent of current and future high-resolution CMB experiments, the kinematic Sunyaev-Zel'dovich (kSZ) effect has become a unique observational probe of the distribution of baryons and velocities in the Universe. In this work, we propose a novel binned bispectrum of the form temperature-temperature-density to extract the late-time kSZ effect from cleaned CMB maps. Unlike 'kSZ tomography' methods, this estimator can use any tracer of the large-scale structure density field projected along the line-of-sight and does not require individual redshifts. With our method, we forecast signal-to-noise ratios (SNR) of $\sim$100-200 for the upcoming Simons Observatory (SO) and CMB-S4 correlated with a galaxy sample from WISE that is restricted to the linear regime. We also extend galaxy modes into the non-linear regime and explore this harmonic space to show that the SNR peaks for squeezed triangles that have a short (linear) density mode and long temperature modes in harmonic space. The existing kSZ$^{2}$-density projected-fields estimator compresses the rich information contained in this bispectrum across various scales and triangle shapes. Moreover, we find that the lensing correction to our estimator's signal is relatively small. We study the dependence of this kSZ signal on $Λ$CDM parameters for SO and CMB-S4 and forecast initial constraints on the sum of neutrino masses while restricting to the linear galaxy bias regime. Our work illustrates the potential of the projected-fields kSZ bispectrum as a novel probe of baryonic abundance and beyond-$Λ$CDM cosmology with upcoming precision measurements.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
AmazonQAC: A Large-Scale, Naturalistic Query Autocomplete Dataset
Authors:
Dante Everaert,
Rohit Patki,
Tianqi Zheng,
Christopher Potts
Abstract:
Query Autocomplete (QAC) is a critical feature in modern search engines, facilitating user interaction by predicting search queries based on input prefixes. Despite its widespread adoption, the absence of large-scale, realistic datasets has hindered advancements in QAC system development. This paper addresses this gap by introducing AmazonQAC, a new QAC dataset sourced from Amazon Search logs, com…
▽ More
Query Autocomplete (QAC) is a critical feature in modern search engines, facilitating user interaction by predicting search queries based on input prefixes. Despite its widespread adoption, the absence of large-scale, realistic datasets has hindered advancements in QAC system development. This paper addresses this gap by introducing AmazonQAC, a new QAC dataset sourced from Amazon Search logs, comprising 395M samples. The dataset includes actual sequences of user-typed prefixes leading to final search terms, as well as session IDs and timestamps that support modeling the context-dependent aspects of QAC. We assess Prefix Trees, semantic retrieval, and Large Language Models (LLMs) with and without finetuning. We find that finetuned LLMs perform best, particularly when incorporating contextual information. However, even our best system achieves only half of what we calculate is theoretically possible on our test data, which implies QAC is a challenging problem that is far from solved with existing systems. This contribution aims to stimulate further research on QAC systems to better serve user needs in diverse environments. We open-source this data on Hugging Face at https://huggingface.co/datasets/amazon/AmazonQAC.
△ Less
Submitted 22 October, 2024;
originally announced November 2024.
-
Retrieval Augmented Spelling Correction for E-Commerce Applications
Authors:
Xuan Guo,
Rohit Patki,
Dante Everaert,
Christopher Potts
Abstract:
The rapid introduction of new brand names into everyday language poses a unique challenge for e-commerce spelling correction services, which must distinguish genuine misspellings from novel brand names that use unconventional spelling. We seek to address this challenge via Retrieval Augmented Generation (RAG). On this approach, product names are retrieved from a catalog and incorporated into the c…
▽ More
The rapid introduction of new brand names into everyday language poses a unique challenge for e-commerce spelling correction services, which must distinguish genuine misspellings from novel brand names that use unconventional spelling. We seek to address this challenge via Retrieval Augmented Generation (RAG). On this approach, product names are retrieved from a catalog and incorporated into the context used by a large language model (LLM) that has been fine-tuned to do contextual spelling correction. Through quantitative evaluation and qualitative error analyses, we find improvements in spelling correction utilizing the RAG framework beyond a stand-alone LLM. We also demonstrate the value of additional finetuning of the LLM to incorporate retrieved context.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Improved Modeling of the Kinematic Sunyaev-Zel'dovich Projected-Fields signal and its Cosmological Dependence
Authors:
Raagini Patki,
Nicholas Battaglia,
Simone Ferraro
Abstract:
Over the past decade, the kinematic Sunyaev-Zel'dovich (kSZ) effect has emerged as an observational probe of the distribution of baryons and velocity fields in the late Universe. Of the many ways to detect the kSZ, the 'projected-fields kSZ estimator' has the promising feature of not being limited to galaxy samples with accurate redshifts. The current theoretical modeling of this estimator involve…
▽ More
Over the past decade, the kinematic Sunyaev-Zel'dovich (kSZ) effect has emerged as an observational probe of the distribution of baryons and velocity fields in the late Universe. Of the many ways to detect the kSZ, the 'projected-fields kSZ estimator' has the promising feature of not being limited to galaxy samples with accurate redshifts. The current theoretical modeling of this estimator involves an approximate treatment only applicable at small scales. As the measurement fidelity rapidly improves, we find it necessary to move beyond the original treatment and hence derive an improved theoretical model for this estimator without these previous approximations. We show that the differences between the predicted signal from the two models are scale-dependent and will be significant for future measurements from the Simons Observatory and CMB-S4 in combination with galaxy data from WISE or the Rubin Observatory, which have high forecasted signal-to-noise ratios ($>100$). Thus, adopting our improved model in future analyses will be important to avoid biases. Equipped with our model, we explore the cosmological dependence of this kSZ signal for future measurements. With a Planck prior, residual uncertainty on $Λ$CDM parameters leads to $\sim7\%$ marginalized uncertainties on the signal amplitude, compared to a sub-percent level forecasted with a fixed cosmology. To illustrate the potential of this kSZ estimator as a cosmological probe, we forecast initial constraints on $Λ$CDM parameters and the sum of neutrino masses, paving the way for jointly fitting both baryonic astrophysics and cosmology in future analyses.
△ Less
Submitted 14 June, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.