-
ODIN: The LAE Lyα Luminosity Function over Cosmic Time and Environmental Density
Authors:
Gautam Nagaraj,
Robin Ciardullo,
Caryl Gronwall,
Vandana Ramakrishnan,
Kyoung-Soo Lee,
Eric Gawiser,
Nicole M. Firestone,
Govind Ramgopal,
J. Aguilar,
Steven Ahlen,
Davide Bianchi,
David Brooks,
Francisco Javier Castander,
Todd Claybaugh,
Andrei Cuceu,
Axel de la Macorra,
Arjun Dey,
Biprateep Dey,
Peter Doel,
Jaime Forero-Romero,
Enrique Gaztanaga,
Satya Gontcho A Gontcho,
Gaston Gutierrez,
Hiram K. Herrera-Alcantar,
Klaus Honscheid
, et al. (31 additional authors not shown)
Abstract:
The ubiquity and relative ease of discovery make $2\lesssim z\lesssim 5$ Ly$α$ emitting galaxies (LAEs) ideal tracers for cosmology. In addition, because Ly$α$ is a resonance line, but frequently observed at large equivalent width, it is potentially a probe of galaxy evolution. The LAE Ly$α$ luminosity function (LF) is an essential measurement for making progress on both of these aspects. Although…
▽ More
The ubiquity and relative ease of discovery make $2\lesssim z\lesssim 5$ Ly$α$ emitting galaxies (LAEs) ideal tracers for cosmology. In addition, because Ly$α$ is a resonance line, but frequently observed at large equivalent width, it is potentially a probe of galaxy evolution. The LAE Ly$α$ luminosity function (LF) is an essential measurement for making progress on both of these aspects. Although several studies have computed the LAE LF, very few have delved into how the function varies with environment. The large area and depth of the One-hundred-deg2 DECam Imaging in Narrowbands (ODIN) survey makes such measurements possible at the cosmic noon redshifts of z~2.4, ~3.1, and ~4.5. In this initial work, we present algorithms to rigorously compute the LAE LF and test our methods on the ~16,000 ODIN LAEs found in the extended COSMOS field. Using these limited samples, we find slight evidence that protocluster environments either suppress the numbers of very faint and very bright LAEs or enhance medium-bright LAEs in comparison to the field. We also find that the LF decreases in number density and evolves towards a steeper faint-end slope over cosmic time from z~4.5 to z~2.4.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation
Authors:
Prashanth Vijayaraghavan,
Luyao Shi,
Ehsan Degan,
Vandana Mukherjee,
Xin Zhang
Abstract:
Analog circuit topology synthesis is integral to Electronic Design Automation (EDA), enabling the automated creation of circuit structures tailored to specific design requirements. However, the vast design search space and strict constraint adherence make efficient synthesis challenging. Leveraging the versatility of Large Language Models (LLMs), we propose AUTOCIRCUIT-RL,a novel reinforcement lea…
▽ More
Analog circuit topology synthesis is integral to Electronic Design Automation (EDA), enabling the automated creation of circuit structures tailored to specific design requirements. However, the vast design search space and strict constraint adherence make efficient synthesis challenging. Leveraging the versatility of Large Language Models (LLMs), we propose AUTOCIRCUIT-RL,a novel reinforcement learning (RL)-based framework for automated analog circuit synthesis. The framework operates in two phases: instruction tuning, where an LLM learns to generate circuit topologies from structured prompts encoding design constraints, and RL refinement, which further improves the instruction-tuned model using reward models that evaluate validity, efficiency, and output voltage. The refined model is then used directly to generate topologies that satisfy the design constraints. Empirical results show that AUTOCIRCUIT-RL generates ~12% more valid circuits and improves efficiency by ~14% compared to the best baselines, while reducing duplicate generation rates by ~38%. It achieves over 60% success in synthesizing valid circuits with limited training data, demonstrating strong generalization. These findings highlight the framework's effectiveness in scaling to complex circuits while maintaining efficiency and constraint adherence, marking a significant advancement in AI-driven circuit design.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Towards Machine Unlearning for Paralinguistic Speech Processing
Authors:
Orchid Chetia Phukan,
Girish,
Mohd Mujtaba Akhtar,
Shubham Singh,
Swarup Ranjan Behera,
Vandana Rajan,
Muskaan Singh,
Arun Balaji Buduru,
Rajesh Sharma
Abstract:
In this work, we pioneer the study of Machine Unlearning (MU) for Paralinguistic Speech Processing (PSP). We focus on two key PSP tasks: Speech Emotion Recognition (SER) and Depression Detection (DD). To this end, we propose, SISA++, a novel extension to previous state-of-the-art (SOTA) MU method, SISA by merging models trained on different shards with weight-averaging. With such modifications, we…
▽ More
In this work, we pioneer the study of Machine Unlearning (MU) for Paralinguistic Speech Processing (PSP). We focus on two key PSP tasks: Speech Emotion Recognition (SER) and Depression Detection (DD). To this end, we propose, SISA++, a novel extension to previous state-of-the-art (SOTA) MU method, SISA by merging models trained on different shards with weight-averaging. With such modifications, we show that SISA++ preserves performance more in comparison to SISA after unlearning in benchmark SER (CREMA-D) and DD (E-DAIC) datasets. Also, to guide future research for easier adoption of MU for PSP, we present ``cookbook recipes'' - actionable recommendations for selecting optimal feature representations and downstream architectures that can mitigate performance degradation after the unlearning process.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
DeepTopoNet: A Framework for Subglacial Topography Estimation on the Greenland Ice Sheets
Authors:
Bayu Adhi Tama,
Mansa Krishna,
Homayra Alam,
Mostafa Cham,
Omar Faruque,
Gong Cheng,
Jianwu Wang,
Mathieu Morlighem,
Vandana Janeja
Abstract:
Understanding Greenland's subglacial topography is critical for projecting the future mass loss of the ice sheet and its contribution to global sea-level rise. However, the complex and sparse nature of observational data, particularly information about the bed topography under the ice sheet, significantly increases the uncertainty in model projections. Bed topography is traditionally measured by a…
▽ More
Understanding Greenland's subglacial topography is critical for projecting the future mass loss of the ice sheet and its contribution to global sea-level rise. However, the complex and sparse nature of observational data, particularly information about the bed topography under the ice sheet, significantly increases the uncertainty in model projections. Bed topography is traditionally measured by airborne ice-penetrating radar that measures the ice thickness directly underneath the aircraft, leaving data gap of tens of kilometers in between flight lines. This study introduces a deep learning framework, which we call as DeepTopoNet, that integrates radar-derived ice thickness observations and BedMachine Greenland data through a novel dynamic loss-balancing mechanism. Among all efforts to reconstruct bed topography, BedMachine has emerged as one of the most widely used datasets, combining mass conservation principles and ice thickness measurements to generate high-resolution bed elevation estimates. The proposed loss function adaptively adjusts the weighting between radar and BedMachine data, ensuring robustness in areas with limited radar coverage while leveraging the high spatial resolution of BedMachine predictions i.e. bed estimates. Our approach incorporates gradient-based and trend surface features to enhance model performance and utilizes a CNN architecture designed for subgrid-scale predictions. By systematically testing on the Upernavik Isstrøm) region, the model achieves high accuracy, outperforming baseline methods in reconstructing subglacial terrain. This work demonstrates the potential of deep learning in bridging observational gaps, providing a scalable and efficient solution to inferring subglacial topography.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis
Authors:
Prashanth Vijayaraghavan,
Soroush Vosoughi,
Lamogha Chiazor,
Raya Horesh,
Rogerio Abreu de Paula,
Ehsan Degan,
Vandana Mukherjee
Abstract:
Recent advancements in large language models (LLMs) have revolutionized natural language processing (NLP) and expanded their applications across diverse domains. However, despite their impressive capabilities, LLMs have been shown to reflect and perpetuate harmful societal biases, including those based on ethnicity, gender, and religion. A critical and underexplored issue is the reinforcement of c…
▽ More
Recent advancements in large language models (LLMs) have revolutionized natural language processing (NLP) and expanded their applications across diverse domains. However, despite their impressive capabilities, LLMs have been shown to reflect and perpetuate harmful societal biases, including those based on ethnicity, gender, and religion. A critical and underexplored issue is the reinforcement of caste-based biases, particularly towards India's marginalized caste groups such as Dalits and Shudras. In this paper, we address this gap by proposing DECASTE, a novel, multi-dimensional framework designed to detect and assess both implicit and explicit caste biases in LLMs. Our approach evaluates caste fairness across four dimensions: socio-cultural, economic, educational, and political, using a range of customized prompting strategies. By benchmarking several state-of-the-art LLMs, we reveal that these models systematically reinforce caste biases, with significant disparities observed in the treatment of oppressed versus dominant caste groups. For example, bias scores are notably elevated when comparing Dalits and Shudras with dominant caste groups, reflecting societal prejudices that persist in model outputs. These results expose the subtle yet pervasive caste biases in LLMs and emphasize the need for more comprehensive and inclusive bias evaluation methodologies that assess the potential risks of deploying such models in real-world contexts.
△ Less
Submitted 4 June, 2025; v1 submitted 20 May, 2025;
originally announced May 2025.
-
Virgo Filaments V: Disrupting the Baryon Cycle in the NGC 5364 Galaxy Group
Authors:
Rose A. Finn,
Gregory Rudnick,
Pascale Jablonka,
Mpati Ramatsoku,
Gautam Nagaraj,
Benedetta Vulcani,
Rebecca A. Koopmann,
Matteo Fossati,
James Agostino,
Yannick Bahe,
Santiago Garcia-Burillo,
Gianluca Castignani,
Francoise Combes,
Kim Conger,
Gabriella De Lucia,
Vandana Desai,
John Moustakas,
Dara Norman,
Damien Sperone-Longin,
Melinda Townsend,
Lizhi Xie,
Daria Zakharova,
Dennis Zaritsky
Abstract:
The Virgo Filament Survey (VFS) is a comprehensive study of galaxies that reside in the extended filamentary structures surrounding the Virgo Cluster, out to 12 virial radii. The primary goal is to characterize all of the dominant baryonic components within galaxies and to understand whether and how they are affected by the filament environment. A key constituent of VFS is a narrowband H$α$ imagin…
▽ More
The Virgo Filament Survey (VFS) is a comprehensive study of galaxies that reside in the extended filamentary structures surrounding the Virgo Cluster, out to 12 virial radii. The primary goal is to characterize all of the dominant baryonic components within galaxies and to understand whether and how they are affected by the filament environment. A key constituent of VFS is a narrowband H$α$ imaging survey of over 600 galaxies, VFS-H$α$. The H$α$ images reveal detailed, resolved maps of the ionized gas and massive star-formation. This imaging is particularly powerful as a probe of environmentally-induced quenching because different physical processes affect the spatial distribution of star formation in different ways. In this paper, we present the first results from the VFS-H$α$ for the NGC~5364 group, a low-mass ($\log_{10}(M_{dyn}/M_\odot) < 13)$ system located at the western edge of the Virgo~III filament. We combine H$α$ imaging with resolved H~I observations from MeerKAT for eight group members. These galaxies exhibit peculiar morphologies, including strong distortions in the stars and the gas, truncated H~I and H$α$ disks, H~I tails, extraplanar H$α$ emission, and off-center H$α$ emission. These signatures are suggestive of environmental processing such as tidal interactions, ram pressure stripping, and starvation. We quantify the role of ram pressure stripping expected in this group, and find that it can explain the cases of H~I tails and truncated H-alpha for all but one of the disk-dominated galaxies. Our observations indicate that multiple physical mechanisms are disrupting the baryon cycle in these group galaxies.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Workshop on Combating Biases and Improving Institutional Culture
Authors:
Deepa Chari,
Vandana Nanal
Abstract:
The imposter syndrome, implicit biases, and microaggressions are some of the problems that adversely impact gender minorities in general. This workshop, conducted as a satellite event of the International Conference on Women in Physics 2023, is an attempt to create a platform for elaborating these concepts and sharing mutual experiences within the global Physics community. The larger goal is to de…
▽ More
The imposter syndrome, implicit biases, and microaggressions are some of the problems that adversely impact gender minorities in general. This workshop, conducted as a satellite event of the International Conference on Women in Physics 2023, is an attempt to create a platform for elaborating these concepts and sharing mutual experiences within the global Physics community. The larger goal is to develop a better understanding of the environments in Physics institutions worldwide. Through a series of activities and discussions, various exemplar scenarios were presented to participants and subsequent in-depth discussion focused on strategies to combat these issues as well as avenues for professional and academic support.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Enhanced hot electron generation from liquid jets in moderate intensity laser-plasma interactions
Authors:
Ratul Sabui,
S. V. Rahul,
Angana Mondal,
Archit Bhardwaj,
Ram Gopal,
Vandana Sharma,
M. Krishnamurthy
Abstract:
We report the generation of MeV temperature electrons using sub-terawatt laser systems with a liquid methanol jet as a target. Remarkably, even at laser intensities of 1016W/cm2, liquid cylindrical (2D) 15 micron methanol jets produce electrons with temperatures of 1 MeV. Hot electron emission characteristics are strikingly similar to those observed in spherical microdroplet (3D) targets. These re…
▽ More
We report the generation of MeV temperature electrons using sub-terawatt laser systems with a liquid methanol jet as a target. Remarkably, even at laser intensities of 1016W/cm2, liquid cylindrical (2D) 15 micron methanol jets produce electrons with temperatures of 1 MeV. Hot electron emission characteristics are strikingly similar to those observed in spherical microdroplet (3D) targets. These results validate that modeling such experiments using 2D PIC simulation is not a compromising approximation. This work further simplifies the experimental complexities towards a multi-KHz highly regenerative source of directed multi-MeV electron (and associated x-ray and ion) generation, demanding laser intensities 100x lower than conventional laser plasma sources. Increased source energy and pointing stability are crucial for imaging or radiographic applications from such sources.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Unique Decay Modes and Signatures of pNGB scalars in $SU(5)/SO(5)$ Composite Higgs Model
Authors:
Nilanjana Kumar,
Vandana Sahdev
Abstract:
The nature of the Higgs boson, whether it is elementary or composite, will be investigated through precision measurements in ongoing experiments. In composite Higgs scenarios, the Higgs may manifest as a pseudo Nambu-Goldstone boson (pNGB) arising from a strongly interacting sector. The $SU(5)/SO(5)$ Composite Higgs Model features a rich scalar sector, with the decay patterns of the scalars being…
▽ More
The nature of the Higgs boson, whether it is elementary or composite, will be investigated through precision measurements in ongoing experiments. In composite Higgs scenarios, the Higgs may manifest as a pseudo Nambu-Goldstone boson (pNGB) arising from a strongly interacting sector. The $SU(5)/SO(5)$ Composite Higgs Model features a rich scalar sector, with the decay patterns of the scalars being heavily influenced by how fermions are embedded in various representations of $SU(5)$. We discuss how the mass of the pNGB scalars and their couplings depend functionally on the compositeness scale and parameters of the strong sector. Unique decay modes of the scalars emerge from the model when the mixing between the mass and the gauge eigenstates is non-negligible. We present a comprehensive and thorough analysis of the fermiophilic and fermiophobic decay modes of the pNGB scalars. One of our main findings is that the decay patterns of the two singly charged scalars differ significantly. Additionally, one pNGB scalar decays to another on-shell pNGB scalar when the masses are more than $\sim 1$ TeV. Both these factors play a significant role in creating highly distinctive signatures in collider experiments. A muon collider presents a promising avenue for detecting pNGB scalars with masses greater than 1 TeV, particularly, in final states involving multiple jets.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
ODIN: Clustering Analysis of 14,000 Lyα Emitting Galaxies at z=2.4, 3.1, and 4.5
Authors:
Danisbel Herrera,
Eric Gawiser,
Barbara Benda,
Nicole Firestone,
Vandana Ramakrishnan,
Byeongha Moon,
Kyoung-Soo Lee,
Changbom Park,
Francisco Valdes,
Yujin Yang,
M. Celeste Artale,
Robin Ciardullo,
Caryl Gronwall,
Lucia Guaita,
Ho Seong Hwang,
Jacob Kennedy,
Ankit Kumar,
Ann Zabludoff
Abstract:
Lyman Alpha Emitters (LAEs) are star-forming galaxies that efficiently probe the spatial distribution of galaxies in the high redshift universe. The spatial clustering of LAEs reflects the properties of their individual host dark matter halos, allowing us to study the evolution of the galaxy-halo connection. We analyze the clustering of 5233, 5220, and 3706 LAEs at z = 2.4, 3.1, and 4.5, respectiv…
▽ More
Lyman Alpha Emitters (LAEs) are star-forming galaxies that efficiently probe the spatial distribution of galaxies in the high redshift universe. The spatial clustering of LAEs reflects the properties of their individual host dark matter halos, allowing us to study the evolution of the galaxy-halo connection. We analyze the clustering of 5233, 5220, and 3706 LAEs at z = 2.4, 3.1, and 4.5, respectively, in the 9 deg$^2$ COSMOS field from the One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey. After correcting for redshift space distortions, LAE contamination rates, and the integral constraint, the observed angular correlation functions imply linear galaxy bias factors of b = $1.64^{+0.26}_{-0.23}, 1.84^{+0.24}_{-0.22}$, and $2.93^{+0.41}_{-0.36}$, for z = 2.4, 3.1, and 4.5, respectively. The median dark matter halo masses inferred from these measurements are $log(M_h/M_{\odot}) = 11.34^{+0.30}_{-0.31}, 10.94^{+0.26}_{-0.28}$, and $10.83^{+0.26}_{-0.24}$ for the three samples, respectively. The analysis also reveals that LAEs occupy roughly 3-5% of the halos whose clustering strength matches that of the LAEs. The large contrast between this low halo occupation fraction and the high fraction of continuum-selected star-forming galaxies that exhibit Ly$α$ in emission implies that LAEs are unusually luminous for their dark matter masses.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
Building Machine Learning Challenges for Anomaly Detection in Science
Authors:
Elizabeth G. Campolongo,
Yuan-Tang Chou,
Ekaterina Govorkova,
Wahid Bhimji,
Wei-Lun Chao,
Chris Harris,
Shih-Chieh Hsu,
Hilmar Lapp,
Mark S. Neubauer,
Josephine Namayanja,
Aneesh Subramanian,
Philip Harris,
Advaith Anand,
David E. Carlyn,
Subhankar Ghosh,
Christopher Lawrence,
Eric Moreno,
Ryan Raikman,
Jiaman Wu,
Ziheng Zhang,
Bayu Adhi,
Mohammad Ahmadi Gharehtoragh,
Saúl Alonso Monsalve,
Marta Babicz,
Furqan Baig
, et al. (125 additional authors not shown)
Abstract:
Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c…
▽ More
Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be confounding since it requires codifying a complete knowledge of the known scientific behaviors and then projecting these known behaviors on the data to look for deviations. When utilizing machine learning, this presents a particular challenge since we require that the model not only understands scientific data perfectly but also recognizes when the data is inconsistent and out of the scope of its trained behavior. In this paper, we present three datasets aimed at developing machine learning-based anomaly detection for disparate scientific domains covering astrophysics, genomics, and polar science. We present the different datasets along with a scheme to make machine learning challenges around the three datasets findable, accessible, interoperable, and reusable (FAIR). Furthermore, we present an approach that generalizes to future machine learning challenges, enabling the possibility of large, more compute-intensive challenges that can ultimately lead to scientific discovery.
△ Less
Submitted 29 March, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
Exploring $β$ decay and $β$-delayed neutron emission in exotic $^{46,47}$Cl isotopes
Authors:
Vandana Tripathi,
B. Longfellow,
A. Volya,
E. Rubino,
C. Benetti,
J. F. Perello,
S. L. Tabor,
S. N. Liddick,
P. C. Bender,
M. P. Carpenter,
J. J. Carroll,
A. Chester,
C. J. Chiara,
K. Childers,
B. R. Clark,
B. P. Crider,
J. T. Harke,
R. Jain,
S. Luitel,
M. J. Mogannam,
T. H. Ogunbeku,
A. L. Richard,
S. Saha,
O. A. Shehu,
R. Unz
, et al. (2 additional authors not shown)
Abstract:
In this paper, $β^-$ and $β$-delayed neutron decays of $^{46,47}$Cl are reported from an experiment carried out at the National Superconducting Cyclotron Laboratory using the Beta Counting System. The half-lives of both $^{46}$Cl and $^{47}$Cl were extracted. Based on the delayed $γ$-ray transitions observed, the level structure of $N = 28$ $^{46}$Ar was determined. Completely different sets of ex…
▽ More
In this paper, $β^-$ and $β$-delayed neutron decays of $^{46,47}$Cl are reported from an experiment carried out at the National Superconducting Cyclotron Laboratory using the Beta Counting System. The half-lives of both $^{46}$Cl and $^{47}$Cl were extracted. Based on the delayed $γ$-ray transitions observed, the level structure of $N = 28$ $^{46}$Ar was determined. Completely different sets of excited states above the first $2^+$ state in $^{46}$Ar were populated in the $^{46}$Cl $\beta0n$ and $^{47}$Cl $\beta1n$ decay channels. Two new $γ$-ray transitions in $^{47}$Ar were identified from the very weak $^{47}$Cl $\beta0n$ decay. Furthermore, $^{46}$Cl $\beta1n$ and $^{47}$Cl $\beta2n$ were also observed to yield different population patterns for levels in $^{45}$Ar, including states of different parities. The experimental results allow us to address some of the open questions related to the delayed neutron emission process. For isotopes with large neutron excess and high $Q_β$ values, delayed neutron emission remains an important decay mode and can be utilized as a powerful spectroscopic tool. Experimental results were compared with shell-model calculations using the FSU and $V_{MU}$ effective interactions.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Advancing climate model interpretability: Feature attribution for Arctic melt anomalies
Authors:
Tolulope Ale,
Nicole-Jeanne Schlegel,
Vandana P. Janeja
Abstract:
The focus of our work is improving the interpretability of anomalies in climate models and advancing our understanding of Arctic melt dynamics. The Arctic and Antarctic ice sheets are experiencing rapid surface melting and increased freshwater runoff, contributing significantly to global sea level rise. Understanding the mechanisms driving snowmelt in these regions is crucial. ERA5, a widely used…
▽ More
The focus of our work is improving the interpretability of anomalies in climate models and advancing our understanding of Arctic melt dynamics. The Arctic and Antarctic ice sheets are experiencing rapid surface melting and increased freshwater runoff, contributing significantly to global sea level rise. Understanding the mechanisms driving snowmelt in these regions is crucial. ERA5, a widely used reanalysis dataset in polar climate studies, offers extensive climate variables and global data assimilation. However, its snowmelt model employs an energy imbalance approach that may oversimplify the complexity of surface melt. In contrast, the Glacier Energy and Mass Balance (GEMB) model incorporates additional physical processes, such as snow accumulation, firn densification, and meltwater percolation/refreezing, providing a more detailed representation of surface melt dynamics. In this research, we focus on analyzing surface snowmelt dynamics of the Greenland Ice Sheet using feature attribution for anomalous melt events in ERA5 and GEMB models. We present a novel unsupervised attribution method leveraging counterfactual explanation method to analyze detected anomalies in ERA5 and GEMB. Our anomaly detection results are validated using MEaSUREs ground-truth data, and the attributions are evaluated against established feature ranking methods, including XGBoost, Shapley values, and Random Forest. Our attribution framework identifies the physics behind each model and the climate features driving melt anomalies. These findings demonstrate the utility of our attribution method in enhancing the interpretability of anomalies in climate models and advancing our understanding of Arctic melt dynamics.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
MorphoITH: A Framework for Deconvolving Intra-Tumor Heterogeneity Using Tissue Morphology
Authors:
Aleksandra Weronika Nielsen,
Hafez Eslami Manoochehri,
Hua Zhong,
Vandana Panwar,
Vipul Jarmale,
Jay Jasti,
Mehrdad Nourani,
Dinesh Rakheja,
James Brugarolas,
Payal Kapur,
Satwik Rajaram
Abstract:
The ability of tumors to evolve and adapt by developing subclones in different genetic and epigenetic states is a major challenge in oncology. Traditional tools like multi-regional sequencing used to study tumor evolution and the resultant intra-tumor heterogeneity (ITH) are often impractical because of their resource-intensiveness and limited scalability. Here, we present MorphoITH, a novel frame…
▽ More
The ability of tumors to evolve and adapt by developing subclones in different genetic and epigenetic states is a major challenge in oncology. Traditional tools like multi-regional sequencing used to study tumor evolution and the resultant intra-tumor heterogeneity (ITH) are often impractical because of their resource-intensiveness and limited scalability. Here, we present MorphoITH, a novel framework that leverages histopathology slides to deconvolve molecular ITH through tissue morphology. MorphoITH integrates a self-supervised deep learning similarity measure to capture phenotypic variation across multiple dimensions (cytology, architecture, and microenvironment) with rigorous methods to eliminate spurious sources of variation. Using a prototype of ITH, clear cell renal cell carcinoma (ccRCC), we show that MorphoITH captures clinically-significant biological features, such as vascular architecture and nuclear grades. Furthermore, we find that MorphoITH recognizes differential biological states corresponding to subclonal changes in key driver genes (BAP1/PBRM1/SETD2). Finally, by applying MorphoITH to a multi-regional sequencing experiment, we postulate evolutionary trajectories that largely recapitulate genetic evolution. In summary, MorphoITH provides a scalable phenotypic lens that bridges the gap between histopathology and genomics, advancing precision oncology.
△ Less
Submitted 2 February, 2025;
originally announced February 2025.
-
Modeling submillimeter galaxies in cosmological simulations: Contribution to the cosmic star formation density and predictions for future surveys
Authors:
Ankit Kumar,
M. Celeste Artale,
Antonio D. Montero-Dorta,
Lucia Guaita,
Kyoung-Soo Lee,
Alexandra Pope,
Joop Schaye,
Matthieu Schaller,
Eric Gawiser,
Ho Seong Hwang,
Woong-Seob Jeong,
Jaehyun Lee,
Nelson Padilla,
Changbom Park,
Vandana Ramakrishnan,
Akriti Singh,
Yujin Yang
Abstract:
Submillimeter galaxies (SMGs) constitute a key population of bright star-forming galaxies at high redshift. These galaxies challenge galaxy formation models, particularly in reproducing their observed number counts and redshift distributions. Furthermore, although SMGs contribute significantly to the cosmic star formation rate density (SFRD), their precise role remains uncertain. Upcoming surveys,…
▽ More
Submillimeter galaxies (SMGs) constitute a key population of bright star-forming galaxies at high redshift. These galaxies challenge galaxy formation models, particularly in reproducing their observed number counts and redshift distributions. Furthermore, although SMGs contribute significantly to the cosmic star formation rate density (SFRD), their precise role remains uncertain. Upcoming surveys, such as the Ultra Deep Survey with the TolTEC camera, are expected to offer valuable insights into SMG properties and their broader impact. Robust modeling of SMGs in a cosmologically representative volume is necessary to investigate their nature in preparation for next-generation submillimeter surveys. We implement and test parametric relations derived from radiative transfer calculations across three cosmological simulations: EAGLE, IllustrisTNG, and FLAMINGO. Particular emphasis is placed on the FLAMINGO due to their large volume and robust statistical sampling of SMGs. Based on the model that best reproduces observations, we forecast submillimeter fluxes within the simulations, analyze the properties of SMGs, and evaluate their evolution over cosmic time. Our results show that the FLAMINGO reproduces the observed redshift distribution and source number counts of SMGs without requiring a top-heavy initial mass function. On the other hand, the EAGLE and IllustrisTNG show a deficit of bright SMGs. We find that SMGs with S850 > 1 mJy contribute up to 27% of the SFRD at z=2.6 in the FLAMINGO, consistent with recent observations. Flux density functions reveal a rise in SMG abundance from z = 6 to 2.5, followed by a sharp decline in the number of brighter SMGs from z = 2.5 to 0. Leveraging the SMG population in FLAMINGO, we forecast that the TolTEC UDS will detect 80,000 sources over 0.8 deg^2 at 1.1 mm (at the 4σ detection limit), capturing about 50% of the cosmic SFRD at z=2.5.
△ Less
Submitted 31 January, 2025;
originally announced January 2025.
-
ODIN: Star Formation Histories Reveal Formative Starbursts Experienced by Lyman Alpha Emitting Galaxies at Cosmic Noon
Authors:
Nicole M. Firestone,
Eric Gawiser,
Kartheik G. Iyer,
Kyoung-Soo Lee,
Vandana Ramakrishnan,
Francisco Valdes,
Changbom Park,
Yujin Yang,
Anahita Alavi,
Robin Ciardullo,
Norman Grogin,
Caryl Gronwall,
Lucia Guaita,
Sungryong Hong,
Ho Seong Hwang,
Sang Hyeok Im,
Woong-Seob Jeong,
Seongjae Kim,
Anton M. Koekemoer,
Ankit Kumar,
Jaehyun Lee,
Vihang Mehta,
Gautam Nagaraj,
Julie Nantais,
Laura Prichard
, et al. (5 additional authors not shown)
Abstract:
In this work, we test the frequent assumption that Lyman Alpha Emitting galaxies (LAEs) are experiencing their first major burst of star formation at the time of observation. To this end, we identify 74 LAEs from the ODIN Survey with rest-UV-through-NIR photometry from UVCANDELS. For each LAE, we perform non-parametric star formation history (SFH) reconstruction using the Dense Basis Gaussian proc…
▽ More
In this work, we test the frequent assumption that Lyman Alpha Emitting galaxies (LAEs) are experiencing their first major burst of star formation at the time of observation. To this end, we identify 74 LAEs from the ODIN Survey with rest-UV-through-NIR photometry from UVCANDELS. For each LAE, we perform non-parametric star formation history (SFH) reconstruction using the Dense Basis Gaussian process-based method of spectral energy distribution fitting. We find that a strong majority (67%) of our LAE SFHs align with the frequently assumed archetype of a first major star formation burst, with at most modest star formation rates (SFRs) in the past. However, the rest of our LAE SFHs have significant amounts of star formation in the past, with 28% exhibiting earlier bursts of star formation with the ongoing burst having the highest SFR (dominant bursts), and the final 5% having experienced their highest SFR in the past (non-dominant bursts). Combining the SFHs indicating first and dominant bursts, ~95% of LAEs are experiencing their largest burst yet -- a formative burst. We also find that the fraction of total stellar mass created in the last 200 Myr is ~1.3 times higher in LAEs than in mass-matched Lyman Break Galaxy (LBG) samples, and that a majority of LBGs are experiencing dominant bursts, reaffirming that LAEs differ from other star forming galaxies. Overall, our results suggest that multiple evolutionary paths can produce galaxies with strong observed Ly$α$ emission.
△ Less
Submitted 2 June, 2025; v1 submitted 14 January, 2025;
originally announced January 2025.
-
Matrix-Free Parallel Scalable Multilevel Deflation Preconditioning for Heterogeneous Time-Harmonic Wave Problems
Authors:
Jinqiang Chen,
Vandana Dwarka,
Cornelis Vuik
Abstract:
We present a matrix-free parallel scalable multilevel deflation preconditioned method for heterogeneous time-harmonic wave problems. Building on the higher-order deflation preconditioning proposed by Dwarka and Vuik (SIAM J. Sci. Comput. 42(2):A901-A928, 2020; J. Comput. Phys. 469:111327, 2022) for highly indefinite time-harmonic waves, we adapt these techniques for parallel implementation in the…
▽ More
We present a matrix-free parallel scalable multilevel deflation preconditioned method for heterogeneous time-harmonic wave problems. Building on the higher-order deflation preconditioning proposed by Dwarka and Vuik (SIAM J. Sci. Comput. 42(2):A901-A928, 2020; J. Comput. Phys. 469:111327, 2022) for highly indefinite time-harmonic waves, we adapt these techniques for parallel implementation in the context of solving large-scale heterogeneous problems with minimal pollution error. Our proposed method integrates the Complex Shifted Laplacian preconditioner with deflation approaches. We employ higher-order deflation vectors and re-discretization schemes derived from the Galerkin coarsening approach for a matrix-free parallel implementation. We suggest a robust and efficient configuration of the matrix-free multilevel deflation method, which yields a close to wavenumber-independent convergence and good time efficiency. Numerical experiments demonstrate the effectiveness of our approach for increasingly complex model problems. The matrix-free implementation of the preconditioned Krylov subspace methods reduces memory consumption, and the parallel framework exhibits satisfactory parallel performance and weak parallel scalability. This work represents a significant step towards developing efficient, scalable, and parallel multilevel deflation preconditioning methods for large-scale real-world applications in wave propagation.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
Listening for Expert Identified Linguistic Features: Assessment of Audio Deepfake Discernment among Undergraduate Students
Authors:
Noshaba N. Bhalli,
Nehal Naqvi,
Chloe Evered,
Christine Mallinson,
Vandana P. Janeja
Abstract:
This paper evaluates the impact of training undergraduate students to improve their audio deepfake discernment ability by listening for expert-defined linguistic features. Such features have been shown to improve performance of AI algorithms; here, we ascertain whether this improvement in AI algorithms also translates to improvement of the perceptual awareness and discernment ability of listeners.…
▽ More
This paper evaluates the impact of training undergraduate students to improve their audio deepfake discernment ability by listening for expert-defined linguistic features. Such features have been shown to improve performance of AI algorithms; here, we ascertain whether this improvement in AI algorithms also translates to improvement of the perceptual awareness and discernment ability of listeners. With humans as the weakest link in any cybersecurity solution, we propose that listener discernment is a key factor for improving trustworthiness of audio content. In this study we determine whether training that familiarizes listeners with English language variation can improve their abilities to discern audio deepfakes. We focus on undergraduate students, as this demographic group is constantly exposed to social media and the potential for deception and misinformation online. To the best of our knowledge, our work is the first study to uniquely address English audio deepfake discernment through such techniques. Our research goes beyond informational training by introducing targeted linguistic cues to listeners as a deepfake discernment mechanism, via a training module. In a pre-/post- experimental design, we evaluated the impact of the training across 264 students as a representative cross section of all students at the University of Maryland, Baltimore County, and across experimental and control sections. Findings show that the experimental group showed a statistically significant decrease in their unsurety when evaluating audio clips and an improvement in their ability to correctly identify clips they were initially unsure about. While results are promising, future research will explore more robust and comprehensive trainings for greater impact.
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
Quenching of Galaxies at Cosmic Noon: Understanding the Effect of Environment
Authors:
Akriti Singh,
Lucia Guaita,
Pascale Hibon,
Boris Häussler,
Kyoung-Soo Lee,
Vandana Ramakrishnan,
Ankit Kumar,
Nelson Padilla,
Nicole M. Firestone,
Hyunmi Song,
Maria Celeste Artale,
Ho Seong Hwang,
Paulina Troncoso Iribarren,
Caryl Gronwall,
Eric Gawiser,
Julie Nantais,
Francisco Valdes,
Changbom Park,
Yujin Yang
Abstract:
The aim of this study is to identify quiescent galaxies in the 2-deg$^2$ COSMOS field at $z \sim 3.1$ and analyze their environment. Using data from the ODIN survey and COSMOS2020 catalog, we identify 24 massive quiescent galaxies (MQGs) with stellar masses $\geq 10^{10.6}$ and derive their star formation histories and quenching timescales using SED fitting with BAGPIPES. Voronoi-based density map…
▽ More
The aim of this study is to identify quiescent galaxies in the 2-deg$^2$ COSMOS field at $z \sim 3.1$ and analyze their environment. Using data from the ODIN survey and COSMOS2020 catalog, we identify 24 massive quiescent galaxies (MQGs) with stellar masses $\geq 10^{10.6}$ and derive their star formation histories and quenching timescales using SED fitting with BAGPIPES. Voronoi-based density maps trace local and large-scale environments using Lyman-$α$ Emitters and photometric galaxies. Results indicate uniformly short quenching timescales ($<$500 Myr) independent of environmental density, suggesting rapid internal mechanisms such as AGN feedback dominate over environmental factors. MQGs do not correlate with protoclusters or filaments, although some are near gas-rich filaments but show no rejuvenation. These findings suggest quenching at high redshift is driven primarily by internal processes rather than environmental interactions.
△ Less
Submitted 27 May, 2025; v1 submitted 19 November, 2024;
originally announced November 2024.
-
Massive neutrinos, Anomalous magnetic moments and Dark matter
Authors:
Vandana Sahdev
Abstract:
Amongst the issues plaguing the Standard Model (SM) are questions pertaining to neutrino masses and mixings, the anomalous magnetic moment of the electron and muon and the problem of a suitable dark matter (DM) candidate. All the three issues can be addressed at once by extending the SM with two generations of vector-like fermions and an inert scalar doublet, all odd under a Z2 symmetry. The light…
▽ More
Amongst the issues plaguing the Standard Model (SM) are questions pertaining to neutrino masses and mixings, the anomalous magnetic moment of the electron and muon and the problem of a suitable dark matter (DM) candidate. All the three issues can be addressed at once by extending the SM with two generations of vector-like fermions and an inert scalar doublet, all odd under a Z2 symmetry. The light neutrino masses and mixings are generated radiatively while maintaining consistency with bounds on lepton flavor violation. Loop diagrams with the very same fields also serve to explain the anomalous magnetic moments. Similarly, the correct dark matter relic abundance is reproduced without coming into conflict with direct detection constraints, or those from big bang nucleosynthesis or the cosmic microwave observations. Finally, prospective signatures at the LHC are discussed.
△ Less
Submitted 27 November, 2024; v1 submitted 18 November, 2024;
originally announced November 2024.
-
Enhanced heat dissipation and lowered power consumption in electronics using two-dimensional hexagonal boron nitride coatings
Authors:
Karthik R,
Ashutosh Srivastava,
Soumen Midya,
Akbar Shanu,
Surbhi Slathia,
Sajith Vandana,
Punathil Raman Sreeram,
Swastik Kar,
Nicholas R. Glavin,
Ajit K Roy,
Abhishek Kumar Singh,
Chandra Sekhar Tiwary
Abstract:
Miniaturization of electronic components has led to overheating, increasing power consumption and causing early circuit failures. Conventional heat dissipation methods are becoming inadequate due to limited surface area and higher short-circuit risks. This study presents a fast, low-cost, and scalable technique using 2D hexagonal boron nitride (hBN) coatings to enhance heat dissipation in commerci…
▽ More
Miniaturization of electronic components has led to overheating, increasing power consumption and causing early circuit failures. Conventional heat dissipation methods are becoming inadequate due to limited surface area and higher short-circuit risks. This study presents a fast, low-cost, and scalable technique using 2D hexagonal boron nitride (hBN) coatings to enhance heat dissipation in commercial electronics. Inexpensive hBN layers, applied by drop casting or spray coating, boost thermal conductivity at IC surfaces from below 0.3 W/m-K to 260 W/m-K, resulting in over double the heat flux and convective heat transfer. This significantly reduces operating temperatures and power consumption, as demonstrated by a 17.4% reduction in a coated audio amplifier circuit board. Density functional theory indicates enhanced interaction between 2D hBN and packaging materials as a key factor. This approach promises substantial energy and cost savings for large-scale electronics without altering existing manufacturing processes.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
Toward Transdisciplinary Approaches to Audio Deepfake Discernment
Authors:
Vandana P. Janeja,
Christine Mallinson
Abstract:
This perspective calls for scholars across disciplines to address the challenge of audio deepfake detection and discernment through an interdisciplinary lens across Artificial Intelligence methods and linguistics. With an avalanche of tools for the generation of realistic-sounding fake speech on one side, the detection of deepfakes is lagging on the other. Particularly hindering audio deepfake det…
▽ More
This perspective calls for scholars across disciplines to address the challenge of audio deepfake detection and discernment through an interdisciplinary lens across Artificial Intelligence methods and linguistics. With an avalanche of tools for the generation of realistic-sounding fake speech on one side, the detection of deepfakes is lagging on the other. Particularly hindering audio deepfake detection is the fact that current AI models lack a full understanding of the inherent variability of language and the complexities and uniqueness of human speech. We see the promising potential in recent transdisciplinary work that incorporates linguistic knowledge into AI approaches to provide pathways for expert-in-the-loop and to move beyond expert agnostic AI-based methods for more robust and comprehensive deepfake detection.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
Virgo Filaments IV: Using WISE to Measure the Modification of Star-Forming Disks in the Extended Regions Around the Virgo Cluster
Authors:
Kim Conger,
Gregory Rudnick,
Rose A. Finn,
Gianluca Castignani,
John Moustakas,
Benedetta Vulcani,
Daria Zakharova,
Lizhi Xie,
Francoise Combes,
Pascale Jablonka,
Yannick Bahé,
Gabriella De Lucia,
Vandana Desai,
Rebecca A. Koopmann,
Dara Norman,
Melinda Townsend,
Dennis Zaritsky
Abstract:
Recent theoretical work and targeted observational studies suggest that filaments are sites of galaxy preprocessing. The aim of the WISESize project is to directly probe galaxies over the full range of environments to quantify and characterize extrinsic galaxy quenching in the local Universe. In this paper, we use GALFIT to measure the infrared 12$μ$m ($R_{12}$) and 3.4$μ$m ($R_{3.4}$) effective r…
▽ More
Recent theoretical work and targeted observational studies suggest that filaments are sites of galaxy preprocessing. The aim of the WISESize project is to directly probe galaxies over the full range of environments to quantify and characterize extrinsic galaxy quenching in the local Universe. In this paper, we use GALFIT to measure the infrared 12$μ$m ($R_{12}$) and 3.4$μ$m ($R_{3.4}$) effective radii of 603 late-type galaxies in and surrounding the Virgo cluster. We find that Virgo cluster galaxies show smaller star-forming disks relative to their field counterparts at the $2.5σ$ level, while filament galaxies show smaller star-forming disks to almost $1.5σ$. Our data, therefore, show that cluster galaxies experience significant effects on their star-forming disks prior to their final quenching period. There is also tentative support for the hypothesis that galaxies are preprocessed in filamentary regions surrounding clusters. On the other hand, galaxies belonging to rich groups and poor groups do not differ significantly from those in the field. We additionally find hints of a positive correlation between stellar mass and size ratio for both rich group and filament galaxies, though the uncertainties on these data are consistent with no correlation. We compare our size measurements with the predictions from two variants of a state-of-the-art semi-analytic model (SAM), one which includes starvation and the other incorporating both starvation and ram-pressure stripping (RPS). Our data appear to disfavor the SAM, which includes RPS for the rich group, filament, and cluster samples, which contributes to improved constraints for general models of galaxy quenching.
△ Less
Submitted 17 January, 2025; v1 submitted 4 November, 2024;
originally announced November 2024.
-
ODIN: Strong Clustering of Protoclusters at Cosmic Noon
Authors:
Vandana Ramakrishnan,
Kyoung-Soo Lee,
Nicole Firestone,
Eric Gawiser,
Maria Celeste Artale,
Caryl Gronwall,
Lucia Guaita,
Sang Hyeok Im,
Woong-Seob Jeong,
Seongjae Kim,
Ankit Kumar,
Jaehyun Lee,
Byeongha Moon,
Nelson Padilla,
Changbom Park,
Hyunmi Song,
Paulina Troncoso,
Yujin Yang
Abstract:
The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is carrying out a systematic search for protoclusters during Cosmic Noon, using Ly$α$-emitting galaxies (LAEs) as tracers. Once completed, ODIN aims to identify hundreds of protoclusters at redshifts of 2.4, 3.1, and 4.5 across seven extragalactic fields, covering a total area of up to 91~deg$^2$. In this work, we report strong clu…
▽ More
The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is carrying out a systematic search for protoclusters during Cosmic Noon, using Ly$α$-emitting galaxies (LAEs) as tracers. Once completed, ODIN aims to identify hundreds of protoclusters at redshifts of 2.4, 3.1, and 4.5 across seven extragalactic fields, covering a total area of up to 91~deg$^2$. In this work, we report strong clustering of high-redshift protoclusters through the protocluster-LAE cross-correlation function measurements of 150 protocluster candidates at $z~=~2.4$ and 3.1, identified in two ODIN fields with a total area of 13.9 deg$^2$. At $z~=~2.4$ and 3.1, respectively, the inferred protocluster biases are $6.6^{+1.3}_{-1.1}$ and $6.1^{+1.3}_{-1.1}$, corresponding to mean halo masses of $\log \langle M /M_\odot\rangle = 13.53^{+0.21}_{-0.24}$ and $12.96^{+0.28}_{-0.33}$. By the present day, these protoclusters are expected to evolve into virialized galaxy clusters with a mean mass of $\sim$ $10^{14.5}~M_\odot$. By comparing the observed number density of protoclusters to that of halos with the measured clustering strength, we find that our sample is highly complete. Finally, the similar descendant masses derived for our samples at $z=2.4$ and 3.1 assuming that the halo number density remains constant suggest that they represent similar structures observed at different cosmic epochs. As a consequence, any observed differences between the two samples can be understood as redshift evolution. The ODIN protocluster samples will thus provide valuable insights into the cosmic evolution of cluster galaxies.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
ALDAS: Audio-Linguistic Data Augmentation for Spoofed Audio Detection
Authors:
Zahra Khanjani,
Christine Mallinson,
James Foulds,
Vandana P Janeja
Abstract:
Spoofed audio, i.e. audio that is manipulated or AI-generated deepfake audio, is difficult to detect when only using acoustic features. Some recent innovative work involving AI-spoofed audio detection models augmented with phonetic and phonological features of spoken English, manually annotated by experts, led to improved model performance. While this augmented model produced substantial improveme…
▽ More
Spoofed audio, i.e. audio that is manipulated or AI-generated deepfake audio, is difficult to detect when only using acoustic features. Some recent innovative work involving AI-spoofed audio detection models augmented with phonetic and phonological features of spoken English, manually annotated by experts, led to improved model performance. While this augmented model produced substantial improvements over traditional acoustic features based models, a scalability challenge motivates inquiry into auto labeling of features. In this paper we propose an AI framework, Audio-Linguistic Data Augmentation for Spoofed audio detection (ALDAS), for auto labeling linguistic features. ALDAS is trained on linguistic features selected and extracted by sociolinguistics experts; these auto labeled features are used to evaluate the quality of ALDAS predictions. Findings indicate that while the detection enhancement is not as substantial as when involving the pure ground truth linguistic features, there is improvement in performance while achieving auto labeling. Labels generated by ALDAS are also validated by the sociolinguistics experts.
△ Less
Submitted 20 October, 2024;
originally announced October 2024.
-
Galaxy populations in protoclusters at cosmic noon
Authors:
Moira Andrews,
M. Celeste Artale,
Ankit Kumar,
Kyoung-Soo Lee,
Tess Florek,
Kaustub Anand,
Candela Cerdosino,
Robin Ciardullo,
Nicole Firestone,
Eric Gawiser,
Caryl Gronwall,
Lucia Guaita,
Sungryong Hong,
Ho Seong Hwang,
Jaehyun Lee,
Seong-Kook Lee,
Nelson Padilla,
Jaehong Park,
Roxana Popescu,
Vandana Ramakrishnan,
Hyunmi Song,
F. Vivanco Cádiz,
Mark Vogelsberger
Abstract:
We investigate the physical properties and redshift evolution of simulated galaxies residing in protoclusters at cosmic noon, to understand the influence of the environment on galaxy formation. This work is to build clear expectations for the ongoing ODIN survey, devoted to mapping large-scale structures at z=2.4, 3.1, and 4.5 using Ly$α$-emitting galaxies (LAEs) as tracers. From the IllustrisTNG…
▽ More
We investigate the physical properties and redshift evolution of simulated galaxies residing in protoclusters at cosmic noon, to understand the influence of the environment on galaxy formation. This work is to build clear expectations for the ongoing ODIN survey, devoted to mapping large-scale structures at z=2.4, 3.1, and 4.5 using Ly$α$-emitting galaxies (LAEs) as tracers. From the IllustrisTNG simulations, we define subregions centered on the most massive clusters ranked by total stellar mass at z=0 and study the properties of galaxies within, including LAEs. To model the LAE population, we take a semi-analytical approach that assigns Ly$α$ luminosity and equivalent width based on the UV luminosities to galaxies in a probabilistic manner. We investigate stellar mass, star formation rate, major mergers, and specific star formation rate of the population of star-forming galaxies and LAEs in the field and protocluster environment and trace their evolution. We find that the overall shape of the UV luminosity function (LF) in simulated protocluster environments is characterized by a shallower faint-end slope and an excess on the bright end, signaling different formation histories for galaxies therein. The difference is milder for the Ly$α$ LF. While protocluster galaxies follow the same SFR-$M_{\odot}$ scaling relation as average field galaxies, a larger fraction appears to have experienced major mergers in the last 200 Myr and as a result shows enhanced star formation at a ~60% level, leading to a flatter distribution in both SFR and $M_{\odot}$ relative to galaxies in the average field. We find that protocluster galaxies, including LAEs, begin to quench much earlier (z~0.8-1.6) than field galaxies (z~0.5-0.9); our result is in agreement with recent observational results and highlights the importance of large-scale environment on the overall formation history of galaxies.
△ Less
Submitted 13 May, 2025; v1 submitted 10 October, 2024;
originally announced October 2024.
-
Discovering Large-Scale Structure at $2<z<5$ in the C3VO Survey
Authors:
Denise Hung,
Brian C. Lemaux,
Olga Cucciati,
Ben Forrest,
Ekta A. Shah,
Roy R. Gal,
Finn Giddings,
Derek Sikorski,
Emmet Golden-Marx,
Lori M. Lubin,
Nimish Hathi,
Giovanni Zamorani,
Lu Shen,
Sandro Bardelli,
Letizia P. Cassara,
Gabriella De Lucia,
Fabio Fontanot,
Bianca Garilli,
Lucia Guaita,
Michaela Monika Hirschmann,
Kyoung-Soo Lee,
Andrew B. Newman,
Vandana Ramakrishnan,
Daniela Vergani,
Lizhi Xie
, et al. (1 additional authors not shown)
Abstract:
The Charting Cluster Construction with VUDS and ORELSE (C3VO) survey is an ongoing imaging and spectroscopic campaign aiming to map out the growth of structure up to $z\sim5$ and was born from the combination of the Visible Multi-Object Spectrograph Ultra Deep Survey and the Observations of Redshift Evolution in Large-Scale Environments (ORELSE) survey. As we previously accomplished with the ORELS…
▽ More
The Charting Cluster Construction with VUDS and ORELSE (C3VO) survey is an ongoing imaging and spectroscopic campaign aiming to map out the growth of structure up to $z\sim5$ and was born from the combination of the Visible Multi-Object Spectrograph Ultra Deep Survey and the Observations of Redshift Evolution in Large-Scale Environments (ORELSE) survey. As we previously accomplished with the ORELSE survey, we apply our technique known as Voronoi tessellation Monte Carlo (VMC) mapping to search for serendipitous galaxy overdensities at $2<z<5$ in the three C3VO fields. We also apply the same technique to mock observations of simulated galaxies with properties derived from the GAlaxy Evolution and Assembly semianalytic model in order to judge the effectiveness of our search algorithm as a function of redshift, total mass, and fraction of spectroscopic redshifts. We find completeness and purity values of the order of 30-50\% for $\log (M_{z=0}/M_{\odot}) > 14$ and $2<z<4$, with a strong dependence on mass and redshift, with values as high as $\sim$80\% and $\sim$70\%, respectively, in the best-case scenario for $\log (M_{z=0}/M_{\odot}) > 14.5$. In the C3VO fields, we were able to recover many of the previously known structures in the literature as well as find hundreds of new overdensity candidates, once again demonstrating the powerful capabilities of VMC mapping when applied to wide-field optical and infrared galaxy evolution surveys at ever higher redshifts.
△ Less
Submitted 20 February, 2025; v1 submitted 30 September, 2024;
originally announced October 2024.
-
Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection
Authors:
Orchid Chetia Phukan,
Swarup Ranjan Behera,
Shubham Singh,
Muskaan Singh,
Vandana Rajan,
Arun Balaji Buduru,
Rajesh Sharma,
S. R. Mahadeva Prasanna
Abstract:
In this study, we address the challenge of depression detection from speech, focusing on the potential of non-semantic features (NSFs) to capture subtle markers of depression. While prior research has leveraged various features for this task, NSFs-extracted from pre-trained models (PTMs) designed for non-semantic tasks such as paralinguistic speech processing (TRILLsson), speaker recognition (x-ve…
▽ More
In this study, we address the challenge of depression detection from speech, focusing on the potential of non-semantic features (NSFs) to capture subtle markers of depression. While prior research has leveraged various features for this task, NSFs-extracted from pre-trained models (PTMs) designed for non-semantic tasks such as paralinguistic speech processing (TRILLsson), speaker recognition (x-vector), and emotion recognition (emoHuBERT)-have shown significant promise. However, the potential of combining these diverse features has not been fully explored. In this work, we demonstrate that the amalgamation of NSFs results in complementary behavior, leading to enhanced depression detection performance. Furthermore, to our end, we introduce a simple novel framework, FuSeR, designed to effectively combine these features. Our results show that FuSeR outperforms models utilizing individual NSFs as well as baseline fusion techniques and obtains state-of-the-art (SOTA) performance in E-DAIC benchmark with RMSE of 5.51 and MAE of 4.48, establishing it as a robust approach for depression detection.
△ Less
Submitted 22 September, 2024;
originally announced September 2024.
-
Hybrid Ensemble Deep Graph Temporal Clustering for Spatiotemporal Data
Authors:
Francis Ndikum Nji,
Omar Faruque,
Mostafa Cham,
Janeja Vandana,
Jianwu Wang
Abstract:
Classifying subsets based on spatial and temporal features is crucial to the analysis of spatiotemporal data given the inherent spatial and temporal variability. Since no single clustering algorithm ensures optimal results, researchers have increasingly explored the effectiveness of ensemble approaches. Ensemble clustering has attracted much attention due to increased diversity, better generalizat…
▽ More
Classifying subsets based on spatial and temporal features is crucial to the analysis of spatiotemporal data given the inherent spatial and temporal variability. Since no single clustering algorithm ensures optimal results, researchers have increasingly explored the effectiveness of ensemble approaches. Ensemble clustering has attracted much attention due to increased diversity, better generalization, and overall improved clustering performance. While ensemble clustering may yield promising results on simple datasets, it has not been fully explored on complex multivariate spatiotemporal data. For our contribution to this field, we propose a novel hybrid ensemble deep graph temporal clustering (HEDGTC) method for multivariate spatiotemporal data. HEDGTC integrates homogeneous and heterogeneous ensemble methods and adopts a dual consensus approach to address noise and misclassification from traditional clustering. It further applies a graph attention autoencoder network to improve clustering performance and stability. When evaluated on three real-world multivariate spatiotemporal data, HEDGTC outperforms state-of-the-art ensemble clustering models by showing improved performance and stability with consistent results. This indicates that HEDGTC can effectively capture implicit temporal patterns in complex spatiotemporal data.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
Investigating Causal Cues: Strengthening Spoofed Audio Detection with Human-Discernible Linguistic Features
Authors:
Zahra Khanjani,
Tolulope Ale,
Jianwu Wang,
Lavon Davis,
Christine Mallinson,
Vandana P. Janeja
Abstract:
Several types of spoofed audio, such as mimicry, replay attacks, and deepfakes, have created societal challenges to information integrity. Recently, researchers have worked with sociolinguistics experts to label spoofed audio samples with Expert Defined Linguistic Features (EDLFs) that can be discerned by the human ear: pitch, pause, word-initial and word-final release bursts of consonant stops, a…
▽ More
Several types of spoofed audio, such as mimicry, replay attacks, and deepfakes, have created societal challenges to information integrity. Recently, researchers have worked with sociolinguistics experts to label spoofed audio samples with Expert Defined Linguistic Features (EDLFs) that can be discerned by the human ear: pitch, pause, word-initial and word-final release bursts of consonant stops, audible intake or outtake of breath, and overall audio quality. It is established that there is an improvement in several deepfake detection algorithms when they augmented the traditional and common features of audio data with these EDLFs. In this paper, using a hybrid dataset comprised of multiple types of spoofed audio augmented with sociolinguistic annotations, we investigate causal discovery and inferences between the discernible linguistic features and the label in the audio clips, comparing the findings of the causal models with the expert ground truth validation labeling process. Our findings suggest that the causal models indicate the utility of incorporating linguistic features to help discern spoofed audio, as well as the overall need and opportunity to incorporate human knowledge into models and techniques for strengthening AI models. The causal discovery and inference can be used as a foundation of training humans to discern spoofed audio as well as automating EDLFs labeling for the purpose of performance improvement of the common AI-based spoofed audio detectors.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Coarse Spaces Based on Higher-Order Interpolation for Schwarz Preconditioners for Helmholtz Problems
Authors:
Erik Sieburgh,
Alexander Heinlein,
Vandana Dwarka,
Cornelis Vuik
Abstract:
The development of scalable and wavenumber-robust iterative solvers for Helmholtz problems is challenging but also relevant for various application fields. In this work, two-level Schwarz domain decomposition preconditioners are enhanced by coarse space constructed using higher-order Bézier interpolation. The numerical results indicate numerical scalability and robustness with respect the wavenumb…
▽ More
The development of scalable and wavenumber-robust iterative solvers for Helmholtz problems is challenging but also relevant for various application fields. In this work, two-level Schwarz domain decomposition preconditioners are enhanced by coarse space constructed using higher-order Bézier interpolation. The numerical results indicate numerical scalability and robustness with respect the wavenumber, as long as the wavenumber times the element size of the coarse mesh is sufficiently low.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
Impact of Transmission Dynamics and Treatment Uptake, Frequency and Timing on the Cost-effectiveness of Directly Acting Antivirals for Hepatitis C Virus Infection
Authors:
Soham Das,
Ajit Sood,
Vandana Midha,
Arshdeep Singh,
Pranjl Sharma,
Varun Ramamohan
Abstract:
Cost-effectiveness analyses, based on decision-analytic models of disease progression and treatment, are routinely used to assess the economic value of a new intervention and consequently inform reimbursement decisions for the intervention. Many decision-analytic models developed to assess the economic value of highly effective directly acting antiviral (DAA) treatments for the hepatitis C virus (…
▽ More
Cost-effectiveness analyses, based on decision-analytic models of disease progression and treatment, are routinely used to assess the economic value of a new intervention and consequently inform reimbursement decisions for the intervention. Many decision-analytic models developed to assess the economic value of highly effective directly acting antiviral (DAA) treatments for the hepatitis C virus (HCV) infection do not incorporate the transmission dynamics of HCV, accounting for which is required to estimate the number of downstream infections prevented by curing an infection. In this study, we develop and validate a comprehensive agent-based simulation (ABS) model of HCV transmission dynamics in the Indian context and use it to: (a) quantify the extent to which the cost-effectiveness of a DAA is underestimated - as a function of its uptake rate - if disease transmission dynamics are not considered in a cost-effectiveness analysis model; and (b) quantify the impact of the frequency and timing of treatment with DAAs, also as a function of their uptake rate, within a disease surveillance period on its cost-effectiveness. The process of accomplishing the above research objectives also motivated the development of a novel random sampling and allocation based approach, along with associated theoretical grounding, to estimate individual-level outcomes within an ABS that incurs substantially lower computational expense than the benchmark incremental accumulation approach.
△ Less
Submitted 17 September, 2024; v1 submitted 27 July, 2024;
originally announced July 2024.
-
Testing Lyman Alpha Emitters and Lyman-Break Galaxies as Tracers of Large-Scale Structures at High Redshifts
Authors:
Sang Hyeok Im,
Ho Seong Hwang,
Jaehong Park,
Jaehyun Lee,
Hyunmi Song,
Stephen Appleby,
Yohan Dubois,
C. Gareth Few,
Brad K. Gibson,
Juhan Kim,
Yonghwi Kim,
Changbom Park,
Christophe Pichon,
Jihye Shin,
Owain N. Snaith,
Maria Celeste Artale,
Eric Gawiser,
Lucia Guaita,
Woong-Seob Jeong,
Kyoung-Soo Lee,
Nelson Padilla,
Vandana Ramakrishnan,
Paulina Troncoso,
Yujin Yang
Abstract:
We test whether Lyman alpha emitters (LAEs) and Lyman-break galaxies (LBGs) can be good tracers of high-z large-scale structures, using the Horizon Run 5 cosmological hydrodynamical simulation. We identify LAEs using the Lyα emission line luminosity and its equivalent width, and LBGs using the broad-band magnitudes at z~2.4, 3.1, and 4.5. We first compare the spatial distributions of LAEs, LBGs, a…
▽ More
We test whether Lyman alpha emitters (LAEs) and Lyman-break galaxies (LBGs) can be good tracers of high-z large-scale structures, using the Horizon Run 5 cosmological hydrodynamical simulation. We identify LAEs using the Lyα emission line luminosity and its equivalent width, and LBGs using the broad-band magnitudes at z~2.4, 3.1, and 4.5. We first compare the spatial distributions of LAEs, LBGs, all galaxies, and dark matter around the filamentary structures defined by dark matter. The comparison shows that both LAEs and LBGs are more concentrated toward the dark matter filaments than dark matter. We also find an empirical fitting formula for the vertical density profile of filaments as a binomial power-law relation of the distance to the filaments. We then compare the spatial distributions of the samples around the filaments defined by themselves. LAEs and LBGs are again more concentrated toward their filaments than dark matter. We also find the overall consistency between filamentary structures defined by LAEs, LBGs, and dark matter, with the median spatial offsets that are smaller than the mean separation of the sample. These results support the idea that the LAEs and LBGs could be good tracers of large-scale structures of dark matter at high redshifts.
△ Less
Submitted 7 January, 2025; v1 submitted 26 July, 2024;
originally announced July 2024.
-
Harnessing Feature Clustering For Enhanced Anomaly Detection With Variational Autoencoder And Dynamic Threshold
Authors:
Tolulope Ale,
Nicole-Jeanne Schlegel,
Vandana P. Janeja
Abstract:
We introduce an anomaly detection method for multivariate time series data with the aim of identifying critical periods and features influencing extreme climate events like snowmelt in the Arctic. This method leverages the Variational Autoencoder (VAE) integrated with dynamic thresholding and correlation-based feature clustering. This framework enhances the VAE's ability to identify localized depe…
▽ More
We introduce an anomaly detection method for multivariate time series data with the aim of identifying critical periods and features influencing extreme climate events like snowmelt in the Arctic. This method leverages the Variational Autoencoder (VAE) integrated with dynamic thresholding and correlation-based feature clustering. This framework enhances the VAE's ability to identify localized dependencies and learn the temporal relationships in climate data, thereby improving the detection of anomalies as demonstrated by its higher F1-score on benchmark datasets. The study's main contributions include the development of a robust anomaly detection method, improving feature representation within VAEs through clustering, and creating a dynamic threshold algorithm for localized anomaly detection. This method offers explainability of climate anomalies across different regions.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
Assessing Annotation Accuracy in Ice Sheets Using Quantitative Metrics
Authors:
Bayu Adhi Tama,
Vandana Janeja,
Sanjay Purushotham
Abstract:
The increasing threat of sea level rise due to climate change necessitates a deeper understanding of ice sheet structures. This study addresses the need for accurate ice sheet data interpretation by introducing a suite of quantitative metrics designed to validate ice sheet annotation techniques. Focusing on both manual and automated methods, including ARESELP and its modified version, MARESELP, we…
▽ More
The increasing threat of sea level rise due to climate change necessitates a deeper understanding of ice sheet structures. This study addresses the need for accurate ice sheet data interpretation by introducing a suite of quantitative metrics designed to validate ice sheet annotation techniques. Focusing on both manual and automated methods, including ARESELP and its modified version, MARESELP, we assess their accuracy against expert annotations. Our methodology incorporates several computer vision metrics, traditionally underutilized in glaciological research, to evaluate the continuity and connectivity of ice layer annotations. The results demonstrate that while manual annotations provide invaluable expert insights, automated methods, particularly MARESELP, improve layer continuity and alignment with expert labels.
△ Less
Submitted 26 June, 2024;
originally announced July 2024.
-
ODIN: Identifying Protoclusters and Cosmic Filaments Traced by Ly$α$-emitting Galaxies
Authors:
Vandana Ramakrishnan,
Kyoung-Soo Lee,
Maria Celeste Artale,
Eric Gawiser,
Yujin Yang,
Changbom Park,
Robin Ciardullo,
Arjun Dey,
Caryl Gronwall,
Lucia Guaita,
Ho Seong Hwang,
Sang Hyeok Im,
Woong-Seob Jeong Seongjae Kim,
Ankit Kumar,
Jaehyun Lee,
Seong-Kook Lee,
Byeongha Moon,
Nelson Padilla,
Alexandra Pope,
Roxana Popescu,
Akriti Singh,
Hyunmi Song,
Paulina Troncoso,
Francisco Valdes,
Ann Zabludoff
Abstract:
To understand the formation and evolution of massive cosmic structures, studying them at high redshift, in the epoch when they formed the majority of their mass is essential. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is undertaking the widest-area narrowband program to date, to use Ly$α$-emitting galaxies (LAEs) to trace the large-scale structure (LSS) of the Universe on t…
▽ More
To understand the formation and evolution of massive cosmic structures, studying them at high redshift, in the epoch when they formed the majority of their mass is essential. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is undertaking the widest-area narrowband program to date, to use Ly$α$-emitting galaxies (LAEs) to trace the large-scale structure (LSS) of the Universe on the scale of 10 - 100 cMpc at three cosmic epochs. In this work, we present results at $z$ = 3.1 based on early ODIN data in the COSMOS field. We identify and characterize protoclusters and cosmic filaments using multiple methods and discuss their strengths and weaknesses. We then compare our observations against the IllustrisTNG suite of cosmological hydrodynamical simulations. The two are in excellent agreement, with a similar number and angular size of structures identified above a specified density threshold. We are able to recover the simulated protoclusters with $\log$(M$_{z=0}$/$M_\odot$) $\gtrsim$ 14.4 in $\sim$ 60% of the cases. With these objects we show that the descendant masses of the protoclusters in our sample can be estimated purely based on our 2D measurements, finding a median $z$ = 0 mass of $\sim10^{14.5}$M$_\odot$. The lack of information on the radial extent of each protocluster introduces a $\sim$0.4 dex uncertainty in its descendant mass. Finally, we show that the recovery of the cosmic web in the vicinity of protoclusters is both efficient and accurate. The similarity of our observations and the simulations imply that our structure selection is likewise robust and efficient, demonstrating that LAEs are reliable tracers of the LSS.
△ Less
Submitted 21 November, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial
Authors:
Jay Jasti,
Hua Zhong,
Vandana Panwar,
Vipul Jarmale,
Jeffrey Miyata,
Deyssy Carrillo,
Alana Christie,
Dinesh Rakheja,
Zora Modrusan,
Edward Ernest Kadel III,
Niha Beig,
Mahrukh Huseni,
James Brugarolas,
Payal Kapur,
Satwik Rajaram
Abstract:
Predictive biomarkers of treatment response are lacking for metastatic clear cell renal cell carcinoma (ccRCC), a tumor type that is treated with angiogenesis inhibitors, immune checkpoint inhibitors, mTOR inhibitors and a HIF2 inhibitor. The Angioscore, an RNA-based quantification of angiogenesis, is arguably the best candidate to predict anti-angiogenic (AA) response. However, the clinical adopt…
▽ More
Predictive biomarkers of treatment response are lacking for metastatic clear cell renal cell carcinoma (ccRCC), a tumor type that is treated with angiogenesis inhibitors, immune checkpoint inhibitors, mTOR inhibitors and a HIF2 inhibitor. The Angioscore, an RNA-based quantification of angiogenesis, is arguably the best candidate to predict anti-angiogenic (AA) response. However, the clinical adoption of transcriptomic assays faces several challenges including standardization, time delay, and high cost. Further, ccRCC tumors are highly heterogenous, and sampling multiple areas for sequencing is impractical. Here we present a novel deep learning (DL) approach to predict the Angioscore from ubiquitous histopathology slides. To overcome the lack of interpretability, one of the biggest limitations of typical DL models, our model produces a visual vascular network which is the basis of the model's prediction. To test its reliability, we applied this model to multiple cohorts including a clinical trial dataset. Our model accurately predicts the RNA-based Angioscore on multiple independent cohorts (spearman correlations of 0.77 and 0.73). Further, the predictions help unravel meaningful biology such as association of angiogenesis with grade, stage, and driver mutation status. Finally, we find our model can predict response to AA therapy, in both a real-world cohort and the IMmotion150 clinical trial. The predictive power of our model vastly exceeds that of CD31, a marker of vasculature, and nearly rivals the performance (c-index 0.66 vs 0.67) of the ground truth RNA-based Angioscore at a fraction of the cost. By providing a robust yet interpretable prediction of the Angioscore from histopathology slides alone, our approach offers insights into angiogenesis biology and AA treatment response.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Creating Geospatial Trajectories from Human Trafficking Text Corpora
Authors:
Saydeh N. Karabatis,
Vandana P. Janeja
Abstract:
Human trafficking is a crime that affects the lives of millions of people across the globe. Traffickers exploit the victims through forced labor, involuntary sex, or organ harvesting. Migrant smuggling could also be seen as a form of human trafficking when the migrant fails to pay the smuggler and is forced into coerced activities. Several news agencies and anti-trafficking organizations have repo…
▽ More
Human trafficking is a crime that affects the lives of millions of people across the globe. Traffickers exploit the victims through forced labor, involuntary sex, or organ harvesting. Migrant smuggling could also be seen as a form of human trafficking when the migrant fails to pay the smuggler and is forced into coerced activities. Several news agencies and anti-trafficking organizations have reported trafficking survivor stories that include the names of locations visited along the trafficking route. Identifying such routes can provide knowledge that is essential to preventing such heinous crimes. In this paper we propose a Narrative to Trajectory (N2T) information extraction system that analyzes reported narratives, extracts relevant information through the use of Natural Language Processing (NLP) techniques, and applies geospatial augmentation in order to automatically plot trajectories of human trafficking routes. We evaluate N2T on human trafficking text corpora and demonstrate that our approach of utilizing data preprocessing and augmenting database techniques with NLP libraries outperforms existing geolocation detection methods.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Narrative to Trajectory (N2T+): Extracting Routes of Life or Death from Human Trafficking Text Corpora
Authors:
Saydeh N. Karabatis,
Vandana P. Janeja
Abstract:
Climate change and political unrest in certain regions of the world are imposing extreme hardship on many communities and are forcing millions of vulnerable populations to abandon their homelands and seek refuge in safer lands. As international laws are not fully set to deal with the migration crisis, people are relying on networks of exploiting smugglers to escape the devastation in order to live…
▽ More
Climate change and political unrest in certain regions of the world are imposing extreme hardship on many communities and are forcing millions of vulnerable populations to abandon their homelands and seek refuge in safer lands. As international laws are not fully set to deal with the migration crisis, people are relying on networks of exploiting smugglers to escape the devastation in order to live in stability. During the smuggling journey, migrants can become victims of human trafficking if they fail to pay the smuggler and may be forced into coerced labor. Government agencies and anti-trafficking organizations try to identify the trafficking routes based on stories of survivors in order to gain knowledge and help prevent such crimes. In this paper, we propose a system called Narrative to Trajectory (N2T+), which extracts trajectories of trafficking routes. N2T+ uses Data Science and Natural Language Processing techniques to analyze trafficking narratives, automatically extract relevant location names, disambiguate possible name ambiguities, and plot the trafficking route on a map. In a comparative evaluation we show that the proposed multi-dimensional approach offers significantly higher geolocation detection than other state of the art techniques.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Small-Scale Testbed for Evaluating C-V2X Applications on 5G Cellular Networks
Authors:
Kaj Munhoz Arfvidsson,
Kleio Fragkedaki,
Frank J. Jiang,
Vandana Narri,
Hans-Cristian Lindh,
Karl H. Johansson,
Jonas Mårtensson
Abstract:
In this work, we present a small-scale testbed for evaluating the real-life performance of cellular V2X (C-V2X) applications on 5G cellular networks. Despite the growing interest and rapid technology development for V2X applications, researchers still struggle to prototype V2X applications with real wireless networks, hardware, and software in the loop in a controlled environment. To help alleviat…
▽ More
In this work, we present a small-scale testbed for evaluating the real-life performance of cellular V2X (C-V2X) applications on 5G cellular networks. Despite the growing interest and rapid technology development for V2X applications, researchers still struggle to prototype V2X applications with real wireless networks, hardware, and software in the loop in a controlled environment. To help alleviate this challenge, we present a testbed designed to accelerate development and evaluation of C-V2X applications on 5G cellular networks. By including a small-scale vehicle platform into the testbed design, we significantly reduce the time and effort required to test new C-V2X applications on 5G cellular networks. With a focus around the integration of small-scale vehicle platforms, we detail the design decisions behind the full software and hardware setup of commonly needed intelligent transport system agents (e.g. sensors, servers, vehicles). Moreover, to showcase the testbed's capability to produce industrially-relevant, real world performance evaluations, we present an evaluation of a simple test case inspired from shared situational awareness. Finally, we discuss the upcoming use of the testbed for evaluating 5G cellular network-based shared situational awareness and other C-V2X applications.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation
Authors:
Thomas Monninger,
Vandana Dokkadi,
Md Zafar Anwar,
Steffen Staab
Abstract:
Autonomous driving requires an accurate representation of the environment. A strategy toward high accuracy is to fuse data from several sensors. Learned Bird's-Eye View (BEV) encoders can achieve this by mapping data from individual sensors into one joint latent space. For cost-efficient camera-only systems, this provides an effective mechanism to fuse data from multiple cameras with different vie…
▽ More
Autonomous driving requires an accurate representation of the environment. A strategy toward high accuracy is to fuse data from several sensors. Learned Bird's-Eye View (BEV) encoders can achieve this by mapping data from individual sensors into one joint latent space. For cost-efficient camera-only systems, this provides an effective mechanism to fuse data from multiple cameras with different views. Accuracy can further be improved by aggregating sensor information over time. This is especially important in monocular camera systems to account for the lack of explicit depth and velocity measurements. Thereby, the effectiveness of developed BEV encoders crucially depends on the operators used to aggregate temporal information and on the used latent representation spaces. We analyze BEV encoders proposed in the literature and compare their effectiveness, quantifying the effects of aggregation operators and latent representations. While most existing approaches aggregate temporal information either in image or in BEV latent space, our analyses and performance comparisons suggest that these latent representations exhibit complementary strengths. Therefore, we develop a novel temporal BEV encoder, TempBEV, which integrates aggregated temporal information from both latent spaces. We consider subsequent image frames as stereo through time and leverage methods from optical flow estimation for temporal stereo encoding. Empirical evaluation on the NuScenes dataset shows a significant improvement by TempBEV over the baseline for 3D object detection and BEV segmentation. The ablation uncovers a strong synergy of joint temporal aggregation in the image and BEV latent space. These results indicate the overall effectiveness of our approach and make a strong case for aggregating temporal information in both image and BEV latent spaces.
△ Less
Submitted 18 September, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm
Authors:
Amir Mohammad Naderi,
Jennifer G. Casey,
Mao-Hsiang Huang,
Rachelle Victorio,
David Y. Chiang,
Calum MacRae,
Hung Cao,
Vandana A. Gupta
Abstract:
Quantifying cardiovascular parameters like ejection fraction in zebrafish as a host of biological investigations has been extensively studied. Since current manual monitoring techniques are time-consuming and fallible, several image processing frameworks have been proposed to automate the process. Most of these works rely on supervised deep-learning architectures. However, supervised methods tend…
▽ More
Quantifying cardiovascular parameters like ejection fraction in zebrafish as a host of biological investigations has been extensively studied. Since current manual monitoring techniques are time-consuming and fallible, several image processing frameworks have been proposed to automate the process. Most of these works rely on supervised deep-learning architectures. However, supervised methods tend to be overfitted on their training dataset. This means that applying the same framework to new data with different imaging setups and mutant types can severely decrease performance. We have developed a Zebrafish Automatic Cardiovascular Assessment Framework (ZACAF) to quantify the cardiac function in zebrafish. In this work, we further applied data augmentation, Transfer Learning (TL), and Test Time Augmentation (TTA) to ZACAF to improve the performance for the quantification of cardiovascular function quantification in zebrafish. This strategy can be integrated with the available frameworks to aid other researchers. We demonstrate that using TL, even with a constrained dataset, the model can be refined to accommodate a novel microscope setup, encompassing diverse mutant types and accommodating various video recording protocols. Additionally, as users engage in successive rounds of TL, the model is anticipated to undergo substantial enhancements in both generalizability and accuracy. Finally, we applied this approach to assess the cardiovascular function in nrap mutant zebrafish, a model of cardiomyopathy.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Gravitational lensing stereoscopy
Authors:
Ira Rai,
Vandana Vinayak,
Richard Gordon
Abstract:
A galaxy cluster, such as RX J2129, sometimes produces two or more gravitationally lensed images of more distant galaxies. We attempt to regard pairs of these images as stereo pairs. While not successful due to the small disparity angles involved, we suggest that with the 1011 light amplification anticipated from the Solar Gravitational Lens (SGL), individual stars of the distant galaxy might be…
▽ More
A galaxy cluster, such as RX J2129, sometimes produces two or more gravitationally lensed images of more distant galaxies. We attempt to regard pairs of these images as stereo pairs. While not successful due to the small disparity angles involved, we suggest that with the 1011 light amplification anticipated from the Solar Gravitational Lens (SGL), individual stars of the distant galaxy might be resolved, resulting in 3D images.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Consistency Based Unsupervised Self-training For ASR Personalisation
Authors:
Jisi Zhang,
Vandana Rajan,
Haaris Mehmood,
David Tuckey,
Pablo Peso Parada,
Md Asif Jalal,
Karthikeyan Saravanan,
Gil Ho Lee,
Jungin Lee,
Seokyeong Jung
Abstract:
On-device Automatic Speech Recognition (ASR) models trained on speech data of a large population might underperform for individuals unseen during training. This is due to a domain shift between user data and the original training data, differed by user's speaking characteristics and environmental acoustic conditions. ASR personalisation is a solution that aims to exploit user data to improve model…
▽ More
On-device Automatic Speech Recognition (ASR) models trained on speech data of a large population might underperform for individuals unseen during training. This is due to a domain shift between user data and the original training data, differed by user's speaking characteristics and environmental acoustic conditions. ASR personalisation is a solution that aims to exploit user data to improve model robustness. The majority of ASR personalisation methods assume labelled user data for supervision. Personalisation without any labelled data is challenging due to limited data size and poor quality of recorded audio samples. This work addresses unsupervised personalisation by developing a novel consistency based training method via pseudo-labelling. Our method achieves a relative Word Error Rate Reduction (WERR) of 17.3% on unlabelled training data and 8.1% on held-out data compared to a pre-trained model, and outperforms the current state-of-the art methods.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Cross-shell excited configurations in the structure of 34Si
Authors:
R. S. Lubna,
A. B. Garnsworthy,
Vandana Tripathi,
G. C. Ball,
C. R. Natzke,
M. Rocchini,
C. Andreoiu,
S. S. Bhattacharjee,
I. Dillmann,
F. H. Garcia,
S. A. Gillespie,
G. Hackman,
C. J. Griffin,
G. Leckenby,
T. Miyagi,
B. Olaizola,
C. Porzio,
M. M. Rajabali,
Y. Saito,
P. Spagnoletti,
S. L. Tabor,
R. Umashankar,
V. Vedia,
A. Volya,
J. Williams
, et al. (1 additional authors not shown)
Abstract:
The cross-shell excited states of $^{34}$Si have been investigated via $β$-decays of the $4^-$ ground state and the $1^+$ isomeric state of $^{34}$Al. Since the valence protons and valence neutrons occupy different major shells in the ground state as well as the intruder $1^+$ isomeric state of $^{34}$Al, intruder levels of $^{34}$Si are populated via allowed $β$ decays. Spin assignments to such i…
▽ More
The cross-shell excited states of $^{34}$Si have been investigated via $β$-decays of the $4^-$ ground state and the $1^+$ isomeric state of $^{34}$Al. Since the valence protons and valence neutrons occupy different major shells in the ground state as well as the intruder $1^+$ isomeric state of $^{34}$Al, intruder levels of $^{34}$Si are populated via allowed $β$ decays. Spin assignments to such intruder levels of $^{34}$Si were established through $γ$-$γ$ angular correlation analysis for the negative parity states with dominant configurations $(νd_{3/2})^{-1} \otimes (νf_{7/2})^{1}$ as well as the positive parity states with dominant configurations $(νsd)^{-2} \otimes (νf_{7/2}p_{3/2})^2$. The configurations of such intruder states play crucial roles in our understanding of the $N=20$ shell gap evolution. A configuration interaction model derived from the FSU Hamiltonian was utilized in order to interpret the intruder states in $^{34}$Si. Shell model interaction derived from a more fundamental theory with the Valence Space In Medium Similarity Renormalization Group (VS-IMSRG) method was also employed to interpret the structure of $^{34}$Si.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
ODIN: Improved Narrowband Ly$α$ Emitter Selection Techniques for $z$ = 2.4, 3.1, and 4.5
Authors:
Nicole M. Firestone,
Eric Gawiser,
Vandana Ramakrishnan,
Kyoung-Soo Lee,
Francisco Valdes,
Changbom Park,
Yujin Yang,
Robin Ciardullo,
María Celeste Artale,
Barbara Benda,
Adam Broussard,
Lana Eid,
Rameen Farooq,
Caryl Gronwall,
Lucia Guaita,
Stephen Gwyn,
Ho Seong Hwang,
Sang Hyeok Im,
Woong-Seob Jeong,
Shreya Karthikeyan,
Dustin Lang,
Byeongha Moon,
Nelson Padilla,
Marcin Sawicki,
Eunsuk Seo
, et al. (3 additional authors not shown)
Abstract:
Lyman-Alpha Emitting galaxies (LAEs) are typically young, low-mass, star-forming galaxies with little extinction from interstellar dust. Their low dust attenuation allows their Ly$α$ emission to shine brightly in spectroscopic and photometric observations, providing an observational window into the high-redshift universe. Narrowband surveys reveal large, uniform samples of LAEs at specific redshif…
▽ More
Lyman-Alpha Emitting galaxies (LAEs) are typically young, low-mass, star-forming galaxies with little extinction from interstellar dust. Their low dust attenuation allows their Ly$α$ emission to shine brightly in spectroscopic and photometric observations, providing an observational window into the high-redshift universe. Narrowband surveys reveal large, uniform samples of LAEs at specific redshifts that probe large scale structure and the temporal evolution of galaxy properties. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) utilizes three custom-made narrowband filters on the Dark Energy Camera (DECam) to discover LAEs at three equally spaced periods in cosmological history. In this paper, we introduce the hybrid-weighted double-broadband continuum estimation technique, which yields improved estimation of Ly$α$ equivalent widths. Using this method, we discover 6032, 5691, and 4066 LAE candidates at $z =$ 2.4, 3.1, and 4.5 in the extended COSMOS field ($\sim$9 deg$^2$). We find that [O II] emitters are a minimal contaminant in our LAE samples, but that interloping Green Pea-like [O III] emitters are important for our redshift 4.5 sample. We introduce an innovative method for identifying [O II] and [O III] emitters via a combination of narrowband excess and galaxy colors, enabling their study as separate classes of objects. We present scaled median stacked SEDs for each galaxy sample, revealing the overall success of our selection methods. We also calculate rest-frame Ly$α$ equivalent widths for our LAE samples and find that the EW distributions are best fit by exponential functions with scale lengths of $w_0$ = 53 $\pm$ 1, 65 $\pm$ 1, and 59 $\pm$ 1 Angstroms, respectively.
△ Less
Submitted 1 October, 2024; v1 submitted 26 December, 2023;
originally announced December 2023.
-
Low spin spectroscopy of neutron-rich 43,44,45Cl via β and (β}n decay
Authors:
V. Tripathi,
S. Bhattacharya,
E. Rubino,
C. Benetti,
J. F. Perello,
S. L. Tabor,
S. N. Liddick,
P. C. Bender,
M. P. Carpenter,
J. J. Carroll,
A. Chester,
C. J. Chiara,
K. Childers,
B. R. Clark,
B. P. Crider,
J. T. Harke,
R. Jain,
B. Longfellow,
S. Luitel,
M. Mogannam,
T. H. Ogunbeku,
A. L. Richard,
S. Saha,
N. Shimizu,
O. A. Shehu
, et al. (5 additional authors not shown)
Abstract:
β decay of neutron-rich isotopes 43,45 S,studied at the National Superconducting Cyclotron Laboratory is reported here. β delayed γ transitions were detected by an array of 16 clover detectors surrounding the Beta Counting Station which consists of a 40x40 Double Sided Silicon Strip Detector followed by a Single Sided Silicon Strip Detector. β decay half-lives have been extracted for 43,45 S by co…
▽ More
β decay of neutron-rich isotopes 43,45 S,studied at the National Superconducting Cyclotron Laboratory is reported here. β delayed γ transitions were detected by an array of 16 clover detectors surrounding the Beta Counting Station which consists of a 40x40 Double Sided Silicon Strip Detector followed by a Single Sided Silicon Strip Detector. β decay half-lives have been extracted for 43,45 S by correlating implants and decays in the pixelated implant detector with further coincidence with γ transitions in the daughter nucleus. The level structure of 43,45 Cl is expanded by the addition of 20 new γ transitions in 43Cl and 8 in 45 Cl with the observation of core excited negative-parity states for the first time. For 45 S decay, a large fraction of the β decay strength goes to delayed neutron emission populating states in 44 Cl which are also presented. Comparison of experimental observations is made to detailed shell-model calculations using the SDPFSDG-MU interaction to highlight the role of the diminished N = 28 neutron shell gap and the near degeneracy of the proton s 1/2 and d 3/2 orbitals on the structure of the neutron-rich Cl isotopes. The current work also provides further support to a ground state spin-parity assignment of 3/2 + in 45 Cl.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
The Future of Astronomical Data Infrastructure: Meeting Report
Authors:
Michael R. Blanton,
Janet D. Evans,
Dara Norman,
William O'Mullane,
Adrian Price-Whelan,
Luca Rizzi,
Alberto Accomazzi,
Megan Ansdell,
Stephen Bailey,
Paul Barrett,
Steven Berukoff,
Adam Bolton,
Julian Borrill,
Kelle Cruz,
Julianne Dalcanton,
Vandana Desai,
Gregory P. Dubois-Felsmann,
Frossie Economou,
Henry Ferguson,
Bryan Field,
Dan Foreman-Mackey,
Jaime Forero-Romero,
Niall Gaffney,
Kim Gillies,
Matthew J. Graham
, et al. (47 additional authors not shown)
Abstract:
The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and productio…
▽ More
The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and production of both astronomical data and software. This report is the culmination of a workshop held in February 2023 on the Future of Astronomical Data Infrastructure. Attended by 70 scientists and software professionals from ground-based and space-based missions and archives spanning the entire spectrum of astronomical research, the group deliberated on the prevailing state of software and data infrastructure in astronomy, identified pressing issues, and explored potential solutions. In this report, we describe the ecosystem of astronomical data, its existing flaws, and the many gaps, duplication, inconsistencies, barriers to access, drags on productivity, missed opportunities, and risks to the long-term integrity of essential data sets. We also highlight the successes and failures in a set of deep dives into several different illustrative components of the ecosystem, included as an appendix.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
The One-hundred-deg^2 DECam Imaging in Narrowbands (ODIN): Survey Design and Science Goals
Authors:
Kyoung-Soo Lee,
Eric Gawiser,
Changbom Park,
Yujin Yang,
Francisco Valdes,
Dustin Lang,
Vandana Ramakrishnan,
Byeongha Moon,
Nicole Firestone,
Stephen Appleby,
Maria Celeste Artale,
Moira Andrews,
Franz E. Bauer,
Barbara Benda,
Adam Broussard,
Yi-Kuan Chiang,
Robin Ciardullo,
Arjun Dey,
Rameen Farooq,
Caryl Gronwall,
Lucia Guaita,
Yun Huang,
Ho Seong Hwang,
Sanghyeok Im,
Woong-Seob Jeong
, et al. (17 additional authors not shown)
Abstract:
We describe the survey design and science goals for ODIN (One-hundred-deg^2 DECam Imaging in Narrowbands), a NOIRLab survey using the Dark Energy Camera (DECam) to obtain deep (AB~25.7) narrow-band images over an unprecedented area of sky. The three custom-built narrow-band filters, N419, N501, and N673, have central wavelengths of 419, 501, and 673 nm and respective full-widthat-half-maxima of 7.…
▽ More
We describe the survey design and science goals for ODIN (One-hundred-deg^2 DECam Imaging in Narrowbands), a NOIRLab survey using the Dark Energy Camera (DECam) to obtain deep (AB~25.7) narrow-band images over an unprecedented area of sky. The three custom-built narrow-band filters, N419, N501, and N673, have central wavelengths of 419, 501, and 673 nm and respective full-widthat-half-maxima of 7.2, 7.4, and 9.8 nm, corresponding to Lya at z=2.4, 3.1, and 4.5 and cosmic times of 2.8, 2.1, and 1.4 Gyr, respectively. When combined with even deeper, public broad-band data from Hyper Suprime-Cam, DECam, and in the future, LSST, the ODIN narrow-band images will enable the selection of over 100,000 Lya-emitting (LAE) galaxies at these epochs. ODIN-selected LAEs will identify protoclusters as galaxy overdensities, and the deep narrow-band images enable detection of highly extended Lya blobs (LABs). Primary science goals include measuring the clustering strength and dark matter halo connection of LAEs, LABs, and protoclusters, and their respective relationship to filaments in the cosmic web. The three epochs allow the redshift evolution of these properties to be determined during the period known as Cosmic Noon, where star formation was at its peak. The two narrow-band filter wavelengths are designed to enable interloper rejection and further scientific studies by revealing [O II] and [O III] at z=0.34, Lya and He II 1640 at z=3.1, and Lyman continuum plus Lya at z=4.5. Ancillary science includes similar studies of the lower-redshift emission-line galaxy samples and investigations of nearby star-forming galaxies resolved into numerous [O III] and [S II] emitting regions.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Towards Robust Solvers for Nuclear Fusion Simulations Using JOREK: A Numerical Analysis Perspective
Authors:
Alex Quinlan,
Vandana Dwarka,
Ihor Holod,
Matthias Hoelzl
Abstract:
One of the most well-established codes for modeling non-linear Magnetohydrodynamics (MHD) for tokamak reactors is JOREK, which solves these equations with a Bézier surface based finite element method. This code produces a highly sparse but also very large linear system. The main solver behind the code uses the Generalized Minimum Residual Method (GMRES) with a physics-based preconditioner, but eve…
▽ More
One of the most well-established codes for modeling non-linear Magnetohydrodynamics (MHD) for tokamak reactors is JOREK, which solves these equations with a Bézier surface based finite element method. This code produces a highly sparse but also very large linear system. The main solver behind the code uses the Generalized Minimum Residual Method (GMRES) with a physics-based preconditioner, but even with the preconditioner there are issues with memory and computation costs and the solver does not always converge well. This work contains the first thorough study of the mathematical properties of the underlying linear system. It enables us to diagnose and pinpoint the cause of hampered convergence. In particular, analyzing the spectral properties of the matrix and the preconditioned system with numerical linear algebra techniques, will open the door to research and investigate more performant solver strategies, such as projection methods.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.