-
The Quantum Cryptography Approach: Unleashing the Potential of Quantum Key Reconciliation Protocol for Secure Communication
Authors:
Neha Sharma,
Vikas Saxena
Abstract:
Quantum cryptography is the study of delivering secret communications across a quantum channel. Recently, Quantum Key Distribution (QKD) has been recognized as the most important breakthrough in quantum cryptography. This process facilitates two distant parties to share secure communications based on physical laws. The BB84 protocol was developed in 1984 and remains the most widely used among BB92…
▽ More
Quantum cryptography is the study of delivering secret communications across a quantum channel. Recently, Quantum Key Distribution (QKD) has been recognized as the most important breakthrough in quantum cryptography. This process facilitates two distant parties to share secure communications based on physical laws. The BB84 protocol was developed in 1984 and remains the most widely used among BB92, Ekert91, COW, and SARG04 protocols. However the practical security of QKD with imperfect devices have been widely discussed, and there are many ways to guarantee that generated key by QKD still provides unconditional security. This paper proposed a novel method that allows users to communicate while generating the secure keys as well as securing the transmission without any leakage of the data. In this approach sender will never reveal her basis, hence neither the receiver nor the intruder will get knowledge of the fundamental basis.Further to detect Eve, polynomial interpolation is also used as a key verification technique. In order to fully utilize the quantum computing capabilities provided by IBM quantum computers, the protocol is executed using the Qiskit backend for 45 qubits. This article discusses a plot of % error against alpha (strength of eavesdropping). As a result, different types of noise have been included, and the success probability of the desired key bits has been determined. Furthermore, the success probability under depolarizing noise is explained for different qubit counts.Last but not least, even when the applied noise is increased to maximum capacity, a 50% probability of successful key generation is still observed in an experiment.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Minimal presentation, finite quotients and lower central series of cactus groups
Authors:
Hugo Chemin,
Neha Nanda
Abstract:
This article deals with the study of cactus groups from a combinatorial point of view. These groups have been gaining prominence lately in various domains of mathematics, amongst which are their relations with well-known groups such as braid groups, diagram groups, to name a few. We compute a minimal presentation for cactus groups in terms of generators and non-redundant relations. We also constru…
▽ More
This article deals with the study of cactus groups from a combinatorial point of view. These groups have been gaining prominence lately in various domains of mathematics, amongst which are their relations with well-known groups such as braid groups, diagram groups, to name a few. We compute a minimal presentation for cactus groups in terms of generators and non-redundant relations. We also construct homomorphisms of these groups onto certain finite groups, which leads to results about finite quotients of cactus groups. More precisely, we prove that all (infinite) dihedral groups appear as quotients of cactus groups. We also investigate the lower central series and its consecutive quotients. While there are already known established similarities with braid groups, we deduce a considerable disparity between the two groups.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
ExTraCT -- Explainable Trajectory Corrections from language inputs using Textual description of features
Authors:
J-Anne Yow,
Neha Priyadarshini Garg,
Manoj Ramanathan,
Wei Tech Ang
Abstract:
Natural language provides an intuitive and expressive way of conveying human intent to robots. Prior works employed end-to-end methods for learning trajectory deformations from language corrections. However, such methods do not generalize to new initial trajectories or object configurations. This work presents ExTraCT, a modular framework for trajectory corrections using natural language that comb…
▽ More
Natural language provides an intuitive and expressive way of conveying human intent to robots. Prior works employed end-to-end methods for learning trajectory deformations from language corrections. However, such methods do not generalize to new initial trajectories or object configurations. This work presents ExTraCT, a modular framework for trajectory corrections using natural language that combines Large Language Models (LLMs) for natural language understanding and trajectory deformation functions. Given a scene, ExTraCT generates the trajectory modification features (scene-specific and scene-independent) and their corresponding natural language textual descriptions for the objects in the scene online based on a template. We use LLMs for semantic matching of user utterances to the textual descriptions of features. Based on the feature matched, a trajectory modification function is applied to the initial trajectory, allowing generalization to unseen trajectories and object configurations. Through user studies conducted both in simulation and with a physical robot arm, we demonstrate that trajectories deformed using our method were more accurate and were preferred in about 80\% of cases, outperforming the baseline. We also showcase the versatility of our system in a manipulation task and an assistive feeding task.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
On a Subclass of Starlike Functions Associated with a Strip Domain
Authors:
S. Sivaprasad Kumar,
Neha Verma
Abstract:
In the present investigation, we introduce a new subclass of starlike functions defined by $\mathcal{S}^{*}_τ:=\{f\in \mathcal{A}:zf'(z)/f(z) \prec 1+\arctan z=:τ(z)\}$, where $τ(z)$ maps the unit disk $\mathbb {D}:= \{z\in \mathbb{C}:|z|<1\}$ onto a strip domain. We derive structural formulae, growth, and distortion theorems for $\mathcal{S}^{*}_τ$. Also, inclusion relations with some well-known…
▽ More
In the present investigation, we introduce a new subclass of starlike functions defined by $\mathcal{S}^{*}_τ:=\{f\in \mathcal{A}:zf'(z)/f(z) \prec 1+\arctan z=:τ(z)\}$, where $τ(z)$ maps the unit disk $\mathbb {D}:= \{z\in \mathbb{C}:|z|<1\}$ onto a strip domain. We derive structural formulae, growth, and distortion theorems for $\mathcal{S}^{*}_τ$. Also, inclusion relations with some well-known subclasses of $\mathcal{S}$ are established and obtain sharp radius estimates, as well as sharp coefficient bounds for the initial five coefficients and the second and third-order Hankel determinants of $\mathcal{S}^{*}_τ$.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
PoseViNet: Distracted Driver Action Recognition Framework Using Multi-View Pose Estimation and Vision Transformer
Authors:
Neha Sengar,
Indra Kumari,
Jihui Lee,
Dongsoo Har
Abstract:
Driver distraction is a principal cause of traffic accidents. In a study conducted by the National Highway Traffic Safety Administration, engaging in activities such as interacting with in-car menus, consuming food or beverages, or engaging in telephonic conversations while operating a vehicle can be significant sources of driver distraction. From this viewpoint, this paper introduces a novel meth…
▽ More
Driver distraction is a principal cause of traffic accidents. In a study conducted by the National Highway Traffic Safety Administration, engaging in activities such as interacting with in-car menus, consuming food or beverages, or engaging in telephonic conversations while operating a vehicle can be significant sources of driver distraction. From this viewpoint, this paper introduces a novel method for detection of driver distraction using multi-view driver action images. The proposed method is a vision transformer-based framework with pose estimation and action inference, namely PoseViNet. The motivation for adding posture information is to enable the transformer to focus more on key features. As a result, the framework is more adept at identifying critical actions. The proposed framework is compared with various state-of-the-art models using SFD3 dataset representing 10 behaviors of drivers. It is found from the comparison that the PoseViNet outperforms these models. The proposed framework is also evaluated with the SynDD1 dataset representing 16 behaviors of driver. As a result, the PoseViNet achieves 97.55% validation accuracy and 90.92% testing accuracy with the challenging dataset.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
A Novel Criterion for Interpreting Acoustic Emission Damage Signals Based on Cluster Onset Distribution
Authors:
Emmanuel Ramasso,
Martin Mbarga Nkogo,
Neha Chandarana,
Gilles Bourbon,
Patrice Le Moal,
Quentin Lefebvre,
Martial Personeni,
Constantinos Soutis,
Matthieu Gresil,
Sébastien Thibaud
Abstract:
Structural health monitoring (SHM) relies on non-destructive techniques such as acoustic emission (AE) that generate large amounts of data over the lifespan of systems. Clustering methods are used to interpret these data and gain insights into damage progression and mechanisms. Conventional methods for evaluating clustering results utilise clustering validity indices (CVI) that prioritise compact…
▽ More
Structural health monitoring (SHM) relies on non-destructive techniques such as acoustic emission (AE) that generate large amounts of data over the lifespan of systems. Clustering methods are used to interpret these data and gain insights into damage progression and mechanisms. Conventional methods for evaluating clustering results utilise clustering validity indices (CVI) that prioritise compact and separable clusters. This paper introduces a novel approach based on the temporal sequence of cluster onsets, indicating the initial appearance of potential damage and allowing for early detection of defect initiation. The proposed CVI is based on the Kullback-Leibler divergence and can incorporate prior information about damage onsets when available. Three experiments on real-world datasets validate the effectiveness of the proposed method. The first benchmark focuses on detecting the loosening of bolted plates under vibration, where the onset-based CVI outperforms the conventional approach in both cluster quality and the accuracy of bolt loosening detection. The results demonstrate not only superior cluster quality but also unmatched precision in identifying cluster onsets, whether during uniform or accelerated damage growth. The two additional applications stem from industrial contexts. The first focuses on micro-drilling of hard materials using electrical discharge machining, demonstrating, for the first time, that the proposed criterion can effectively retrieve electrode progression to the reference depth, thus validating the setting of the machine to ensure structural integrity. The final application involves damage understanding in a composite/metal hybrid joint structure, where the cluster timeline is used to establish a scenario leading to critical failure due to slippage.
△ Less
Submitted 6 June, 2025; v1 submitted 20 December, 2023;
originally announced December 2023.
-
On near orthogonality of certain $k$-vectors involving generalized Ramanujan sums
Authors:
Neha Elizabeth Thomas,
K Vishnu Namboothiri
Abstract:
The near orthgonality of certain $k$-vectors involving the Ramanujan sums were studied by E. Alkan in [J. Number Theory, 140:147--168 (2014)]. Here we undertake the study of similar vectors involving a generalization of the Ramanujan sums defined by E. Cohen in [Duke Math. J., 16(2):85--90 (1949)]. We also prove that the weighted average…
▽ More
The near orthgonality of certain $k$-vectors involving the Ramanujan sums were studied by E. Alkan in [J. Number Theory, 140:147--168 (2014)]. Here we undertake the study of similar vectors involving a generalization of the Ramanujan sums defined by E. Cohen in [Duke Math. J., 16(2):85--90 (1949)]. We also prove that the weighted average $\frac{1}{k^{s(r+1)}}\sum \limits_{j=1}^{k^s}j^rc_k^{(s)}(j)$ remains positve for all $r\geq 1$. Further, we give a lower bound for $\max\limits_{N}\left|\sum \limits_{j=1}^{N^s}c_k^{(s)}(j) \right|$.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Orthogonality of a new family of $q$-Sobolev type polynomials
Authors:
Neha,
A. Swaminathan
Abstract:
In this work, we introduce and construct specific $q$-polynomials that are desired from the well-established families of $q$-orthogonal polynomials, namely little $q$-Jacobi polynomials and $q$-Laguerre polynomials, respectively. We examine these newly constructed $q$-polynomials and observe that they possess integral representations of little $q$-Jacobi polynomials and $q$-Laguerre polynomials. T…
▽ More
In this work, we introduce and construct specific $q$-polynomials that are desired from the well-established families of $q$-orthogonal polynomials, namely little $q$-Jacobi polynomials and $q$-Laguerre polynomials, respectively. We examine these newly constructed $q$-polynomials and observe that they possess integral representations of little $q$-Jacobi polynomials and $q$-Laguerre polynomials. These polynomials solve a third-order $q$-difference equation and display an unconventional four-term recurrence relation. This unique recurrence relation makes us categorize them as $q$-Sobolev-type orthogonal polynomials. This motivation leads to defining the general Sobolev-type orthogonality for $q$-polynomials. Special cases of these polynomials are also explored and discussed. Furthermore, we delve into the behavior of these $q$-orthogonal polynomials of Sobolev type as the parameters approach $1$. We also examine their zeros and interlacing properties.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Dissecting the Stochastic Gravitational Wave Background with Astrometry
Authors:
Mesut Çalışkan,
Yifan Chen,
Liang Dai,
Neha Anil Kumar,
Isak Stomberg,
Xiao Xue
Abstract:
Astrometry, the precise measurement of star motions, offers an alternative avenue to investigate low-frequency gravitational waves through the spatial deflection of photons, complementing pulsar timing arrays reliant on timing residuals. Upcoming data from Gaia and Roman can not only cross-check pulsar timing array findings but also explore the uncharted frequency range bridging pulsar timing arra…
▽ More
Astrometry, the precise measurement of star motions, offers an alternative avenue to investigate low-frequency gravitational waves through the spatial deflection of photons, complementing pulsar timing arrays reliant on timing residuals. Upcoming data from Gaia and Roman can not only cross-check pulsar timing array findings but also explore the uncharted frequency range bridging pulsar timing arrays and LISA. We present an analytical framework to evaluate the feasibility of detecting a gravitational wave background, considering measurement noise and the intrinsic variability of the stochastic background. Furthermore, we highlight astrometry's crucial role in uncovering key properties of the gravitational wave background, such as spectral index and chirality, employing information-matrix analysis. Finally, we simulate the emergence of quadrupolar correlations, commonly referred to as the generalized Hellings-Downs curves.
△ Less
Submitted 3 May, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Linear polarization of the stochastic gravitational-wave background with pulsar timing arrays
Authors:
Neha Anil Kumar,
Mesut Çalışkan,
Gabriela Sato-Polito,
Marc Kamionkowski,
Lingyuan Ji
Abstract:
Pulsar-timing collaborations have recently reported evidence for the detection of an isotropic stochastic gravitational-wave background consistent with one sourced by a population of inspiralling supermassive black hole binaries. However, a certain degree of anisotropy and polarization may be present. Thus, the characterization of the energy density and polarization of the background at different…
▽ More
Pulsar-timing collaborations have recently reported evidence for the detection of an isotropic stochastic gravitational-wave background consistent with one sourced by a population of inspiralling supermassive black hole binaries. However, a certain degree of anisotropy and polarization may be present. Thus, the characterization of the energy density and polarization of the background at different angular scales is important. In this paper, we describe the signatures of linear polarization in the stochastic gravitational-wave background on the timing residuals obtained with pulsar-timing arrays. We expand the linear polarization map in terms of spin-weighted spherical harmonics and recast it into the $E$-mode (parity even) and $B$-mode (parity odd) basis. We provide expressions for the minimum-variance estimators for the coefficients of that expansion and evaluate the smallest detectable signal as a function of the signal-to-noise ratio with which the isotropic GW signal is detected and the number of pulsars in the survey. We evaluate the covariance between the estimators for the spherical-harmonic coefficients of the linear polarization $E$-modes and those for the intensity anisotropy. We also show that there is no covariance between the spherical-harmonic coefficients for the $B$-modes of the linear polarization and those for the circular polarization, even though both have the same parity. Our approach results in simple, elegant, and easily evaluated expressions for the overlap reduction functions for linear polarization.
△ Less
Submitted 5 September, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations
Authors:
Neha Kalibhat,
Warren Morningstar,
Alex Bijamov,
Luyang Liu,
Karan Singhal,
Philip Mansfield
Abstract:
Self-Supervised Learning (SSL) enables training performant models using limited labeled data. One of the pillars underlying vision SSL is the use of data augmentations/perturbations of the input which do not significantly alter its semantic content. For audio and other temporal signals, augmentations are commonly used alongside format transforms such as Fourier transforms or wavelet transforms. Un…
▽ More
Self-Supervised Learning (SSL) enables training performant models using limited labeled data. One of the pillars underlying vision SSL is the use of data augmentations/perturbations of the input which do not significantly alter its semantic content. For audio and other temporal signals, augmentations are commonly used alongside format transforms such as Fourier transforms or wavelet transforms. Unlike augmentations, format transforms do not change the information contained in the data; rather, they express the same information in different coordinates. In this paper, we study the effects of format transforms and augmentations both separately and together on vision SSL. We define augmentations in frequency space called Fourier Domain Augmentations (FDA) and show that training SSL models on a combination of these and image augmentations can improve the downstream classification accuracy by up to 1.3% on ImageNet-1K. We also show improvements against SSL baselines in few-shot and transfer learning setups using FDA. Surprisingly, we also observe that format transforms can improve the quality of learned representations even without augmentations; however, the combination of the two techniques yields better quality.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Integration of Swin UNETR and statistical shape modeling for a semi-automated segmentation of the knee and biomechanical modeling of articular cartilage
Authors:
Reza Kakavand,
Mehrdad Palizi,
Peyman Tahghighi,
Reza Ahmadi,
Neha Gianchandani,
Samer Adeeb,
Roberto Souza,
W. Brent Edwards,
Amin Komeili
Abstract:
Simulation studies like finite element (FE) modeling provide insight into knee joint mechanics without patient experimentation. Generic FE models represent biomechanical behavior of the tissue by overlooking variations in geometry, loading, and material properties of a population. On the other hand, subject-specific models include these specifics, resulting in enhanced predictive precision. Howeve…
▽ More
Simulation studies like finite element (FE) modeling provide insight into knee joint mechanics without patient experimentation. Generic FE models represent biomechanical behavior of the tissue by overlooking variations in geometry, loading, and material properties of a population. On the other hand, subject-specific models include these specifics, resulting in enhanced predictive precision. However, creating such models is laborious and time-intensive. The present study aimed to enhance subject-specific knee joint FE modeling by incorporating a semi-automated segmentation algorithm. This segmentation was a 3D Swin UNETR for an initial segmentation of the femur and tibia, followed by a statistical shape model (SSM) adjustment to improve surface roughness and continuity. Five hundred and seven magnetic resonance images (MRIs) from the Osteoarthritis Initiative (OAI) database were used to build and validate the segmentation model. A semi-automated FE model was developed using this semi-automated segmentation. On the other hand, a manual FE model was developed through manual segmentation (i.e., the gold standard approach). Both FE models were subjected to gait loading. The predicted mechanical response of manual and semi-automated FE models were compared. In the result, our semi-automated segmentation achieved Dice similarity coefficient (DSC) over 98% for both femur and tibia. The mechanical results (max principal stress, max principal strain, fluid pressure, fibril strain, and contact area) showed no significant differences between the manual and semi-automated FE models, indicating the effectiveness of the proposed semi-automated segmentation in creating accurate knee joint FE models. ( https://data.mendeley.com/datasets/k5hdc9cz7w/1 ).
△ Less
Submitted 18 September, 2023;
originally announced December 2023.
-
Reconstructing patchy helium reionization using the cosmic microwave background and large-scale structure
Authors:
Mesut Çalışkan,
Neha Anil Kumar,
Selim C. Hotinli,
Marc Kamionkowski
Abstract:
The intergalactic helium became fully ionized by the end of cosmic noon ($z\sim2$). Similarly to the reionization of hydrogen, helium reionization is expected to be patchy, driven by luminous quasars that ionize the intergalactic gas in their surrounding environment. Probing the morphology of ionized electrons during this epoch can provide crucial information about early structure formation, inclu…
▽ More
The intergalactic helium became fully ionized by the end of cosmic noon ($z\sim2$). Similarly to the reionization of hydrogen, helium reionization is expected to be patchy, driven by luminous quasars that ionize the intergalactic gas in their surrounding environment. Probing the morphology of ionized electrons during this epoch can provide crucial information about early structure formation, including the clustering and luminosities of quasars, the accretion rates, variability, and lifetimes of active galactic nuclei, as well as the growth and evolution of supermassive black holes. In this study, we present how measurements of the cosmic microwave background (CMB) can be used to reconstruct the optical-depth fluctuations resulting from patchy helium reionization. As helium reionization occurred at lower redshifts, upcoming probes of large-scale structure surveys will present a significant opportunity to enhance the prospects of probing this epoch by their combined analysis with the CMB. Using a joint information-matrix analysis of hydrogen and helium reionization, we show that near-future galaxy and CMB surveys will have enough statistical power to detect optical-depth fluctuations due to doubly-ionized helium, providing a way of measuring the redshift and duration of helium reionization to high significance. We also show that modeling uncertainties in helium reionization can impact the measurement precision of parameters characterizing hydrogen reionization.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Exploring light nuclei production at RHIC and LHC energies with A Multi-Phase Transport model and a coalescence afterburner
Authors:
Yoshini Bailung,
Neha Shah,
Ankhi Roy
Abstract:
In heavy-ion collisions, understanding how light nuclei species are produced can provide insight into the nature of hadronic interactions in extreme conditions. It can also shed light on understanding the matter-antimatter asymmetry and dark matter searches in astrophysical processes. To investigate the production mechanism of light nuclei such as deuteron, triton, and helium-3, we use a naive coa…
▽ More
In heavy-ion collisions, understanding how light nuclei species are produced can provide insight into the nature of hadronic interactions in extreme conditions. It can also shed light on understanding the matter-antimatter asymmetry and dark matter searches in astrophysical processes. To investigate the production mechanism of light nuclei such as deuteron, triton, and helium-3, we use a naive coalescence afterburner coupled to the well-known $``$A Multi-Phase Transport model" (AMPT). We focus on studying the production of light nuclei in central Au+Au collisions at different center of mass energies ($\sqrt{s_{_{\rm{NN}}}}$ = 19.6, 39, and 200 GeV) and in Pb+Pb collisions at $\sqrt{s_{_{\rm{NN}}}}$ = 2.76 TeV, at mid-rapidity. We generate events with the string melting version of AMPT, and feed the information of the nucleons with spatial and momentum conditions into the coalescence afterburner. Our study reports differential and integrated yields in transverse momentum ($p_{\rm{T}}$) of the light nuclei in different center of mass energies. We also estimate the coalescence parameters ($B_A$) as a function of $p_{\rm{T}}$ and collision energy for (anti-)deuterons, tritons and helium-3s for Au+Au and Pb+Pb collisions, which are compared to other light nuclei production studies. All results are compared with measurements from the STAR and ALICE experiments.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Uncertainty in Additive Feature Attribution methods
Authors:
Abhishek Madaan,
Tanya Chowdhury,
Neha Rana,
James Allan,
Tanmoy Chakraborty
Abstract:
In this work, we explore various topics that fall under the umbrella of Uncertainty in post-hoc Explainable AI (XAI) methods. We in particular focus on the class of additive feature attribution explanation methods. We first describe our specifications of uncertainty and compare various statistical and recent methods to quantify the same. Next, for a particular instance, we study the relationship b…
▽ More
In this work, we explore various topics that fall under the umbrella of Uncertainty in post-hoc Explainable AI (XAI) methods. We in particular focus on the class of additive feature attribution explanation methods. We first describe our specifications of uncertainty and compare various statistical and recent methods to quantify the same. Next, for a particular instance, we study the relationship between a feature's attribution and its uncertainty and observe little correlation. As a result, we propose a modification in the distribution from which perturbations are sampled in LIME-based algorithms such that the important features have minimal uncertainty without an increase in computational cost. Next, while studying how the uncertainty in explanations varies across the feature space of a classifier, we observe that a fraction of instances show near-zero uncertainty. We coin the term "stable instances" for such instances and diagnose factors that make an instance stable. Next, we study how an XAI algorithm's uncertainty varies with the size and complexity of the underlying model. We observe that the more complex the model, the more inherent uncertainty is exhibited by it. As a result, we propose a measure to quantify the relative complexity of a blackbox classifier. This could be incorporated, for example, in LIME-based algorithms' sampling densities, to help different explanation algorithms achieve tighter confidence levels. Together, the above measures would have a strong impact on making XAI models relatively trustworthy for the end-user as well as aiding scientific discovery.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
How Strong a Kick Should be to Topple Northeastern's Tumbling Robot?
Authors:
Adarsh Salagame,
Neha Bhattachan,
Andre Caetano,
Ian McCarthy,
Henry Noyes,
Brandon Petersen,
Alexander Qiu,
Matthew Schroeter,
Nolan Smithwick,
Konrad Sroka,
Jason Widjaja,
Yash Bohra,
Kaushik Venkatesh,
Kruthika Gangaraju,
Paul Ghanem,
Ioannis Mandralis,
Eric Sihite,
Arash Kalantari,
Alireza Ramezani
Abstract:
Rough terrain locomotion has remained one of the most challenging mobility questions. In 2022, NASA's Innovative Advanced Concepts (NIAC) Program invited US academic institutions to participate NASA's Breakthrough, Innovative \& Game-changing (BIG) Idea competition by proposing novel mobility systems that can negotiate extremely rough terrain, lunar bumpy craters. In this competition, Northeastern…
▽ More
Rough terrain locomotion has remained one of the most challenging mobility questions. In 2022, NASA's Innovative Advanced Concepts (NIAC) Program invited US academic institutions to participate NASA's Breakthrough, Innovative \& Game-changing (BIG) Idea competition by proposing novel mobility systems that can negotiate extremely rough terrain, lunar bumpy craters. In this competition, Northeastern University won NASA's top Artemis Award award by proposing an articulated robot tumbler called COBRA (Crater Observing Bio-inspired Rolling Articulator). This report briefly explains the underlying principles that made COBRA successful in competing with other concepts ranging from cable-driven to multi-legged designs from six other participating US institutions.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
Benefits and Harms of Large Language Models in Digital Mental Health
Authors:
Munmun De Choudhury,
Sachin R. Pendse,
Neha Kumar
Abstract:
The past decade has been transformative for mental health research and practice. The ability to harness large repositories of data, whether from electronic health records (EHR), mobile devices, or social media, has revealed a potential for valuable insights into patient experiences, promising early, proactive interventions, as well as personalized treatment plans. Recent developments in generative…
▽ More
The past decade has been transformative for mental health research and practice. The ability to harness large repositories of data, whether from electronic health records (EHR), mobile devices, or social media, has revealed a potential for valuable insights into patient experiences, promising early, proactive interventions, as well as personalized treatment plans. Recent developments in generative artificial intelligence, particularly large language models (LLMs), show promise in leading digital mental health to uncharted territory. Patients are arriving at doctors' appointments with information sourced from chatbots, state-of-the-art LLMs are being incorporated in medical software and EHR systems, and chatbots from an ever-increasing number of startups promise to serve as AI companions, friends, and partners. This article presents contemporary perspectives on the opportunities and risks posed by LLMs in the design, development, and implementation of digital mental health tools. We adopt an ecological framework and draw on the affordances offered by LLMs to discuss four application areas -- care-seeking behaviors from individuals in need of care, community care provision, institutional and medical care provision, and larger care ecologies at the societal level. We engage in a thoughtful consideration of whether and how LLM-based technologies could or should be employed for enhancing mental health. The benefits and harms our article surfaces could serve to help shape future research, advocacy, and regulatory efforts focused on creating more responsible, user-friendly, equitable, and secure LLM-based tools for mental health treatment and intervention.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Efficient Computation of Overlap Reduction Functions for Pulsar Timing Arrays
Authors:
Neha Anil Kumar,
Marc Kamionkowski
Abstract:
Pulsar timing arrays seek and study gravitational waves (GWs) through the angular two-point correlation function of timing residuals they induce in pulsars. The two-point correlation function induced by the standard transverse-traceless GWs is the famous Hellings-Downs curve, a function only of the angle between the two pulsars. Additional polarization modes (vector/scalar) that may arise in alter…
▽ More
Pulsar timing arrays seek and study gravitational waves (GWs) through the angular two-point correlation function of timing residuals they induce in pulsars. The two-point correlation function induced by the standard transverse-traceless GWs is the famous Hellings-Downs curve, a function only of the angle between the two pulsars. Additional polarization modes (vector/scalar) that may arise in alternative-gravity theories have different angular correlation functions. Furthermore, anisotropy, linear, or circular polarization in the stochastic GW background gives rise to additional structure in the two-point correlation function that cannot be written simply in terms of the angular separation of the two pulsars. In this paper, we provide a simple formula for the most general two-point correlation function--or overlap reduction function (ORF)--for a gravitational-wave background with an arbitrary polarization state, possibly containing anisotropies in its intensity and polarization (linear or circular). We provide specific expressions for the ORFs sourced by the general-relativistic transverse-traceless GW modes as well as vector (or spin-1) modes that may arise in alternative-gravity theories.
△ Less
Submitted 22 October, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
32x100 GHz WDM filter based on ultra-compact silicon rings with a high thermal tuning efficiency of 5.85 mW/pi
Authors:
Qingzhong Deng,
Ahmed H. El-Saeed,
Alaa Elshazly,
Guy Lepage,
Chiara Marchese,
Hakim Kobbi,
Rafal Magdziak,
Jeroen De Coster,
Neha Singh,
Marko Ersek Filipcic,
Kristof Croes,
Dimitrios Velenis,
Maumita Chakrabarti,
Peter De Heyn,
Peter Verheyen,
Philippe Absil,
Filippo Ferraro,
Yoojin Ban,
Joris Van Campenhout
Abstract:
To the best of our knowledge, this paper has achieved the lowest thermal tuning power (5.85 mW/pi) for silicon rings with FSR>=3.2 THz, and the first silicon ring-based WDM-32x100 GHz filter.
To the best of our knowledge, this paper has achieved the lowest thermal tuning power (5.85 mW/pi) for silicon rings with FSR>=3.2 THz, and the first silicon ring-based WDM-32x100 GHz filter.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Decoding the Molecular Universe -- Workshop Report
Authors:
Thomas O. Metz,
Joshua N. Adkins,
Peter B. Armentrout,
Patrick Chain,
Fanny Chu,
Courtney D Corley,
John R. Cort,
Elizabeth Denis,
Daniel Drell,
Katherine R. Duncan,
Robert G. Ewing,
Facundo M. Fernandez,
Oliver Fiehn,
Neha Garg,
Stefan Grimme,
Christopher Henry,
Robert L. Hettich,
Tobias Kind,
Roger G. Linington,
Gary W. Miller,
Trent Northen,
Kirsten Overdahl,
Ari Patrinos,
Daniel Raftery,
Paul Rigor
, et al. (8 additional authors not shown)
Abstract:
On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology develop…
▽ More
On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology development. These experts were invited to assess the value and feasibility of a grand-scale project to create new technologies that would allow the identification and quantification of all small molecules, or to decode the molecular universe. The Decoding the Molecular Universe project would extend and complement the success of the Human Genome Project by developing new capabilities and technologies to measure small molecules (defined as non-protein, non-polymer molecules less than 1500 Daltons) of any origin and generated in biological systems or produced abiotically. Workshop attendees 1) explored what new understanding of biological and environmental systems could be revealed through the lens of small molecules; 2) characterized the similarities in current needs and technical challenges between each science or mission area for unambiguous and comprehensive determination of the composition and quantities of small molecules of any sample; 3) determined the extent to which technologies or methods currently exist for unambiguously and comprehensively determining the small molecule composition of any sample and in a reasonable time; and 4) identified the attributes of the ideal technology or approach for universal small molecule measurement and identification. The workshop concluded with a discussion of how a project of this scale could be undertaken, possible thrusts for the project, early proof-of-principle applications, and similar efforts upon which the project could be modeled.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
TAPA-CS: Enabling Scalable Accelerator Design on Distributed HBM-FPGAs
Authors:
Neha Prakriya,
Yuze Chi,
Suhail Basalama,
Linghao Song,
Jason Cong
Abstract:
Despite the increasing adoption of Field-Programmable Gate Arrays (FPGAs) in compute clouds, there remains a significant gap in programming tools and abstractions which can leverage network-connected, cloud-scale, multi-die FPGAs to generate accelerators with high frequency and throughput. To this end, we propose TAPA-CS, a task-parallel dataflow programming framework which automatically partition…
▽ More
Despite the increasing adoption of Field-Programmable Gate Arrays (FPGAs) in compute clouds, there remains a significant gap in programming tools and abstractions which can leverage network-connected, cloud-scale, multi-die FPGAs to generate accelerators with high frequency and throughput. To this end, we propose TAPA-CS, a task-parallel dataflow programming framework which automatically partitions and compiles a large design across a cluster of FPGAs with no additional user effort while achieving high frequency and throughput. TAPA-CS has three main contributions. First, it is an open-source framework which allows users to leverage virtually "unlimited" accelerator fabric, high-bandwidth memory (HBM), and on-chip memory, by abstracting away the underlying hardware. This reduces the user's programming burden to a logical one, enabling software developers and researchers with limited FPGA domain knowledge to deploy larger designs than possible earlier. Second, given as input a large design, TAPA-CS automatically partitions the design to map to multiple FPGAs, while ensuring congestion control, resource balancing, and overlapping of communication and computation. Third, TAPA-CS couples coarse-grained floorplanning with automated interconnect pipelining at the inter- and intra-FPGA levels to ensure high frequency. We have tested TAPA-CS on our multi-FPGA testbed where the FPGAs communicate through a high-speed 100Gbps Ethernet infrastructure. We have evaluated the performance and scalability of our tool on designs, including systolic-array based convolutional neural networks (CNNs), graph processing workloads such as page rank, stencil applications like the Dilate kernel, and K-nearest neighbors (KNN). TAPA-CS has the potential to accelerate development of increasingly complex and large designs on the low power and reconfigurable FPGAs.
△ Less
Submitted 1 February, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering
Authors:
Neha Srikanth,
Rupak Sarkar,
Heran Mane,
Elizabeth M. Aparicio,
Quynh C. Nguyen,
Rachel Rudinger,
Jordan Boyd-Graber
Abstract:
Questions posed by information-seeking users often contain implicit false or potentially harmful assumptions. In a high-risk domain such as maternal and infant health, a question-answering system must recognize these pragmatic constraints and go beyond simply answering user questions, examining them in context to respond helpfully. To achieve this, we study assumptions and implications, or pragmat…
▽ More
Questions posed by information-seeking users often contain implicit false or potentially harmful assumptions. In a high-risk domain such as maternal and infant health, a question-answering system must recognize these pragmatic constraints and go beyond simply answering user questions, examining them in context to respond helpfully. To achieve this, we study assumptions and implications, or pragmatic inferences, made when mothers ask questions about pregnancy and infant care by collecting a dataset of 2,727 inferences from 500 questions across three diverse sources. We study how health experts naturally address these inferences when writing answers, and illustrate that informing existing QA pipelines with pragmatic inferences produces responses that are more complete, mitigating the propagation of harmful beliefs.
△ Less
Submitted 2 April, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Special least squares solutions of the reduced biquaternion matrix equation with applications
Authors:
Sk. Safique Ahmad,
Neha Bhadala
Abstract:
This paper presents an efficient method for obtaining the least squares Hermitian solutions of the reduced biquaternion matrix equation $(AXB, CXD) = (E, F )$. The method leverages the real representation of reduced biquaternion matrices. Furthermore, we establish the necessary and sufficient conditions for the existence and uniqueness of the Hermitian solution, along with a general expression for…
▽ More
This paper presents an efficient method for obtaining the least squares Hermitian solutions of the reduced biquaternion matrix equation $(AXB, CXD) = (E, F )$. The method leverages the real representation of reduced biquaternion matrices. Furthermore, we establish the necessary and sufficient conditions for the existence and uniqueness of the Hermitian solution, along with a general expression for it. Notably, this approach differs from the one previously developed by Yuan et al. $(2020)$, which relied on the complex representation of reduced biquaternion matrices. In contrast, our method exclusively employs real matrices and utilizes real arithmetic operations, resulting in enhanced efficiency. We also apply our developed framework to find the Hermitian solutions for the complex matrix equation $(AXB, CXD) = (E, F )$, expanding its utility in addressing inverse problems. Specifically, we investigate its effectiveness in addressing partially described inverse eigenvalue problems. Finally, we provide numerical examples to demonstrate the effectiveness of our method and its superiority over the existing approach.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
Algebraic technique for mixed least squares and total least squares problem in the reduced biquaternion algebra
Authors:
Sk. Safique Ahmad,
Neha Bhadala
Abstract:
This paper presents the reduced biquaternion mixed least squares and total least squares (RBMTLS) method for solving an overdetermined system $AX \approx B$ in the reduced biquaternion algebra. The RBMTLS method is suitable when matrix $B$ and a few columns of matrix $A$ contain errors. By examining real representations of reduced biquaternion matrices, we investigate the conditions for the existe…
▽ More
This paper presents the reduced biquaternion mixed least squares and total least squares (RBMTLS) method for solving an overdetermined system $AX \approx B$ in the reduced biquaternion algebra. The RBMTLS method is suitable when matrix $B$ and a few columns of matrix $A$ contain errors. By examining real representations of reduced biquaternion matrices, we investigate the conditions for the existence and uniqueness of the real RBMTLS solution and derive an explicit expression for the real RBMTLS solution. The proposed technique covers two special cases: the reduced biquaternion total least squares (RBTLS) method and the reduced biquaternion least squares (RBLS) method. Furthermore, the developed method is also used to find the best approximate solution to $AX \approx B$ over a complex field. Lastly, a numerical example is presented to support our findings.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
L-structure least squares solutions of reduced biquaternion matrix equations with applications
Authors:
Sk. Safique Ahmad,
Neha Bhadala
Abstract:
This paper presents a framework for computing the structure-constrained least squares solutions to the generalized reduced biquaternion matrix equations (RBMEs). The investigation focuses on three different matrix equations: a linear matrix equation with multiple unknown L-structures, a linear matrix equation with one unknown L-structure, and the general coupled linear matrix equations with one un…
▽ More
This paper presents a framework for computing the structure-constrained least squares solutions to the generalized reduced biquaternion matrix equations (RBMEs). The investigation focuses on three different matrix equations: a linear matrix equation with multiple unknown L-structures, a linear matrix equation with one unknown L-structure, and the general coupled linear matrix equations with one unknown L-structure. Our approach leverages the complex representation of reduced biquaternion matrices. To showcase the versatility of the developed framework, we utilize it to find structure-constrained solutions for complex and real matrix equations, broadening its applicability to various inverse problems. Specifically, we explore its utility in addressing partially described inverse eigenvalue problems (PDIEPs) and generalized PDIEPs. Our study concludes with numerical examples.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
A Framework for Programmability in Digital Currency
Authors:
Nikhil George,
Thaddeus Dryja,
Neha Narula
Abstract:
Programmable money, enabled by digital currencies, facilitates outcomes beyond simple payments by allowing users to attach conditions to the movement of funds through code. However, there is a lack of clarity on defining programmable money, where programmability can be implemented, and the resulting tradeoffs. This paper provides a definition of programmable money with four key components: a forma…
▽ More
Programmable money, enabled by digital currencies, facilitates outcomes beyond simple payments by allowing users to attach conditions to the movement of funds through code. However, there is a lack of clarity on defining programmable money, where programmability can be implemented, and the resulting tradeoffs. This paper provides a definition of programmable money with four key components: a format for representing value, a set of programmable instructions, an execution environment providing a coherence guarantee, and rules around permissioning. We discuss programmability primitives, categorizing them into levels based on expressiveness. We outline four locations programmability could be offered - hardcoded into system rules, via client-supplied programs/smart contracts, in client code, or via intermediaries - analyzing benefits and risks of each. For policymakers evaluating central bank digital currencies, we recommend considering these aspects holistically and their interplay with regulation in system design. Our framework and vocabulary enable more nuanced analysis of implementing programmability.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Understanding the Role and Design Space of Demand Sinks in Low-carbon Power Systems
Authors:
Sam van der Jagt,
Neha Patankar,
Jesse Jenkins
Abstract:
As the availability of weather-dependent, zero marginal cost resources such as wind and solar power increases, a variety of flexible electricity loads, or `demand sinks', could be deployed to use intermittently available low-cost electricity to produce valuable outputs. This study provides a general framework to evaluate any potential demand sink technology and understand its viability to be deplo…
▽ More
As the availability of weather-dependent, zero marginal cost resources such as wind and solar power increases, a variety of flexible electricity loads, or `demand sinks', could be deployed to use intermittently available low-cost electricity to produce valuable outputs. This study provides a general framework to evaluate any potential demand sink technology and understand its viability to be deployed cost-effectively in low-carbon power systems. We use an electricity system optimization model to assess 98 discrete combinations of capital costs and output values that collectively span the range of feasible characteristics of potential demand sink technologies. We find that candidates like hydrogen electrolysis, direct air capture, and flexible electric heating can all achieve significant installed capacity (>10% of system peak load) if lower capital costs are reached in the future. Demand sink technologies significantly increase installed wind and solar capacity while not significantly affecting battery storage, firm generating capacity, or the average cost of electricity.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
A central limit theorem for Hilbert modular forms
Authors:
Jishu Das,
Neha Prabhu
Abstract:
For a prime ideal $\mathfrak{p}$ in a totally real number field $L$ with the adele ring $\mathbb{A}$, we study the distribution of angles $θ_π(\mathfrak{p})$ coming from Satake parameters corresponding to unramified $π_\mathfrak{p}$ where $π_\mathfrak{p}$ comes from a global $π$ ranging over a certain finite set $Π_{\underline{k}}(\mathfrak{n})$ of cuspidal automorphic representations of GL…
▽ More
For a prime ideal $\mathfrak{p}$ in a totally real number field $L$ with the adele ring $\mathbb{A}$, we study the distribution of angles $θ_π(\mathfrak{p})$ coming from Satake parameters corresponding to unramified $π_\mathfrak{p}$ where $π_\mathfrak{p}$ comes from a global $π$ ranging over a certain finite set $Π_{\underline{k}}(\mathfrak{n})$ of cuspidal automorphic representations of GL$_2(\mathbb{A})$ with trivial central character. For such a representation $π$, it is known that the angles $θ_π(\mathfrak{p})$ follow the Sato-Tate distribution. Fixing an interval $I\subseteq [0,π]$, we prove a central limit theorem for the number of angles $θ_π(\mathfrak{p})$ that lie in $I$, as $\mathrm{N}(\mathfrak{p})\to\infty$. The result assumes $\mathfrak{n}$ to be a squarefree integral ideal, and that the components in the weight vector $\underline{k}$ grow suitably fast as a function of $x$.
△ Less
Submitted 29 October, 2023;
originally announced October 2023.
-
Probing rotational decoherence with a trapped-ion planar rotor
Authors:
Neil Glikin,
Benjamin A. Stickler,
Ryan Tollefsen,
Sara Mouradian,
Neha Yadav,
Erik Urban,
Klaus Hornberger,
Hartmut Haeffner
Abstract:
The quantum rotor is one of the simplest model systems in quantum mechanics, but only in recent years has theoretical work revealed general fundamental scaling laws for its decoherence. For example, a superposition of orientations decoheres at a rate proportional to the sine squared of the angle between them. Here we observe scaling laws for rotational decoherence dynamics for the first time, usin…
▽ More
The quantum rotor is one of the simplest model systems in quantum mechanics, but only in recent years has theoretical work revealed general fundamental scaling laws for its decoherence. For example, a superposition of orientations decoheres at a rate proportional to the sine squared of the angle between them. Here we observe scaling laws for rotational decoherence dynamics for the first time, using a 4-micrometer diameter planar rotor composed of two Paul-trapped ions. We prepare the rotational motion of the ion crystal into superpositions of angular momentum with well-defined differences ranging from 1-3 $\hbar$, and measure the rate of decoherence. We also tune the system-environment interaction strength by introducing resonant electric field noise. The observed scaling relationships for decoherence are in excellent agreement with recent theoretical work, and are directly relevant to the growing development of rotor-based quantum applications.
△ Less
Submitted 24 January, 2025; v1 submitted 20 October, 2023;
originally announced October 2023.
-
Studying the Effects of Sex-related Differences on Brain Age Prediction using brain MR Imaging
Authors:
Mahsa Dibaji,
Neha Gianchandani,
Akhil Nair,
Mansi Singhal,
Roberto Souza,
Mariana Bento
Abstract:
While utilizing machine learning models, one of the most crucial aspects is how bias and fairness affect model outcomes for diverse demographics. This becomes especially relevant in the context of machine learning for medical imaging applications as these models are increasingly being used for diagnosis and treatment planning. In this paper, we study biases related to sex when developing a machine…
▽ More
While utilizing machine learning models, one of the most crucial aspects is how bias and fairness affect model outcomes for diverse demographics. This becomes especially relevant in the context of machine learning for medical imaging applications as these models are increasingly being used for diagnosis and treatment planning. In this paper, we study biases related to sex when developing a machine learning model based on brain magnetic resonance images (MRI). We investigate the effects of sex by performing brain age prediction considering different experimental designs: model trained using only female subjects, only male subjects and a balanced dataset. We also perform evaluation on multiple MRI datasets (Calgary-Campinas(CC359) and CamCAN) to assess the generalization capability of the proposed models. We found disparities in the performance of brain age prediction models when trained on distinct sex subgroups and datasets, in both final predictions and decision making (assessed using interpretability models). Our results demonstrated variations in model generalizability across sex-specific subgroups, suggesting potential biases in models trained on unbalanced datasets. This underlines the critical role of careful experimental design in generating fair and reliable outcomes.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
A voxel-level approach to brain age prediction: A method to assess regional brain aging
Authors:
Neha Gianchandani,
Mahsa Dibaji,
Johanna Ospel,
Fernando Vega,
Mariana Bento,
M. Ethan MacDonald,
Roberto Souza
Abstract:
Brain aging is a regional phenomenon, a facet that remains relatively under-explored within the realm of brain age prediction research using machine learning methods. Voxel-level predictions can provide localized brain age estimates that can provide granular insights into the regional aging processes. This is essential to understand the differences in aging trajectories in healthy versus diseased…
▽ More
Brain aging is a regional phenomenon, a facet that remains relatively under-explored within the realm of brain age prediction research using machine learning methods. Voxel-level predictions can provide localized brain age estimates that can provide granular insights into the regional aging processes. This is essential to understand the differences in aging trajectories in healthy versus diseased subjects. In this work, a deep learning-based multitask model is proposed for voxel-level brain age prediction from T1-weighted magnetic resonance images. The proposed model outperforms the models existing in the literature and yields valuable clinical insights when applied to both healthy and diseased populations. Regional analysis is performed on the voxel-level brain age predictions to understand aging trajectories of known anatomical regions in the brain and show that there exist disparities in regional aging trajectories of healthy subjects compared to ones with underlying neurological disorders such as Dementia and more specifically, Alzheimer's disease. Our code is available at https://github.com/nehagianchandani/Voxel-level-brain-age-prediction.
△ Less
Submitted 24 April, 2024; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Neural ring homomorphism preserves mandatory sets required for open convexity
Authors:
Neha Gupta,
Suhith K N
Abstract:
It has been studied by Curto et al. (SIAM J. on App. Alg. and Geom., 1(1) : 222 $\unicode{x2013}$ 238, 2017) that a neural code that has an open convex realization does not have any local obstruction relative to the neural code. Further, a neural code $ \mathcal{C} $ has no local obstructions if and only if it contains the set of mandatory codewords, $ \mathcal{C}_{\min}(Δ),$ which depends only on…
▽ More
It has been studied by Curto et al. (SIAM J. on App. Alg. and Geom., 1(1) : 222 $\unicode{x2013}$ 238, 2017) that a neural code that has an open convex realization does not have any local obstruction relative to the neural code. Further, a neural code $ \mathcal{C} $ has no local obstructions if and only if it contains the set of mandatory codewords, $ \mathcal{C}_{\min}(Δ),$ which depends only on the simplicial complex $Δ=Δ(\mathcal{C})$. Thus if $\mathcal{C} \not \supseteq \mathcal{C}_{\min}(Δ)$, then $\mathcal{C}$ cannot be open convex. However, the problem of constructing $ \mathcal{C}_{\min}(Δ) $ for any given code $ \mathcal{C} $ is undecidable. There is yet another way to capture the local obstructions via the homological mandatory set, $ \mathcal{M}_H(Δ). $ The significance of $ \mathcal{M}_H(Δ) $ for a given code $ \mathcal{C} $ is that $ \mathcal{M}_H(Δ) \subseteq \mathcal{C}_{\min}(Δ)$ and so $ \mathcal{C} $ will have local obstructions if $ \mathcal{C}\not\supseteq\mathcal{M}_H(Δ). $ In this paper we study the affect on the sets $\mathcal{C}_{\min}(Δ) $ and $\mathcal{M}_H(Δ)$ under the action of various surjective elementary code maps. Further, we study the relationship between Stanley-Reisner rings of the simplicial complexes associated with neural codes of the elementary code maps. Moreover, using this relationship, we give an alternative proof to show that $ \mathcal{M}_H(Δ) $ is preserved under the elementary code maps.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Enabling multi-messenger astronomy with continuous gravitational waves: early warning and sky localization of binary neutron stars in Einstein Telescope
Authors:
Andrew L. Miller,
Neha Singh,
Cristiano Palomba
Abstract:
Next-generation gravitational-wave detectors will provide unprecedented sensitivity to inspiraling binary neutron stars and black holes, enabling detections at the peak of star formation and beyond. However, the signals from these systems will last much longer than those in current detectors, and overlap in both time and frequency, leading to increased computational cost to search for them with st…
▽ More
Next-generation gravitational-wave detectors will provide unprecedented sensitivity to inspiraling binary neutron stars and black holes, enabling detections at the peak of star formation and beyond. However, the signals from these systems will last much longer than those in current detectors, and overlap in both time and frequency, leading to increased computational cost to search for them with standard matched filtering analyses, and a higher probability that they are observed in the presence of non-Gaussian noise. We therefore present a method to search for gravitational waves from compact binary inspirals in next-generation detectors that is computationally efficient and robust against gaps in data collection and noise non-stationarities. Our method finds tracks in the time/frequency plane of the detector that uniquely describe specific inspiraling systems. We find that we could detect $\sim 5$ overlapping, intermediate-strength signals (matched-filter signal-to-noise ratio $ρ\approx 58$) without a sensitivity loss. Additionally, we demonstrate that our method can enable multi-messenger astronomy: using only low frequencies ($2-20$ Hz), we could warn astronomers $\sim 2.5$ hours before a GW170817-like merger at 40 Mpc and provide a sky localization of $\sim 20$ deg$^2$ using only one ``L'' of Einstein Telescope. Additionally, assuming that primordial black holes exist, we derive projected constraints on the fraction of dark matter they could compose, $f_{\rm PBH}\sim 10^{-6}-10^{-4}$, for $\sim 1-0.1M_\odot$ equal-mass systems, respectively, using a rate suppression factor $f_{\rm sup}=2.5\times 10^{-3}$. Comparing matched filtering searches to our proposed method at a fixed sensitivity, we find a factor of $\sim10-50$ speed-up when we begin an analysis at a frequency of 5 Hz up to 12 Hz for a system with a chirp mass between $\sim[1,2]M_\odot$.
△ Less
Submitted 21 February, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Byzantine Multiple Access Channels -- Part II: Communication With Adversary Identification
Authors:
Neha Sangwan,
Mayank Bakshi,
Bikash Kumar Dey,
Vinod M. Prabhakaran
Abstract:
We introduce the problem of determining the identity of a byzantine user (internal adversary) in a communication system. We consider a two-user discrete memoryless multiple access channel where either user may deviate from the prescribed behaviour. Since small deviations may be indistinguishable from the effects of channel noise, it might be overly restrictive to attempt to detect all deviations.…
▽ More
We introduce the problem of determining the identity of a byzantine user (internal adversary) in a communication system. We consider a two-user discrete memoryless multiple access channel where either user may deviate from the prescribed behaviour. Since small deviations may be indistinguishable from the effects of channel noise, it might be overly restrictive to attempt to detect all deviations. When neither user deviates, correct decoding is required. When one user deviates, the decoder must either output a pair of messages of which the message of the non-deviating user is correct or identify the deviating user. The users and the receiver do not share any randomness. The results include a characterization of the set of channels where communication is feasible, and an inner and outer bound on the capacity region. We also show that whenever the rate region has non-empty interior, the capacity region is same as the capacity region under randomized encoding, where each user shares independent randomness with the receiver. We also give an outer bound for this randomized coding capacity region.
△ Less
Submitted 24 September, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning
Authors:
Neha S. Wadia,
Yatin Dandi,
Michael I. Jordan
Abstract:
The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and multi-agent problems. In these broader settings, new mathematical challenges emerge that involve equilibria and game theory instead of optima. Gradient-based method…
▽ More
The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and multi-agent problems. In these broader settings, new mathematical challenges emerge that involve equilibria and game theory instead of optima. Gradient-based methods remain essential -- given the high dimensionality and large scale of machine-learning problems -- but simple gradient descent is no longer the point of departure for algorithm design. We provide a gentle introduction to a broader framework for gradient-based algorithms in machine learning, beginning with saddle points and monotone games, and proceeding to general variational inequalities. While we provide convergence proofs for several of the algorithms that we present, our main focus is that of providing motivation and intuition.
△ Less
Submitted 26 February, 2024; v1 submitted 9 September, 2023;
originally announced September 2023.
-
Adapting Self-Supervised Representations to Multi-Domain Setups
Authors:
Neha Kalibhat,
Sam Sharpe,
Jeremy Goodsitt,
Bayan Bruss,
Soheil Feizi
Abstract:
Current state-of-the-art self-supervised approaches, are effective when trained on individual domains but show limited generalization on unseen domains. We observe that these models poorly generalize even when trained on a mixture of domains, making them unsuitable to be deployed under diverse real-world setups. We therefore propose a general-purpose, lightweight Domain Disentanglement Module (DDM…
▽ More
Current state-of-the-art self-supervised approaches, are effective when trained on individual domains but show limited generalization on unseen domains. We observe that these models poorly generalize even when trained on a mixture of domains, making them unsuitable to be deployed under diverse real-world setups. We therefore propose a general-purpose, lightweight Domain Disentanglement Module (DDM) that can be plugged into any self-supervised encoder to effectively perform representation learning on multiple, diverse domains with or without shared classes. During pre-training according to a self-supervised loss, DDM enforces a disentanglement in the representation space by splitting it into a domain-variant and a domain-invariant portion. When domain labels are not available, DDM uses a robust clustering approach to discover pseudo-domains. We show that pre-training with DDM can show up to 3.5% improvement in linear probing accuracy on state-of-the-art self-supervised models including SimCLR, MoCo, BYOL, DINO, SimSiam and Barlow Twins on multi-domain benchmarks including PACS, DomainNet and WILDS. Models trained with DDM show significantly improved generalization (7.4%) to unseen domains compared to baselines. Therefore, DDM can efficiently adapt self-supervised encoders to provide high-quality, generalizable representations for diverse multi-domain data.
△ Less
Submitted 12 December, 2023; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Authors:
Neha Sengupta,
Sunil Kumar Sahu,
Bokang Jia,
Satheesh Katipomu,
Haonan Li,
Fajri Koto,
William Marshall,
Gurpreet Gosal,
Cynthia Liu,
Zhiming Chen,
Osama Mohammed Afzal,
Samta Kamboj,
Onkar Pandit,
Rahul Pal,
Lalit Pradhan,
Zain Muhammad Mujahid,
Massa Baali,
Xudong Han,
Sondos Mahmoud Bsharat,
Alham Fikri Aji,
Zhiqiang Shen,
Zhengzhong Liu,
Natalia Vassilieva,
Joel Hestness,
Andy Hock
, et al. (7 additional authors not shown)
Abstract:
We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models (LLMs). The models are based on the GPT-3 decoder-only architecture and are pretrained on a mixture of Arabic and English texts, including source code in various programming languages. With 13 billion parameters, they demonstrate better knowledge and reasoning…
▽ More
We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models (LLMs). The models are based on the GPT-3 decoder-only architecture and are pretrained on a mixture of Arabic and English texts, including source code in various programming languages. With 13 billion parameters, they demonstrate better knowledge and reasoning capabilities in Arabic than any existing open Arabic and multilingual models by a sizable margin, based on extensive evaluation. Moreover, the models are competitive in English compared to English-centric open models of similar size, despite being trained on much less English data. We provide a detailed description of the training, the tuning, the safety alignment, and the evaluation of the models. We release two open versions of the model -- the foundation Jais model, and an instruction-tuned Jais-chat variant -- with the aim of promoting research on Arabic LLMs. Available at https://huggingface.co/inception-mbzuai/jais-13b-chat
△ Less
Submitted 29 September, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Sensitivity of IceCube-Gen2 to measure flavor composition of Astrophysical neutrinos
Authors:
Neha Lad
Abstract:
The observation of an astrophysical neutrino flux in IceCube and its detection capability to separate between the different neutrino flavors has led IceCube to constraint the flavor content of this flux. IceCube-Gen2 is the planned extension of the current IceCube detector, which will be about 8 times larger than the current instrumented volume. In this work, we study the sensitivity of IceCube-Ge…
▽ More
The observation of an astrophysical neutrino flux in IceCube and its detection capability to separate between the different neutrino flavors has led IceCube to constraint the flavor content of this flux. IceCube-Gen2 is the planned extension of the current IceCube detector, which will be about 8 times larger than the current instrumented volume. In this work, we study the sensitivity of IceCube-Gen2 to the astrophysical neutrino flavor composition and investigate its tau neutrino identification capabilities. We apply the IceCube analysis on a simulated IceCube-Gen2 dataset that mimics the High Energy Starting Event (HESE) classification. Reconstructions are performed using sensors that have 3 times higher quantum efficiency and isotropic angular acceptance compared to the current IceCube optical modules. We present the projected sensitivity for 10 years of data on constraining the flavor ratio of the astrophysical neutrino flux at Earth by IceCube-Gen2.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Summary of IceCube Tau Neutrino Searches and Flavor Composition Measurements of the Diffuse Astrophysical Neutrino Flux
Authors:
Neha Lad,
D. F. Cowen
Abstract:
We present a summary of the flavor composition measurements for the diffuse astrophysical neutrino flux using data from the IceCube Neutrino Observatory at the South Pole. IceCube has identified candidate astrophysical tau neutrinos through two different approaches. One approach used a dedicated particle identification algorithm for the classification and reconstruction of the 'Double Cascade' eve…
▽ More
We present a summary of the flavor composition measurements for the diffuse astrophysical neutrino flux using data from the IceCube Neutrino Observatory at the South Pole. IceCube has identified candidate astrophysical tau neutrinos through two different approaches. One approach used a dedicated particle identification algorithm for the classification and reconstruction of the 'Double Cascade' event topology, a signature of tau neutrino charged current interactions. This first approach is applied to the High Energy Starting Events (HESE) sample, an all-sky, all-flavor set of neutrino events with energy above 60~TeV encompassing 12 years of IceCube livetime. We show that the addition of more years of data and updated ice properties on the HESE sample delivers tighter constraints on the flavor composition of the astrophysical neutrino flux than previous IceCube analyses, in particular when it is fit in combination with high statistics samples of through-going tracks and cascades. A second approach uses a sensitive machine-learning-based selection technique that finds seven candidate events in 9.7 years of IceCube data. This approach excludes the zero astrophysical tau neutrino hypothesis at the highest statistical significance to date.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Laser-assisted (e,2e) study with twisted electron beam on H-atom
Authors:
Neha,
Nikita Dhankhar,
Raul Sheldon Pinto,
Rakesh Choubisa
Abstract:
We study the laser-assisted twisted electron beam impact ionization of the hydrogen atom in coplanar asymmetric geometry. We develope the theoretical model in the first Born approximation. In the presence of the laser field, we treat the incident and scattered electrons as Volkov waves, the ejected electron, moving in the combined field of the laser and residual ion H + , is described by a Coulomb…
▽ More
We study the laser-assisted twisted electron beam impact ionization of the hydrogen atom in coplanar asymmetric geometry. We develope the theoretical model in the first Born approximation. In the presence of the laser field, we treat the incident and scattered electrons as Volkov waves, the ejected electron, moving in the combined field of the laser and residual ion H + , is described by a Coulomb-Volkov wave function . In this communication, we compare the angular profile of triple differential cross-section (TDCS) for laser-assisted twisted electron beam incidence with the plane-wave, laser-assisted plane wave, and field-free twisted electron beam for (e,2e) processes, for different orbital angular momentum (OAM) number (m l ) values. We analyze the influence of the laser parameters (photon exchanged, intensity ) on the angular distribution of the TDCS. We study the (T DCS) av for macroscopic target to examine the effect of opening angle θ p of the twisted electron beam on the angular profile of TDCS. Our results clearly show the impact of laser parameters (electric field ? and number of photon exchanged (l)) and twisted electron beam parameters (OAM number (m l ) and opening angle (θ p )) on the angular distribution of TDCS.
△ Less
Submitted 26 August, 2023;
originally announced August 2023.
-
Discovering Dichotomies for Problems in Database Theory
Authors:
Neha Makhija
Abstract:
Dichotomy theorems, which characterize the conditions under which a problem can be solved efficiently, have helped identify important tractability borders for as probabilistic query evaluation, view maintenance, query containment (among many more problems). However, dichotomy theorems for many such problems remain elusive under key settings such as bag semantics or for queries with self-joins. Thi…
▽ More
Dichotomy theorems, which characterize the conditions under which a problem can be solved efficiently, have helped identify important tractability borders for as probabilistic query evaluation, view maintenance, query containment (among many more problems). However, dichotomy theorems for many such problems remain elusive under key settings such as bag semantics or for queries with self-joins. This work aims to unearth dichotomies for fundamental problems in reverse data management and knowledge representation. We use a novel approach to discovering dichotomies: instead of creating dedicated algorithms for easy (PTIME) and hard cases (NP-complete), we devise unified algorithms that are guaranteed to terminate in PTIME for easy cases. Using this approach, we discovered new tractable cases for the problem of minimal factorization of provenance formulas as well as dichotomies under bag semantics for the problems of resilience and causal responsibility
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Reframing the Brain Age Prediction Problem to a More Interpretable and Quantitative Approach
Authors:
Neha Gianchandani,
Mahsa Dibaji,
Mariana Bento,
Ethan MacDonald,
Roberto Souza
Abstract:
Deep learning models have achieved state-of-the-art results in estimating brain age, which is an important brain health biomarker, from magnetic resonance (MR) images. However, most of these models only provide a global age prediction, and rely on techniques, such as saliency maps to interpret their results. These saliency maps highlight regions in the input image that were significant for the mod…
▽ More
Deep learning models have achieved state-of-the-art results in estimating brain age, which is an important brain health biomarker, from magnetic resonance (MR) images. However, most of these models only provide a global age prediction, and rely on techniques, such as saliency maps to interpret their results. These saliency maps highlight regions in the input image that were significant for the model's predictions, but they are hard to be interpreted, and saliency map values are not directly comparable across different samples. In this work, we reframe the age prediction problem from MR images to an image-to-image regression problem where we estimate the brain age for each brain voxel in MR images. We compare voxel-wise age prediction models against global age prediction models and their corresponding saliency maps. The results indicate that voxel-wise age prediction models are more interpretable, since they provide spatial information about the brain aging process, and they benefit from being quantitative.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Stiefel-Whitney Classes Of Representations Of Dihedral Groups
Authors:
Sujeet Bhalerao,
Rohit Joshi,
Neha Malik
Abstract:
We compute the Stiefel-Whitney Classes for representations of dihedral groups $D_m$ in terms of character values of order two elements. We also provide criteria to identify representations V which lift to the double covers of the orthogonal group O(V ) and those with non-trivial mod 2 Euler class.
We compute the Stiefel-Whitney Classes for representations of dihedral groups $D_m$ in terms of character values of order two elements. We also provide criteria to identify representations V which lift to the double covers of the orthogonal group O(V ) and those with non-trivial mod 2 Euler class.
△ Less
Submitted 4 October, 2023; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Fractional Generalizations of the Compound Poisson Process
Authors:
Neha Gupta,
Aditya Maheshwari
Abstract:
This paper introduces the Generalized Fractional Compound Poisson Process (GFCPP), which claims to be a unified fractional version of the compound Poisson process (CPP) that encompasses existing variations as special cases. We derive its distributional properties, generalized fractional differential equations, and martingale properties. Some results related to the governing differential equation a…
▽ More
This paper introduces the Generalized Fractional Compound Poisson Process (GFCPP), which claims to be a unified fractional version of the compound Poisson process (CPP) that encompasses existing variations as special cases. We derive its distributional properties, generalized fractional differential equations, and martingale properties. Some results related to the governing differential equation about the special cases of jump distributions, including exponential, Mittag-Leffler, Bernstéin, discrete uniform, truncated geometric, and discrete logarithm. Some of processes in the literature such as the fractional Poisson process of order $k$, Pólya-Aeppli process of order $k$, and fractional negative binomial process becomes the special case of the GFCPP. Classification based on arrivals by time-changing the compound Poisson process by the inverse tempered and the inverse of inverse Gaussian subordinators are studied. Finally, we present the simulation of the sample paths of the above-mentioned processes.
△ Less
Submitted 23 July, 2023;
originally announced July 2023.
-
Ascent and Descent of Composition Operators on Orlicz-lorentz space
Authors:
Neha Bhatia,
Anuradha Gupta
Abstract:
The aim of this paper is to discuss the characterizations of the composition operators on Orlicz-Lorentz space to have finite ascent (or descent).
The aim of this paper is to discuss the characterizations of the composition operators on Orlicz-Lorentz space to have finite ascent (or descent).
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Identifying Interpretable Subspaces in Image Representations
Authors:
Neha Kalibhat,
Shweta Bhardwaj,
Bayan Bruss,
Hamed Firooz,
Maziar Sanjabi,
Soheil Feizi
Abstract:
We propose Automatic Feature Explanation using Contrasting Concepts (FALCON), an interpretability framework to explain features of image representations. For a target feature, FALCON captions its highly activating cropped images using a large captioning dataset (like LAION-400m) and a pre-trained vision-language model like CLIP. Each word among the captions is scored and ranked leading to a small…
▽ More
We propose Automatic Feature Explanation using Contrasting Concepts (FALCON), an interpretability framework to explain features of image representations. For a target feature, FALCON captions its highly activating cropped images using a large captioning dataset (like LAION-400m) and a pre-trained vision-language model like CLIP. Each word among the captions is scored and ranked leading to a small number of shared, human-understandable concepts that closely describe the target feature. FALCON also applies contrastive interpretation using lowly activating (counterfactual) images, to eliminate spurious concepts. Although many existing approaches interpret features independently, we observe in state-of-the-art self-supervised and supervised models, that less than 20% of the representation space can be explained by individual features. We show that features in larger spaces become more interpretable when studied in groups and can be explained with high-order scoring concepts through FALCON. We discuss how extracted concepts can be used to explain and debug failures in downstream tasks. Finally, we present a technique to transfer concepts from one (explainable) representation space to another unseen representation space by learning a simple linear transformation. Code available at https://github.com/NehaKalibhat/falcon-explain.
△ Less
Submitted 7 September, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Demonstrating a long-coherence dual-rail erasure qubit using tunable transmons
Authors:
Harry Levine,
Arbel Haim,
Jimmy S. C. Hung,
Nasser Alidoust,
Mahmoud Kalaee,
Laura DeLorenzo,
E. Alex Wollack,
Patricio Arrangoiz-Arriola,
Amirhossein Khalajhedayati,
Rohan Sanil,
Hesam Moradinejad,
Yotam Vaknin,
Aleksander Kubica,
David Hover,
Shahriar Aghaeimeibodi,
Joshua Ari Alcid,
Christopher Baek,
James Barnett,
Kaustubh Bawdekar,
Przemyslaw Bienias,
Hugh Carson,
Cliff Chen,
Li Chen,
Harut Chinkezian,
Eric M. Chisholm
, et al. (88 additional authors not shown)
Abstract:
Quantum error correction with erasure qubits promises significant advantages over standard error correction due to favorable thresholds for erasure errors. To realize this advantage in practice requires a qubit for which nearly all errors are such erasure errors, and the ability to check for erasure errors without dephasing the qubit. We demonstrate that a "dual-rail qubit" consisting of a pair of…
▽ More
Quantum error correction with erasure qubits promises significant advantages over standard error correction due to favorable thresholds for erasure errors. To realize this advantage in practice requires a qubit for which nearly all errors are such erasure errors, and the ability to check for erasure errors without dephasing the qubit. We demonstrate that a "dual-rail qubit" consisting of a pair of resonantly coupled transmons can form a highly coherent erasure qubit, where transmon $T_1$ errors are converted into erasure errors and residual dephasing is strongly suppressed, leading to millisecond-scale coherence within the qubit subspace. We show that single-qubit gates are limited primarily by erasure errors, with erasure probability $p_\text{erasure} = 2.19(2)\times 10^{-3}$ per gate while the residual errors are $\sim 40$ times lower. We further demonstrate mid-circuit detection of erasure errors while introducing $< 0.1\%$ dephasing error per check. Finally, we show that the suppression of transmon noise allows this dual-rail qubit to preserve high coherence over a broad tunable operating range, offering an improved capacity to avoid frequency collisions. This work establishes transmon-based dual-rail qubits as an attractive building block for hardware-efficient quantum error correction.
△ Less
Submitted 20 March, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Probing wave-optics effects and low-mass dark matter halos with lensing of gravitational waves from massive black holes
Authors:
Mesut Çalışkan,
Neha Anil Kumar,
Lingyuan Ji,
Jose M. Ezquiaga,
Roberto Cotesta,
Emanuele Berti,
Marc Kamionkowski
Abstract:
The Laser Interferometer Space Antenna (LISA) will detect gravitational waves (GWs) emitted by massive black hole binaries (MBHBs) in the low-frequency ($\sim$mHz) band. Low-mass lenses, such as low-mass dark matter halos or subhalos, have sizes comparable to the wavelength of these GWs. Encounters with these lenses produce wave-optics (WO) effects that alter waveform phase and amplitude. Thus, a…
▽ More
The Laser Interferometer Space Antenna (LISA) will detect gravitational waves (GWs) emitted by massive black hole binaries (MBHBs) in the low-frequency ($\sim$mHz) band. Low-mass lenses, such as low-mass dark matter halos or subhalos, have sizes comparable to the wavelength of these GWs. Encounters with these lenses produce wave-optics (WO) effects that alter waveform phase and amplitude. Thus, a single event with observable WO effects can be used to probe the lens properties. In this paper, we first compute the probability of observing WO effects in a model-agnostic way. We perform information-matrix analyses over approximately 1000 MBHBs with total mass, mass ratio, and redshift spanning the ranges relevant to LISA. We then calculate lensing rates using three semi-analytical models of MBHB populations. In both cases, we use a waveform model that includes merger, ringdown, and higher-order modes. We use two lens population models: the theory-based Press-Schechter halo mass function and an observation-based model derived from Sloan Digital Sky Survey. We find that the probability of detecting WO effects can be as large as $\sim 3\%$, $\sim1.5\%$, and $\sim 1 \%$ at $1σ$, $3σ$, and $5σ$ confidence levels, respectively. The most optimistic MBHB population model yields $\sim 8$, $\sim 4$, and $\sim 3$ events with detectable WO effects at the same confidence levels, while the rates drop to $\sim 0.01$ in the more pessimistic scenarios. The most likely lens masses probed by LISA are in the range $(10^3, 10^8)\, M_{\odot}$, and the most probable redshifts are in the range $(0.3, 1.7)$. Therefore, LISA observations of WO effects can probe low-mass DM halos, complementing strong lensing and other observations.
△ Less
Submitted 12 May, 2024; v1 submitted 13 July, 2023;
originally announced July 2023.
-
When Does Confidence-Based Cascade Deferral Suffice?
Authors:
Wittawat Jitkrittum,
Neha Gupta,
Aditya Krishna Menon,
Harikrishna Narasimhan,
Ankit Singh Rawat,
Sanjiv Kumar
Abstract:
Cascades are a classical strategy to enable inference cost to vary adaptively across samples, wherein a sequence of classifiers are invoked in turn. A deferral rule determines whether to invoke the next classifier in the sequence, or to terminate prediction. One simple deferral rule employs the confidence of the current classifier, e.g., based on the maximum predicted softmax probability. Despite…
▽ More
Cascades are a classical strategy to enable inference cost to vary adaptively across samples, wherein a sequence of classifiers are invoked in turn. A deferral rule determines whether to invoke the next classifier in the sequence, or to terminate prediction. One simple deferral rule employs the confidence of the current classifier, e.g., based on the maximum predicted softmax probability. Despite being oblivious to the structure of the cascade -- e.g., not modelling the errors of downstream models -- such confidence-based deferral often works remarkably well in practice. In this paper, we seek to better understand the conditions under which confidence-based deferral may fail, and when alternate deferral strategies can perform better. We first present a theoretical characterisation of the optimal deferral rule, which precisely characterises settings under which confidence-based deferral may suffer. We then study post-hoc deferral mechanisms, and demonstrate they can significantly improve upon confidence-based deferral in settings where (i) downstream models are specialists that only work well on a subset of inputs, (ii) samples are subject to label noise, and (iii) there is distribution shift between the train and test set.
△ Less
Submitted 23 January, 2024; v1 submitted 6 July, 2023;
originally announced July 2023.
-
A computable formula for evaluating the mean square sum of $L$-functions
Authors:
Neha Elizabeth Thomas,
K Vishnu Namboothiri
Abstract:
For Dirichlet characters $χ$ mod $k$ where $k\geq 3$, we here give a computable formula for evaluating the mean square sums $\sum\limits_{\substack{χ\text{ mod }k\\χ(-1)=(-1)^r}}|L(r,χ)|^2$ for any positive integer $r\geq 3$. We also give an inductive formula for computing the sum $\sum\limits_{\substack{1\leq m\leq k \\ (m, k)=1}}\frac{1}{\left(\sin\left(\frac{πm}{k}\right)\right)^{2n}}$ where…
▽ More
For Dirichlet characters $χ$ mod $k$ where $k\geq 3$, we here give a computable formula for evaluating the mean square sums $\sum\limits_{\substack{χ\text{ mod }k\\χ(-1)=(-1)^r}}|L(r,χ)|^2$ for any positive integer $r\geq 3$. We also give an inductive formula for computing the sum $\sum\limits_{\substack{1\leq m\leq k \\ (m, k)=1}}\frac{1}{\left(\sin\left(\frac{πm}{k}\right)\right)^{2n}}$ where $n$ is a positive integer in terms of Bernoulli numbers and binomial coefficients.
△ Less
Submitted 12 December, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.