Search | arXiv e-print repository

Cost-aware simulation-based inference

Authors: Ayush Bharti, Daolang Huang, Samuel Kaski, François-Xavier Briol

Abstract: Simulation-based inference (SBI) is the preferred framework for estimating parameters of intractable models in science and engineering. A significant challenge in this context is the large computational cost of simulating data from complex models, and the fact that this cost often depends on parameter values. We therefore propose \textit{cost-aware SBI methods} which can significantly reduce the c… ▽ More Simulation-based inference (SBI) is the preferred framework for estimating parameters of intractable models in science and engineering. A significant challenge in this context is the large computational cost of simulating data from complex models, and the fact that this cost often depends on parameter values. We therefore propose \textit{cost-aware SBI methods} which can significantly reduce the cost of existing sampling-based SBI methods, such as neural SBI and approximate Bayesian computation. This is achieved through a combination of rejection and self-normalised importance sampling, which significantly reduces the number of expensive simulations needed. Our approach is studied extensively on models from epidemiology to telecommunications engineering, where we obtain significant reductions in the overall cost of inference. △ Less

Submitted 17 February, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

arXiv:2410.07839 [pdf, other]

Semantic Self-Consistency: Enhancing Language Model Reasoning via Semantic Weighting

Authors: Tim Knappe, Ryan Li, Ayush Chauhan, Kaylee Chhua, Kevin Zhu, Sean O'Brien

Abstract: While large language models (LLMs) have rapidly improved their performance on a broad number of tasks, they still often fall short on reasoning tasks. As LLMs become more integrated in diverse real-world tasks, advancing their reasoning capabilities is crucial to their effectiveness in nuanced, complex problems. Wang et al.'s self-consistency framework reveals that sampling multiple rationales bef… ▽ More While large language models (LLMs) have rapidly improved their performance on a broad number of tasks, they still often fall short on reasoning tasks. As LLMs become more integrated in diverse real-world tasks, advancing their reasoning capabilities is crucial to their effectiveness in nuanced, complex problems. Wang et al.'s self-consistency framework reveals that sampling multiple rationales before taking a majority vote reliably improves model performance across various closed-answer reasoning tasks. Standard methods based on this framework aggregate the final decisions of these rationales but fail to utilize the semantic information detailed in the step-by-step reasoning paths. Our work introduces semantic self-consistency, enhancing this approach by incorporating and analyzing both the reasoning paths of these rationales in addition to their final decisions before taking a majority vote. These methods not only improve the reliability of reasoning paths but also cause more robust performance on complex reasoning tasks. △ Less

Submitted 28 January, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

Comments: Accepted to MATH-AI at NeurIPS 2024

arXiv:2410.07307 [pdf, other]

Investigating the sightline of a highly scattered FRB through a filamentary structure in the local Universe

Authors: Kaitlyn Shin, Calvin Leung, Sunil Simha, Bridget C. Andersen, Emmanuel Fonseca, Kenzie Nimmo, Mohit Bhardwaj, Charanjot Brar, Shami Chatterjee, Amanda M. Cook, B. M. Gaensler, Ronniy C. Joseph, Dylan Jow, Jane Kaczmarek, Lordrick Kahinga, Victoria M. Kaspi, Bikash Kharel, Adam E. Lanman, Mattias Lazda, Robert A. Main, Lluis Mas-Ribas, Kiyoshi W. Masui, Juan Mena-Parra, Daniele Michilli, Ayush Pandhi , et al. (9 additional authors not shown)

Abstract: Fast radio bursts (FRBs) are unique probes of extragalactic ionized baryonic structure as each signal, through its burst properties, holds information about the ionized matter it encounters along its sightline. FRB 20200723B is a burst with a scattering timescale of $τ_\mathrm{400\,MHz} >$1 second at 400 MHz and a dispersion measure of DM $\sim$ 244 pc cm$^{-3}$. Observed across the entire CHIME/F… ▽ More Fast radio bursts (FRBs) are unique probes of extragalactic ionized baryonic structure as each signal, through its burst properties, holds information about the ionized matter it encounters along its sightline. FRB 20200723B is a burst with a scattering timescale of $τ_\mathrm{400\,MHz} >$1 second at 400 MHz and a dispersion measure of DM $\sim$ 244 pc cm$^{-3}$. Observed across the entire CHIME/FRB frequency band, it is the single-component burst with the largest scattering timescale yet observed by CHIME/FRB. The combination of its high scattering timescale and relatively low dispersion measure present an uncommon opportunity to use FRB 20200723B to explore the properties of the cosmic web it traversed. With an $\sim$arcminute-scale localization region, we find the most likely host galaxy is NGC 4602 (with PATH probability $P(O|x)=0.985$), which resides $\sim$30 Mpc away within a sheet filamentary structure on the outskirts of the Virgo Cluster. We place an upper limit on the average free electron density of this filamentary structure of $\langle n_e \rangle < 4.6^{+9.6}_{-2.0} \times 10^{-5}$ cm$^{-3}$, broadly consistent with expectations from cosmological simulations. We investigate whether the source of scattering lies within the same galaxy as the FRB, or at a farther distance from an intervening structure along the line of sight. Comparing with Milky Way pulsar observations, we suggest the scattering may originate from within the host galaxy of FRB 20200723B. △ Less

Submitted 9 October, 2024; originally announced October 2024.

Comments: 20 pages, 6 figures, submitted. Comments welcome!

arXiv:2410.06576 [pdf, other]

On The Relationship between Visual Anomaly-free and Anomalous Representations

Authors: Riya Sadrani, Hrishikesh Sharma, Ayush Bachan

Abstract: Anomaly Detection is an important problem within computer vision, having variety of real-life applications. Yet, the current set of solutions to this problem entail known, systematic shortcomings. Specifically, contemporary surface Anomaly Detection task assumes the presence of multiple specific anomaly classes e.g. cracks, rusting etc., unlike one-class classification model of past. However, buil… ▽ More Anomaly Detection is an important problem within computer vision, having variety of real-life applications. Yet, the current set of solutions to this problem entail known, systematic shortcomings. Specifically, contemporary surface Anomaly Detection task assumes the presence of multiple specific anomaly classes e.g. cracks, rusting etc., unlike one-class classification model of past. However, building a deep learning model in such setup remains a challenge because anomalies arise rarely, and hence anomaly samples are quite scarce. Transfer learning has been a preferred paradigm in such situations. But the typical source domains with large dataset sizes e.g. ImageNet, JFT-300M, LAION-2B do not correlate well with the domain of surfaces and materials, an important premise of transfer learning. In this paper, we make an important hypothesis and show, by exhaustive experimentation, that the space of anomaly-free visual patterns of the normal samples correlates well with each of the various spaces of anomalous patterns of the class-specific anomaly samples. The first results of using this hypothesis in transfer learning have indeed been quite encouraging. We expect that finding such a simple closeby domain that readily entails large number of samples, and which also oftentimes shows interclass separability though with narrow margins, will be a useful discovery. Especially, it is expected to improve domain adaptation for anomaly detection, and few-shot learning for anomaly detection, making in-the-wild anomaly detection realistically possible in future. △ Less

Submitted 9 October, 2024; originally announced October 2024.

arXiv:2410.06385 [pdf, other]

Skin Cancer Machine Learning Model Tone Bias

Authors: James Pope, Md Hassanuzzaman, William Chapman, Huw Day, Mingmar Sherpa, Omar Emara, Nirmala Adhikari, Ayush Joshi

Abstract: Background: Many open-source skin cancer image datasets are the result of clinical trials conducted in countries with lighter skin tones. Due to this tone imbalance, machine learning models derived from these datasets can perform well at detecting skin cancer for lighter skin tones. Any tone bias in these models could introduce fairness concerns and reduce public trust in the artificial intelligen… ▽ More Background: Many open-source skin cancer image datasets are the result of clinical trials conducted in countries with lighter skin tones. Due to this tone imbalance, machine learning models derived from these datasets can perform well at detecting skin cancer for lighter skin tones. Any tone bias in these models could introduce fairness concerns and reduce public trust in the artificial intelligence health field. Methods: We examine a subset of images from the International Skin Imaging Collaboration (ISIC) archive that provide tone information. The subset has a significant tone imbalance. These imbalances could explain a model's tone bias. To address this, we train models using the imbalanced dataset and a balanced dataset to compare against. The datasets are used to train a deep convolutional neural network model to classify the images as malignant or benign. We then evaluate the models' disparate impact, based on selection rate, relative to dark or light skin tone. Results: Using the imbalanced dataset, we found that the model is significantly better at detecting malignant images in lighter tone resulting in a disparate impact of 0.577. Using the balanced dataset, we found that the model is also significantly better at detecting malignant images in lighter versus darker tones with a disparate impact of 0.684. Using the imbalanced or balanced dataset to train the model still results in a disparate impact well below the standard threshold of 0.80 which suggests the model is biased with respect to skin tone. Conclusion: The results show that typical skin cancer machine learning models can be tone biased. These results provide evidence that diagnosis or tone imbalance is not the cause of the bias. Other techniques will be necessary to identify and address the bias in these models, an area of future investigation. △ Less

Submitted 19 March, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

arXiv:2410.06344 [pdf, other]

Electric field driven spin textures in heavy fermion van der Waals magnets

Authors: Aayush Vijayvargia, Hao Zhang, Kipton Barros, Shi-Zeng Lin, Onur Erten

Abstract: The recently discovered van der Waals material CeSiI exhibits both heavy fermion behavior and spiral order with strong magnetic anisotropy which makes it a potential host for topological spin textures such as skyrmions through electrical gating. A monolayer of CeSiI consists of two layers of Ce atoms on triangular lattices that sandwich a silicene layer. Motivated by the experiments, we explore ma… ▽ More The recently discovered van der Waals material CeSiI exhibits both heavy fermion behavior and spiral order with strong magnetic anisotropy which makes it a potential host for topological spin textures such as skyrmions through electrical gating. A monolayer of CeSiI consists of two layers of Ce atoms on triangular lattices that sandwich a silicene layer. Motivated by the experiments, we explore magnetic phase diagram in van der Waals heavy fermion materials as a function of anisotropy and applied magnetic field using an effective spin model. We demonstrate that application of an external electric field can tune the Kondo coupling on each Ce layer differently, in turn allowing for controlling the intra- and interlayer magnetic couplings. Our analysis indicates that this fine-tuning leads to the coexistence of different magnetic orders in a single monolayer. In particular, we show that a novel vortex phase can be stabilized only in the presence of an external electric field. Our results highlight the unique advantages and the tunability of van der Waals heavy fermion materials for manipulation of chiral magnetic phases. △ Less

Submitted 8 October, 2024; originally announced October 2024.

Comments: 7 pages, 5 figures

arXiv:2410.05928 [pdf, other]

Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning

Authors: Ayush Singh, Mansi Gupta, Shivank Garg, Abhinav Kumar, Vansh Agrawal

Abstract: Vision-Language Models (VLMs) have transformed tasks requiring visual and reasoning abilities, such as image retrieval and Visual Question Answering (VQA). Despite their success, VLMs face significant challenges with tasks involving geometric reasoning, algebraic problem-solving, and counting. These limitations stem from difficulties effectively integrating multiple modalities and accurately inter… ▽ More Vision-Language Models (VLMs) have transformed tasks requiring visual and reasoning abilities, such as image retrieval and Visual Question Answering (VQA). Despite their success, VLMs face significant challenges with tasks involving geometric reasoning, algebraic problem-solving, and counting. These limitations stem from difficulties effectively integrating multiple modalities and accurately interpreting geometry-related tasks. Various works claim that introducing a captioning pipeline before VQA tasks enhances performance. We incorporated this pipeline for tasks involving geometry, algebra, and counting. We found that captioning results are not generalizable, specifically with larger VLMs primarily trained on downstream QnA tasks showing random performance on math-related challenges. However, we present a promising alternative: task-based prompting, enriching the prompt with task-specific guidance. This approach shows promise and proves more effective than direct captioning methods for math-heavy problems. △ Less

Submitted 8 October, 2024; originally announced October 2024.

arXiv:2410.05915 [pdf, other]

Give me a hint: Can LLMs take a hint to solve math problems?

Authors: Vansh Agrawal, Pratham Singla, Amitoj Singh Miglani, Shivank Garg, Ayush Mangal

Abstract: While state-of-the-art LLMs have shown poor logical and basic mathematical reasoning, recent works try to improve their problem-solving abilities using prompting techniques. We propose giving "hints" to improve the language model's performance on advanced mathematical problems, taking inspiration from how humans approach math pedagogically. We also test robustness to adversarial hints and demonstr… ▽ More While state-of-the-art LLMs have shown poor logical and basic mathematical reasoning, recent works try to improve their problem-solving abilities using prompting techniques. We propose giving "hints" to improve the language model's performance on advanced mathematical problems, taking inspiration from how humans approach math pedagogically. We also test robustness to adversarial hints and demonstrate their sensitivity to them. We demonstrate the effectiveness of our approach by evaluating various diverse LLMs, presenting them with a broad set of problems of different difficulties and topics from the MATH dataset and comparing against techniques such as one-shot, few-shot, and chain of thought prompting. △ Less

Submitted 9 November, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

arXiv:2410.05326 [pdf, other]

Early-Cycle Internal Impedance Enables ML-Based Battery Cycle Life Predictions Across Manufacturers

Authors: Tyler Sours, Shivang Agarwal, Marc Cormier, Jordan Crivelli-Decker, Steffen Ridderbusch, Stephen L. Glazier, Connor P. Aiken, Aayush R. Singh, Ang Xiao, Omar Allam

Abstract: Predicting the end-of-life (EOL) of lithium-ion batteries across different manufacturers presents significant challenges due to variations in electrode materials, manufacturing processes, cell formats, and a lack of generally available data. Methods that construct features solely on voltage-capacity profile data typically fail to generalize across cell chemistries. This study introduces a methodol… ▽ More Predicting the end-of-life (EOL) of lithium-ion batteries across different manufacturers presents significant challenges due to variations in electrode materials, manufacturing processes, cell formats, and a lack of generally available data. Methods that construct features solely on voltage-capacity profile data typically fail to generalize across cell chemistries. This study introduces a methodology that combines traditional voltage-capacity features with Direct Current Internal Resistance (DCIR) measurements, enabling more accurate and generalizable EOL predictions. The use of early-cycle DCIR data captures critical degradation mechanisms related to internal resistance growth, enhancing model robustness. Models are shown to successfully predict the number of cycles to EOL for unseen manufacturers of varied electrode composition with a mean absolute error (MAE) of 150 cycles. This cross-manufacturer generalizability reduces the need for extensive new data collection and retraining, enabling manufacturers to optimize new battery designs using existing datasets. Additionally, a novel DCIR-compatible dataset is released as part of ongoing efforts to enrich the growing ecosystem of cycling data and accelerate battery materials development. △ Less

Submitted 13 May, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

Comments: 17 pages, 7 figures

arXiv:2410.02160 [pdf, other]

RiskSEA : A Scalable Graph Embedding for Detecting On-chain Fraudulent Activities on the Ethereum Blockchain

Authors: Ayush Agarwal, Lv Lu, Arjun Maheswaran, Varsha Mahadevan, Bhaskar Krishnamachari

Abstract: Like any other useful technology, cryptocurrencies are sometimes used for criminal activities. While transactions are recorded on the blockchain, there exists a need for a more rapid and scalable method to detect addresses associated with fraudulent activities. We present RiskSEA, a scalable risk scoring system capable of effectively handling the dynamic nature of large-scale blockchain transactio… ▽ More Like any other useful technology, cryptocurrencies are sometimes used for criminal activities. While transactions are recorded on the blockchain, there exists a need for a more rapid and scalable method to detect addresses associated with fraudulent activities. We present RiskSEA, a scalable risk scoring system capable of effectively handling the dynamic nature of large-scale blockchain transaction graphs. The risk scoring system, which we implement for Ethereum, consists of 1. a scalable approach to generating node2vec embedding for entire set of addresses to capture the graph topology 2. transaction-based features to capture the transactional behavioral pattern of an address 3. a classifier model to generate risk score for addresses that combines the node2vec embedding and behavioral features. Efficiently generating node2vec embedding for large scale and dynamically evolving blockchain transaction graphs is challenging, we present two novel approaches for generating node2vec embeddings and effectively scaling it to the entire set of blockchain addresses: 1. node2vec embedding propagation and 2. dynamic node2vec embedding. We present a comprehensive analysis of the proposed approaches. Our experiments show that combining both behavioral and node2vec features boosts the classification performance significantly, and that the dynamic node2vec embeddings perform better than the node2vec propagated embeddings. △ Less

Submitted 2 October, 2024; originally announced October 2024.

Comments: arXiv admin note: text overlap with arXiv:2203.12363 by other authors

arXiv:2409.20533 [pdf, other]

doi 10.1051/0004-6361/202452451

The eventful life of a luminous galaxy at z = 14: metal enrichment, feedback, and low gas fraction?

Authors: Stefano Carniani, Francesco D'Eugenio, Xihan Ji, Eleonora Parlanti, Jan Scholtz, Fengwu Sun, Giacomo Venturi, Tom J. L. C. Bakx, Mirko Curti, Roberto Maiolino, Sandro Tacchella, Jorge A. Zavala, Kevin Hainline, Joris Witstok, Benjamin D. Johnson, Stacey Alberts, Andrew J. Bunker, Stéphane Charlot, Daniel J. Eisenstein, Jakob M. Helton, Peter Jakobsen, Nimisha Kumari, Brant Robertson, Aayush Saxena, Hannah Übler , et al. (3 additional authors not shown)

Abstract: JADES-GS-z14-0 is the most distant spectroscopically confirmed galaxy yet, at $z>14$. With a UV magnitude of -20.81, it is one of the most luminous galaxies at cosmic dawn and its half-light radius of 260 pc means that stars dominate the observed UV emission. We report ALMA detection of [OIII]88$μ$m line emission with a significance of 6.67$σ$ and at a frequency of 223.524~GHz, corresponding to a… ▽ More JADES-GS-z14-0 is the most distant spectroscopically confirmed galaxy yet, at $z>14$. With a UV magnitude of -20.81, it is one of the most luminous galaxies at cosmic dawn and its half-light radius of 260 pc means that stars dominate the observed UV emission. We report ALMA detection of [OIII]88$μ$m line emission with a significance of 6.67$σ$ and at a frequency of 223.524~GHz, corresponding to a redshift of $14.1796\pm0.0007$, which is consistent with the candidate CIII] line detected in the NIRSpec spectrum. At this spectroscopic redshift, the Lyman-$α$ break identified with NIRSpec requires a damped Lyman-$α$ absorber with a column density of $\log(N_{\rm HI}/\mathrm{cm}^{-2})=21.96$. The total [O\,{\sc iii}]88$μ$m luminosity (log$(L_{\rm [OIII]}/L_\odot) = 8.3\pm 0.1$) is fully consistent with the local $L_{\rm [OIII]}-SFR$ relation and indicating a gas-phase metallicity $>0.1~{\rm Z_{\rm \odot}}$. Using \texttt{prospector} SED modeling and combining the ALMA data with JWST observations, we find $Z=0.17~{\rm Z_{\rm \odot}}$ and a non-zero escape fraction of ionizing photons ($\sim11\%$), which is necessary by the code to reproduce the UV spectrum. We measure an ${\rm [O III]}5007$Å/[O III]88$μ$m line flux ratio between 1 and 20, resulting in an upper limit to the electron density of roughly 700 cm$^{-3}$ assuming a single-cloud photoionization model. The [OIIII]88$μ$m emission line is spectrally resolved, with a FWHM of 100 km/s, resulting in a dynamical mass of log($M_{\rm dyn}/M_\odot$) = 9.0$\pm0.2$. When compared to the stellar mass, this value represents a conservative upper limit on the gas mass fraction, which ranges from 50\% to 80\%, depending on the assumed star formation history. Past radiation-driven outflows may have cleared the galaxy from the gas, reducing the gas fraction and thus increasing the escape fraction of ionizing photons. △ Less

Submitted 6 March, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

Comments: 14 pages, 9 figure

Journal ref: A&A 696, A87 (2025)

arXiv:2409.18642 [pdf]

Enhanced Convolution Neural Network with Optimized Pooling and Hyperparameter Tuning for Network Intrusion Detection

Authors: Ayush Kumar Sharma, Sourav Patel, Supriya Bharat Wakchaure, Abirami S

Abstract: Network Intrusion Detection Systems (NIDS) are essential for protecting computer networks from malicious activities, including Denial of Service (DoS), Probing, User-to-Root (U2R), and Remote-to-Local (R2L) attacks. Without effective NIDS, networks are vulnerable to significant security breaches and data loss. Machine learning techniques provide a promising approach to enhance NIDS by automating t… ▽ More Network Intrusion Detection Systems (NIDS) are essential for protecting computer networks from malicious activities, including Denial of Service (DoS), Probing, User-to-Root (U2R), and Remote-to-Local (R2L) attacks. Without effective NIDS, networks are vulnerable to significant security breaches and data loss. Machine learning techniques provide a promising approach to enhance NIDS by automating threat detection and improving accuracy. In this research, we propose an Enhanced Convolutional Neural Network (EnCNN) for NIDS and evaluate its performance using the KDDCUP'99 dataset. Our methodology includes comprehensive data preprocessing, exploratory data analysis (EDA), and feature engineering. We compare EnCNN with various machine learning algorithms, including Logistic Regression, Decision Trees, Support Vector Machines (SVM), and ensemble methods like Random Forest, AdaBoost, and Voting Ensemble. The results show that EnCNN significantly improves detection accuracy, with a notable 10% increase over state-of-art approaches. This demonstrates the effectiveness of EnCNN in real-time network intrusion detection, offering a robust solution for identifying and mitigating security threats, and enhancing overall network resilience. △ Less

Submitted 27 September, 2024; originally announced September 2024.

Comments: 7 Pages , 2 figures , 4 Tables , Conference paper

arXiv:2409.16472 [pdf, other]

Sub-Nyquist USF Spectral Estimation: $K$ Frequencies with $6K + 4$ Modulo Samples

Authors: Ruiming Guo, Yuliang Zhu, Ayush Bhandari

Abstract: Digital acquisition of high bandwidth signals is particularly challenging when Nyquist rate sampling is impractical. This has led to extensive research in sub-Nyquist sampling methods, primarily for spectral and sinusoidal frequency estimation. However, these methods struggle with high-dynamic-range (HDR) signals that can saturate analog-to-digital converters (ADCs). Addressing this, we introduce… ▽ More Digital acquisition of high bandwidth signals is particularly challenging when Nyquist rate sampling is impractical. This has led to extensive research in sub-Nyquist sampling methods, primarily for spectral and sinusoidal frequency estimation. However, these methods struggle with high-dynamic-range (HDR) signals that can saturate analog-to-digital converters (ADCs). Addressing this, we introduce a novel sub-Nyquist spectral estimation method, driven by the Unlimited Sensing Framework (USF), utilizing a multi-channel system. The sub-Nyquist USF method aliases samples in both amplitude and frequency domains, rendering the inverse problem particularly challenging. Towards this goal, our exact recovery theorem establishes that $K$ sinusoids of arbitrary amplitudes and frequencies can be recovered from $6K + 4$ modulo samples, remarkably, independent of the sampling rate or folding threshold. In the true spirit of sub-Nyquist sampling, via modulo ADC hardware experiments, we demonstrate successful spectrum estimation of HDR signals in the kHz range using Hz range sampling rates (0.078\% Nyquist rate). Our experiments also reveal up to a 33-fold improvement in frequency estimation accuracy using one less bit compared to conventional ADCs. These findings open new avenues in spectral estimation applications, e.g., radars, direction-of-arrival (DoA) estimation, and cognitive radio, showcasing the potential of USF. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Comments: 18 pages, 8 figures, accepted to IEEE Trans. on Signal Processing

arXiv:2409.16301 [pdf, other]

Gait Switching and Enhanced Stabilization of Walking Robots with Deep Learning-based Reachability: A Case Study on Two-link Walker

Authors: Xingpeng Xia, Jason J. Choi, Ayush Agrawal, Koushil Sreenath, Claire J. Tomlin, Somil Bansal

Abstract: Learning-based approaches have recently shown notable success in legged locomotion. However, these approaches often lack accountability, necessitating empirical tests to determine their effectiveness. In this work, we are interested in designing a learning-based locomotion controller whose stability can be examined and guaranteed. This can be achieved by verifying regions of attraction (RoAs) of l… ▽ More Learning-based approaches have recently shown notable success in legged locomotion. However, these approaches often lack accountability, necessitating empirical tests to determine their effectiveness. In this work, we are interested in designing a learning-based locomotion controller whose stability can be examined and guaranteed. This can be achieved by verifying regions of attraction (RoAs) of legged robots to their stable walking gaits. This is a non-trivial problem for legged robots due to their hybrid dynamics. Although previous work has shown the utility of Hamilton-Jacobi (HJ) reachability to solve this problem, its practicality was limited by its poor scalability. The core contribution of our work is the employment of a deep learning-based HJ reachability solution to the hybrid legged robot dynamics, which overcomes the previous work's limitation. With the learned reachability solution, first, we can estimate a library of RoAs for various gaits. Second, we can design a one-step predictive controller that effectively stabilizes to an individual gait within the verified RoA. Finally, we can devise a strategy that switches gaits, in response to external perturbations, whose feasibility is guided by the RoA analysis. We demonstrate our method in a two-link walker simulation, whose mathematical model is well established. Our method achieves improved stability than previous model-based methods, while ensuring transparency that was not present in the existing learning-based approaches. △ Less

Submitted 10 September, 2024; originally announced September 2024.

Comments: The first two authors contributed equally. This work is supported in part by the NSF Grant CMMI-1944722, the NSF CAREER Program under award 2240163, the NASA ULI on Safe Aviation Autonomy, and the DARPA Assured Autonomy and Assured Neuro Symbolic Learning and Reasoning (ANSR) programs. The work of Jason J. Choi received the support of a fellowship from Kwanjeong Educational Foundation, Korea

arXiv:2409.16288 [pdf, other]

Self-Supervised Any-Point Tracking by Contrastive Random Walks

Authors: Ayush Shrivastava, Andrew Owens

Abstract: We present a simple, self-supervised approach to the Tracking Any Point (TAP) problem. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks, using the transformer's attention-based global matching to define the transition matrices for a random walk on a space-time graph. The ability to perform "all pairs" comparisons between points allow… ▽ More We present a simple, self-supervised approach to the Tracking Any Point (TAP) problem. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks, using the transformer's attention-based global matching to define the transition matrices for a random walk on a space-time graph. The ability to perform "all pairs" comparisons between points allows the model to obtain high spatial precision and to obtain a strong contrastive learning signal, while avoiding many of the complexities of recent approaches (such as coarse-to-fine matching). To do this, we propose a number of design decisions that allow global matching architectures to be trained through self-supervision using cycle consistency. For example, we identify that transformer-based methods are sensitive to shortcut solutions, and propose a data augmentation scheme to address them. Our method achieves strong performance on the TapVid benchmarks, outperforming previous self-supervised tracking methods, such as DIFT, and is competitive with several supervised methods. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Comments: ECCV 2024. Project link: https://ayshrv.com/gmrw . Code: https://github.com/ayshrv/gmrw/

arXiv:2409.14609 [pdf]

doi 10.1109/Confluence52989.2022.9734222

Nirjas: An open source framework for extracting metadata from the source code

Authors: Ayush Bhardwaj, Sahil, Kaushlendra Pratap, Gaurav Mishra

Abstract: Metadata and comments are critical elements of any software development process. In this paper, we explain how metadata and comments in source code can play an essential role in comprehending software. We introduce a Python-based open-source framework, Nirjas, which helps in extracting this metadata in a structured manner. Various syntaxes, types, and widely accepted conventions exist for adding c… ▽ More Metadata and comments are critical elements of any software development process. In this paper, we explain how metadata and comments in source code can play an essential role in comprehending software. We introduce a Python-based open-source framework, Nirjas, which helps in extracting this metadata in a structured manner. Various syntaxes, types, and widely accepted conventions exist for adding comments in source files of different programming languages. Edge cases can create noise in extraction, for which we use Regex to accurately retrieve metadata. Non-Regex methods can give results but often miss accuracy and noise separation. Nirjas also separates different types of comments, source code, and provides details about those comments, such as line number, file name, language used, total SLOC, etc. Nirjas is a standalone Python framework/library and can be easily installed via source or pip (the Python package installer). Nirjas was initially created as part of a Google Summer of Code project and is currently developed and maintained under the FOSSology organization. △ Less

Submitted 22 September, 2024; originally announced September 2024.

Comments: 2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)

arXiv:2409.13229 [pdf, other]

Multiscale Encoder and Omni-Dimensional Dynamic Convolution Enrichment in nnU-Net for Brain Tumor Segmentation

Authors: Sahaj K. Mistry, Sourav Saini, Aashray Gupta, Aayush Gupta, Sunny Rai, Vinit Jakhetiya, Ujjwal Baid, Sharath Chandra Guntuku

Abstract: Brain tumor segmentation plays a crucial role in computer-aided diagnosis. This study introduces a novel segmentation algorithm utilizing a modified nnU-Net architecture. Within the nnU-Net architecture's encoder section, we enhance conventional convolution layers by incorporating omni-dimensional dynamic convolution layers, resulting in improved feature representation. Simultaneously, we propose… ▽ More Brain tumor segmentation plays a crucial role in computer-aided diagnosis. This study introduces a novel segmentation algorithm utilizing a modified nnU-Net architecture. Within the nnU-Net architecture's encoder section, we enhance conventional convolution layers by incorporating omni-dimensional dynamic convolution layers, resulting in improved feature representation. Simultaneously, we propose a multi-scale attention strategy that harnesses contemporary insights from various scales. Our model's efficacy is demonstrated on diverse datasets from the BraTS-2023 challenge. Integrating omni-dimensional dynamic convolution (ODConv) layers and multi-scale features yields substantial improvement in the nnU-Net architecture's performance across multiple tumor segmentation datasets. Remarkably, our proposed model attains good accuracy during validation for the BraTS Africa dataset. The ODconv source code along with full training code is available on GitHub. △ Less

Submitted 20 September, 2024; originally announced September 2024.

Comments: 9 pages, 3 figures. Accepted at MICCAI 2023, to be published in Springer LNCS. GitHub: https://github.com/i-sahajmistry/nnUNet_BraTS2023

arXiv:2409.12947 [pdf, other]

Unrolled denoising networks provably learn optimal Bayesian inference

Authors: Aayush Karan, Kulin Shah, Sitan Chen, Yonina C. Eldar

Abstract: Much of Bayesian inference centers around the design of estimators for inverse problems which are optimal assuming the data comes from a known prior. But what do these optimality guarantees mean if the prior is unknown? In recent years, algorithm unrolling has emerged as deep learning's answer to this age-old question: design a neural network whose layers can in principle simulate iterations of in… ▽ More Much of Bayesian inference centers around the design of estimators for inverse problems which are optimal assuming the data comes from a known prior. But what do these optimality guarantees mean if the prior is unknown? In recent years, algorithm unrolling has emerged as deep learning's answer to this age-old question: design a neural network whose layers can in principle simulate iterations of inference algorithms and train on data generated by the unknown prior. Despite its empirical success, however, it has remained unclear whether this method can provably recover the performance of its optimal, prior-aware counterparts. In this work, we prove the first rigorous learning guarantees for neural networks based on unrolling approximate message passing (AMP). For compressed sensing, we prove that when trained on data drawn from a product prior, the layers of the network approximately converge to the same denoisers used in Bayes AMP. We also provide extensive numerical experiments for compressed sensing and rank-one matrix estimation demonstrating the advantages of our unrolled architecture - in addition to being able to obliviously adapt to general priors, it exhibits improvements over Bayes AMP in more general settings of low dimensions, non-Gaussian designs, and non-product priors. △ Less

Submitted 19 September, 2024; originally announced September 2024.

Comments: 32 pages

arXiv:2409.12661 [pdf, other]

doi 10.1145/3680528.3687655

Manifold Sampling for Differentiable Uncertainty in Radiance Fields

Authors: Linjie Lyu, Ayush Tewari, Marc Habermann, Shunsuke Saito, Michael Zollhöfer, Thomas Leimkühler, Christian Theobalt

Abstract: Radiance fields are powerful and, hence, popular models for representing the appearance of complex scenes. Yet, constructing them based on image observations gives rise to ambiguities and uncertainties. We propose a versatile approach for learning Gaussian radiance fields with explicit and fine-grained uncertainty estimates that impose only little additional cost compared to uncertainty-agnostic t… ▽ More Radiance fields are powerful and, hence, popular models for representing the appearance of complex scenes. Yet, constructing them based on image observations gives rise to ambiguities and uncertainties. We propose a versatile approach for learning Gaussian radiance fields with explicit and fine-grained uncertainty estimates that impose only little additional cost compared to uncertainty-agnostic training. Our key observation is that uncertainties can be modeled as a low-dimensional manifold in the space of radiance field parameters that is highly amenable to Monte Carlo sampling. Importantly, our uncertainties are differentiable and, thus, allow for gradient-based optimization of subsequent captures that optimally reduce ambiguities. We demonstrate state-of-the-art performance on next-best-view planning tasks, including high-dimensional illumination planning for optimal radiance field relighting quality. △ Less

Submitted 19 September, 2024; originally announced September 2024.

Comments: Siggraph Asia 2024 conference

arXiv:2409.12378 [pdf, other]

Star cluster formation from turbulent clumps. IV. Protoplanetary disc evolution

Authors: Aayush Gautam, Juan P. Farias, Jonathan C. Tan

Abstract: Most stars are born in the crowded environments of gradually forming star clusters. Dynamical interactions between close-passing stars and the evolving UV radiation fields from proximate massive stars are expected to sculpt the protoplanetary discs in these clusters, potentially contributing to the diversity of planetary systems that we observe. Here, we investigate the impact of cluster environme… ▽ More Most stars are born in the crowded environments of gradually forming star clusters. Dynamical interactions between close-passing stars and the evolving UV radiation fields from proximate massive stars are expected to sculpt the protoplanetary discs in these clusters, potentially contributing to the diversity of planetary systems that we observe. Here, we investigate the impact of cluster environment on disc demographics by implementing simple protoplanetary disc evolution models within $N$-body simulations of gradual star cluster formation. We consider a range of star formation efficiency per free-fall time, $ε_{\rm ff}$, and mass surface density of the natal cloud environment, $Σ_{\rm cl}$, both of which affect the overall duration of cluster formation. We track the interaction history of all stars to estimate the dynamical truncation of the discs around stars involved in close encounters. We also track external photoevaporation of the discs due to the ionizing radiation field of the nearby high- and intermediate-mass ($> 5 M_\odot$) stars. We find that $ε_{\rm ff}$, $Σ_{\rm cl}$, and the degree of primordial binarity have major influences on the masses and radii of the disc population. In particular, external photo-evaporation has a greater impact than dynamical interactions in determining the fate of discs in our clusters. △ Less

Submitted 18 September, 2024; originally announced September 2024.

Comments: Submitted to MNRAS. 16 pages, 9 figures. Comments welcome

arXiv:2409.11586 [pdf, other]

Distributed Deep Koopman Learning for Nonlinear Dynamics

Authors: Wenjian Hao, Lili Wang, Ayush Rai, Shaoshuai Mou

Abstract: Koopman operator theory has proven to be highly significant in system identification, even for challenging scenarios involving nonlinear time-varying systems (NTVS). In this context, we examine a network of connected agents, each with limited observation capabilities, aiming to estimate the dynamics of an NTVS collaboratively. Drawing inspiration from Koopman operator theory, deep neural networks,… ▽ More Koopman operator theory has proven to be highly significant in system identification, even for challenging scenarios involving nonlinear time-varying systems (NTVS). In this context, we examine a network of connected agents, each with limited observation capabilities, aiming to estimate the dynamics of an NTVS collaboratively. Drawing inspiration from Koopman operator theory, deep neural networks, and distributed consensus, we introduce a distributed algorithm for deep Koopman learning of the dynamics of an NTVS. This approach enables individual agents to approximate the entire dynamics despite having access to only partial state observations. We guarantee consensus not only on the estimated dynamics but also on its structure, i.e., the matrices encountered in the linear equation of the lifted Koopman system. We provide theoretical insights into the convergence of the learning process and accompanying numerical simulations. △ Less

Submitted 17 September, 2024; originally announced September 2024.

arXiv:2409.11533 [pdf, other]

A search for persistent radio sources toward repeating fast radio bursts discovered by CHIME/FRB

Authors: Adaeze L. Ibik, Maria R. Drout, Bryan M. Gaensler, Paul Scholz, Navin Sridhar, Ben Margalit, Casey J. Law, Tracy E. Clarke, Shriharsh P. Tendulkar, Daniele Michilli, Tarraneh Eftekhari, Mohit Bhardwaj, Sarah Burke-Spolaor, Shami Chatterjee, Amanda M. Cook, Jason W. T. Hessels, Franz Kirsten, Ronniy C. Joseph, Victoria M. Kaspi, Mattias Lazda, Kiyoshi W. Masui, Kenzie Nimmo, Ayush Pandhi, Aaron B. Pearlman, Ziggy Pleunis , et al. (3 additional authors not shown)

Abstract: The identification of persistent radio sources (PRSs) coincident with two repeating fast radio bursts (FRBs) supports FRB theories requiring a compact central engine. However, deep non-detections in other cases highlight the diversity of repeating FRBs and their local environments. Here, we perform a systematic search for radio sources towards 37 CHIME/FRB repeaters using their arcminute localizat… ▽ More The identification of persistent radio sources (PRSs) coincident with two repeating fast radio bursts (FRBs) supports FRB theories requiring a compact central engine. However, deep non-detections in other cases highlight the diversity of repeating FRBs and their local environments. Here, we perform a systematic search for radio sources towards 37 CHIME/FRB repeaters using their arcminute localizations and a combination of archival surveys and targeted observations. Through multi-wavelength analysis of individual radio sources, we identify two (20181030A-S1 and 20190417A-S1) for which we disfavor an origin of either star formation or an active galactic nucleus in their host galaxies and thus consider them candidate PRSs. We do not find any associated PRSs for the majority of the repeating FRBs in our sample. For 8 FRB fields with Very Large Array imaging, we provide deep limits on the presence of PRSs that are 2--4 orders of magnitude fainter than the PRS associated with FRB\,20121102A. Using Very Large Array Sky Survey imaging of all 37 fields, we constrain the rate of luminous ($\gtrsim$10$^{40}$ erg s$^{-1}$) PRSs associated with repeating FRBs to be low. Within the context of FRB-PRS models, we find that 20181030A-S1 and 20190417A-S1 can be reasonably explained within the context of magnetar, hypernebulae, gamma-ray burst afterglow, or supernova ejecta models -- although we note that both sources follow the radio luminosity versus rotation measure relationship predicted in the nebula model framework. Future observations will be required to both further characterize and confirm the association of these PRS candidates with the FRBs. △ Less

Submitted 7 November, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

Comments: 37 pages, 12 figures

arXiv:2409.11410 [pdf, other]

Multilevel Verification on a Single Digital Decentralized Distributed (DDD) Ledger

Authors: Ayush Thada, Aanchal Kandpal, Dipanwita Sinha Mukharjee

Abstract: This paper presents an approach to using decentralized distributed digital (DDD) ledgers like blockchain with multi-level verification. In regular DDD ledgers like Blockchain, only a single level of verification is available, which makes it not useful for those systems where there is a hierarchy and verification is required on each level. In systems where hierarchy emerges naturally, the inclusion… ▽ More This paper presents an approach to using decentralized distributed digital (DDD) ledgers like blockchain with multi-level verification. In regular DDD ledgers like Blockchain, only a single level of verification is available, which makes it not useful for those systems where there is a hierarchy and verification is required on each level. In systems where hierarchy emerges naturally, the inclusion of hierarchy in the solution for the problem of the system enables us to come up with a better solution. Introduction to hierarchy means there could be several verification within a level in the hierarchy and more than one level of verification, which implies other challenges induced by an interaction between the various levels of hierarchies that also need to be addressed, like verification of the work of the previous level of hierarchy by given level in the hierarchy. The paper will address all these issues, and provide a road map to trace the state of the system at any given time and probability of failure of the system. △ Less

Submitted 27 September, 2024; v1 submitted 3 September, 2024; originally announced September 2024.

Comments: Submitted to IEEE Transactions on Information Forensics and Security Journal

arXiv:2409.10576 [pdf]

Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports

Authors: Mohamed Sobhi Jabal, Pranav Warman, Jikai Zhang, Kartikeye Gupta, Ayush Jain, Maciej Mazurowski, Walter Wiggins, Kirti Magudia, Evan Calabrese

Abstract: Purpose: To develop and evaluate an automated system for extracting structured clinical information from unstructured radiology and pathology reports using open-weights large language models (LMs) and retrieval augmented generation (RAG), and to assess the effects of model configuration variables on extraction performance. Methods and Materials: The study utilized two datasets: 7,294 radiology rep… ▽ More Purpose: To develop and evaluate an automated system for extracting structured clinical information from unstructured radiology and pathology reports using open-weights large language models (LMs) and retrieval augmented generation (RAG), and to assess the effects of model configuration variables on extraction performance. Methods and Materials: The study utilized two datasets: 7,294 radiology reports annotated for Brain Tumor Reporting and Data System (BT-RADS) scores and 2,154 pathology reports annotated for isocitrate dehydrogenase (IDH) mutation status. An automated pipeline was developed to benchmark the performance of various LMs and RAG configurations. The impact of model size, quantization, prompting strategies, output formatting, and inference parameters was systematically evaluated. Results: The best performing models achieved over 98% accuracy in extracting BT-RADS scores from radiology reports and over 90% for IDH mutation status extraction from pathology reports. The top model being medical fine-tuned llama3. Larger, newer, and domain fine-tuned models consistently outperformed older and smaller models. Model quantization had minimal impact on performance. Few-shot prompting significantly improved accuracy. RAG improved performance for complex pathology reports but not for shorter radiology reports. Conclusions: Open LMs demonstrate significant potential for automated extraction of structured clinical data from unstructured clinical reports with local privacy-preserving application. Careful model selection, prompt engineering, and semi-automated optimization using annotated data are critical for optimal performance. These approaches could be reliable enough for practical use in research workflows, highlighting the potential for human-machine collaboration in healthcare data extraction. △ Less

Submitted 18 September, 2024; v1 submitted 15 September, 2024; originally announced September 2024.

ACM Class: J.3; I.2; I.2.7

arXiv:2409.10568 [pdf, other]

On the limits of agency in agent-based models

Authors: Ayush Chopra, Shashank Kumar, Nurullah Giray-Kuru, Ramesh Raskar, Arnau Quera-Bofarull

Abstract: Agent-based modeling (ABM) offers powerful insights into complex systems, but its practical utility has been limited by computational constraints and simplistic agent behaviors, especially when simulating large populations. Recent advancements in large language models (LLMs) could enhance ABMs with adaptive agents, but their integration into large-scale simulations remains challenging. This work i… ▽ More Agent-based modeling (ABM) offers powerful insights into complex systems, but its practical utility has been limited by computational constraints and simplistic agent behaviors, especially when simulating large populations. Recent advancements in large language models (LLMs) could enhance ABMs with adaptive agents, but their integration into large-scale simulations remains challenging. This work introduces a novel methodology that bridges this gap by efficiently integrating LLMs into ABMs, enabling the simulation of millions of adaptive agents. We present LLM archetypes, a technique that balances behavioral complexity with computational efficiency, allowing for nuanced agent behavior in large-scale simulations. Our analysis explores the crucial trade-off between simulation scale and individual agent expressiveness, comparing different agent architectures ranging from simple heuristic-based agents to fully adaptive LLM-powered agents. We demonstrate the real-world applicability of our approach through a case study of the COVID-19 pandemic, simulating 8.4 million agents representing New York City and capturing the intricate interplay between health behaviors and economic outcomes. Our method significantly enhances ABM capabilities for predictive and counterfactual analyses, addressing limitations of historical data in policy design. By implementing these advances in an open-source framework, we facilitate the adoption of LLM archetypes across diverse ABM applications. Our results show that LLM archetypes can markedly improve the realism and utility of large-scale ABMs while maintaining computational feasibility, opening new avenues for modeling complex societal challenges and informing data-driven policy decisions. △ Less

Submitted 10 November, 2024; v1 submitted 14 September, 2024; originally announced September 2024.

arXiv:2409.10040 [pdf, other]

Enhancing HAP Networks with Reconfigurable Intelligent Surfaces

Authors: Islam M. Tanash, Ayush Kumar Dwivedi, Fatemeh Rafiei Maleki, Taneli Riihonen

Abstract: This paper presents and analyzes a reconfigurable intelligent surface (RIS)-based high-altitude platform (HAP) network. Stochastic geometry is used to model the arbitrary locations of the HAPs and RISs as a homogenous Poisson point process. Considering that the links between the HAPs, RISs, and users are $κ$--$μ$ faded, the coverage and ergodic capacity of the proposed system are expressed. The an… ▽ More This paper presents and analyzes a reconfigurable intelligent surface (RIS)-based high-altitude platform (HAP) network. Stochastic geometry is used to model the arbitrary locations of the HAPs and RISs as a homogenous Poisson point process. Considering that the links between the HAPs, RISs, and users are $κ$--$μ$ faded, the coverage and ergodic capacity of the proposed system are expressed. The analytically derived performance measures are verified through Monte Carlo simulations. Significant improvements in system performance and the impact of system parameters are demonstrated in the results. Thus, the proposed system concept can improve connectivity and data offloading in smart cities and dense urban environments. △ Less

Submitted 16 September, 2024; originally announced September 2024.

arXiv:2409.09802 [pdf, other]

doi 10.1103/PhysRevD.111.044063

Photon Ring Dimming as a Signature of Photon-Axion Conversion in Janis-Newman-Winicour Naked Singularity

Authors: Ayush Hazarika, Premachand Mahapatra, Subhadip Sau

Abstract: The possible existence of axions in the universe introduces the intriguing possibility of photon-axion conversion in strong magnetic fields, particularly near compact objects like supermassive black holes or even naked singularity. In this study, we investigate the conversion of photons into axions in the vicinity of a Janis-Newman-Winicour (JNW) spacetime, a well-known naked singularity solution.… ▽ More The possible existence of axions in the universe introduces the intriguing possibility of photon-axion conversion in strong magnetic fields, particularly near compact objects like supermassive black holes or even naked singularity. In this study, we investigate the conversion of photons into axions in the vicinity of a Janis-Newman-Winicour (JNW) spacetime, a well-known naked singularity solution. Our analysis reveals that photons can efficiently convert into axions with masses less than $100 \rm \ neV$. We calculate the conversion probability and find that it is significantly influenced by the characteristic parameter of the JNW spacetime. The potential observational signatures of this conversion, would be the dimming of photon ring in the X-ray and gamma-ray spectrum. Our findings suggest that compact objects like M87* could be prime candidates for detecting photon-axion conversion effects, provided future advances in high-resolution observations. △ Less

Submitted 20 February, 2025; v1 submitted 15 September, 2024; originally announced September 2024.

Comments: Published in Physical Review D, 17 figures, 24 pages

arXiv:2409.08092 [pdf, other]

Modeling Snow on Sea Ice using Physics Guided Machine Learning

Authors: Ayush Prasad, Ioanna Merkouriadi, Aleksi Nummelin

Abstract: Snow is a crucial element of the sea ice system, affecting sea ice growth and decay due to its low thermal conductivity and high albedo. Despite its importance, present-day climate models have an idealized representation of snow, often including only single-layer thermodynamics and omitting several processes that shape its properties. Although advanced snow process models like SnowModel exist, the… ▽ More Snow is a crucial element of the sea ice system, affecting sea ice growth and decay due to its low thermal conductivity and high albedo. Despite its importance, present-day climate models have an idealized representation of snow, often including only single-layer thermodynamics and omitting several processes that shape its properties. Although advanced snow process models like SnowModel exist, they are often excluded from climate modeling due to their high computational costs. SnowModel simulates snow depth, density, blowing-snow redistribution, sublimation, grain size, and thermal conductivity in a multi-layer snowpack. It operates with high spatial (1 meter) and temporal (1 hour) resolution. However, for large regions like the Arctic Ocean, these high-resolution simulations face challenges such as slow processing and large resource requirements. Data-driven emulators are used to address these issues, but they often lack generalizability and consistency with physical laws. In our study, we address these challenges by developing a physics-guided emulator that incorporates physical laws governing changes in snow density due to compaction. We evaluated three machine learning models: Long Short-Term Memory (LSTM), Physics-Guided LSTM, and Random Forest across five Arctic regions. All models achieved high accuracy, with the Physics-Guided LSTM showing the best performance in accuracy and generalizability. Our approach offers a faster way to emulate SnowModel with a speedup of over 9000 times, maintaining high fidelity. △ Less

Submitted 12 September, 2024; originally announced September 2024.

Comments: Accepted for publication in Cambridge Environmental Data Science

arXiv:2409.06405 [pdf, other]

JADES: Measuring reionization properties using Lyman-alpha emission

Authors: Gareth C. Jones, Andrew J. Bunker, Aayush Saxena, Santiago Arribas, Rachana Bhatawdekar, Kristan Boyett, Alex. J. Cameron, Stefano Carniani, Stephane Charlot, Emma Curtis-Lake, Kevin Hainline, Benjamin D. Johnson, Nimisha Kumari, Michael V. Maseda, Hans-Walter Rix, Brant E. Robertson, Sandro Tacchella, Hannah Übler, Christina C. Williams, Chris Willott, Joris Witstok, Yongda Zhu

Abstract: Ly$α$ is the transition to the ground state from the first excited state of hydrogen (the most common element). Resonant scattering of this line by neutral hydrogen greatly impedes its emergence from galaxies, so the fraction of galaxies emitting Ly$α$ is a tracer of the neutral fraction of the intergalactic medium (IGM), and thus the history of reionisation. In previous works, we used early JWST/… ▽ More Ly$α$ is the transition to the ground state from the first excited state of hydrogen (the most common element). Resonant scattering of this line by neutral hydrogen greatly impedes its emergence from galaxies, so the fraction of galaxies emitting Ly$α$ is a tracer of the neutral fraction of the intergalactic medium (IGM), and thus the history of reionisation. In previous works, we used early JWST/NIRSpec data from the JWST Advanced Deep Extragalactic Survey (JADES) to classify and characterise Ly$α$ emitting galaxies (LAEs). This survey is approaching completion, and the current sample is nearly an order of magnitude larger. From a sample of 795 galaxies in JADES at $4.0<z<14.3$, we find evidence for Ly$α$ emission in 150sources. We reproduce the previously found correlation between Ly$α$ escape fraction ($f_{esc}^{Lyα}$) - Ly$α$ rest-frame equivalent width ($REW_{Lyα}$) and the negative correlation between Ly$α$ velocity offset - $f_{esc}^{Lyα}$. Both $f_{esc}^{Lyα}$ and $REW_{Lyα}$ decrease with redshift ($z\gtrsim5.5$), indicating the progression of reionisation on a population scale. Our data are used to demonstrate an increasing IGM transmission of Ly$α$ from $z\sim14-6$. We measure the completeness-corrected fraction of LAEs (\xlya) from $z=4-9.5$. An application of these \xlya values to the results of previously utilised semi-analytical models suggests a high neutral fraction at $z=7$ ($X_{HI}\sim0.8-0.9$). Using an updated fit to the intrinsic distribution of $REW_{Lyα}$ results in a lower value in agreement with current works ($X_{HI}=0.64_{-0.21}^{+0.13}$). This sample of LAEs will be paramount for unbiased population studies of galaxies in the EoR. △ Less

Submitted 28 November, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

Comments: 26 pages, 22 figures, accepted for publication in MNRAS

arXiv:2409.05240 [pdf, other]

doi 10.1038/s41524-025-01532-6

A Physics-Enforced Neural Network to Predict Polymer Melt Viscosity

Authors: Ayush Jain, Rishi Gurnani, Arunkumar Rajan, H. Jerry Qi, Rampi Ramprasad

Abstract: Achieving superior polymeric components through additive manufacturing (AM) relies on precise control of rheology. One key rheological property particularly relevant to AM is melt viscosity ($η$). Melt viscosity is influenced by polymer chemistry, molecular weight ($M_w$), polydispersity, induced shear rate ($\dotγ$), and processing temperature ($T$). The relationship of $η$ with $M_w$, $\dotγ$, a… ▽ More Achieving superior polymeric components through additive manufacturing (AM) relies on precise control of rheology. One key rheological property particularly relevant to AM is melt viscosity ($η$). Melt viscosity is influenced by polymer chemistry, molecular weight ($M_w$), polydispersity, induced shear rate ($\dotγ$), and processing temperature ($T$). The relationship of $η$ with $M_w$, $\dotγ$, and $T$ may be captured by parameterized equations. Several physical experiments are required to fit the parameters, so predicting $η$ of a new polymer material in unexplored physical domains is a laborious process. Here, we develop a Physics-Enforced Neural Network (PENN) model that predicts the empirical parameters and encodes the parametrized equations to calculate $η$ as a function of polymer chemistry, $M_w$, polydispersity, $\dotγ$, and $T$. We benchmark our PENN against physics-unaware Artificial Neural Network (ANN) and Gaussian Process Regression (GPR) models. Finally, we demonstrate that the PENN offers superior values of $η$ when extrapolating to unseen values of $M_w$, $\dotγ$, and $T$ for sparsely seen polymers. △ Less

Submitted 8 September, 2024; originally announced September 2024.

arXiv:2409.04505 [pdf, other]

Cavity-mediated superthermal phonon correlations in the ultrastrong coupling regime

Authors: Dasom Kim, Jin Hou, Geon Lee, Ayush Agrawal, Sunghwan Kim, Hao Zhang, Di Bao, Andrey Baydin, Wenjing Wu, Fuyang Tay, Shengxi Huang, Elbert E. M. Chia, Dai-Sik Kim, Minah Seo, Aditya D. Mohite, David Hagenmüller, Junichiro Kono

Abstract: Phonons, or vibrational quanta, are behind some of the most fundamental physical phenomena in solids, including superconductivity, Raman processes, and broken-symmetry phases. It is therefore of fundamental importance to find ways to harness phonons for controlling these phenomena and developing novel quantum technologies. However, the majority of current phonon control techniques rely on the use… ▽ More Phonons, or vibrational quanta, are behind some of the most fundamental physical phenomena in solids, including superconductivity, Raman processes, and broken-symmetry phases. It is therefore of fundamental importance to find ways to harness phonons for controlling these phenomena and developing novel quantum technologies. However, the majority of current phonon control techniques rely on the use of intense external driving fields or strong anharmonicities, which restricts their range of applications. Here, we present a scheme for controlling the intensity fluctuations in phonon emission at room temperature based on multimode ultrastrong light--matter coupling. The multimode ultrastrong coupling regime is achieved by coupling two optical phonon modes in lead halide perovskites to an array of nanoslots, which operates as a single-mode cavity. The extremely small mode volume of the nanoslots enables unprecedented coupling strengths in a cavity phonon-polariton system. In the far-detuned, low-cavity-frequency regime, we demonstrate that the nanoslot resonator mediates an effective coupling between the phonon modes, resulting in superthermal phonon bunching in thermal equilibrium, both within the same mode and between different modes. Experimental results are in good agreement with a multimode Hopfield model. Our work paves the way for the tailoring of phonons to modify charge and energy transport in perovskite materials, with potential applications in light-collecting or emitting devices. △ Less

Submitted 6 September, 2024; originally announced September 2024.

arXiv:2409.03795 [pdf, other]

Security Implications and Mitigation Strategies in MPLS Networks

Authors: Ayush Thakur

Abstract: Multiprotocol Label Switching (MPLS) is a high-performance telecommunications technology that directs data from one network node to another based on short path labels rather than long network addresses. Its efficiency and scalability have made it a popular choice for large-scale and enterprise networks. However, as MPLS networks grow and evolve, they encounter various security challenges. This pap… ▽ More Multiprotocol Label Switching (MPLS) is a high-performance telecommunications technology that directs data from one network node to another based on short path labels rather than long network addresses. Its efficiency and scalability have made it a popular choice for large-scale and enterprise networks. However, as MPLS networks grow and evolve, they encounter various security challenges. This paper explores the security implications associated with MPLS networks, including risks such as label spoofing, traffic interception, and denial of service attacks. Additionally, it evaluates advanced mitigation strategies to address these vulnerabilities, leveraging mathematical models and security protocols to enhance MPLS network resilience. By integrating theoretical analysis with practical solutions, this paper aims to provide a comprehensive understanding of MPLS security and propose effective methods for safeguarding network infrastructure. △ Less

Submitted 4 September, 2024; originally announced September 2024.

arXiv:2409.00629 [pdf, other]

Assessing the Impact of Upselling in Online Fantasy Sports

Authors: Aayush Chaudhary

Abstract: This study explores the impact of upselling on user engagement. We model users' deposit behaviour on the fantasy sports platform Dream11. Subsequently, we develop an experimental framework to evaluate the effect of upselling using an intensity parameter. Our live experiments on user deposit behaviour reveal decreased user recall with heightened upselling intensity. Our findings indicate that incre… ▽ More This study explores the impact of upselling on user engagement. We model users' deposit behaviour on the fantasy sports platform Dream11. Subsequently, we develop an experimental framework to evaluate the effect of upselling using an intensity parameter. Our live experiments on user deposit behaviour reveal decreased user recall with heightened upselling intensity. Our findings indicate that increased upselling intensity improves user deposit metrics and concurrently diminishes user satisfaction and conversion rates. We conduct robust counterfactual analysis and train causal meta-learners to personalise users' upselling intensity levels to reach an optimal trade-off point. △ Less

Submitted 9 September, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

arXiv:2409.00553 [pdf, other]

Multi-Output Distributional Fairness via Post-Processing

Authors: Gang Li, Qihang Lin, Ayush Ghosh, Tianbao Yang

Abstract: The post-processing approaches are becoming prominent techniques to enhance machine learning models' fairness because of their intuitiveness, low computational cost, and excellent scalability. However, most existing post-processing methods are designed for task-specific fairness measures and are limited to single-output models. In this paper, we introduce a post-processing method for multi-output… ▽ More The post-processing approaches are becoming prominent techniques to enhance machine learning models' fairness because of their intuitiveness, low computational cost, and excellent scalability. However, most existing post-processing methods are designed for task-specific fairness measures and are limited to single-output models. In this paper, we introduce a post-processing method for multi-output models, such as the ones used for multi-task/multi-class classification and representation learning, to enhance a model's distributional parity, a task-agnostic fairness measure. Existing methods for achieving distributional parity rely on the (inverse) cumulative density function of a model's output, restricting their applicability to single-output models. Extending previous works, we propose to employ optimal transport mappings to move a model's outputs across different groups towards their empirical Wasserstein barycenter. An approximation technique is applied to reduce the complexity of computing the exact barycenter and a kernel regression method is proposed to extend this process to out-of-sample data. Our empirical studies evaluate the proposed approach against various baselines on multi-task/multi-class classification and representation learning tasks, demonstrating the effectiveness of the proposed approach. △ Less

Submitted 20 March, 2025; v1 submitted 31 August, 2024; originally announced September 2024.

Comments: 21 pages, 4 figures

arXiv:2408.16608 [pdf, other]

doi 10.1038/s41586-025-08779-5

Witnessing the onset of reionisation via Lyman-$α$ emission at redshift 13

Authors: Joris Witstok, Peter Jakobsen, Roberto Maiolino, Jakob M. Helton, Benjamin D. Johnson, Brant E. Robertson, Sandro Tacchella, Alex J. Cameron, Renske Smit, Andrew J. Bunker, Aayush Saxena, Fengwu Sun, Stacey Alberts, Santiago Arribas, William M. Baker, Rachana Bhatawdekar, Kristan Boyett, Phillip A. Cargile, Stefano Carniani, Stéphane Charlot, Jacopo Chevallard, Mirko Curti, Emma Curtis-Lake, Francesco D'Eugenio, Daniel J. Eisenstein , et al. (12 additional authors not shown)

Abstract: $\require{mediawiki-texvc}$Cosmic Reionisation commenced when ultraviolet (UV) radiation produced in the first galaxies began illuminating the cold, neutral gas that filled the primordial Universe. Recent James Webb Space Telescope (JWST) observations have shown that surprisingly UV-bright galaxies were in place beyond redshift $z = 14$, when the Universe was less than $300 \, \mathrm{Myr}… ▽ More $\require{mediawiki-texvc}$Cosmic Reionisation commenced when ultraviolet (UV) radiation produced in the first galaxies began illuminating the cold, neutral gas that filled the primordial Universe. Recent James Webb Space Telescope (JWST) observations have shown that surprisingly UV-bright galaxies were in place beyond redshift $z = 14$, when the Universe was less than $300 \, \mathrm{Myr}$ old. Smooth turnovers of their UV continua have been interpreted as damping-wing absorption of Lyman-$α$ (Ly$α$), the principal hydrogen transition. However, spectral signatures encoding crucial properties of these sources, such as their emergent radiation field, largely remain elusive. Here we report spectroscopy from the JWST Advanced Deep Extragalactic Survey (JADES) of a galaxy at redshift $z = 13.0$ that reveal a singular, bright emission line unambiguously identified as Ly$α$, in addition to a smooth turnover. We observe an equivalent width of $\text{EW}_\mathrm{Lyα} > 40 \, Å$ (rest frame), previously only seen at $z < 9$ where the intervening intergalactic medium (IGM) becomes increasingly ionised. Together with an extremely blue UV continuum, the unexpected Ly$α$ emission indicates the galaxy is a prolific producer and leaker of ionising photons. This suggests massive, hot stars or an active galactic nucleus (AGN) have created an early reionised region to prevent complete extinction of Ly$α$, thus shedding new light on the nature of the earliest galaxies and the onset of Reionisation only $330 \, \mathrm{Myr}$ after the Big Bang. △ Less

Submitted 26 March, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

Comments: 22 pages, 12 figures, 4 tables. This version of the article has been accepted for publication, after peer review (when applicable) but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://dx.doi.org/10.1038/s41586-025-08779-5

arXiv:2408.14040 [pdf, other]

Evaluating The Explainability of State-of-the-Art Deep Learning-based Network Intrusion Detection Systems

Authors: Ayush Kumar, Vrizlynn L. L. Thing

Abstract: Network Intrusion Detection Systems (NIDSs) which use deep learning (DL) models achieve high detection performance and accuracy while avoiding dependence on fixed signatures extracted from attack artifacts. However, there is a noticeable hesitance among network security experts and practitioners when it comes to deploying DL-based NIDSs in real-world production environments due to their black-box… ▽ More Network Intrusion Detection Systems (NIDSs) which use deep learning (DL) models achieve high detection performance and accuracy while avoiding dependence on fixed signatures extracted from attack artifacts. However, there is a noticeable hesitance among network security experts and practitioners when it comes to deploying DL-based NIDSs in real-world production environments due to their black-box nature, i.e., how and why the underlying models make their decisions. In this work, we analyze state-of-the-art DL-based NIDS models using explainable AI (xAI) techniques (e.g., TRUSTEE, SHAP) through extensive experiments with two different attack datasets. Using the explanations generated for the models' decisions, the most prominent features used by each NIDS model considered are presented. We compare the explanations generated across xAI methods for a given NIDS model as well as the explanations generated across the NIDS models for a given xAI method. Finally, we evaluate the vulnerability of each NIDS model to inductive bias (artifacts learnt from training data). The results show that: (1) some DL-based NIDS models can be better explained than other models, (2) xAI explanations are in conflict for most of the NIDS models considered in this work and (3) some NIDS models are more vulnerable to inductive bias than other models. △ Less

Submitted 19 February, 2025; v1 submitted 26 August, 2024; originally announced August 2024.

arXiv:2408.14020 [pdf, other]

doi 10.3847/1538-4357/adc918

Implications of Fermionic Dark Matter Interactions on Anisotropic Neutron Stars

Authors: Premachand Mahapatra, Chiranjeeb Singha, Ayush Hazarika, Prasanta Kumar Das

Abstract: The presence of Dark matter (DM) within a neutron star (NS) can substantially influence the macroscopic properties. It is commonly assumed that the pressure inside an NS is isotropic, but in reality, pressure is locally anisotropic. This study explores the properties of anisotropic NS with a subfraction of DM (isotropic) trapped inside. Implementing a two-fluid formalism with three Equations of St… ▽ More The presence of Dark matter (DM) within a neutron star (NS) can substantially influence the macroscopic properties. It is commonly assumed that the pressure inside an NS is isotropic, but in reality, pressure is locally anisotropic. This study explores the properties of anisotropic NS with a subfraction of DM (isotropic) trapped inside. Implementing a two-fluid formalism with three Equations of State (EOS): AP3 (a realistic nucleon-nucleon interaction model), BSk22 (modeling atomic nuclei and neutron-matter), and MPA1 (considering relativistic effects in nuclear interactions). The properties of NS, such as mass ($M$), radius ($R$), and dimensionless tidal deformability ($Λ$), for various DM-anisotropic configurations, have been rigorously tested against observational constraints. These constraints include data from the binary NS merger GW170817, NICER x-ray measurements, and pulsar mass-radius observations. We observe that with increasing DM subfraction, higher anisotropies could also satisfy the observational constraints. Furthermore, increasing the coupling ($g$) between DM and its mediator leads to the formation of a core-halo structure, with a DM halo surrounding the baryonic matter (BM). Specifically, for coupling values of $g = 10^{-4}$, $10^{-3.7}$, and $10^{-3.5}$, we observe that the maximum radius ($R_{max}$) decreases with increasing anisotropy, which contrasts with the behavior at $g = 10^{-5}$ and in scenarios with no DM. Our analysis indicates that binary pulsar systems could potentially constrain the extent of admixed anisotropic NS or, more optimistically, provide evidence for the existence of DM-admixed anisotropic NS. △ Less

Submitted 12 September, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

Comments: 29 pages, 12 figures, 6 tables

arXiv:2408.13627 [pdf, other]

Recent Event Camera Innovations: A Survey

Authors: Bharatesh Chakravarthi, Aayush Atul Verma, Kostas Daniilidis, Cornelia Fermuller, Yezhou Yang

Abstract: Event-based vision, inspired by the human visual system, offers transformative capabilities such as low latency, high dynamic range, and reduced power consumption. This paper presents a comprehensive survey of event cameras, tracing their evolution over time. It introduces the fundamental principles of event cameras, compares them with traditional frame cameras, and highlights their unique charact… ▽ More Event-based vision, inspired by the human visual system, offers transformative capabilities such as low latency, high dynamic range, and reduced power consumption. This paper presents a comprehensive survey of event cameras, tracing their evolution over time. It introduces the fundamental principles of event cameras, compares them with traditional frame cameras, and highlights their unique characteristics and operational differences. The survey covers various event camera models from leading manufacturers, key technological milestones, and influential research contributions. It explores diverse application areas across different domains and discusses essential real-world and synthetic datasets for research advancement. Additionally, the role of event camera simulators in testing and development is discussed. This survey aims to consolidate the current state of event cameras and inspire further innovation in this rapidly evolving field. To support the research community, a GitHub page (https://github.com/chakravarthi589/Event-based-Vision_Resources) categorizes past and future research articles and consolidates valuable resources. △ Less

Submitted 27 August, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

arXiv:2408.13215 [pdf, other]

Morphology of 137 Fast Radio Bursts down to Microseconds Timescales from The First CHIME/FRB Baseband Catalog

Authors: Ketan R. Sand, Alice P. Curtin, Daniele Michilli, Victoria M. Kaspi, Emmanuel Fonseca, Kenzie Nimmo, Ziggy Pleunis, Kaitlyn Shin, Mohit Bhardwaj, Charanjot Brar, Matt Dobbs, Gwendolyn Eadie, B. M. Gaensler, Ronniy C. Joseph, Calvin Leung, Robert Main, Kiyoshi W. Masui, Ryan Mckinven, Ayush Pandhi, Aaron B. Pearlman, Masoud Rafiei-Ravandi, Mawson W. Sammons, Kendrick Smith, Ingrid H. Stairs

Abstract: We present a spectro-temporal analysis of 137 fast radio bursts (FRBs) from the first CHIME/FRB baseband catalog, including 125 one-off bursts and 12 repeat bursts, down to microsecond resolution using the least-squares optimization fitting routine: fitburst. Our measured values are compared with those in the first CHIME/FRB intensity catalog, revealing that nearly one-third of our sample exhibits… ▽ More We present a spectro-temporal analysis of 137 fast radio bursts (FRBs) from the first CHIME/FRB baseband catalog, including 125 one-off bursts and 12 repeat bursts, down to microsecond resolution using the least-squares optimization fitting routine: fitburst. Our measured values are compared with those in the first CHIME/FRB intensity catalog, revealing that nearly one-third of our sample exhibits additional burst components at higher time resolutions. We measure sub-burst components within burst envelopes as narrow as $\sim$23 $μ$s (FWHM), with 20% of the sample displaying sub-structures narrower than 100 $μ$s, offering constraints on emission mechanisms. Scattering timescales in the sample range from 30 $μ$s to 13 ms at 600 MHz. We observe no correlations between scattering time and dispersion measure, rotation measure, or linear polarization fraction, with the latter suggesting that depolarization due to multipath propagation is negligible in our sample. Bursts with narrower envelopes ($\leq$ 1 ms) in our sample exhibit higher flux densities, indicating the potential presence of sub-ms FRBs that are being missed by our real-time system below a brightness threshold. Most multicomponent bursts in our sample exhibit sub-burst separations of $\leq$ 1 ms, with no bursts showing separations $<$41 $μ$s, even at a time resolution of 2.56 $μ$s, but both scattering and low signal-to-noise ratio can hinder detection of additional components. Lastly, given the morphological diversity of our sample, we suggest that one-off and repeating FRBs can come from different classes but have overlapping property distributions. △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: 25 pages 14 figures

arXiv:2408.11895 [pdf, other]

doi 10.3847/1538-4357/ad6a13

Contemporaneous X-ray Observations of 30 Bright Radio Bursts from the Prolific Fast Radio Burst Source FRB 20220912A

Authors: Amanda M. Cook, Paul Scholz, Aaron B. Pearlman, Thomas C. Abbott, Marilyn Cruces, B. M. Gaensler, Fengqiu, Dong, Daniele Michilli, Gwendolyn Eadie, Victoria M. Kaspi, Ingrid Stairs, Chia Min Tan, Mohit Bhardwaj, Tomas Cassanelli, Alice P. Curtin, Adaeze L. Ibik, Mattias Lazda, Kiyoshi W. Masui, Ayush Pandhi, Masoud Rafiei-Ravandi, Mawson W. Sammons, Kaitlyn Shin, Kendrick Smith, David C. Stenning

Abstract: We present an extensive contemporaneous X-ray and radio campaign performed on the repeating fast radio burst (FRB) source FRB 20220912A for eight weeks immediately following the source's detection by CHIME/FRB. This includes X-ray data from XMM-Newton, NICER, and Swift, and radio detections of FRB 20220912A from CHIME/Pulsar and Effelsberg. We detect no significant X-ray emission at the time of 30… ▽ More We present an extensive contemporaneous X-ray and radio campaign performed on the repeating fast radio burst (FRB) source FRB 20220912A for eight weeks immediately following the source's detection by CHIME/FRB. This includes X-ray data from XMM-Newton, NICER, and Swift, and radio detections of FRB 20220912A from CHIME/Pulsar and Effelsberg. We detect no significant X-ray emission at the time of 30 radio bursts with upper limits on $0.5-10.0$ keV X-ray fluence of $(1.5-14.5)\times 10^{-10}$ erg cm$^{-2}$ (99.7% credible interval, unabsorbed) on a timescale of 100 ms. Translated into a fluence ratio $η_{\text{ x/r}} = F_{\text{X-ray}}/F_{\text{radio}}$, this corresponds to $η_{\text{ x/r}} < 7\times10^{6}$. For persistent emission from the location of FRB 20220912A, we derive a 99.7% $0.5-10.0$ keV isotropic flux limit of $8.8\times 10^{-15}$ erg cm$^{-2}$ s$^{-1}$ (unabsorbed) or an isotropic luminosity limit of 1.4$\times10^{41}$ erg s$^{-1}$ at a distance of 362.4 Mpc. We derive a hierarchical extension to the standard Bayesian treatment of low-count and background-contaminated X-ray data, which allows the robust combination of multiple observations. This methodology allows us to place the best (lowest) 99.7% credible interval upper limit on an FRB $η_{\text{ x/r}}$ to date, $η_{\text{ x/r}} < 2\times10^6$, assuming that all thirty detected radio bursts are associated with X-ray bursts with the same fluence ratio. If we instead adopt an X-ray spectrum similar to the X-ray burst observed contemporaneously with FRB-like emission from Galactic magnetar SGR 1935+2154 detected on 2020 April 28, we derive a 99.7% credible interval upper limit on $η_{\text{ x/r}}$ of $8\times10^5$, which is only 3 times the observed value of $η_{\text{ x/r}}$ for SGR 1935+2154. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 23 pages, 3 figures. ApJ in press (accepted after resubmission July 19th, 2024)

arXiv:2408.11374 [pdf, other]

A Unified Framework for Continual Learning and Unlearning

Authors: Romit Chatterjee, Vikram Chundawat, Ayush Tarun, Ankur Mali, Murari Mandal

Abstract: Continual learning and machine unlearning are crucial challenges in machine learning, typically addressed separately. Continual learning focuses on adapting to new knowledge while preserving past information, whereas unlearning involves selectively forgetting specific subsets of data. In this paper, we introduce a new framework that jointly tackles both tasks by leveraging controlled knowledge dis… ▽ More Continual learning and machine unlearning are crucial challenges in machine learning, typically addressed separately. Continual learning focuses on adapting to new knowledge while preserving past information, whereas unlearning involves selectively forgetting specific subsets of data. In this paper, we introduce a new framework that jointly tackles both tasks by leveraging controlled knowledge distillation. Our approach enables efficient learning with minimal forgetting and effective targeted unlearning. By incorporating a fixed memory buffer, the system supports learning new concepts while retaining prior knowledge. The distillation process is carefully managed to ensure a balance between acquiring new information and forgetting specific data as needed. Experimental results on benchmark datasets show that our method matches or exceeds the performance of existing approaches in both continual learning and machine unlearning. This unified framework is the first to address both challenges simultaneously, paving the way for adaptable models capable of dynamic learning and forgetting while maintaining strong overall performance. Source code: \textcolor{blue}{https://respailab.github.io/CLMUL} △ Less

Submitted 25 December, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

arXiv:2408.10816 [pdf, other]

Deep Learning-based Classification of Dementia using Image Representation of Subcortical Signals

Authors: Shivani Ranjan, Ayush Tripathi, Harshal Shende, Robin Badal, Amit Kumar, Pramod Yadav, Deepak Joshi, Lalan Kumar

Abstract: Dementia is a neurological syndrome marked by cognitive decline. Alzheimer's disease (AD) and Frontotemporal dementia (FTD) are the common forms of dementia, each with distinct progression patterns. EEG, a non-invasive tool for recording brain activity, has shown potential in distinguishing AD from FTD and mild cognitive impairment (MCI). Previous studies have utilized various EEG features, such a… ▽ More Dementia is a neurological syndrome marked by cognitive decline. Alzheimer's disease (AD) and Frontotemporal dementia (FTD) are the common forms of dementia, each with distinct progression patterns. EEG, a non-invasive tool for recording brain activity, has shown potential in distinguishing AD from FTD and mild cognitive impairment (MCI). Previous studies have utilized various EEG features, such as subband power and connectivity patterns to differentiate these conditions. However, artifacts in EEG signals can obscure crucial information, necessitating advanced signal processing techniques. This study aims to develop a deep learning-based classification system for dementia by analyzing scout time-series signals from deep brain regions, specifically the hippocampus, amygdala, and thalamus. The study utilizes scout time series extracted via the standardized low-resolution brain electromagnetic tomography (sLORETA) technique. The time series is converted to image representations using continuous wavelet transform (CWT) and fed as input to deep learning models. Two high-density EEG datasets are utilized to check for the efficacy of the proposed method: the online BrainLat dataset (comprising AD, FTD, and healthy controls (HC)) and the in-house IITD-AIIA dataset (including subjects with AD, MCI, and HC). Different classification strategies and classifier combinations have been utilized for the accurate mapping of classes on both datasets. The best results were achieved by using a product of probabilities from classifiers for left and right subcortical regions in conjunction with the DenseNet model architecture. It yields accuracies of 94.17$\%$ and 77.72$\%$ on the BrainLat and IITD-AIIA datasets, respectively. This highlights the potential of this approach for early and accurate differentiation of neurodegenerative disorders. △ Less

Submitted 20 August, 2024; originally announced August 2024.

arXiv:2408.10601 [pdf]

Wavelength-Agnostic Metasurface Design for Next-Generation 2D Photodetectors

Authors: Ayush M. Jamdar, Rituraj, Srini Krishnamurthy, Vidya Praveen Bhallamudi, Sivarama Krishnan

Abstract: We explore a versatile technique for inverse designing 2D photonic crystal metasurfaces. These surfaces, known for their ability to manipulate light-matter interactions, can be precisely controlled to achieve specific functionalities. The key lies in efficiently optimizing the geometric patterns and dimensions of the metasurface. Through a composite method which exploits two well-established parad… ▽ More We explore a versatile technique for inverse designing 2D photonic crystal metasurfaces. These surfaces, known for their ability to manipulate light-matter interactions, can be precisely controlled to achieve specific functionalities. The key lies in efficiently optimizing the geometric patterns and dimensions of the metasurface. Through a composite method which exploits two well-established paradigms - Covariance Matrix Adaptation optimization and Rigorous Coupled Wave Analysis (RCWA), we demonstrate our ability to design and optimize resonances in metaelements to achieve desired optical performance such as near-perfect absorption at chosen wavelengths/optical modes, which otherwise proves to be challenging or even impossible with conventional inverse design implementations. We apply our method to design three-layered structures involving a monolayer absorber, transparent metasubstrate, and a back mirror to get near 100% absorption at one or two chosen wavelengths. For illustration, we choose black phosphorus and silicon metasurface to predict ~100% absorption in a monolayer at 1550 nm. The versatile technique can be applied to tailor reflectance and transmittance for any optical mode and wavelength. This computationally efficient design method paves the way for creating high-performance 2D metasurface-based devices with a variety of applications, including quantum technology components such as single photon sensors and biphoton sources, communication systems, and non-linear light conversion. △ Less

Submitted 27 November, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

arXiv:2408.09117 [pdf, other]

LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems

Authors: Aayush Agrawal, Ashmitha Jaysi Sivakumar, Ibrahim Kaif, Chayan Banerjee

Abstract: Accurate lane detection is essential for effective path planning and lane following in autonomous driving, especially in scenarios with significant occlusion from vehicles and pedestrians. Existing models often struggle under such conditions, leading to unreliable navigation and safety risks. We propose two innovative approaches to enhance lane detection in these challenging environments, each sho… ▽ More Accurate lane detection is essential for effective path planning and lane following in autonomous driving, especially in scenarios with significant occlusion from vehicles and pedestrians. Existing models often struggle under such conditions, leading to unreliable navigation and safety risks. We propose two innovative approaches to enhance lane detection in these challenging environments, each showing notable improvements over current methods. The first approach aug-Segment improves conventional lane detection models by augmenting the training dataset of CULanes with simulated occlusions and training a segmentation model. This method achieves a 12% improvement over a number of SOTA models on the CULanes dataset, demonstrating that enriched training data can better handle occlusions, however, since this model lacked robustness to certain settings, our main contribution is the second approach, LOID Lane Occlusion Inpainting and Detection. LOID introduces an advanced lane detection network that uses an image processing pipeline to identify and mask occlusions. It then employs inpainting models to reconstruct the road environment in the occluded areas. The enhanced image is processed by a lane detection algorithm, resulting in a 20% & 24% improvement over several SOTA models on the BDDK100 and CULanes datasets respectively, highlighting the effectiveness of this novel technique. △ Less

Submitted 17 August, 2024; originally announced August 2024.

Comments: 8 pages, 6 figures and 4 tables

arXiv:2408.07050 [pdf, other]

PSM: Learning Probabilistic Embeddings for Multi-scale Zero-Shot Soundscape Mapping

Authors: Subash Khanal, Eric Xing, Srikumar Sastry, Aayush Dhakal, Zhexiao Xiong, Adeel Ahmad, Nathan Jacobs

Abstract: A soundscape is defined by the acoustic environment a person perceives at a location. In this work, we propose a framework for mapping soundscapes across the Earth. Since soundscapes involve sound distributions that span varying spatial scales, we represent locations with multi-scale satellite imagery and learn a joint representation among this imagery, audio, and text. To capture the inherent unc… ▽ More A soundscape is defined by the acoustic environment a person perceives at a location. In this work, we propose a framework for mapping soundscapes across the Earth. Since soundscapes involve sound distributions that span varying spatial scales, we represent locations with multi-scale satellite imagery and learn a joint representation among this imagery, audio, and text. To capture the inherent uncertainty in the soundscape of a location, we design the representation space to be probabilistic. We also fuse ubiquitous metadata (including geolocation, time, and data source) to enable learning of spatially and temporally dynamic representations of soundscapes. We demonstrate the utility of our framework by creating large-scale soundscape maps integrating both audio and text with temporal control. To facilitate future research on this task, we also introduce a large-scale dataset, GeoSound, containing over $300k$ geotagged audio samples paired with both low- and high-resolution satellite imagery. We demonstrate that our method outperforms the existing state-of-the-art on both GeoSound and the existing SoundingEarth dataset. Our dataset and code is available at https://github.com/mvrl/PSM. △ Less

Submitted 13 August, 2024; originally announced August 2024.

Comments: Accepted at ACM MM 2024

arXiv:2408.07009 [pdf, other]

Imagen 3

Authors: Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Lluis Castrejon, Kelvin Chan, Yichang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Christos Kaplanis , et al. (237 additional authors not shown)

Abstract: We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models. We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models. △ Less

Submitted 21 December, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

arXiv:2408.06113 [pdf, other]

IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI

Authors: Yash Rampuria, Deep Boliya, Shreyash Gupta, Gopalan Iyengar, Ayush Rohilla, Mohak Vyas, Chaitanya Langde, Mehul Vijay Chanda, Ronak Gautam Matai, Kothapalli Namitha, Ajinkya Pawar, Bhaskar Biswas, Nakul Agarwal, Rajit Khandelwal, Rohan Kumar, Shubham Agarwal, Vishwam Patel, Abhimanyu Singh Rathore, Amna Rahman, Ayush Mishra, Yash Tangri

Abstract: This work presents the design and development of IIT Bombay Racing's Formula Student style autonomous racecar algorithm capable of running at the racing events of Formula Student-AI, held in the UK. The car employs a cutting-edge sensor suite of the compute unit NVIDIA Jetson Orin AGX, 2 ZED2i stereo cameras, 1 Velodyne Puck VLP16 LiDAR and SBG Systems Ellipse N GNSS/INS IMU. It features deep lear… ▽ More This work presents the design and development of IIT Bombay Racing's Formula Student style autonomous racecar algorithm capable of running at the racing events of Formula Student-AI, held in the UK. The car employs a cutting-edge sensor suite of the compute unit NVIDIA Jetson Orin AGX, 2 ZED2i stereo cameras, 1 Velodyne Puck VLP16 LiDAR and SBG Systems Ellipse N GNSS/INS IMU. It features deep learning algorithms and control systems to navigate complex tracks and execute maneuvers without any human intervention. The design process involved extensive simulations and testing to optimize the vehicle's performance and ensure its safety. The algorithms have been tested on a small scale, in-house manufactured 4-wheeled robot and on simulation software. The results obtained for testing various algorithms in perception, simultaneous localization and mapping, path planning and controls have been detailed. △ Less

Submitted 12 August, 2024; originally announced August 2024.

Comments: 8 pages, 19 figures

arXiv:2408.05089 [pdf, other]

doi 10.1017/jfm.2025.237

Viscoelastic Worthington jets & droplets produced by bursting bubbles

Authors: Ayush K. Dixit, Alexandros Oratis, Konstantinos Zinelis, Detlef Lohse, Vatsal Sanjay

Abstract: Bubble bursting and subsequent collapse of the open cavity at free surfaces of contaminated liquids can generate aerosol droplets, facilitating pathogen transport. After film rupture, capillary waves focus at the cavity base, potentially generating fast Worthington jets that are responsible for ejecting the droplets away from the source. While extensively studied for Newtonian fluids, the influenc… ▽ More Bubble bursting and subsequent collapse of the open cavity at free surfaces of contaminated liquids can generate aerosol droplets, facilitating pathogen transport. After film rupture, capillary waves focus at the cavity base, potentially generating fast Worthington jets that are responsible for ejecting the droplets away from the source. While extensively studied for Newtonian fluids, the influence of non-Newtonian rheology on this process remains poorly understood. Here, we employ direct numerical simulations to investigate the bubble cavity collapse in viscoelastic media, such as polymeric liquids. We find that the jet and drop formation are dictated by two dimensionless parameters: the elastocapillary number $Ec$ (the ratio of the elastic modulus and the Laplace pressure) and the Deborah number $De$ (the ratio of the relaxation time and the inertio-capillary timescale). We show that for low values of $Ec$ and $De$, the viscoelastic liquid adopts a Newtonian-like behavior, where the dynamics are governed by the solvent Ohnesorge number $Oh_s$ (the ratio of visco-capillary and inertio-capillary timescales). In contrast, for large values $Ec$ and $De$, the enhanced elastic stresses completely suppress the formation of the jet. For some cases with intermediate values of $Ec$ and $De$, smaller droplets are produced compared to Newtonian fluids, potentially enhancing aerosol dispersal. By mapping the phase space spanned by $Ec$, $De$, and $Oh_s$, we reveal three distinct flow regimes: (i) jets forming droplets, (ii) jets without droplet formation, and (iii) absence of jet formation. Our results elucidate the mechanisms underlying aerosol suppression versus fine spray formation in polymeric liquids, with implications for pathogen transmission and industrial processes involving viscoelastic fluids. △ Less

Submitted 14 May, 2025; v1 submitted 9 August, 2024; originally announced August 2024.

Journal ref: J. Fluid Mech., 1010, A2 (2025)

arXiv:2408.04870 [pdf, other]

ConfusedPilot: Confused Deputy Risks in RAG-based LLMs

Authors: Ayush RoyChowdhury, Mulong Luo, Prateek Sahu, Sarbartha Banerjee, Mohit Tiwari

Abstract: Retrieval augmented generation (RAG) is a process where a large language model (LLM) retrieves useful information from a database and then generates the responses. It is becoming popular in enterprise settings for daily business operations. For example, Copilot for Microsoft 365 has accumulated millions of businesses. However, the security implications of adopting such RAG-based systems are unclea… ▽ More Retrieval augmented generation (RAG) is a process where a large language model (LLM) retrieves useful information from a database and then generates the responses. It is becoming popular in enterprise settings for daily business operations. For example, Copilot for Microsoft 365 has accumulated millions of businesses. However, the security implications of adopting such RAG-based systems are unclear. In this paper, we introduce ConfusedPilot, a class of security vulnerabilities of RAG systems that confuse Copilot and cause integrity and confidentiality violations in its responses. First, we investigate a vulnerability that embeds malicious text in the modified prompt in RAG, corrupting the responses generated by the LLM. Second, we demonstrate a vulnerability that leaks secret data, which leverages the caching mechanism during retrieval. Third, we investigate how both vulnerabilities can be exploited to propagate misinformation within the enterprise and ultimately impact its operations, such as sales and manufacturing. We also discuss the root cause of these attacks by investigating the architecture of a RAG-based system. This study highlights the security vulnerabilities in today's RAG-based systems and proposes design guidelines to secure future RAG-based systems. △ Less

Submitted 23 October, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

arXiv:2408.04800 [pdf]

A High-Temperature Thermocouple Development by Additive Manufacturing: Tungsten-Nickel (W-Ni) and Molybdenum (Mo) Integration with Ceramic Structures

Authors: Azizul Islam, Aayush Alok, Vamsi Borra, Pedro Cortes

Abstract: Additive manufacturing holds more potential to enable the development of ceramic-based components. Ceramics offer high resistance to heat, high fracture toughness, and are extremely corrosion resistant. Thus, ceramics are widely used in sectors such as the aerospace industry, automotive, microelectronics, and biomedicine. Using various additive manufacturing platforms, ceramics with complex and un… ▽ More Additive manufacturing holds more potential to enable the development of ceramic-based components. Ceramics offer high resistance to heat, high fracture toughness, and are extremely corrosion resistant. Thus, ceramics are widely used in sectors such as the aerospace industry, automotive, microelectronics, and biomedicine. Using various additive manufacturing platforms, ceramics with complex and uniquely designed geometry can be developed to suit specific applications. This project aims at innovating high-temperature thermocouples by embedding conductive metal pastes into a ceramic structure. The paste used includes tungsten, molybdenum, and antimony. The metal pastes are precisely extruded into a T-shaped trench inside the ceramic matrix. Following specific temperature ranges, the ceramic matrix is sintered to improve the properties of the material. The sensors produced can function at extremely high temperatures and are thereby suitable for high-temperature environments. Comparative testing of the 3D sintered sensors with conventional temperature sensors shows high correlation between the two classes of sensors. The resulting R-squared value of 0.9885 is satisfactory which implies the reliability and accuracy of 3D sintering sensors are satisfactory in temperature sensing applications. △ Less

Submitted 18 September, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

Comments: 12 pages, 5 figures

Showing 201–250 of 1,098 results for author: Aayush