Search | arXiv e-print repository

Low-Loss Superconducting Resonators Fabricated from Tantalum Films Grown at Room Temperature

Authors: Guillaume Marcaud, David Perello, Cliff Chen, Esha Umbarkar, Conan Weiland, Jiansong Gao, Sandra Diez, Victor Ly, Neha Mahuli, Nathan D'Souza, Yuan He, Shahriar Aghaeimeibodi, Rachel Resnick, Cherno Jaye, Abdul K. Rumaiz, Daniel A. Fischer, Matthew Hunt, Oskar Painter, Ignace Jarrige

Abstract: The use of $α$-tantalum in superconducting circuits has enabled a considerable improvement of the coherence time of transmon qubits. The standard approach to grow $α$-tantalum thin films on silicon involves heating the substrate, which takes several hours per deposition and prevents the integration of this material with wafers containing temperature-sensitive components. We report a detailed exper… ▽ More The use of $α$-tantalum in superconducting circuits has enabled a considerable improvement of the coherence time of transmon qubits. The standard approach to grow $α$-tantalum thin films on silicon involves heating the substrate, which takes several hours per deposition and prevents the integration of this material with wafers containing temperature-sensitive components. We report a detailed experimental study of an alternative growth method of $α$-tantalum on silicon, which is achieved at room temperature through the use of a niobium seed layer. Despite a substantially higher density of oxygen-rich grain boundaries in the films sputtered at room temperature, resonators made from these films are found to have state-of-the-art quality factors, comparable to resonators fabricated from tantalum grown at high temperature. This finding challenges previous assumptions about correlations between material properties and microwave loss of superconducting thin films, and opens a new avenue for the integration of tantalum into fabrication flows with limited thermal budget. △ Less

Submitted 16 January, 2025; originally announced January 2025.

arXiv:2501.06306 [pdf, other]

On How Traffic Signals Impact the Fundamental Diagrams of Urban Roads

Authors: Chao Zhang, Yechen Li, Neha Arora, Carolina Osorio

Abstract: Being widely adopted by the transportation and planning practitioners, the fundamental diagram (FD) is the primary tool used to relate the key macroscopic traffic variables of speed, flow, and density. We empirically analyze the relation between vehicular space-mean speeds and flows given different signal settings and postulate a parsimonious parametric function form of the traditional FD where it… ▽ More Being widely adopted by the transportation and planning practitioners, the fundamental diagram (FD) is the primary tool used to relate the key macroscopic traffic variables of speed, flow, and density. We empirically analyze the relation between vehicular space-mean speeds and flows given different signal settings and postulate a parsimonious parametric function form of the traditional FD where its function parameters are explicitly modeled as a function of the signal plan factors. We validate the proposed formulation using data from signalized urban road segments in Salt Lake City, Utah, USA. The proposed formulation builds our understanding of how changes to signal settings impact the FDs, and more generally the congestion patterns, of signalized urban segments. △ Less

Submitted 10 January, 2025; originally announced January 2025.

Comments: Published in the 4th Symposium on Management of Future Motorway and Urban Traffic Systems (MFTS)

arXiv:2501.06126 [pdf, other]

Merging Feed-Forward Sublayers for Compressed Transformers

Authors: Neha Verma, Kenton Murray, Kevin Duh

Abstract: With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques is growing in order to deploy these models widely. The sheer parameter count of these models makes it difficult to fit them into the memory constraints of different hardware. In this work, we present a novel approach to model compression by merging similar parameter groups within a model, ra… ▽ More With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques is growing in order to deploy these models widely. The sheer parameter count of these models makes it difficult to fit them into the memory constraints of different hardware. In this work, we present a novel approach to model compression by merging similar parameter groups within a model, rather than pruning away less important parameters. Specifically, we select, align, and merge separate feed-forward sublayers in Transformer models, and test our method on language modeling, image classification, and machine translation. With our method, we demonstrate performance comparable to the original models while combining more than a third of model feed-forward sublayers, and demonstrate improved performance over a strong layer-pruning baseline. For instance, we can remove over 21% of total parameters from a Vision Transformer, while maintaining 99% of its original performance. Additionally, we observe that some groups of feed-forward sublayers exhibit high activation similarity, which may help explain their surprising mergeability. △ Less

Submitted 28 March, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

arXiv:2501.04783 [pdf, other]

Traffic Simulations: Multi-City Calibration of Metropolitan Highway Networks

Authors: Chao Zhang, Yechen Li, Neha Arora, Damien Pierce, Carolina Osorio

Abstract: This paper proposes an approach to perform travel demand calibration for high-resolution stochastic traffic simulators. It employs abundant travel times at the path-level, departing from the standard practice of resorting to scarce segment-level sensor counts. The proposed approach is shown to tackle high-dimensional instances in a sample-efficient way. For the first time, case studies on 6 metrop… ▽ More This paper proposes an approach to perform travel demand calibration for high-resolution stochastic traffic simulators. It employs abundant travel times at the path-level, departing from the standard practice of resorting to scarce segment-level sensor counts. The proposed approach is shown to tackle high-dimensional instances in a sample-efficient way. For the first time, case studies on 6 metropolitan highway networks are carried out, considering a total of 54 calibration scenarios. This is the first work to show the ability of a calibration algorithm to systematically scale across networks. Compared to the state-of-the-art simultaneous perturbation stochastic approximation (SPSA) algorithm, the proposed approach enhances fit to field data by an average 43.5% with a maximum improvement of 80.0%, and does so within fewer simulation calls. △ Less

Submitted 8 January, 2025; originally announced January 2025.

Comments: Published on the 27th IEEE International Conference on Intelligent Transportation Systems (ITSC) (2024)

arXiv:2412.20909 [pdf, other]

Stiefel-Whitney Classes for Finite Symplectic Groups

Authors: Neha Malik, Steven Spallone

Abstract: Let $q$ be an odd prime power, and $G=\text{Sp}(2n,q)$ the finite symplectic group. We give an expression for the total Stiefel-Whitney Classes (SWCs) for orthogonal representations $π$ of $G$, in terms of character values of $π$ at elements of order $2$. We give "universal formulas'' for the fourth and eighth SWCs. For $n=2$, we compute the subring of the mod $2$ cohomology generated by the SWCs… ▽ More Let $q$ be an odd prime power, and $G=\text{Sp}(2n,q)$ the finite symplectic group. We give an expression for the total Stiefel-Whitney Classes (SWCs) for orthogonal representations $π$ of $G$, in terms of character values of $π$ at elements of order $2$. We give "universal formulas'' for the fourth and eighth SWCs. For $n=2$, we compute the subring of the mod $2$ cohomology generated by the SWCs $w_k(π)$. △ Less

Submitted 30 December, 2024; originally announced December 2024.

Comments: 22 pages

MSC Class: 20G40; 55R40

arXiv:2412.17899 [pdf, other]

A mixing time bound for Gibbs sampling from log-smooth log-concave distributions

Authors: Neha S. Wadia

Abstract: The Gibbs sampler, also known as the coordinate hit-and-run algorithm, is a Markov chain that is widely used to draw samples from probability distributions in arbitrary dimensions. At each iteration of the algorithm, a randomly selected coordinate is resampled from the distribution that results from conditioning on all the other coordinates. We study the behavior of the Gibbs sampler on the class… ▽ More The Gibbs sampler, also known as the coordinate hit-and-run algorithm, is a Markov chain that is widely used to draw samples from probability distributions in arbitrary dimensions. At each iteration of the algorithm, a randomly selected coordinate is resampled from the distribution that results from conditioning on all the other coordinates. We study the behavior of the Gibbs sampler on the class of log-smooth and strongly log-concave target distributions supported on $\mathbb{R}^n$. Assuming the initial distribution is $M$-warm with respect to the target, we show that the Gibbs sampler requires at most $O^{\star}\left(κ^2 n^{7.5}\left(\max\left\{1,\sqrt{\frac{1}{n}\log \frac{2M}γ}\right\}\right)^2\right)$ steps to produce a sample with error no more than $γ$ in total variation distance from a distribution with condition number $κ$. △ Less

Submitted 23 December, 2024; originally announced December 2024.

Comments: 22 pages, 4 figures

arXiv:2412.14913 [pdf, other]

Dynamics of Quantum Coherence and Non-Classical Correlations in Open Quantum System Coupled to a Squeezed Thermal Bath

Authors: Neha Pathania, Ramniwas Meena, Subhashish Banerjee

Abstract: We investigate the intricate dynamics of quantum coherence and non-classical correlations in a two-qubit open quantum system coupled to a squeezed thermal reservoir. By exploring the correlations between spatially separated qubits, we unravel the complex interplay between quantum correlations and decoherence induced by the reservoir. Our findings demonstrate that non-classical correlations such as… ▽ More We investigate the intricate dynamics of quantum coherence and non-classical correlations in a two-qubit open quantum system coupled to a squeezed thermal reservoir. By exploring the correlations between spatially separated qubits, we unravel the complex interplay between quantum correlations and decoherence induced by the reservoir. Our findings demonstrate that non-classical correlations such as quantum consonance, quantum discord, local quantum uncertainty, and quantum Fisher information are highly sensitive to the collective regime. These insights identify key parameters for optimizing quantum metrology and parameter estimation in systems exposed to environmental interactions. Furthermore, we quantify these quantum correlations in the context of practical applications such as quantum teleportation, using the two metrics viz. maximal teleportation fidelity and fidelity deviation. This work bridges theoretical advancements with real-world applications, offering a comprehensive framework for leveraging quantum resources under the influence of environmental decoherence. △ Less

Submitted 19 December, 2024; originally announced December 2024.

arXiv:2412.14089 [pdf, other]

doi 10.1145/3589132.3625566

On the Use of Abundant Road Speed Data for Travel Demand Calibration of Urban Traffic Simulators

Authors: Suyash Vishnoi, Akhil Shetty, Iveel Tsogsuren, Neha Arora, Carolina Osorio

Abstract: This work develops a compute-efficient algorithm to tackle a fundamental problem in transportation: that of urban travel demand estimation. It focuses on the calibration of origin-destination travel demand input parameters for high-resolution traffic simulation models. It considers the use of abundant traffic road speed data. The travel demand calibration problem is formulated as a continuous, hig… ▽ More This work develops a compute-efficient algorithm to tackle a fundamental problem in transportation: that of urban travel demand estimation. It focuses on the calibration of origin-destination travel demand input parameters for high-resolution traffic simulation models. It considers the use of abundant traffic road speed data. The travel demand calibration problem is formulated as a continuous, high-dimensional, simulation-based optimization (SO) problem with bound constraints. There is a lack of compute efficient algorithms to tackle this problem. We propose the use of an SO algorithm that relies on an efficient, analytical, differentiable, physics-based traffic model, known as a metamodel or surrogate model. We formulate a metamodel that enables the use of road speed data. Tests are performed on a Salt Lake City network. We study how the amount of data, as well as the congestion levels, impact both in-sample and out-of-sample performance. The proposed method outperforms the benchmark for both in-sample and out-of-sample performance by 84.4% and 72.2% in terms of speeds and counts, respectively. Most importantly, the proposed method yields the highest compute efficiency, identifying solutions with good performance within few simulation function evaluations (i.e., with small samples). △ Less

Submitted 18 December, 2024; originally announced December 2024.

Comments: 4 pages

Journal ref: Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, pp. 1-4. 2023

arXiv:2412.11059 [pdf, other]

On the specific solutions of reduced biquaternion equality constrained least squares problem and their relative forward error bound

Authors: Sk. Safique Ahmad, Neha Bhadala

Abstract: This study focuses on addressing the challenge of solving the reduced biquaternion equality constrained least squares (RBLSE) problem. We develop algebraic techniques to derive real and complex solutions for the RBLSE problem by utilizing the real and complex forms of reduced biquaternion matrices. Furthermore, we propose algorithms and provide a detailed analysis of their computational complexity… ▽ More This study focuses on addressing the challenge of solving the reduced biquaternion equality constrained least squares (RBLSE) problem. We develop algebraic techniques to derive real and complex solutions for the RBLSE problem by utilizing the real and complex forms of reduced biquaternion matrices. Furthermore, we propose algorithms and provide a detailed analysis of their computational complexity for finding special solutions to the RBLSE problem. A perturbation analysis is conducted, establishing an upper bound for the relative forward error of these solutions. This analysis ensures the accuracy and stability of the solutions in the presence of data perturbations, which is crucial for practical applications where errors arising from input inaccuracies can cause deviations between computed and true solutions. Numerical examples are presented to validate the proposed algorithms, demonstrate their effectiveness, and verify the accuracy of the established upper bound for the relative forward errors. These findings lay the groundwork for exploring applications in 3D and 4D algebra such as robotics, signal, and image processing, expanding their impact on practical and emerging domains. △ Less

Submitted 2 May, 2025; v1 submitted 15 December, 2024; originally announced December 2024.

arXiv:2412.09411 [pdf, other]

Resilience for Regular Path Queries: Towards a Complexity Classification

Authors: Antoine Amarilli, Wolfgang Gatterbauer, Neha Makhija, Mikaël Monet

Abstract: The resilience problem for a query and an input set or bag database is to compute the minimum number of facts to remove from the database to make the query false. In this paper, we study how to compute the resilience of Regular Path Queries (RPQs) over graph databases. Our goal is to characterize the regular languages $L$ for which it is tractable to compute the resilience of the existentially-qua… ▽ More The resilience problem for a query and an input set or bag database is to compute the minimum number of facts to remove from the database to make the query false. In this paper, we study how to compute the resilience of Regular Path Queries (RPQs) over graph databases. Our goal is to characterize the regular languages $L$ for which it is tractable to compute the resilience of the existentially-quantified RPQ built from $L$. We show that computing the resilience in this sense is tractable (even in combined complexity) for all RPQs defined from so-called local languages. By contrast, we show hardness in data complexity for RPQs defined from the following language classes (after reducing the languages to eliminate redundant words): all finite languages featuring a word containing a repeated letter, and all languages featuring a specific kind of counterexample to being local (which we call four-legged languages). The latter include in particular all languages that are not star-free. Our results also imply hardness for all non-local languages with a so-called neutral letter. We last highlight some remaining obstacles towards a full dichotomy. △ Less

Submitted 23 March, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

Comments: 48 pages, 17 figures. Minor updates relative to version 1. This version includes the appendices with all proofs, and all reviewer feedback: it is identical to the PODS'25 publication up to minor formatting differences

arXiv:2412.09230 [pdf, other]

Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering

Authors: Sai Bhargav Rongali, Mohamad Hassan N C, Ankit Jha, Neha Bhargava, Saurabh Prasad, Biplab Banerjee

Abstract: This paper tackles the intricate challenge of video question-answering (VideoQA). Despite notable progress, current methods fall short of effectively integrating questions with video frames and semantic object-level abstractions to create question-aware video representations. We introduce Local-Global Question Aware Video Embedding (LGQAVE), which incorporates three major innovations to integrate… ▽ More This paper tackles the intricate challenge of video question-answering (VideoQA). Despite notable progress, current methods fall short of effectively integrating questions with video frames and semantic object-level abstractions to create question-aware video representations. We introduce Local-Global Question Aware Video Embedding (LGQAVE), which incorporates three major innovations to integrate multi-modal knowledge better and emphasize semantic visual concepts relevant to specific questions. LGQAVE moves beyond traditional ad-hoc frame sampling by utilizing a cross-attention mechanism that precisely identifies the most relevant frames concerning the questions. It captures the dynamics of objects within these frames using distinct graphs, grounding them in question semantics with the miniGPT model. These graphs are processed by a question-aware dynamic graph transformer (Q-DGT), which refines the outputs to develop nuanced global and local video representations. An additional cross-attention module integrates these local and global embeddings to generate the final video embeddings, which a language model uses to generate answers. Extensive evaluations across multiple benchmarks demonstrate that LGQAVE significantly outperforms existing models in delivering accurate multi-choice and open-ended answers. △ Less

Submitted 12 December, 2024; originally announced December 2024.

Journal ref: WACV2025

arXiv:2412.05724 [pdf, other]

A Tiered GAN Approach for Monet-Style Image Generation

Authors: FNU Neha, Deepshikha Bhati, Deepak Kumar Shukla, Md Amiruzzaman

Abstract: Generative Adversarial Networks (GANs) have proven to be a powerful tool in generating artistic images, capable of mimicking the styles of renowned painters, such as Claude Monet. This paper introduces a tiered GAN model to progressively refine image quality through a multi-stage process, enhancing the generated images at each step. The model transforms random noise into detailed artistic represen… ▽ More Generative Adversarial Networks (GANs) have proven to be a powerful tool in generating artistic images, capable of mimicking the styles of renowned painters, such as Claude Monet. This paper introduces a tiered GAN model to progressively refine image quality through a multi-stage process, enhancing the generated images at each step. The model transforms random noise into detailed artistic representations, addressing common challenges such as instability in training, mode collapse, and output quality. This approach combines downsampling and convolutional techniques, enabling the generation of high-quality Monet-style artwork while optimizing computational efficiency. Experimental results demonstrate the architecture's ability to produce foundational artistic structures, though further refinements are necessary for achieving higher levels of realism and fidelity to Monet's style. Future work focuses on improving training methodologies and model complexity to bridge the gap between generated and true artistic images. Additionally, the limitations of traditional GANs in artistic generation are analyzed, and strategies to overcome these shortcomings are proposed. △ Less

Submitted 7 December, 2024; originally announced December 2024.

arXiv:2412.05686 [pdf, other]

Neural network interpretability with layer-wise relevance propagation: novel techniques for neuron selection and visualization

Authors: Deepshikha Bhati, Fnu Neha, Md Amiruzzaman, Angela Guercio, Deepak Kumar Shukla, Ben Ward

Abstract: Interpreting complex neural networks is crucial for understanding their decision-making processes, particularly in applications where transparency and accountability are essential. This proposed method addresses this need by focusing on layer-wise Relevance Propagation (LRP), a technique used in explainable artificial intelligence (XAI) to attribute neural network outputs to input features through… ▽ More Interpreting complex neural networks is crucial for understanding their decision-making processes, particularly in applications where transparency and accountability are essential. This proposed method addresses this need by focusing on layer-wise Relevance Propagation (LRP), a technique used in explainable artificial intelligence (XAI) to attribute neural network outputs to input features through backpropagated relevance scores. Existing LRP methods often struggle with precision in evaluating individual neuron contributions. To overcome this limitation, we present a novel approach that improves the parsing of selected neurons during LRP backward propagation, using the Visual Geometry Group 16 (VGG16) architecture as a case study. Our method creates neural network graphs to highlight critical paths and visualizes these paths with heatmaps, optimizing neuron selection through accuracy metrics like Mean Squared Error (MSE) and Symmetric Mean Absolute Percentage Error (SMAPE). Additionally, we utilize a deconvolutional visualization technique to reconstruct feature maps, offering a comprehensive view of the network's inner workings. Extensive experiments demonstrate that our approach enhances interpretability and supports the development of more transparent artificial intelligence (AI) systems for computer vision applications. This advancement has the potential to improve the trustworthiness of AI models in real-world machine vision applications, thereby increasing their reliability and effectiveness. △ Less

Submitted 7 December, 2024; originally announced December 2024.

arXiv:2412.05252 [pdf, other]

From classical techniques to convolution-based models: A review of object detection algorithms

Authors: Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Md Amiruzzaman

Abstract: Object detection is a fundamental task in computer vision and image understanding, with the goal of identifying and localizing objects of interest within an image while assigning them corresponding class labels. Traditional methods, which relied on handcrafted features and shallow models, struggled with complex visual data and showed limited performance. These methods combined low-level features w… ▽ More Object detection is a fundamental task in computer vision and image understanding, with the goal of identifying and localizing objects of interest within an image while assigning them corresponding class labels. Traditional methods, which relied on handcrafted features and shallow models, struggled with complex visual data and showed limited performance. These methods combined low-level features with contextual information and lacked the ability to capture high-level semantics. Deep learning, especially Convolutional Neural Networks (CNNs), addressed these limitations by automatically learning rich, hierarchical features directly from data. These features include both semantic and high-level representations essential for accurate object detection. This paper reviews object detection frameworks, starting with classical computer vision methods. We categorize object detection approaches into two groups: (1) classical computer vision techniques and (2) CNN-based detectors. We compare major CNN models, discussing their strengths and limitations. In conclusion, this review highlights the significant advancements in object detection through deep learning and identifies key areas for further research to improve performance. △ Less

Submitted 6 December, 2024; originally announced December 2024.

arXiv:2412.03933 [pdf, other]

Exploring AI Text Generation, Retrieval-Augmented Generation, and Detection Technologies: a Comprehensive Overview

Authors: Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Angela Guercio, Ben Ward

Abstract: The rapid development of Artificial Intelligence (AI) has led to the creation of powerful text generation models, such as large language models (LLMs), which are widely used for diverse applications. However, concerns surrounding AI-generated content, including issues of originality, bias, misinformation, and accountability, have become increasingly prominent. This paper offers a comprehensive ove… ▽ More The rapid development of Artificial Intelligence (AI) has led to the creation of powerful text generation models, such as large language models (LLMs), which are widely used for diverse applications. However, concerns surrounding AI-generated content, including issues of originality, bias, misinformation, and accountability, have become increasingly prominent. This paper offers a comprehensive overview of AI text generators (AITGs), focusing on their evolution, capabilities, and ethical implications. This paper also introduces Retrieval-Augmented Generation (RAG), a recent approach that improves the contextual relevance and accuracy of text generation by integrating dynamic information retrieval. RAG addresses key limitations of traditional models, including their reliance on static knowledge and potential inaccuracies in handling real-world data. Additionally, the paper reviews detection tools that help differentiate AI-generated text from human-written content and discusses the ethical challenges these technologies pose. The paper explores future directions for improving detection accuracy, supporting ethical AI development, and increasing accessibility. The paper contributes to a more responsible and reliable use of AI in content creation through these discussions. △ Less

Submitted 5 December, 2024; originally announced December 2024.

arXiv:2412.02242 [pdf, other]

U-Net in Medical Image Segmentation: A Review of Its Applications Across Modalities

Authors: Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Sonavi Makarand Dalvi, Nikolaos Mantzou, Safa Shubbar

Abstract: Medical imaging is essential in healthcare to provide key insights into patient anatomy and pathology, aiding in diagnosis and treatment. Non-invasive techniques such as X-ray, Magnetic Resonance Imaging (MRI), Computed Tomography (CT), and Ultrasound (US), capture detailed images of organs, tissues, and abnormalities. Effective analysis of these images requires precise segmentation to delineate r… ▽ More Medical imaging is essential in healthcare to provide key insights into patient anatomy and pathology, aiding in diagnosis and treatment. Non-invasive techniques such as X-ray, Magnetic Resonance Imaging (MRI), Computed Tomography (CT), and Ultrasound (US), capture detailed images of organs, tissues, and abnormalities. Effective analysis of these images requires precise segmentation to delineate regions of interest (ROI), such as organs or lesions. Traditional segmentation methods, relying on manual feature-extraction, are labor-intensive and vary across experts. Recent advancements in Artificial Intelligence (AI) and Deep Learning (DL), particularly convolutional models such as U-Net and its variants (U-Net++ and U-Net 3+), have transformed medical image segmentation (MIS) by automating the process and enhancing accuracy. These models enable efficient, precise pixel-wise classification across various imaging modalities, overcoming the limitations of manual segmentation. This review explores various medical imaging techniques, examines the U-Net architectures and their adaptations, and discusses their application across different modalities. It also identifies common challenges in MIS and proposes potential solutions. △ Less

Submitted 3 December, 2024; originally announced December 2024.

arXiv:2412.02166 [pdf, other]

Analyzing the Impact of AI Tools on Student Study Habits and Academic Performance

Authors: Ben Ward, Deepshikha Bhati, Fnu Neha, Angela Guercio

Abstract: This study explores the effectiveness of AI tools in enhancing student learning, specifically in improving study habits, time management, and feedback mechanisms. The research focuses on how AI tools can support personalized learning, adaptive test adjustments, and provide real-time classroom analysis. Student feedback revealed strong support for these features, and the study found a significant r… ▽ More This study explores the effectiveness of AI tools in enhancing student learning, specifically in improving study habits, time management, and feedback mechanisms. The research focuses on how AI tools can support personalized learning, adaptive test adjustments, and provide real-time classroom analysis. Student feedback revealed strong support for these features, and the study found a significant reduction in study hours alongside an increase in GPA, suggesting positive academic outcomes. Despite these benefits, challenges such as over-reliance on AI and difficulties in integrating AI with traditional teaching methods were also identified, emphasizing the need for AI tools to complement conventional educational strategies rather than replace them. Data were collected through a survey with a Likert scale and follow-up interviews, providing both quantitative and qualitative insights. The analysis involved descriptive statistics to summarize demographic data, AI usage patterns, and perceived effectiveness, as well as inferential statistics (T-tests, ANOVA) to examine the impact of demographic factors on AI adoption. Regression analysis identified predictors of AI adoption, and qualitative responses were thematically analyzed to understand students' perspectives on the future of AI in education. This mixed-methods approach provided a comprehensive view of AI's role in education and highlighted the importance of privacy, transparency, and continuous refinement of AI features to maximize their educational benefits. △ Less

Submitted 2 December, 2024; originally announced December 2024.

arXiv:2411.18783 [pdf, other]

Quasitoric representation of generalized braids

Authors: Neha Nanda, Manpreet Singh

Abstract: In this paper, we define generalized braid theories in alignment with the language of Fenn and Bartholomew for knot theories, and compute a generating set for the pure generalized braid theories. Using this, we prove that every oriented normal generalized knot is the closure of a quasitoric normal generalized braid. Further, we prove that the set of quasitoric normal generalized braids forms a sub… ▽ More In this paper, we define generalized braid theories in alignment with the language of Fenn and Bartholomew for knot theories, and compute a generating set for the pure generalized braid theories. Using this, we prove that every oriented normal generalized knot is the closure of a quasitoric normal generalized braid. Further, we prove that the set of quasitoric normal generalized braids forms a subgroup of normal generalized braid group. △ Less

Submitted 27 November, 2024; originally announced November 2024.

Comments: 16 pages, 20 figures

MSC Class: 57K10; 20F36

arXiv:2411.17603 [pdf, ps, other]

Is Integer Linear Programming All You Need for Deletion Propagation? A Unified and Practical Approach for Generalized Deletion Propagation

Authors: Neha Makhija, Wolfgang Gatterbauer

Abstract: Deletion Propagation (DP) refers to a family of database problems rooted in the classical view-update problem: how to propagate intended deletions in a view (query output) back to the source database while satisfying constraints and minimizing side effects. Although studied for over 40 years, DP variants, their complexities, and practical algorithms have been typically explored in isolation. Thi… ▽ More Deletion Propagation (DP) refers to a family of database problems rooted in the classical view-update problem: how to propagate intended deletions in a view (query output) back to the source database while satisfying constraints and minimizing side effects. Although studied for over 40 years, DP variants, their complexities, and practical algorithms have been typically explored in isolation. This work presents a unified and generalized framework for DP with several key benefits: (1) It unifies and generalizes all previously known DP variants, effectively subsuming them within a broader class of problems, including new, well-motivated variants. (2) It comes with a practical and general-purpose algorithm that is ``coarse-grained instance-optimal'': it runs in PTIME for all known PTIME cases and can automatically exploit structural regularities in the data, i.e. it does not rely on hints about such regularities as part of the input. (3) It is complete: our framework handles all known DP variants in all settings (including those involving self-joins, unions, and bag semantics), and allows us to provide new complexity results. (4) It is easy to implement and, in many cases, outperforms prior variant-specific solutions, sometimes by orders of magnitude. We provide the first experimental results for several DP variants previously studied only in theory. △ Less

Submitted 16 June, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

Comments: 19 pages, 12 figures

arXiv:2411.16887 [pdf, other]

doi 10.5281/zenodo.13958746

Modelling to Generate Continuous Alternatives: Enabling Real-Time Feasible Portfolio Generation in Convex Planning Models

Authors: Michael Lau, Xin Wang, Neha Patankar, Jesse D. Jenkins

Abstract: Decarbonization provides new opportunities to plan energy systems for improved health, resilience, equity, and environmental outcomes, but challenges in siting and social acceptance of transition goals and targets threaten progress. Modelling to Generate Alternatives (MGA) provides an optimization method for capturing many near-cost-optimal system configurations, and can provide insights into the… ▽ More Decarbonization provides new opportunities to plan energy systems for improved health, resilience, equity, and environmental outcomes, but challenges in siting and social acceptance of transition goals and targets threaten progress. Modelling to Generate Alternatives (MGA) provides an optimization method for capturing many near-cost-optimal system configurations, and can provide insights into the tradeoffs between objectives and flexibility available in the system. However, MGA is currently limited in interactive applicability to these problems due to a lack of methods for allowing users to explore near-optimal feasible spaces. In this work we describe Modelling to Generate Continuous Alternatives (MGCA), a novel post-processing algorithm for convex planning problems which enables users to rapidly generate new interior solutions, incorporate new constraints, and solve within the space with convex objectives. MGCA begins with a dimensionality reduction to capacity decisions and metric values. We then take advantage of convex combinations to generate interior points by allowing user weight specification and encoding convex combinations in an optimization problem with user-defined additional constraints and objective. Dimensionality reduction enables this problem to solve in tenths of a second, suitable for analysis in interactive settings. We discuss the interpolation of capacity and operational metric values, finding capacity metrics can be perfectly interpolated while operational metrics remain within the feasible range of the points used to create them. We demonstrate interpolated solutions can be exported and re-solved with an economic dispatch model to provide operational metric values consistent with least-cost decision-making and show interpolated metric values are generally within 10% of the optimal value. △ Less

Submitted 25 November, 2024; originally announced November 2024.

arXiv:2411.15025 [pdf, other]

doi 10.1002/lpor.202401357

Low-Loss and Low-Power Silicon Ring Based WDM 32$\times$100 GHz Filter Enabled by a Novel Bend Design

Authors: Qingzhong Deng, Ahmed H. El-Saeed, Alaa Elshazly, Guy Lepage, Chiara Marchese, Pieter Neutens, Hakim Kobbi, Rafal Magdziak, Jeroen De Coster, Javad Rahimi Vaskasi, Minkyu Kim, Yeyu Tong, Neha Singh, Marko Ersek Filipcic, Pol Van Dorpe, Kristof Croes, Maumita Chakrabarti, Dimitrios Velenis, Peter De Heyn, Peter Verheyen, Philippe Absil, Filippo Ferraro, Yoojin Ban, Joris Van Campenhout

Abstract: Ring resonators are crucial in silicon photonics for various applications, but conventional designs face performance trade-offs. Here a third-order polynomial interconnected circular (TOPIC) bend is proposed to revolutionize the ring designs fundamentally. The TOPIC bend has a unique feature of continuous curvature and curvature derivative, which is theoretically derived to be essential for wavegu… ▽ More Ring resonators are crucial in silicon photonics for various applications, but conventional designs face performance trade-offs. Here a third-order polynomial interconnected circular (TOPIC) bend is proposed to revolutionize the ring designs fundamentally. The TOPIC bend has a unique feature of continuous curvature and curvature derivative, which is theoretically derived to be essential for waveguide loss optimization. With the TOPIC bend, the silicon ring resonators demonstrated here have achieved three records to the best of our knowledge: the smallest radius (0.7 $\mathrm{μm}$) for silicon rings resonating with single guided mode, the lowest thermal tuning power (5.85 mW/$π$) for silicon rings with FSR $\geq$3.2 THz, and the first silicon ring-based WDM 32$\times$100 GHz filter. The filter has doubled the channel amount compared to the state of the art, and meanwhile achieved low insertion loss (1.91 $\pm$ 0.28 dB) and low tuning power (283 GHz/mW). Moreover, the TOPIC bend is not limited to ring applications, it can also be used to create bends with an arbitrary angle, with the advantages of ultra-compact radius and heater integration, which are expected to replace all circular bends in integrated photonics, greatly reducing system size and power consumption. △ Less

Submitted 22 November, 2024; originally announced November 2024.

arXiv:2411.14437 [pdf]

Transforming Business with Generative AI: Research, Innovation, Market Deployment and Future Shifts in Business Models

Authors: Narotam Singh, Vaibhav Chaudhary, Nimisha Singh, Neha Soni, Amita Kapoor

Abstract: This paper explores the transformative impact of Generative AI (GenAI) on the business landscape, examining its role in reshaping traditional business models, intensifying market competition, and fostering innovation. By applying the principles of Neo-Schumpeterian economics, the research analyses how GenAI is driving a new wave of "creative destruction," leading to the emergence of novel business… ▽ More This paper explores the transformative impact of Generative AI (GenAI) on the business landscape, examining its role in reshaping traditional business models, intensifying market competition, and fostering innovation. By applying the principles of Neo-Schumpeterian economics, the research analyses how GenAI is driving a new wave of "creative destruction," leading to the emergence of novel business paradigms and value propositions. The findings reveal that GenAI enhances operational efficiency, facilitates product and service innovation, and creates new revenue streams, positioning it as a powerful catalyst for substantial shifts in business structures and strategies. However, the deployment of GenAI also presents significant challenges, including ethical concerns, regulatory demands, and the risk of job displacement. By addressing the multifarious nature of GenAI, this paper provides valuable insights for business leaders, policymakers, and researchers, guiding them towards a balanced and responsible integration of this transformative technology. Ultimately, GenAI is not merely a technological advancement but a driver of profound change, heralding a future where creativity, efficiency, and growth are redefined. △ Less

Submitted 4 November, 2024; originally announced November 2024.

Comments: 30 pages, 12 figures, original submission

MSC Class: 68T05 (Primary); 68T50; 91B84; 91B69; 62H30 (Secondary) ACM Class: I.2.7; I.2.6; K.4.1; K.6.1; J.4; H.1.2

arXiv:2411.10548 [pdf, ps, other]

BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (68 additional authors not shown)

Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational biology and chemistry AI models across hundreds of GPUs. Its modular design allows the integration of individual components, such as data loaders, into existing workflows and is open to community contributions. We detail technical features of the BioNeMo Framework through use cases such as pLM pre-training and fine-tuning. On 256 NVIDIA A100s, BioNeMo Framework trains a three billion parameter BERT-based pLM on over one trillion tokens in 4.2 days. The BioNeMo Framework is open-source and free for everyone to use. △ Less

Submitted 12 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

arXiv:2411.09795 [pdf]

Comparative Study of InGaAs and GaAsSb Nanowires for Room Temperature Operation of Avalanche Photodiodes at 1.55 μm

Authors: Shrivatch Sankar, Punam Murkute, Micah Meleski, Nathan Gajowski, Neha Nooman, Md. Saiful Islam Sumon, Shamsul Arafin, Ronald M. Reano, Sanjay Krishna

Abstract: III V semiconductor nanowire based photodetectors have significant potential for remote sensing and LiDAR applications, particularly due to their ability to operate at 1.55 μm. Achieving room temperature operation and near unity absorption using these nanowires at 1.55 μm is crucial for single photon detection, which offers a promising solution to the challenges posed by the existing superconducti… ▽ More III V semiconductor nanowire based photodetectors have significant potential for remote sensing and LiDAR applications, particularly due to their ability to operate at 1.55 μm. Achieving room temperature operation and near unity absorption using these nanowires at 1.55 μm is crucial for single photon detection, which offers a promising solution to the challenges posed by the existing superconducting nanowire single photon detectors. Key materials suited for this wavelength include lattice matched In0.53Ga0.47As and Ga0.5As0.5Sb to InP. This study reports a comparison between InGaAs and GaAsSb nanowires to achieve high absorption efficiency at room temperature. Through optimized nanowire arrangement and geometry, we aim to maximize absorption. Our approach features a comparative analysis of patterned InGaAs and GaAsSb nanowires with absorption characteristics modeled using finite difference time domain simulations to enhance absorption at the target wavelength. We also present the complete workflow for nanowire fabrication, modeling, and simulation, encompassing the production of tapered nanowire structures and measurement of their absorption efficiency. Our experimental results show that tapered InGaAs and GaAsSb nanowires exhibit an absorption efficiency of 93% and 92%, respectively, at room temperature around 1.55 μm. △ Less

Submitted 23 November, 2024; v1 submitted 14 November, 2024; originally announced November 2024.

arXiv:2411.06704 [pdf, other]

Accelerating Low-field MRI: Compressed Sensing and AI for fast noise-robust imaging

Authors: Efrat Shimron, Shanshan Shan, James Grover, Neha Koonjoo, Sheng Shen, Thomas Boele, Annabel J. Sorby-Adams, John E. Kirsch, Matthew S. Rosen, David E. J. Waddington

Abstract: Portable, low-field Magnetic Resonance Imaging (MRI) scanners are increasingly being deployed in clinical settings. However, critical barriers to their widespread use include low signal-to-noise ratio (SNR), generally low image quality, and long scan duration. As these systems can operate in unusual environments, the level and spectral characteristics of the environmental electromagnetic inference… ▽ More Portable, low-field Magnetic Resonance Imaging (MRI) scanners are increasingly being deployed in clinical settings. However, critical barriers to their widespread use include low signal-to-noise ratio (SNR), generally low image quality, and long scan duration. As these systems can operate in unusual environments, the level and spectral characteristics of the environmental electromagnetic inference (EMI) noise can change substantially across sites and scans, further reducing image quality. Methods for accelerating acquisition and boosting image quality are of critical importance to enable clinically actionable high-quality imaging in these systems. Despite the role that compressed sensing (CS) and artificial intelligence (AI)-based methods have had in improving image quality for high-field MRI, their adoption for low-field imaging is in its infancy, and it is unclear how robust these methods are in low SNR regimes. Here, we investigate and compare leading CS and AI-based methods for image reconstruction from subsampled data and perform a thorough analysis of their performance across a range of SNR values. We compare classical L1-wavelet CS with leading data-driven and model-driven AI methods. Experiments are performed using publicly available datasets and our own low-field and high-field experimental data. Specifically, we apply an unrolled AI network to low-field MRI, and find it outperforms competing reconstruction methods. We prospectively deploy our undersampling methods to accelerate imaging on a 6.5 mT MRI scanner. This work highlights the potential and pitfalls of advanced reconstruction techniques in low-field MRI, paving the way for broader clinical applications. △ Less

Submitted 10 November, 2024; originally announced November 2024.

arXiv:2410.21307 [pdf]

Geometric Correction and Mosaic Generation of Geo High Resolution Camera Images

Authors: Ankur Garg, Nitesh Thapa, Ghansham Sangar, Neha Gaur, Meenakshi Sarkar, S. Manthira Moorthi, Debajyoti Dhar

Abstract: The Geo High Resolution Camera (GHRC) aboard ISRO GSAT-29 satellite is a state-of-the-art 6-band Visible and Near Infrared (VNIR) imager in geostationary orbit at 55degE longitude. It provides a ground sampling distance of 55 meters at nadir, covering 110x110 km at a time, and can image the entire Earth disk using a scan mirror mechanism. To cover India, GHRC uses a two-dimensional raster scanning… ▽ More The Geo High Resolution Camera (GHRC) aboard ISRO GSAT-29 satellite is a state-of-the-art 6-band Visible and Near Infrared (VNIR) imager in geostationary orbit at 55degE longitude. It provides a ground sampling distance of 55 meters at nadir, covering 110x110 km at a time, and can image the entire Earth disk using a scan mirror mechanism. To cover India, GHRC uses a two-dimensional raster scanning technique, resulting in over 1,000 scenes that must be stitched into a seamless mosaic. This paper presents the geolocation model and examines potential sources of targeting error, with an assessment of location accuracy. Challenges in inter-band registration and inter-frame mosaicing are addressed through algorithms for geometric correction, band-to-band registration, and seamless mosaic generation. In-flight geometric calibration, including adjustments to the instrument interior alignment angles using ground reference images, has improved pointing and location accuracy. A backtracking algorithm has been developed to correct frame-to-frame mosaicing errors for large-scale mosaics, leveraging geometric models, image processing, and space resection techniques. These advancements now enable the operational generation of full India mosaics with 100-meter resolution and high geometric fidelity, enhancing the GHRC capabilities for Earth observation and monitoring applications. △ Less

Submitted 24 October, 2024; originally announced October 2024.

Comments: Preprint

arXiv:2410.20231 [pdf, other]

CAVE-Net: Classifying Abnormalities in Video Capsule Endoscopy

Authors: Ishita Harish, Saurav Mishra, Neha Bhadoria, Rithik Kumar, Madhav Arora, Syed Rameem Zahra, Ankur Gupta

Abstract: Accurate classification of medical images is critical for detecting abnormalities in the gastrointestinal tract, a domain where misclassification can significantly impact patient outcomes. We propose an ensemble-based approach to improve diagnostic accuracy in analyzing complex image datasets. Using a Convolutional Block Attention Module along with a Deep Neural Network, we leverage the unique fea… ▽ More Accurate classification of medical images is critical for detecting abnormalities in the gastrointestinal tract, a domain where misclassification can significantly impact patient outcomes. We propose an ensemble-based approach to improve diagnostic accuracy in analyzing complex image datasets. Using a Convolutional Block Attention Module along with a Deep Neural Network, we leverage the unique feature extraction capabilities of each model to enhance the overall accuracy. The classification models, such as Random Forest, XGBoost, Support Vector Machine and K-Nearest Neighbors are introduced to further diversify the predictive power of proposed ensemble. By using these methods, the proposed framework, CAVE-Net, provides robust feature discrimination and improved classification results. Experimental evaluations demonstrate that the CAVE-Net achieves high accuracy and robustness across challenging and imbalanced classes, showing significant promise for broader applications in computer vision tasks. △ Less

Submitted 30 December, 2024; v1 submitted 26 October, 2024; originally announced October 2024.

arXiv:2410.19656 [pdf, other]

APRICOT: Active Preference Learning and Constraint-Aware Task Planning with LLMs

Authors: Huaxiaoyue Wang, Nathaniel Chin, Gonzalo Gonzalez-Pumariega, Xiangwan Sun, Neha Sunkara, Maximus Adrian Pace, Jeannette Bohg, Sanjiban Choudhury

Abstract: Home robots performing personalized tasks must adeptly balance user preferences with environmental affordances. We focus on organization tasks within constrained spaces, such as arranging items into a refrigerator, where preferences for placement collide with physical limitations. The robot must infer user preferences based on a small set of demonstrations, which is easier for users to provide tha… ▽ More Home robots performing personalized tasks must adeptly balance user preferences with environmental affordances. We focus on organization tasks within constrained spaces, such as arranging items into a refrigerator, where preferences for placement collide with physical limitations. The robot must infer user preferences based on a small set of demonstrations, which is easier for users to provide than extensively defining all their requirements. While recent works use Large Language Models (LLMs) to learn preferences from user demonstrations, they encounter two fundamental challenges. First, there is inherent ambiguity in interpreting user actions, as multiple preferences can often explain a single observed behavior. Second, not all user preferences are practically feasible due to geometric constraints in the environment. To address these challenges, we introduce APRICOT, a novel approach that merges LLM-based Bayesian active preference learning with constraint-aware task planning. APRICOT refines its generated preferences by actively querying the user and dynamically adapts its plan to respect environmental constraints. We evaluate APRICOT on a dataset of diverse organization tasks and demonstrate its effectiveness in real-world scenarios, showing significant improvements in both preference satisfaction and plan feasibility. The project website is at https://portal-cornell.github.io/apricot/ △ Less

Submitted 25 October, 2024; originally announced October 2024.

Comments: Conference on Robot Learning (CoRL) 2024

arXiv:2410.17869 [pdf]

doi 10.1021/acsaem.5c00495

Direct observation of thermal hysteresis in the molecular dynamics of barocaloric neopentyl glycol

Authors: Frederic Rendell-Bhatti, Markus Appel, Connor S. Inglis, Melony Dilshad, Neha Mehta, Jonathan Radcliffe, Xavier Moya, Donald A. MacLaren, David Boldrin

Abstract: Barocalorics (BCs) are emerging as promising alternatives to vapour-phase refrigerants, which are problematic as they exacerbate climate change when they inevitably leak into the atmosphere. However, the commercialisation of BC refrigerants is significantly hindered by hysteresis in the solid-solid phase transition that would be exploited in a refrigeration cycle. Here, we provide new insight into… ▽ More Barocalorics (BCs) are emerging as promising alternatives to vapour-phase refrigerants, which are problematic as they exacerbate climate change when they inevitably leak into the atmosphere. However, the commercialisation of BC refrigerants is significantly hindered by hysteresis in the solid-solid phase transition that would be exploited in a refrigeration cycle. Here, we provide new insight into the hysteresis that is a critical step towards the rational design of viable BCs. By studying the benchmark BC plastic crystal, neopentyl glycol (NPG), we observe directly the liberation of the hydroxyl rotational modes that unlock the hydrogen bond network, distinguishing for the first time the molecular reorientation and hydroxymethyl rotational modes. We showcase the use high-resolution inelastic fixed-window scans in combination with quasielastic neutron scattering (QENS) measurements to build a comprehensive microscopic understanding of the NPG phase transition, directly tracking the molecular dynamics of the phase transition. Hysteresis previously observed in calorimetric studies of NPG is now observed directly as hysteresis in molecular rotational modes, and hence in the formation and disruption of hydrogen bonding. Furthermore, by tracking the thermal activation of three main reorientation modes, we suggest that their fractional excitations may resolve an outstanding discrepancy between measured and calculated entropy change. These results allow for direct study of the molecular dynamics that govern the thermal hysteresis of small molecule energy materials. They will be broadly applicable, as many promising BC material families possess first-order transitions involving molecular reorientations. △ Less

Submitted 28 March, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

arXiv:2410.15472 [pdf]

doi 10.11159/icbes24.158

Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation

Authors: Fnu Neha, Arvind K. Bansal

Abstract: Renal tumors, especially renal cell carcinoma (RCC), show significant heterogeneity, posing challenges for diagnosis using radiology images such as MRI, echocardiograms, and CT scans. U-Net based deep learning techniques are emerging as a promising approach for automated medical image segmentation for minimally invasive diagnosis of renal tumors. However, current techniques need further improvemen… ▽ More Renal tumors, especially renal cell carcinoma (RCC), show significant heterogeneity, posing challenges for diagnosis using radiology images such as MRI, echocardiograms, and CT scans. U-Net based deep learning techniques are emerging as a promising approach for automated medical image segmentation for minimally invasive diagnosis of renal tumors. However, current techniques need further improvements in accuracy to become clinically useful to radiologists. In this study, we present an improved U-Net based model for end-to-end automated semantic segmentation of CT scan images to identify renal tumors. The model uses residual connections across convolution layers, integrates a multi-layer feature fusion (MFF) and cross-channel attention (CCA) within encoder blocks, and incorporates skip connections augmented with additional information derived using MFF and CCA. We evaluated our model on the KiTS19 dataset, which contains data from 210 patients. For kidney segmentation, our model achieves a Dice Similarity Coefficient (DSC) of 0.97 and a Jaccard index (JI) of 0.95. For renal tumor segmentation, our model achieves a DSC of 0.96 and a JI of 0.91. Based on a comparison of available DSC scores, our model outperforms the current leading models. △ Less

Submitted 21 October, 2024; v1 submitted 20 October, 2024; originally announced October 2024.

Comments: 8 pages

Journal ref: Proceedings of the 10th World Congress on Electrical Engineering and Computer Systems and Science, Avestia Publishing, ISSN = 2369-811X, 2024

arXiv:2410.15229 [pdf]

doi 10.1080/19490976.2025.2505115

Deep Learning-based Detection of Bacterial Swarm Motion Using a Single Image

Authors: Yuzhu Li, Hao Li, Weijie Chen, Keelan O'Riordan, Neha Mani, Yuxuan Qi, Tairan Liu, Sridhar Mani, Aydogan Ozcan

Abstract: Distinguishing between swarming and swimming, the two principal forms of bacterial movement, holds significant conceptual and clinical relevance. This is because bacteria that exhibit swarming capabilities often possess unique properties crucial to the pathogenesis of infectious diseases and may also have therapeutic potential. Here, we report a deep learning-based swarming classifier that rapidly… ▽ More Distinguishing between swarming and swimming, the two principal forms of bacterial movement, holds significant conceptual and clinical relevance. This is because bacteria that exhibit swarming capabilities often possess unique properties crucial to the pathogenesis of infectious diseases and may also have therapeutic potential. Here, we report a deep learning-based swarming classifier that rapidly and autonomously predicts swarming probability using a single blurry image. Compared with traditional video-based, manually-processed approaches, our method is particularly suited for high-throughput environments and provides objective, quantitative assessments of swarming probability. The swarming classifier demonstrated in our work was trained on Enterobacter sp. SM3 and showed good performance when blindly tested on new swarming (positive) and swimming (negative) test images of SM3, achieving a sensitivity of 97.44% and a specificity of 100%. Furthermore, this classifier demonstrated robust external generalization capabilities when applied to unseen bacterial species, such as Serratia marcescens DB10 and Citrobacter koseri H6. It blindly achieved a sensitivity of 97.92% and a specificity of 96.77% for DB10, and a sensitivity of 100% and a specificity of 97.22% for H6. This competitive performance indicates the potential to adapt our approach for diagnostic applications through portable devices or even smartphones. This adaptation would facilitate rapid, objective, on-site screening for bacterial swarming motility, potentially enhancing the early detection and treatment assessment of various diseases, including inflammatory bowel diseases (IBD) and urinary tract infections (UTI). △ Less

Submitted 19 October, 2024; originally announced October 2024.

Comments: 17 Pages, 4 Figures

Journal ref: Gut Microbes (2025)

arXiv:2410.12311 [pdf, other]

Open Domain Question Answering with Conflicting Contexts

Authors: Siyi Liu, Qiang Ning, Kishaloy Halder, Wei Xiao, Zheng Qi, Phu Mon Htut, Yi Zhang, Neha Anna John, Bonan Min, Yassine Benajiba, Dan Roth

Abstract: Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depending on this information may result in untruthful and inaccurate answers. To understand the gravity of this problem, we collect a human-annotated datas… ▽ More Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depending on this information may result in untruthful and inaccurate answers. To understand the gravity of this problem, we collect a human-annotated dataset, Question Answering with Conflicting Contexts (QACC), and find that as much as 25% of unambiguous, open domain questions can lead to conflicting contexts when retrieved using Google Search. We evaluate and benchmark three powerful Large Language Models (LLMs) with our dataset QACC and demonstrate their limitations in effectively addressing questions with conflicting information. To explore how humans reason through conflicting contexts, we request our annotators to provide explanations for their selections of correct answers. We demonstrate that by finetuning LLMs to explain their answers, we can introduce richer information into their training that guide them through the process of reasoning with conflicting contexts. △ Less

Submitted 27 April, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

arXiv:2410.11967 [pdf]

Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification

Authors: Reddy Mandati, Vladyslav Anderson, Po-chen Chen, Ankush Agarwal, Tatjana Dokic, David Barnard, Michael Finn, Jesse Cromer, Andrew Mccauley, Clay Tutaj, Neha Dave, Bobby Besharati, Jamie Barnett, Timothy Krall

Abstract: In the past utilities relied on in-field inspections to identify asset defects. Recently, utilities have started using drone-based inspections to enhance the field-inspection process. We consider a vast repository of drone images, providing a wealth of information about asset health and potential issues. However, making the collected imagery data useful for automated defect detection requires sign… ▽ More In the past utilities relied on in-field inspections to identify asset defects. Recently, utilities have started using drone-based inspections to enhance the field-inspection process. We consider a vast repository of drone images, providing a wealth of information about asset health and potential issues. However, making the collected imagery data useful for automated defect detection requires significant manual labeling effort. We propose a novel solution that combines synthetic asset defect images with manually labeled drone images. This solution has several benefits: improves performance of defect detection, reduces the number of hours spent on manual labeling, and enables the capability to generate realistic images of rare defects where not enough real-world data is available. We employ a workflow that combines 3D modeling tools such as Maya and Unreal Engine to create photorealistic 3D models and 2D renderings of defective assets and their surroundings. These synthetic images are then integrated into our training pipeline augmenting the real data. This study implements an end-to-end Artificial Intelligence solution to detect assets and asset defects from the combined imagery repository. The unique contribution of this research lies in the application of advanced computer vision models and the generation of photorealistic 3D renderings of defective assets, aiming to transform the asset inspection process. Our asset detection model has achieved an accuracy of 92 percent, we achieved a performance lift of 67 percent when introducing approximately 2,000 synthetic images of 2k resolution. In our tests, the defect detection model achieved an accuracy of 73 percent across two batches of images. Our analysis demonstrated that synthetic data can be successfully used in place of real-world manually labeled data to train defect detection model. △ Less

Submitted 15 October, 2024; originally announced October 2024.

arXiv:2410.09047 [pdf, other]

Unraveling and Mitigating Safety Alignment Degradation of Vision-Language Models

Authors: Qin Liu, Chao Shang, Ling Liu, Nikolaos Pappas, Jie Ma, Neha Anna John, Srikanth Doss, Lluis Marquez, Miguel Ballesteros, Yassine Benajiba

Abstract: The safety alignment ability of Vision-Language Models (VLMs) is prone to be degraded by the integration of the vision module compared to its LLM backbone. We investigate this phenomenon, dubbed as ''safety alignment degradation'' in this paper, and show that the challenge arises from the representation gap that emerges when introducing vision modality to VLMs. In particular, we show that the repr… ▽ More The safety alignment ability of Vision-Language Models (VLMs) is prone to be degraded by the integration of the vision module compared to its LLM backbone. We investigate this phenomenon, dubbed as ''safety alignment degradation'' in this paper, and show that the challenge arises from the representation gap that emerges when introducing vision modality to VLMs. In particular, we show that the representations of multi-modal inputs shift away from that of text-only inputs which represent the distribution that the LLM backbone is optimized for. At the same time, the safety alignment capabilities, initially developed within the textual embedding space, do not successfully transfer to this new multi-modal representation space. To reduce safety alignment degradation, we introduce Cross-Modality Representation Manipulation (CMRM), an inference time representation intervention method for recovering the safety alignment ability that is inherent in the LLM backbone of VLMs, while simultaneously preserving the functional capabilities of VLMs. The empirical results show that our framework significantly recovers the alignment ability that is inherited from the LLM backbone with minimal impact on the fluency and linguistic capabilities of pre-trained VLMs even without additional training. Specifically, the unsafe rate of LLaVA-7B on multi-modal input can be reduced from 61.53% to as low as 3.15% with only inference-time intervention. WARNING: This paper contains examples of toxic or harmful language. △ Less

Submitted 11 October, 2024; originally announced October 2024.

Comments: Preprint

arXiv:2410.00260 [pdf, other]

DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining

Authors: Vinayak Arannil, Neha Narwal, Sourav Sanjukta Bhabesh, Sai Nikhil Thirandas, Darren Yow-Bang Wang, Graham Horwood, Alex Anto Chirayath, Gouri Pandeshwar

Abstract: Large Language Models (LLMs) have shown remarkable ability to generalize effectively across numerous industry domains while executing a range of tasks. Many of these competencies are obtained from the data utilized during the pre-training phase of the Language Models (LMs). However, these models exhibit limitations when tasked with performing in specialized or low-resource industry domains. More r… ▽ More Large Language Models (LLMs) have shown remarkable ability to generalize effectively across numerous industry domains while executing a range of tasks. Many of these competencies are obtained from the data utilized during the pre-training phase of the Language Models (LMs). However, these models exhibit limitations when tasked with performing in specialized or low-resource industry domains. More recent approaches use LLMs for generating domain-specific synthetic data but most often they lack in truthfulness and complexity. Alternatively, in cases where domain data is available like healthcare and finance most of the LMs are proprietary necessitating the need for a scalable method to curate real world industry specific pre-training data. In this work, we propose an automated and scalable framework - DoPAMine:Domain-specific Pre-training Adaptation from seed-guided data Mining, to mine domain specific training data from a large data corpus for domain adaptation of a LM. The framework leverages the parametric knowledge of a LLM to generate diverse and representative seed data tailored to a specific domain which is then used to mine real world data from a large data corpus like Common Crawl. We evaluated our framework's performance in the continual pre-training (CPT) setting by training two domain specific 7B parameter LMs in healthcare and finance with data mined via DoPAMine. Our experiments show that DoPAMine boosts the performance of pre-trained LLMs on average by 4.9% and 5.1% in zero-shot and 5-shot settings respectively on healthcare tasks from MMLU, MedQA, MedMCQA and PubMedQA datasets, and 2.9% and 6.7% for zero-shot and 5-shot settings respectively on finance tasks from FiQA-SA, FPB and Headlines datasets when compared to the baseline. △ Less

Submitted 9 October, 2024; v1 submitted 30 September, 2024; originally announced October 2024.

arXiv:2409.20406 [pdf]

Lateral diffusion in 2-micron InGaAs/GaAsSb superlattice planar diodes using atomic layer deposition of ZnO

Authors: Manisha Muduli, Nathan Gajaowski, Hyemin Jung, Neha Nooman, Bhupesh Bhardwaj, Mariah Schwartz, Seunghyun Lee, Sanjay Krishna

Abstract: Avalanche photodiodes used for greenhouse gas sensing often use a mesa-structure that suffers from high surface leakage currents and edge breakdown. In this paper, we report 2-micron InGaAs/GaAsSb superlattice (SL) based planar PIN diodes to eliminate the challenges posed by conventional mesa diodes. An alternate way to fabricate planar diodes using atomic layer deposited ZnO was explored and the… ▽ More Avalanche photodiodes used for greenhouse gas sensing often use a mesa-structure that suffers from high surface leakage currents and edge breakdown. In this paper, we report 2-micron InGaAs/GaAsSb superlattice (SL) based planar PIN diodes to eliminate the challenges posed by conventional mesa diodes. An alternate way to fabricate planar diodes using atomic layer deposited ZnO was explored and the effect of the diffusion process on the superlattice was studied using X-ray diffraction. The optimum diffusion conditions were then used to make planar PIN diodes. The diffused Zn concentration was measured to be approximately 1E20 cm-3 with a diffusion depth of 50 nm and a lateral diffusion ranging between 18 microns to 30 microns. A background doping of 5.8 x 1E14 cm-3 for the UID layer was determined by analyzing the capacitance-voltage measurements of the superlattice PIN diodes. The room temperature dark current for a device with a designed diameter of 30 microns is 1E-6 A at -2V. The quantum efficiency of the diode with a designed diameter of 200 microns was obtained to be 11.11% at 2-micron illumination. Further optimization of this diffusion process may lead to a rapid, manufacturable, and cost-effective method of developing planar diodes. △ Less

Submitted 30 September, 2024; originally announced September 2024.

arXiv:2409.19939 [pdf, other]

doi 10.1038/s41597-025-04825-z

A database of upper limb surface electromyogram signals from demographically diverse individuals

Authors: Harshavardhana T. Gowda, Neha Kaul, Carlos Carrasco, Marcus A. Battraw, Safa Amer, Saniya Kotwal, Selena Lam, Zachary McNaughton, Ferdous Rahimi, Sana Shehabi, Jonathon S. Schofield, Lee M. Miller

Abstract: Upper limb based neuromuscular interfaces aim to provide a seamless way for humans to interact with technology. Among noninvasive interfaces, surface electromyogram (EMG) signals hold significant promise. However, their sensitivity to physiological and anatomical factors remains poorly understood, raising questions about how these factors influence gesture decoding across individuals or groups. To… ▽ More Upper limb based neuromuscular interfaces aim to provide a seamless way for humans to interact with technology. Among noninvasive interfaces, surface electromyogram (EMG) signals hold significant promise. However, their sensitivity to physiological and anatomical factors remains poorly understood, raising questions about how these factors influence gesture decoding across individuals or groups. To facilitate the study of signal distribution shifts across individuals or groups of individuals, we present a dataset of upper limb EMG signals and physiological measures from 91 demographically diverse adults. Participants were selected to represent a range of ages (18 to 92 years) and body mass indices (healthy, overweight, and obese). The dataset also includes measures such as skin hydration and elasticity, which may affect EMG signals. This dataset provides a basis to study demographic confounds in EMG signals and serves as a benchmark to test the development of fair and unbiased algorithms that enable accurate hand gesture decoding across demographically diverse subjects. Additionally, we validate the quality of the collected data using state-of-the-art gesture decoding techniques. △ Less

Submitted 2 April, 2025; v1 submitted 30 September, 2024; originally announced September 2024.

Journal ref: Sci Data 12, 517 (2025)

arXiv:2409.16560 [pdf, other]

Dynamic-Width Speculative Beam Decoding for Efficient LLM Inference

Authors: Zongyue Qin, Zifan He, Neha Prakriya, Jason Cong, Yizhou Sun

Abstract: Large language models (LLMs) have shown outstanding performance across numerous real-world tasks. However, the autoregressive nature of these models makes the inference process slow and costly. Speculative decoding has emerged as a promising solution, leveraging a smaller auxiliary model to draft future tokens, which are then validated simultaneously by the larger model, achieving a speed-up of 1-… ▽ More Large language models (LLMs) have shown outstanding performance across numerous real-world tasks. However, the autoregressive nature of these models makes the inference process slow and costly. Speculative decoding has emerged as a promising solution, leveraging a smaller auxiliary model to draft future tokens, which are then validated simultaneously by the larger model, achieving a speed-up of 1-2x. Although speculative decoding matches the same distribution as multinomial sampling, multinomial sampling itself is prone to suboptimal outputs, whereas beam sampling is widely recognized for producing higher-quality results by maintaining multiple candidate sequences at each step. This paper explores the novel integration of speculative decoding with beam sampling. However, there are four key challenges: (1) how to generate multiple sequences from the larger model's distribution given drafts sequences from the small model; (2) how to dynamically optimize the number of beams to balance efficiency and accuracy; (3) how to efficiently verify the multiple drafts in parallel; and (4) how to address the extra memory costs inherent in beam sampling. To address these challenges, we propose dynamic-width speculative beam decoding (DSBD). Specifically, we first introduce a novel draft and verification scheme that generates multiple sequences following the large model's distribution based on beam sampling trajectories from the small model. Then, we introduce an adaptive mechanism to dynamically tune the number of beams based on the context, optimizing efficiency and effectiveness. Besides, we extend tree-based parallel verification to handle multiple trees simultaneously, accelerating the verification process. Finally, we illustrate a simple modification to our algorithm to mitigate the memory overhead of beam sampling... △ Less

Submitted 14 March, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

arXiv:2409.14788 [pdf, ps, other]

The Frobenius number for the triple of the 2-step star numbers

Authors: Takao Komatsu, Ritika Goel, Neha Gupta

Abstract: In this paper, we give closed form expressions of the Frobenius number for the triple of the $2$-step star numbers $an(n-2) + 1$ for an integer $a \geq 4$. These numbers have been studied from different aspects for some $a$'s. These numbers can also be considered as variations of the well known star numbers of the form $6n(n-1) + 1$. We also give closed form expressions of the Sylvester number (ge… ▽ More In this paper, we give closed form expressions of the Frobenius number for the triple of the $2$-step star numbers $an(n-2) + 1$ for an integer $a \geq 4$. These numbers have been studied from different aspects for some $a$'s. These numbers can also be considered as variations of the well known star numbers of the form $6n(n-1) + 1$. We also give closed form expressions of the Sylvester number (genus) for the triple of the $2$-step star numbers. △ Less

Submitted 23 September, 2024; originally announced September 2024.

arXiv:2409.13025 [pdf, other]

doi 10.1038/s41586-025-08642-7

Hardware-efficient quantum error correction via concatenated bosonic qubits

Authors: Harald Putterman, Kyungjoo Noh, Connor T. Hann, Gregory S. MacCabe, Shahriar Aghaeimeibodi, Rishi N. Patel, Menyoung Lee, William M. Jones, Hesam Moradinejad, Roberto Rodriguez, Neha Mahuli, Jefferson Rose, John Clai Owens, Harry Levine, Emma Rosenfeld, Philip Reinhold, Lorenzo Moncelsi, Joshua Ari Alcid, Nasser Alidoust, Patricio Arrangoiz-Arriola, James Barnett, Przemyslaw Bienias, Hugh A. Carson, Cliff Chen, Li Chen , et al. (96 additional authors not shown)

Abstract: In order to solve problems of practical importance, quantum computers will likely need to incorporate quantum error correction, where a logical qubit is redundantly encoded in many noisy physical qubits. The large physical-qubit overhead typically associated with error correction motivates the search for more hardware-efficient approaches. Here, using a microfabricated superconducting quantum circ… ▽ More In order to solve problems of practical importance, quantum computers will likely need to incorporate quantum error correction, where a logical qubit is redundantly encoded in many noisy physical qubits. The large physical-qubit overhead typically associated with error correction motivates the search for more hardware-efficient approaches. Here, using a microfabricated superconducting quantum circuit, we realize a logical qubit memory formed from the concatenation of encoded bosonic cat qubits with an outer repetition code of distance $d=5$. The bosonic cat qubits are passively protected against bit flips using a stabilizing circuit. Cat-qubit phase-flip errors are corrected by the repetition code which uses ancilla transmons for syndrome measurement. We realize a noise-biased CX gate which ensures bit-flip error suppression is maintained during error correction. We study the performance and scaling of the logical qubit memory, finding that the phase-flip correcting repetition code operates below threshold, with logical phase-flip error decreasing with code distance from $d=3$ to $d=5$. Concurrently, the logical bit-flip error is suppressed with increasing cat-qubit mean photon number. The minimum measured logical error per cycle is on average $1.75(2)\%$ for the distance-3 code sections, and $1.65(3)\%$ for the longer distance-5 code, demonstrating the effectiveness of bit-flip error suppression throughout the error correction cycle. These results, where the intrinsic error suppression of the bosonic encodings allows us to use a hardware-efficient outer error correcting code, indicate that concatenated bosonic codes are a compelling paradigm for reaching fault-tolerant quantum computation. △ Less

Submitted 23 March, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

Journal ref: Nature 638, 927-934 (2025)

arXiv:2409.12971 [pdf, other]

Reducing transmission expansion by co-optimizing sizing of wind, solar, storage and grid connection capacity

Authors: Aneesha Manocha, Gabriel Mantegna, Neha Patankar, Jesse D. Jenkins

Abstract: Expanding transmission capacity is likely a bottleneck that will restrict variable renewable energy (VRE) deployment required to achieve ambitious emission reduction goals. Interconnection and inter-zonal transmission buildout may be displaced by the optimal sizing of VRE to grid connection capacity and by the co-location of VRE and battery resources behind interconnection. However, neither of the… ▽ More Expanding transmission capacity is likely a bottleneck that will restrict variable renewable energy (VRE) deployment required to achieve ambitious emission reduction goals. Interconnection and inter-zonal transmission buildout may be displaced by the optimal sizing of VRE to grid connection capacity and by the co-location of VRE and battery resources behind interconnection. However, neither of these capabilities is commonly captured in macro-energy system models. We develop two new functionalities to explore the substitutability of storage for transmission and the optimal capacity and siting decisions of renewable energy and battery resources through 2030 in the Western Interconnection of the United States. Our findings indicate that modeling optimized interconnection and storage co-location better captures the full value of energy storage and its ability to substitute for transmission. Optimizing interconnection capacity and co-location can reduce total grid connection and shorter-distance transmission capacity expansion on the order of 10% at storage penetration equivalent to 2.5-10% of peak system demand. The decline in interconnection capacity corresponds with greater ratios of VRE to grid connection capacity (an average of 1.5-1.6 megawatt (MW) PV:1 MW inverter capacity, 1.2-1.3 MW wind:1 MW interconnection). Co-locating storage with VREs also results in a 10-15% increase in wind capacity, as wind sites tend to require longer and more costly interconnection. Finally, co-located storage exhibits higher value than standalone storage in our model setup (22-25%). Given the coarse representation of transmission networks in our modeling, this outcome likely overstates the real-world importance of storage co-location with VREs. However, it highlights how siting storage in grid-constrained locations can maximize the value of storage and reduce transmission expansion. △ Less

Submitted 2 September, 2024; originally announced September 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2303.11586

arXiv:2409.07298 [pdf, other]

What is the nature of GW230529? An exploration of the gravitational lensing hypothesis

Authors: Justin Janquart, David Keitel, Rico K. L. Lo, Juno C. L. Chan, Jose Marìa Ezquiaga, Otto A. Hannuksela, Alvin K. Y. Li, Anupreeta More, Hemantakumar Phurailatpam, Neha Singh, Laura E. Uronen, Mick Wright, Naresh Adhikari, Sylvia Biscoveanu, Tomasz Bulik, Amanda M. Farah, Anna Heffernan, Prathamesh Joshi, Vincent Juste, Atul Kedia, Shania A. Nichols, Geraint Pratten, C. Rawcliffe, Soumen Roy, Elise M. Sänger , et al. (4 additional authors not shown)

Abstract: On the 29th of May 2023, the LIGO-Virgo-KAGRA Collaboration observed a compact binary coalescence event consistent with a neutron star-black hole merger, though the heavier object of mass 2.5-4.5 $M_\odot$ would fall into the purported lower mass gap. An alternative explanation for apparent observations of events in this mass range has been suggested as strongly gravitationally lensed binary neutr… ▽ More On the 29th of May 2023, the LIGO-Virgo-KAGRA Collaboration observed a compact binary coalescence event consistent with a neutron star-black hole merger, though the heavier object of mass 2.5-4.5 $M_\odot$ would fall into the purported lower mass gap. An alternative explanation for apparent observations of events in this mass range has been suggested as strongly gravitationally lensed binary neutron stars. In this scenario, magnification would lead to the source appearing closer and heavier than it really is. Here, we investigate the chances and possible consequences for the GW230529 event to be gravitationally lensed. We find this would require high magnifications and we obtain low rates for observing such an event, with a relative fraction of lensed versus unlensed observed events of $2 \times 10^{-3}$ at most. When comparing the lensed and unlensed hypotheses accounting for the latest rates and population model, we find a 1/58 chance of lensing, disfavoring this option. Moreover, when the magnification is assumed to be strong enough to bring the mass of the heavier binary component below the standard limits on neutron star masses, we find high probability for the lighter object to have a sub-solar mass, making the binary even more exotic than a mass-gap neutron star-black hole system. Even when the secondary is not sub-solar, its tidal deformability would likely be measurable, which is not the case for GW230529. Finally, we do not find evidence for extra lensing signatures such as the arrival of additional lensed images, type-II image dephasing, or microlensing. Therefore, we conclude it is unlikely for GW230529 to be a strongly gravitationally lensed binary neutron star signal. △ Less

Submitted 17 September, 2024; v1 submitted 11 September, 2024; originally announced September 2024.

Comments: 15 pages, 11 figures

Report number: LIGO-P2400353

arXiv:2409.06410 [pdf]

LaB6 aided spontaneous conversion of bulk graphite into carbon nanotubes at normal atmospheric conditions

Authors: Shalaka A. Kamble, Soumen Karmakar, Somnath R. Bhopale, Sanket D. Jangale, Neha P. Gadke, Srikumar Ghorui, S. V. Bhoraskar, M. A. More, V. L. Mathe

Abstract: Herein, we report a case study in which we saw the spontaneous conversion of commercial bulk graphite into LaB6 decorated carbon nanotubes (CNTs) under normal atmospheric conditions. The feedstock graphite was used as a hollow cylindrical anode filled with LaB6 powder and partially eroded in a DC electric-arc plasma reactor in pure nitrogen atmosphere. An unusual and spontaneous deformation of the… ▽ More Herein, we report a case study in which we saw the spontaneous conversion of commercial bulk graphite into LaB6 decorated carbon nanotubes (CNTs) under normal atmospheric conditions. The feedstock graphite was used as a hollow cylindrical anode filled with LaB6 powder and partially eroded in a DC electric-arc plasma reactor in pure nitrogen atmosphere. An unusual and spontaneous deformation of the plasma-treated residual anode into a fluffy powder was seen to continue for months when left to ambient atmospheric conditions. The existence of LaB6 decorated multi-walled CNTs at large quantity was confirmed in the as-generated powder by using electron microscopy, Raman spectroscopy and x-ray diffraction. The as-synthesized CNT-based large-area field emitter showed promising field-emitting properties with a low turn-on electric field of ~1.5 V per micrometer, and a current density of ~1.17 mA per square cm at an applied electric field of 3.24 V per micrometer. △ Less

Submitted 10 September, 2024; originally announced September 2024.

Comments: 6 pages, 5figures

arXiv:2409.06131 [pdf, other]

Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review

Authors: Neha Prakriya, Jui-Nan Yen, Cho-Jui Hsieh, Jason Cong

Abstract: Traditional Large Language Model (LLM) pretraining relies on autoregressive language modeling with randomly sampled data from web-scale datasets. Inspired by human learning techniques like spaced repetition, we hypothesize that random sampling leads to high training costs, lower-quality models, and significant data forgetting. To address these inefficiencies, we propose the Learn-Focus-Review (LFR… ▽ More Traditional Large Language Model (LLM) pretraining relies on autoregressive language modeling with randomly sampled data from web-scale datasets. Inspired by human learning techniques like spaced repetition, we hypothesize that random sampling leads to high training costs, lower-quality models, and significant data forgetting. To address these inefficiencies, we propose the Learn-Focus-Review (LFR) paradigm -- a dynamic training approach that adapts to the model's learning progress. LFR tracks the model's learning performance across data blocks (sequences of tokens) and prioritizes revisiting challenging regions of the dataset that are more prone to being forgotten, enabling better retention and more efficient learning. Using the LFR paradigm, we pretrained Llama and GPT models on the SlimPajama and OpenWebText datasets, respectively. These models were evaluated on downstream tasks across various domains, including question answering, problem-solving, commonsense reasoning, language modeling, and translation. Compared to baseline models trained on the full datasets, LFR consistently achieved lower perplexity and higher accuracy, while using only 5%--19% of the training tokens. Furthermore, LFR matched the performance of industry-standard Pythia models with up to 2$\times$ the parameter count, using just 3.2% of the training tokens, demonstrating its effectiveness and efficiency. △ Less

Submitted 28 January, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

arXiv:2409.04880 [pdf, other]

Towards identifying Source credibility on Information Leakage in Digital Gadget Market

Authors: Neha Kumaru, Garvit Gupta, Shreyas Mongia, Shubham Singh, Ponnurangam Kumaraguru, Arun Balaji Buduru

Abstract: The use of Social media to share content is on a constant rise. One of the capsize effect of information sharing on Social media includes the spread of sensitive information on the public domain. With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed. Many web-blog… ▽ More The use of Social media to share content is on a constant rise. One of the capsize effect of information sharing on Social media includes the spread of sensitive information on the public domain. With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed. Many web-blogs on digital gadget market have mushroomed recently, making the problem of information leak all pervasive. Credible leaks on specifics of an upcoming device can cause a lot of financial damage to the respective organization. Hence, it is crucial to assess the credibility of the platforms that continuously post about a smartphone or digital gadget leaks. In this work, we analyze the headlines of leak web-blog posts and their corresponding official press-release. We first collect 54, 495 leak and press-release headlines for different smartphones. We train our custom NER model to capture the evolving smartphone names with an accuracy of 82.14% on manually annotated results. We further propose a credibility score metric for the web-blog, based on the number of falsified and authentic smartphone leak posts. △ Less

Submitted 7 September, 2024; originally announced September 2024.

arXiv:2408.11695 [pdf, other]

Hawkes process with tempered Mittag-Leffler kernel

Authors: Neha Gupta, Aditya Maheshwari

Abstract: In this paper, we propose an extension of the Hawkes process by incorporating a kernel based on the tempered Mittag-Leffler distribution. This is the generalization of the work presented in [10]. We derive analytical results for the expectation of the conditional intensity and the expected number of events in the counting process. Additionally, we investigate the limiting behavior of the expectati… ▽ More In this paper, we propose an extension of the Hawkes process by incorporating a kernel based on the tempered Mittag-Leffler distribution. This is the generalization of the work presented in [10]. We derive analytical results for the expectation of the conditional intensity and the expected number of events in the counting process. Additionally, we investigate the limiting behavior of the expectation of the conditional intensity. Finally, we present an empirical comparison of the studied process with its limiting special cases. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 9 pages, 5 figures

MSC Class: 60G55; 60G22

arXiv:2408.10239 [pdf, ps, other]

A Conceptual Framework for Ethical Evaluation of Machine Learning Systems

Authors: Neha R. Gupta, Jessica Hullman, Hari Subramonyam

Abstract: Research in Responsible AI has developed a range of principles and practices to ensure that machine learning systems are used in a manner that is ethical and aligned with human values. However, a critical yet often neglected aspect of ethical ML is the ethical implications that appear when designing evaluations of ML systems. For instance, teams may have to balance a trade-off between highly infor… ▽ More Research in Responsible AI has developed a range of principles and practices to ensure that machine learning systems are used in a manner that is ethical and aligned with human values. However, a critical yet often neglected aspect of ethical ML is the ethical implications that appear when designing evaluations of ML systems. For instance, teams may have to balance a trade-off between highly informative tests to ensure downstream product safety, with potential fairness harms inherent to the implemented testing procedures. We conceptualize ethics-related concerns in standard ML evaluation techniques. Specifically, we present a utility framework, characterizing the key trade-off in ethical evaluation as balancing information gain against potential ethical harms. The framework is then a tool for characterizing challenges teams face, and systematically disentangling competing considerations that teams seek to balance. Differentiating between different types of issues encountered in evaluation allows us to highlight best practices from analogous domains, such as clinical trials and automotive crash testing, which navigate these issues in ways that can offer inspiration to improve evaluation processes in ML. Our analysis underscores the critical need for development teams to deliberately assess and manage ethical complexities that arise during the evaluation of ML systems, and for the industry to move towards designing institutional policies to support ethical evaluations. △ Less

Submitted 4 August, 2024; originally announced August 2024.

arXiv:2407.21584 [pdf, other]

Quantum Thermodynamics of Open Quantum Systems: Nature of Thermal Fluctuations

Authors: Neha Pathania, Devvrat Tiwari, Subhashish Banerjee

Abstract: We investigate the thermodynamic behavior of open quantum systems through the Hamiltonian of Mean Force, focusing on two models: a two-qubit system interacting with a thermal bath and a Jaynes-Cummings Model without the rotating wave approximation. By analyzing both weak and strong coupling regimes, we uncover the impact of environmental interactions on quantum thermodynamic quantities, including… ▽ More We investigate the thermodynamic behavior of open quantum systems through the Hamiltonian of Mean Force, focusing on two models: a two-qubit system interacting with a thermal bath and a Jaynes-Cummings Model without the rotating wave approximation. By analyzing both weak and strong coupling regimes, we uncover the impact of environmental interactions on quantum thermodynamic quantities, including specific heat capacity, internal energy, and entropy. Further, the ergotropy and entropy production are computed. We also explore the thermodynamic uncertainty relation, which sets an upper bound on the signal-to-noise ratio. △ Less

Submitted 31 July, 2024; originally announced July 2024.

Comments: 10 pages, 9 figures

arXiv:2407.20978 [pdf]

Are gene-by-environment interactions leveraged in multi-modality neural networks for breast cancer prediction?

Authors: Monica Isgut, Andrew Hornback, Yunan Luo, Asma Khimani, Neha Jain, May D. Wang

Abstract: Polygenic risk scores (PRSs) can significantly enhance breast cancer risk prediction when combined with clinical risk factor data. While many studies have explored the value-add of PRSs, little is known about the potential impact of gene-by-gene or gene-by-environment interactions towards enhancing the risk discrimination capabilities of multi-modal models combining PRSs with clinical data. In thi… ▽ More Polygenic risk scores (PRSs) can significantly enhance breast cancer risk prediction when combined with clinical risk factor data. While many studies have explored the value-add of PRSs, little is known about the potential impact of gene-by-gene or gene-by-environment interactions towards enhancing the risk discrimination capabilities of multi-modal models combining PRSs with clinical data. In this study, we integrated data on 318 individual genotype variants along with clinical data in a neural network to explore whether gene-by-gene (i.e., between individual variants) and/or gene-by-environment (between clinical risk factors and variants) interactions could be leveraged jointly during training to improve breast cancer risk prediction performance. We benchmarked our approach against a baseline model combining traditional univariate PRSs with clinical data in a logistic regression model and ran an interpretability analysis to identify feature interactions. While our model did not demonstrate improved performance over the baseline, we discovered 248 (<1%) statistically significant gene-by-gene and gene-by-environment interactions out of the ~53.6k possible feature pairs, the most contributory of which included rs6001930 (MKL1) and rs889312 (MAP3K1), with age and menopause being the most heavily interacting non-genetic risk factors. We also modeled the significant interactions as a network of highly connected features, suggesting that potential higher-order interactions are captured by the model. Although gene-by-environment (or gene-by-gene) interactions did not enhance breast cancer risk prediction performance in neural networks, our study provides evidence that these interactions can be leveraged by these models to inform their predictions. This study represents the first application of neural networks to screen for interactions impacting breast cancer risk using real-world data. △ Less

Submitted 30 July, 2024; originally announced July 2024.

arXiv:2407.20393 [pdf, other]

Validating Mean Field Theory in a New Complex, Disordered High-Entropy Spinel Oxide

Authors: Neha Sharma, Nikita Sharma, Jyoti Sharma, S. D. Kaushik, Sanjoy Kr. Mahatha, Tirthankar Chakraborty, Sourav Marik

Abstract: The advent of novel high-entropy oxides has sparked substantial research interest due to their exceptional functional properties, which often surpass the mere sum of their constituent elements' characteristics. This study introduces a complex high-entropy spinel oxide with composition (Ni$_{0.2}$Mg$_{0.2}$Co$_{0.2}$Cu$_{0.2}$Zn$_{0.2}$)(Mn$_{0.66}$Fe$_{0.66}$Cr$_{0.66}$)O$_{4}$. We performed compr… ▽ More The advent of novel high-entropy oxides has sparked substantial research interest due to their exceptional functional properties, which often surpass the mere sum of their constituent elements' characteristics. This study introduces a complex high-entropy spinel oxide with composition (Ni$_{0.2}$Mg$_{0.2}$Co$_{0.2}$Cu$_{0.2}$Zn$_{0.2}$)(Mn$_{0.66}$Fe$_{0.66}$Cr$_{0.66}$)O$_{4}$. We performed comprehensive structural (X-ray and Neutron diffraction), microstructural, magnetic, and local electronic structure investigations on this material. Despite the material's high degree of disorder, detailed magnetization measurements and low temperature neutron powder diffraction studies reveal long-range ferrimagnetic ordering beginning at 293 K. The sample exhibits a high saturation magnetization of 766 emu-cm${^3}$ (at 50 K), a low coercivity (H$_C$) of 100 Oe (50 K), a high transition temperature (T$_C$) around room temperature, and high resistivity value of 4000 Ohm-cm at room temperature, indicating its potential for high density memory devices. The magnetic structure is determined using a collinear-type ferrimagnetic model with a propagation vector k = 0,0,0. Various analytical techniques, including modified Arrott plots, Kouvel-Fischer analysis, and critical isotherm analysis, are employed to investigate the phase transitions and magnetic properties of this complex system. Our results indicate a second-order phase transition. Remarkably, despite the complex structure and significant disorder, the critical exponents obtained are consistent with the mean field model. The high entropy leads to a remarkably homogeneous distribution of multiple cations, validating the approximation of average local magnetic environments and supporting the mean field theory. △ Less

Submitted 29 July, 2024; originally announced July 2024.

Showing 51–100 of 577 results for author: Neha