-
Hopping Forcing Number in Random $d$-regular Graphs
Authors:
Pawel Pralat,
Harjas Singh
Abstract:
Hopping forcing is a single player combinatorial game in which the player is presented a graph on $n$ vertices, some of which are initially blue with the remaining vertices being white. In each round $t$, a blue vertex $v$ with all neighbours blue may hop and colour a white vertex blue in the second neighbourhood, provided that $v$ has not performed a hop in the previous $t-1$ rounds. The objectiv…
▽ More
Hopping forcing is a single player combinatorial game in which the player is presented a graph on $n$ vertices, some of which are initially blue with the remaining vertices being white. In each round $t$, a blue vertex $v$ with all neighbours blue may hop and colour a white vertex blue in the second neighbourhood, provided that $v$ has not performed a hop in the previous $t-1$ rounds. The objective of the game is to eventually colour every vertex blue by repeatedly applying the hopping forcing rule. Subsequently, for a given graph $G$, the hopping forcing number is the minimum number of initial blue vertices that are required to achieve the objective.
In this paper, we study the hopping forcing number for random $d$-regular graphs. Specifically, we aim to derive asymptotic upper and lower bounds for the hopping forcing number for various values of $d \geq 2$.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Heterogeneous Graph Auto-Encoder for CreditCard Fraud Detection
Authors:
Moirangthem Tiken Singh,
Rabinder Kumar Prasad,
Gurumayum Robert Michael,
N K Kaphungkui,
N. Hemarjit Singh
Abstract:
The digital revolution has significantly impacted financial transactions, leading to a notable increase in credit card usage. However, this convenience comes with a trade-off: a substantial rise in fraudulent activities. Traditional machine learning methods for fraud detection often struggle to capture the inherent interconnectedness within financial data. This paper proposes a novel approach for…
▽ More
The digital revolution has significantly impacted financial transactions, leading to a notable increase in credit card usage. However, this convenience comes with a trade-off: a substantial rise in fraudulent activities. Traditional machine learning methods for fraud detection often struggle to capture the inherent interconnectedness within financial data. This paper proposes a novel approach for credit card fraud detection that leverages Graph Neural Networks (GNNs) with attention mechanisms applied to heterogeneous graph representations of financial data. Unlike homogeneous graphs, heterogeneous graphs capture intricate relationships between various entities in the financial ecosystem, such as cardholders, merchants, and transactions, providing a richer and more comprehensive data representation for fraud analysis. To address the inherent class imbalance in fraud data, where genuine transactions significantly outnumber fraudulent ones, the proposed approach integrates an autoencoder. This autoencoder, trained on genuine transactions, learns a latent representation and flags deviations during reconstruction as potential fraud. This research investigates two key questions: (1) How effectively can a GNN with an attention mechanism detect and prevent credit card fraud when applied to a heterogeneous graph? (2) How does the efficacy of the autoencoder with attention approach compare to traditional methods? The results are promising, demonstrating that the proposed model outperforms benchmark algorithms such as Graph Sage and FI-GRL, achieving a superior AUC-PR of 0.89 and an F1-score of 0.81. This research significantly advances fraud detection systems and the overall security of financial transactions by leveraging GNNs with attention mechanisms and addressing class imbalance through an autoencoder.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential Detection
Authors:
Hubert Cecotti,
Dalvir Dhaliwal,
Hardip Singh,
Yogesh Kumar Meena
Abstract:
Post-training quantization (PTQ) is a technique used to optimize and reduce the memory footprint and computational requirements of machine learning models. It has been used primarily for neural networks. For Brain-Computer Interfaces (BCI) that are fully portable and usable in various situations, it is necessary to provide approaches that are lightweight for storage and computation. In this paper,…
▽ More
Post-training quantization (PTQ) is a technique used to optimize and reduce the memory footprint and computational requirements of machine learning models. It has been used primarily for neural networks. For Brain-Computer Interfaces (BCI) that are fully portable and usable in various situations, it is necessary to provide approaches that are lightweight for storage and computation. In this paper, we propose the evaluation of post-training quantization on state-of-the-art approaches in brain-computer interfaces and assess their impact on accuracy. We evaluate the performance of the single-trial detection of event-related potentials representing one major BCI paradigm. The area under the receiver operating characteristic curve drops from 0.861 to 0.825 with PTQ when applied on both spatial filters and the classifier, while reducing the size of the model by about $\times$ 15. The results support the conclusion that PTQ can substantially reduce the memory footprint of the models while keeping roughly the same level of accuracy.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Authors:
Ann Huang,
Satpreet H. Singh,
Flavio Martinelli,
Kanaka Rajan
Abstract:
Task-trained recurrent neural networks (RNNs) are widely used in neuroscience and machine learning to model dynamical computations. To gain mechanistic insight into how neural systems solve tasks, prior work often reverse-engineers individual trained networks. However, different RNNs trained on the same task and achieving similar performance can exhibit strikingly different internal solutions-a ph…
▽ More
Task-trained recurrent neural networks (RNNs) are widely used in neuroscience and machine learning to model dynamical computations. To gain mechanistic insight into how neural systems solve tasks, prior work often reverse-engineers individual trained networks. However, different RNNs trained on the same task and achieving similar performance can exhibit strikingly different internal solutions-a phenomenon known as solution degeneracy. Here, we develop a unified framework to systematically quantify and control solution degeneracy across three levels: behavior, neural dynamics, and weight space. We apply this framework to 3,400 RNNs trained on four neuroscience-relevant tasks-flip-flop memory, sine wave generation, delayed discrimination, and path integration-while systematically varying task complexity, learning regime, network size, and regularization. We find that higher task complexity and stronger feature learning reduce degeneracy in neural dynamics but increase it in weight space, with mixed effects on behavior. In contrast, larger networks and structural regularization reduce degeneracy at all three levels. These findings empirically validate the Contravariance Principle and provide practical guidance for researchers aiming to tailor RNN solutions-whether to uncover shared neural mechanisms or to model individual variability observed in biological systems. This work provides a principled framework for quantifying and controlling solution degeneracy in task-trained RNNs, offering new tools for building more interpretable and biologically grounded models of neural computation.
△ Less
Submitted 28 May, 2025; v1 submitted 4 October, 2024;
originally announced October 2024.
-
Optimised Hybrid Classical-Quantum Algorithm for Accelerated Solution of Sparse Linear Systems
Authors:
Hakikat Singh
Abstract:
Efficiently solving large-scale sparse linear systems poses a significant challenge in computational science, especially in fields such as physics, engineering, machine learning, and finance. Traditional classical algorithms face scalability issues as the size of these systems increases, leading to performance degradation. On the other hand, quantum algorithms, like the Harrow-Hassidim-Lloyd (HHL)…
▽ More
Efficiently solving large-scale sparse linear systems poses a significant challenge in computational science, especially in fields such as physics, engineering, machine learning, and finance. Traditional classical algorithms face scalability issues as the size of these systems increases, leading to performance degradation. On the other hand, quantum algorithms, like the Harrow-Hassidim-Lloyd (HHL) algorithm, offer exponential speedups for solving linear systems, yet they are constrained by the current state of quantum hardware and sensitivity to matrix condition numbers. This paper introduces a hybrid classical-quantum algorithm that combines CUDA-accelerated preconditioning techniques with the HHL algorithm to solve sparse linear systems more efficiently. The classical GPU parallelism is utilised to preprocess and precondition the matrix, reducing its condition number, while quantum computing is employed to solve the preconditioned system using the HHL algorithm. Additionally, the algorithm integrates machine learning models, particularly reinforcement learning, to dynamically optimise system parameters, such as block sizes and preconditioning stratgies, based on real-time performance data. Our experimental results show that the proposed approach not only surpasses traditional methods in speed and scalability but also mitigates some of the inherent limitations of quantum algorithms. This work pushes the boundaries of efficient computing and provides a foundation for future advancements in hybrid computational frameworks.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
Energy-efficient picosecond spin-orbit torque magnetization switching in ferro- and ferrimagnetic films
Authors:
Eva Díaz,
Alberto Anadón,
Pablo Olleros-Rodríguez,
Harjinder Singh,
Héloïse Damas,
Paolo Perna,
Martina Morassi,
Aristide Lemaître,
Michel Hehn,
Jon Gorchon
Abstract:
Electrical current pulses can be used to manipulate magnetization efficiently via spin-orbit torques (SOTs). Pulse durations as short as a few picoseconds have been used to switch the magnetization of ferromagnetic films, reaching the THz regime. However, little is known about the reversal mechanisms and energy requirements in the ultrafast switching regime. In this work, we quantify the energy co…
▽ More
Electrical current pulses can be used to manipulate magnetization efficiently via spin-orbit torques (SOTs). Pulse durations as short as a few picoseconds have been used to switch the magnetization of ferromagnetic films, reaching the THz regime. However, little is known about the reversal mechanisms and energy requirements in the ultrafast switching regime. In this work, we quantify the energy cost for magnetization reversal over 7 orders of magnitude in pulse duration, in both ferromagnetic and ferrimagnetic samples, bridging quasi-static spintronics and femtomagnetism. To this end, we develop a method to stretch picosecond pulses generated by a photoconductive switch by an order of magnitude. Thereby, we can create current pulses from picoseconds to durations approaching pulse width available with commercial instruments. We show that the energy cost for SOT switching decreases by more than an order of magnitude in all samples when the pulse duration enters the picosecond range. We project an energy cost of 9 fJ for a 100 x 100 nm 2 ferrimagnetic device. Micromagnetic and macrospin simulations unveil a transition from a non-coherent to a coherent magnetization reversal with a strong modification of the magnetization dynamical trajectories as pulse duration is reduced. Our results cement the potential for high-speed magnetic spin-orbit torque memories and highlights alternative magnetization reversal pathways at fast time scales.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Colour-Based Disentangling of Mira Variables and Ultra-Cool Dwarfs
Authors:
Aleksandra Avdeeva,
Kefeng Tan,
Santosh Joshi,
Dana Kovaleva,
Harinder P. Singh,
Ali Luo,
Oleg Malkov
Abstract:
Despite having different astronomical characteristics, the studies of mira variables and ultra-cool dwarfs frequently show similar red colors, which could cause leading to photometric misclassification. This study uses photometric data from the WISE, 2MASS, and Pan-STARRS surveys to construct color-based selection criteria for red dwarfs, brown dwarfs, and Mira variables. On analyzing the color in…
▽ More
Despite having different astronomical characteristics, the studies of mira variables and ultra-cool dwarfs frequently show similar red colors, which could cause leading to photometric misclassification. This study uses photometric data from the WISE, 2MASS, and Pan-STARRS surveys to construct color-based selection criteria for red dwarfs, brown dwarfs, and Mira variables. On analyzing the color indices, we developed empirical rules that separate these objects with an overall classification accuracy of approximately 91%-92%. While the differentiation between red dwarfs and both Mira variables and brown dwarfs is effective, challenges remain in distinguishing Mira variables from brown dwarfs due to overlapping color indices. The robustness of our classification technique was validated by a bootstrap analysis, highlighting the significance of color indices in large photometric surveys for stellar classification.
△ Less
Submitted 17 September, 2024;
originally announced September 2024.
-
Hand-Object Interaction Pretraining from Videos
Authors:
Himanshu Gaurav Singh,
Antonio Loquercio,
Carmelo Sferrazza,
Jane Wu,
Haozhi Qi,
Pieter Abbeel,
Jitendra Malik
Abstract:
We present an approach to learn general robot manipulation priors from 3D hand-object interaction trajectories. We build a framework to use in-the-wild videos to generate sensorimotor robot trajectories. We do so by lifting both the human hand and the manipulated object in a shared 3D space and retargeting human motions to robot actions. Generative modeling on this data gives us a task-agnostic ba…
▽ More
We present an approach to learn general robot manipulation priors from 3D hand-object interaction trajectories. We build a framework to use in-the-wild videos to generate sensorimotor robot trajectories. We do so by lifting both the human hand and the manipulated object in a shared 3D space and retargeting human motions to robot actions. Generative modeling on this data gives us a task-agnostic base policy. This policy captures a general yet flexible manipulation prior. We empirically demonstrate that finetuning this policy, with both reinforcement learning (RL) and behavior cloning (BC), enables sample-efficient adaptation to downstream tasks and simultaneously improves robustness and generalizability compared to prior approaches. Qualitative experiments are available at: \url{https://hgaurav2k.github.io/hop/}.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
Pareto Set Prediction Assisted Bilevel Multi-objective Optimization
Authors:
Bing Wang,
Hemant K. Singh,
Tapabrata Ray
Abstract:
Bilevel optimization problems comprise an upper level optimization task that contains a lower level optimization task as a constraint. While there is a significant and growing literature devoted to solving bilevel problems with single objective at both levels using evolutionary computation, there is relatively scarce work done to address problems with multiple objectives (BLMOP) at both levels. Fo…
▽ More
Bilevel optimization problems comprise an upper level optimization task that contains a lower level optimization task as a constraint. While there is a significant and growing literature devoted to solving bilevel problems with single objective at both levels using evolutionary computation, there is relatively scarce work done to address problems with multiple objectives (BLMOP) at both levels. For black-box BLMOPs, the existing evolutionary techniques typically utilize nested search, which in its native form consumes large number of function evaluations. In this work, we propose to reduce this expense by predicting the lower level Pareto set for a candidate upper level solution directly, instead of conducting an optimization from scratch. Such a prediction is significantly challenging for BLMOPs as it involves one-to-many mapping scenario. We resolve this bottleneck by supplementing the dataset using a helper variable and construct a neural network, which can then be trained to map the variables in a meaningful manner. Then, we embed this initialization within a bilevel optimization framework, termed Pareto set prediction assisted evolutionary bilevel multi-objective optimization (PSP-BLEMO). Systematic experiments with existing state-of-the-art methods are presented to demonstrate its benefit. The experiments show that the proposed approach is competitive across a range of problems, including both deceptive and non-deceptive problems
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
STAB: Speech Tokenizer Assessment Benchmark
Authors:
Shikhar Vashishth,
Harman Singh,
Shikhar Bharadwaj,
Sriram Ganapathy,
Chulayuth Asawaroengchai,
Kartik Audhkhasi,
Andrew Rosenberg,
Ankur Bapna,
Bhuvana Ramabhadran
Abstract:
Representing speech as discrete tokens provides a framework for transforming speech into a format that closely resembles text, thus enabling the use of speech as an input to the widely successful large language models (LLMs). Currently, while several speech tokenizers have been proposed, there is ambiguity regarding the properties that are desired from a tokenizer for specific downstream tasks and…
▽ More
Representing speech as discrete tokens provides a framework for transforming speech into a format that closely resembles text, thus enabling the use of speech as an input to the widely successful large language models (LLMs). Currently, while several speech tokenizers have been proposed, there is ambiguity regarding the properties that are desired from a tokenizer for specific downstream tasks and its overall generalizability. Evaluating the performance of tokenizers across different downstream tasks is a computationally intensive effort that poses challenges for scalability. To circumvent this requirement, we present STAB (Speech Tokenizer Assessment Benchmark), a systematic evaluation framework designed to assess speech tokenizers comprehensively and shed light on their inherent characteristics. This framework provides a deeper understanding of the underlying mechanisms of speech tokenization, thereby offering a valuable resource for expediting the advancement of future tokenizer models and enabling comparative analysis using a standardized benchmark. We evaluate the STAB metrics and correlate this with downstream task performance across a range of speech tasks and tokenizer choices.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Disease Classification and Impact of Pretrained Deep Convolution Neural Networks on Diverse Medical Imaging Datasets across Imaging Modalities
Authors:
Jutika Borah,
Kumaresh Sarmah,
Hidam Kumarjit Singh
Abstract:
Imaging techniques such as Chest X-rays, whole slide images, and optical coherence tomography serve as the initial screening and detection for a wide variety of medical pulmonary and ophthalmic conditions respectively. This paper investigates the intricacies of using pretrained deep convolutional neural networks with transfer learning across diverse medical imaging datasets with varying modalities…
▽ More
Imaging techniques such as Chest X-rays, whole slide images, and optical coherence tomography serve as the initial screening and detection for a wide variety of medical pulmonary and ophthalmic conditions respectively. This paper investigates the intricacies of using pretrained deep convolutional neural networks with transfer learning across diverse medical imaging datasets with varying modalities for binary and multiclass classification. We conducted a comprehensive performance analysis with ten network architectures and model families each with pretraining and random initialization. Our finding showed that the use of pretrained models as fixed feature extractors yields poor performance irrespective of the datasets. Contrary, histopathology microscopy whole slide images have better performance. It is also found that deeper and more complex architectures did not necessarily result in the best performance. This observation implies that the improvements in ImageNet are not parallel to the medical imaging tasks. Within a medical domain, the performance of the network architectures varies within model families with shifts in datasets. This indicates that the performance of models within a specific modality may not be conclusive for another modality within the same domain. This study provides a deeper understanding of the applications of deep learning techniques in medical imaging and highlights the impact of pretrained networks across different medical imaging datasets under five different experimental settings.
△ Less
Submitted 2 September, 2024; v1 submitted 30 August, 2024;
originally announced August 2024.
-
Pixels to Prose: Understanding the art of Image Captioning
Authors:
Hrishikesh Singh,
Aarti Sharma,
Millie Pant
Abstract:
In the era of evolving artificial intelligence, machines are increasingly emulating human-like capabilities, including visual perception and linguistic expression. Image captioning stands at the intersection of these domains, enabling machines to interpret visual content and generate descriptive text. This paper provides a thorough review of image captioning techniques, catering to individuals ent…
▽ More
In the era of evolving artificial intelligence, machines are increasingly emulating human-like capabilities, including visual perception and linguistic expression. Image captioning stands at the intersection of these domains, enabling machines to interpret visual content and generate descriptive text. This paper provides a thorough review of image captioning techniques, catering to individuals entering the field of machine learning who seek a comprehensive understanding of available options, from foundational methods to state-of-the-art approaches. Beginning with an exploration of primitive architectures, the review traces the evolution of image captioning models to the latest cutting-edge solutions. By dissecting the components of these architectures, readers gain insights into the underlying mechanisms and can select suitable approaches tailored to specific problem requirements without duplicating efforts. The paper also delves into the application of image captioning in the medical domain, illuminating its significance in various real-world scenarios.
Furthermore, the review offers guidance on evaluating the performance of image captioning systems, highlighting key metrics for assessment. By synthesizing theoretical concepts with practical application, this paper equips readers with the knowledge needed to navigate the complex landscape of image captioning and harness its potential for diverse applications in machine learning and beyond.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices
Authors:
Zhiyong Zhang,
Aniket Gupta,
Huaizu Jiang,
Hanumant Singh
Abstract:
Real-time high-accuracy optical flow estimation is crucial for various real-world applications. While recent learning-based optical flow methods have achieved high accuracy, they often come with significant computational costs. In this paper, we propose a highly efficient optical flow method that balances high accuracy with reduced computational demands. Building upon NeuFlow v1, we introduce new…
▽ More
Real-time high-accuracy optical flow estimation is crucial for various real-world applications. While recent learning-based optical flow methods have achieved high accuracy, they often come with significant computational costs. In this paper, we propose a highly efficient optical flow method that balances high accuracy with reduced computational demands. Building upon NeuFlow v1, we introduce new components including a much more light-weight backbone and a fast refinement module. Both these modules help in keeping the computational demands light while providing close to state of the art accuracy. Compares to other state of the art methods, our model achieves a 10x-70x speedup while maintaining comparable performance on both synthetic and real-world data. It is capable of running at over 20 FPS on 512x384 resolution images on a Jetson Orin Nano. The full training and evaluation code is available at https://github.com/neufieldrobotics/NeuFlow_v2.
△ Less
Submitted 21 August, 2024; v1 submitted 19 August, 2024;
originally announced August 2024.
-
Work and Activation in a Nematic Polymer Network Ribbon
Authors:
H. Singh,
K. Suryanarayanan,
E. G. Virga
Abstract:
We study spontaneous deformations of a ribbon made of nematic polymer networks and activated under the action of a mechanical load. We show that when such ribbons are activated appropriately, the deformations produced can pull back and perform work against the externally applied load. We perform two numerical experiments to demonstrate this effect: (1) the \emph{pulling} experiment, where the ribb…
▽ More
We study spontaneous deformations of a ribbon made of nematic polymer networks and activated under the action of a mechanical load. We show that when such ribbons are activated appropriately, the deformations produced can pull back and perform work against the externally applied load. We perform two numerical experiments to demonstrate this effect: (1) the \emph{pulling} experiment, where the ribbon is pulled longitudinally by a point force, and (2) the \emph{bending} experiment, where the ribbon is bent out of plane by a terminally applied point force. We quantify the capacity of the ribbon to work against external loads, and compute its dependence on both the ribbon thickness and the imprinted nematic texture (that is, the distribution of the nematic directors across the ribbon's length). Finally, we compute the efficiency of the activation process. Building on the outcomes of our numerical explorations, we formulate two educated conjectures on how the activation efficiency can in general be improved by acting on both the applied load and the imprinted nematic texture.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
Symmetric Encryption Scheme Based on Quasigroup Using Chained Mode of Operation
Authors:
Satish Kumar,
Harshdeep Singh,
Indivar Gupta,
Ashok Ji Gupta
Abstract:
In this paper, we propose a novel construction for a symmetric encryption scheme, referred as SEBQ which is based on the structure of quasigroup. We utilize concepts of chaining like mode of operation and present a block cipher with in-built properties. We prove that SEBQ shows resistance against chosen plaintext attack (CPA) and by applying unbalanced Feistel transformation [19], it achieves secu…
▽ More
In this paper, we propose a novel construction for a symmetric encryption scheme, referred as SEBQ which is based on the structure of quasigroup. We utilize concepts of chaining like mode of operation and present a block cipher with in-built properties. We prove that SEBQ shows resistance against chosen plaintext attack (CPA) and by applying unbalanced Feistel transformation [19], it achieves security against chosen ciphertext attacks (CCA). Subsequently, we conduct an assessment of the randomness of the proposed scheme by running the NIST test suite and we analyze the impact of the initial vector, secret key and plaintext on ciphertext through an avalanche effect analysis. We also compare the results with existing schemes based on quasigroups [11,46]. Moreover, we analyze the computational complexity in terms of number of operations needed for encryption and decryption process.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
The Dusty Aftermath of a Rapid Nova: V5579 Sgr
Authors:
A. Raj,
M. S. Bisht,
F. M. Walter,
R. Pandey,
C. E. Woodward,
D. E. Harker,
D. Bisht,
H. P. Singh,
A. Agarwal,
J. C. Pandey,
A. Joshi,
K. Belwal,
Christian Buil
Abstract:
V5579 Sgr was a fast nova discovered in 2008 April 18.784 UT. We present the optical spectroscopic observations of the nova observed from the Castanet Tolosan, SMARTS and CTIO observatories spanning over 2008 April 23 to 2015 May 11. The spectra are dominated by hydrogen Balmer, Fe II and O I lines with P-Cygni profiles in the early phase, typical of an Fe II class nova. The spectra show He I and…
▽ More
V5579 Sgr was a fast nova discovered in 2008 April 18.784 UT. We present the optical spectroscopic observations of the nova observed from the Castanet Tolosan, SMARTS and CTIO observatories spanning over 2008 April 23 to 2015 May 11. The spectra are dominated by hydrogen Balmer, Fe II and O I lines with P-Cygni profiles in the early phase, typical of an Fe II class nova. The spectra show He I and He II lines along with forbidden lines from N, Ar, S, and O in the nebular phase. The nova showed a pronounced dust formation episode that began about 20 days after the outburst. The dust temperature and mass were estimated using the WISE data from spectral energy distribution (SED) fits. The PAH-like features are also seen in the nova ejecta in the mid-IR Gemini spectra taken 522 d after the discovery. Analysis of the light curve indicates values of t$_2$ and t$_3$ about 9 and 13 days, respectively, placing the nova in the category of fast nova. The best fit cloudy model of the early decline phase JHK spectra obtained on 2008 May 3 and the nebular optical spectrum obtained on 2011 June 2 shows a hot white dwarf source with T$_{BB}$ $\sim$ 2.6 $\times$ 10$^5$ K having a luminosity of 9.8 $\times$ 10$^{36}$ ergs s$^{-1}$. Our abundance analysis shows that the ejecta is significantly enhanced relative to solar, O/H = 32.2, C/H = 15.5 and N/H = 40.0 in the early decline phase and O/H = 5.8, He/H = 1.5 and N/H = 22.0 in the nebular phase.
△ Less
Submitted 7 August, 2024; v1 submitted 4 August, 2024;
originally announced August 2024.
-
Inverse Particle Filter
Authors:
Himali Singh,
Arpan Chattopadhyay,
Kumar Vijay Mishra
Abstract:
In cognitive systems, recent emphasis has been placed on studying the cognitive processes of the subject whose behavior was the primary focus of the system's cognitive response. This approach, known as inverse cognition, arises in counter-adversarial applications and has motivated the development of inverse Bayesian filters. In this context, a cognitive adversary, such as a radar, uses a forward B…
▽ More
In cognitive systems, recent emphasis has been placed on studying the cognitive processes of the subject whose behavior was the primary focus of the system's cognitive response. This approach, known as inverse cognition, arises in counter-adversarial applications and has motivated the development of inverse Bayesian filters. In this context, a cognitive adversary, such as a radar, uses a forward Bayesian filter to track its target of interest. An inverse filter is then employed to infer the adversary's estimate of the target's or defender's state. Previous studies have addressed this inverse filtering problem by introducing methods like the inverse Kalman filter (KF), inverse extended KF, and inverse unscented KF. However, these filters typically assume additive Gaussian noise models and/or rely on local approximations of non-linear dynamics at the state estimates, limiting their practical application. In contrast, this paper adopts a global filtering approach and presents the development of an inverse particle filter (I-PF). The particle filter framework employs Monte Carlo (MC) methods to approximate arbitrary posterior distributions. Moreover, under mild system-level conditions, the proposed I-PF demonstrates convergence to the optimal inverse filter. Additionally, we propose the differentiable I-PF to address scenarios where system information is unknown to the defender. Using the recursive Cramer-Rao lower bound and non-credibility index (NCI), our numerical experiments for different systems demonstrate the estimation performance and time complexity of the proposed filter.
△ Less
Submitted 29 March, 2025; v1 submitted 23 July, 2024;
originally announced July 2024.
-
Kaluza-Klein discreteness of the entropy: Symmetrical bath and CFT subsystem
Authors:
Harvendra Singh
Abstract:
We explore the entanglement entropy of CFT systems in contact with large bath system, such that the complete system lives on the boundary of $AdS_{d+1}$ spacetime. We are interested in finding the HEE of a bath (system-B) in contact with a central subsystem-A. We assume that the net size of systems A and B together remains fixed while allowing variation in individual sizes. This assumption is simp…
▽ More
We explore the entanglement entropy of CFT systems in contact with large bath system, such that the complete system lives on the boundary of $AdS_{d+1}$ spacetime. We are interested in finding the HEE of a bath (system-B) in contact with a central subsystem-A. We assume that the net size of systems A and B together remains fixed while allowing variation in individual sizes. This assumption is simply guided by the conservation laws. It is found that for large bath size the island entropy term are important. However other subleading (icebergs) terms do also contribute to bath entropy. The contributions are generally not separable from each other and all such contributions add together to give rise a fixed quantity. Further when accounted properly all such contributions will form part of higher entropy branch for the bath. Nevertheless the HEE of bath system should be subjected to minimality principle. The quantum minimality principle $ S_{quantum}[B]=\{S[A], S_{total}+S[A]\}_{min}$, is local in nature and gives rise to the Page curve. It is shown that the changes in bath entropy do capture Kaluza-Klein discreteness. The minimality principle would be applicable in finite temperature systems as well.
△ Less
Submitted 22 October, 2024; v1 submitted 18 July, 2024;
originally announced July 2024.
-
SHARC-VQE: Simplified Hamiltonian Approach with Refinement and Correction enabled Variational Quantum Eigensolver for Molecular Simulation
Authors:
Harshdeep Singh,
Sonjoy Majumder,
Sabyashachi Mishra
Abstract:
The transformation of a molecular Hamiltonian from the fermionic space to the qubit space results in a series of Pauli strings. Calculating the energy then involves evaluating the expectation values of each of these strings, which presents a significant bottleneck for applying variational quantum eigensolvers (VQEs) in quantum chemistry. Unlike fermionic Hamiltonians, the terms in a qubit Hamilton…
▽ More
The transformation of a molecular Hamiltonian from the fermionic space to the qubit space results in a series of Pauli strings. Calculating the energy then involves evaluating the expectation values of each of these strings, which presents a significant bottleneck for applying variational quantum eigensolvers (VQEs) in quantum chemistry. Unlike fermionic Hamiltonians, the terms in a qubit Hamiltonian are additive. This work leverages this property to introduce a novel method for extracting information from the partial qubit Hamiltonian, thereby enhancing the efficiency of VQEs. This work introduces the SHARC-VQE (Simplified Hamiltonian Approximation, Refinement, and Correction-VQE) method, where the full molecular Hamiltonian is partitioned into two parts based on the ease of quantum execution. The easy-to-execute part constitutes the Partial Hamiltonian, and the remaining part, while more complex to execute, is generally less significant. The latter is approximated by a refined operator and added up as a correction into the partial Hamiltonian. SHARC-VQE significantly reduces computational costs for molecular simulations. The cost of a single energy measurement can be reduced from $O(\frac{N^4}{ε^2})$ to $O(\frac{1}{ε^2})$ for a system of $N$ qubits and accuracy $ε$, while the overall cost of VQE can be reduced from $O(\frac{N^7}{ε^2})$ to $O(\frac{N^3}{ε^2})$. Furthermore, measurement outcomes using SHARC-VQE are less prone to errors induced by noise from quantum circuits, reducing the errors from 20-40% to 5-10% without any additional error correction or mitigation technique. Additionally, the SHARC-VQE is demonstrated as an initialization technique, where the simplified partial Hamiltonian is used to identify an optimal starting point for a complex problem.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Subdiffusive transport in the Fredkin dynamical universality class
Authors:
Catherine McCarthy,
Hansveer Singh,
Sarang Gopalakrishnan,
Romain Vasseur
Abstract:
We identify a pseudolocal conserved charge in the Fredkin and Motzkin quantum spin chains and explore its consequences for the hydrodynamics of systems with Fredkin- or Motzkin-type kinetic constraints. We use this quantity to formulate an exact upper bound ${\cal O}(L^{-5/2})$ on the gap of the Fredkin and Motzkin spin chains. Our results establish that transport in kinetically constrained dynami…
▽ More
We identify a pseudolocal conserved charge in the Fredkin and Motzkin quantum spin chains and explore its consequences for the hydrodynamics of systems with Fredkin- or Motzkin-type kinetic constraints. We use this quantity to formulate an exact upper bound ${\cal O}(L^{-5/2})$ on the gap of the Fredkin and Motzkin spin chains. Our results establish that transport in kinetically constrained dynamical systems with Fredkin or Motzkin constraints is subdiffusive, with dynamical exponent $z \geq 5/2$.
△ Less
Submitted 18 March, 2025; v1 submitted 15 July, 2024;
originally announced July 2024.
-
Cross-Lingual Multi-Hop Knowledge Editing
Authors:
Aditi Khandelwal,
Harman Singh,
Hengrui Gu,
Tianlong Chen,
Kaixiong Zhou
Abstract:
Large language models are often expected to constantly adapt to new sources of knowledge and knowledge editing techniques aim to efficiently patch the outdated model knowledge, with minimal modification. Most prior works focus on monolingual knowledge editing in English, even though new information can emerge in any language from any part of the world. We propose the Cross-Lingual Multi-Hop Knowle…
▽ More
Large language models are often expected to constantly adapt to new sources of knowledge and knowledge editing techniques aim to efficiently patch the outdated model knowledge, with minimal modification. Most prior works focus on monolingual knowledge editing in English, even though new information can emerge in any language from any part of the world. We propose the Cross-Lingual Multi-Hop Knowledge Editing paradigm, for measuring and analyzing the performance of various SoTA knowledge editing techniques in a cross-lingual setup. Specifically, we create a parallel cross-lingual benchmark, CROLIN-MQUAKE for measuring the knowledge editing capabilities. Our extensive analysis over various knowledge editing techniques uncover significant gaps in performance between the cross-lingual and English-centric setting. Following this, we propose a significantly improved system for cross-lingual multi-hop knowledge editing, CLEVER-CKE. CLEVER-CKE is based on a retrieve, verify and generate knowledge editing framework, where a retriever is formulated to recall edited facts and support an LLM to adhere to knowledge edits. We develop language-aware and hard-negative based contrastive objectives for improving the cross-lingual and fine-grained fact retrieval and verification process used in this framework. Extensive experiments on three LLMs, eight languages, and two datasets show CLEVER-CKE's significant gains of up to 30% over prior methods.
△ Less
Submitted 15 February, 2025; v1 submitted 14 July, 2024;
originally announced July 2024.
-
Entanglement asymmetry in conformal field theory and holography
Authors:
Francesco Benini,
Victor Godet,
Amartya Harsh Singh
Abstract:
Entanglement asymmetry is a measure of symmetry breaking in quantum subsystems, inspired by quantum information theory, particularly suited to study out-of-equilibrium states. We study the entanglement asymmetry of a class of excited "coherent states" in conformal quantum field theories with a U(1) symmetry, employing Euclidean path-integral methods with topological symmetry defects and the replic…
▽ More
Entanglement asymmetry is a measure of symmetry breaking in quantum subsystems, inspired by quantum information theory, particularly suited to study out-of-equilibrium states. We study the entanglement asymmetry of a class of excited "coherent states" in conformal quantum field theories with a U(1) symmetry, employing Euclidean path-integral methods with topological symmetry defects and the replica formalism. We compute, at leading order in perturbation theory, the asymmetry for a variety of subsystems, including finite spherical subregions in flat space, in finite volume, and at positive temperature. We also study its Lorentzian time evolution, showcasing the dynamical restoration of the symmetry due to thermalization, as well as the presence of a quantum Mpemba effect. Our results are universal, and apply in any number of dimensions. We also show that the perturbative entanglement asymmetry is related to the Fisher information metric, which has a known holographic dual called Hollands-Wald canonical energy, and that it is captured by the AdS bulk charge contained in the entanglement wedge.
△ Less
Submitted 9 June, 2025; v1 submitted 10 July, 2024;
originally announced July 2024.
-
QMViT: A Mushroom is worth 16x16 Words
Authors:
Siddhant Dutta,
Hemant Singh,
Kalpita Shankhdhar,
Sridhar Iyer
Abstract:
Consuming poisonous mushrooms can have severe health consequences, even resulting in fatality and accurately distinguishing edible from toxic mushroom varieties remains a significant challenge in ensuring food safety. So, it's crucial to distinguish between edible and poisonous mushrooms within the existing species. This is essential due to the significant demand for mushrooms in people's daily me…
▽ More
Consuming poisonous mushrooms can have severe health consequences, even resulting in fatality and accurately distinguishing edible from toxic mushroom varieties remains a significant challenge in ensuring food safety. So, it's crucial to distinguish between edible and poisonous mushrooms within the existing species. This is essential due to the significant demand for mushrooms in people's daily meals and their potential contributions to medical science. This work presents a novel Quantum Vision Transformer architecture that leverages quantum computing to enhance mushroom classification performance. By implementing specialized quantum self-attention mechanisms using Variational Quantum Circuits, the proposed architecture achieved 92.33% and 99.24% accuracy based on their category and their edibility respectively. This demonstrates the success of the proposed architecture in reducing false negatives for toxic mushrooms, thus ensuring food safety. Our research highlights the potential of QMViT for improving mushroom classification as a whole.
△ Less
Submitted 10 May, 2024;
originally announced July 2024.
-
Decomposition of Difficulties in Complex Optimization Problems Using a Bilevel Approach
Authors:
Ankur Sinha,
Dhaval Pujara,
Hemant Kumar Singh
Abstract:
Practical optimization problems may contain different kinds of difficulties that are often not tractable if one relies on a particular optimization method. Different optimization approaches offer different strengths that are good at tackling one or more difficulty in an optimization problem. For instance, evolutionary algorithms have a niche in handling complexities like discontinuity, non-differe…
▽ More
Practical optimization problems may contain different kinds of difficulties that are often not tractable if one relies on a particular optimization method. Different optimization approaches offer different strengths that are good at tackling one or more difficulty in an optimization problem. For instance, evolutionary algorithms have a niche in handling complexities like discontinuity, non-differentiability, discreteness and non-convexity. However, evolutionary algorithms may get computationally expensive for mathematically well behaved problems with large number of variables for which classical mathematical programming approaches are better suited. In this paper, we demonstrate a decomposition strategy that allows us to synergistically apply two complementary approaches at the same time on a complex optimization problem. Evolutionary algorithms are useful in this context as their flexibility makes pairing with other solution approaches easy. The decomposition idea is a special case of bilevel optimization that separates the difficulties into two levels and assigns different approaches at each level that is better equipped at handling them. We demonstrate the benefits of the proposed decomposition idea on a wide range of test problems.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Conforming/Non-Conforming Mixed Finite Element Methods for Optimal Control of Velocity-Vorticity-Pressure Formulation for the Oseen Problem with Variable Viscosity
Authors:
Harpal Singh,
Arbaz Khan
Abstract:
This work examines the distributed optimal control of generalized Oseen equations with non-constant viscosity. We propose and analyze a new conforming augmented mixed finite element method and a Discontinuous Galerkin (DG) method for the velocity-vorticity-pressure formulation. The continuous formulation, which incorporates least-squares terms from both the constitutive equation and the incompress…
▽ More
This work examines the distributed optimal control of generalized Oseen equations with non-constant viscosity. We propose and analyze a new conforming augmented mixed finite element method and a Discontinuous Galerkin (DG) method for the velocity-vorticity-pressure formulation. The continuous formulation, which incorporates least-squares terms from both the constitutive equation and the incompressibility condition, is well-posed under certain assumptions on the viscosity parameter. The CG method is divergence-conforming and suits any Stokes inf-sup stable velocity-pressure finite element pair, while a generic discrete space approximates vorticity. The DG scheme employs a stabilization technique, and a piecewise constant discretization estimates the control variable. We establish optimal a priori and residual-based a posteriori error estimates for the proposed schemes. Finally, we provide numerical experiments to showcase the method's performance and effectiveness.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
Authors:
Shufan Li,
Harkanwar Singh,
Aditya Grover
Abstract:
Text-to-image (T2I) models achieve high-fidelity generation through extensive training on large datasets. However, these models may unintentionally pick up undesirable biases of their training data, such as over-representation of particular identities in gender or ethnicity neutral prompts. Existing alignment methods such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference O…
▽ More
Text-to-image (T2I) models achieve high-fidelity generation through extensive training on large datasets. However, these models may unintentionally pick up undesirable biases of their training data, such as over-representation of particular identities in gender or ethnicity neutral prompts. Existing alignment methods such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO) fail to address this problem effectively because they operate on pairwise preferences consisting of individual samples, while the aforementioned biases can only be measured at a population level. For example, a single sample for the prompt "doctor" could be male or female, but a model generating predominantly male doctors even with repeated sampling reflects a gender bias. To address this limitation, we introduce PopAlign, a novel approach for population-level preference optimization, while standard optimization would prefer entire sets of samples over others. We further derive a stochastic lower bound that directly optimizes for individual samples from preferred populations over others for scalable training. Using human evaluation and standard image quality and bias metrics, we show that PopAlign significantly mitigates the bias of pretrained T2I models while largely preserving the generation quality. Code is available at https://github.com/jacklishufan/PopAlignSDXL.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Center and radius of a subset of metric space
Authors:
Akhilesh Badra,
Hemant Kumar Singh
Abstract:
In this paper, we introduce a notion of the center and radius of a subset A of metric space X. In the Euclidean spaces, this notion can be seen as the extension of the center and radius of open/closed balls. The center and radius of a finite product of subsets of metric spaces, and a finite union of subsets of a metric space are also determined. For any subset A of metric space X, there is a natur…
▽ More
In this paper, we introduce a notion of the center and radius of a subset A of metric space X. In the Euclidean spaces, this notion can be seen as the extension of the center and radius of open/closed balls. The center and radius of a finite product of subsets of metric spaces, and a finite union of subsets of a metric space are also determined. For any subset A of metric space X, there is a natural question to identify the open balls of X with the largest radius that are entirely contained in A. To answer this question, we introduce a notion of quasi-center and quasi-radius of a subset A of metric space X. We prove that the center of the largest open balls contained in A belongs to the quasi-center of A, and its radius is equal to the quasi-radius of A. In particular, for the Euclidean spaces, we see that the center of largest open balls contained in A belongs to the center of A, and its radius is equal to the radius of A.
△ Less
Submitted 21 August, 2024; v1 submitted 22 June, 2024;
originally announced June 2024.
-
Low-mass stellar and substellar content of the young cluster Berkeley 59
Authors:
Neelam Panwar,
Rishi C.,
Saurabh Sharma,
Devendra K. Ojha,
Manash R. Samal,
H. P. Singh,
Ram Kesh Yadav
Abstract:
We present a multi-wavelength analysis of the young star cluster Berkeley 59 (Be 59) based on the $Gaia$ data and deep infrared (IR) observations with the 3.58-m Telescopio Nazionale Galileo and $Spitzer$ space telescope. The mean proper motion of the cluster is found to be $μ$$_α$cos$δ$ $\sim$ -0.63 mas yr$^{-1}$ and $μ$$_δ$ $\sim$ -1.83 mas yr$^{-1}$ and the kinematic distance of the cluster,…
▽ More
We present a multi-wavelength analysis of the young star cluster Berkeley 59 (Be 59) based on the $Gaia$ data and deep infrared (IR) observations with the 3.58-m Telescopio Nazionale Galileo and $Spitzer$ space telescope. The mean proper motion of the cluster is found to be $μ$$_α$cos$δ$ $\sim$ -0.63 mas yr$^{-1}$ and $μ$$_δ$ $\sim$ -1.83 mas yr$^{-1}$ and the kinematic distance of the cluster, $\sim$ 1 kpc, is in agreement with previous photometric studies. Present data is the deepest available near-IR observations for the cluster so far and reached below 0.03 M$_\odot$. The mass function of the cluster region is calculated using the statistically cleaned color-magnitude diagram and is similar to the Salpeter value for the member stars above 0.4 M$_\odot$. In contrast, the slope becomes shallower ($Γ$ $\sim$ 0.01 $\pm$ 0.18) in the mass range 0.04 - 0.4 M$_\odot$, comparable to other nearby clusters. The spatial distribution of young brown dwarfs (BDs) and stellar candidates shows a non-homogeneous distribution. This suggests that the radiation feedback from massive stars may be a prominent factor contributing to the BD population in the cluster Be 59. We also estimated the star-to-BD ratio for the cluster, which is found to be $\sim$ 3.6. The Kolomogorov-Smirnov test shows that stellar and BD populations significantly differ, and stellar candidates are near the cluster center compared to the BDs, suggesting mass segregation in the cluster toward the substellar mass regime.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
Authors:
Shijian Deng,
Erin E. Kosloski,
Siddhi Patel,
Zeke A. Barnett,
Yiyang Nan,
Alexander Kaplan,
Sisira Aarukapalli,
William T. Doan,
Matthew Wang,
Harsh Singh,
Pamela R. Rollins,
Yapeng Tian
Abstract:
In this article, we introduce a novel problem of audio-visual autism behavior recognition, which includes social behavior recognition, an essential aspect previously omitted in AI-assisted autism screening research. We define the task at hand as one that is audio-visual autism behavior recognition, which uses audio and visual cues, including any speech present in the audio, to recognize autism-rel…
▽ More
In this article, we introduce a novel problem of audio-visual autism behavior recognition, which includes social behavior recognition, an essential aspect previously omitted in AI-assisted autism screening research. We define the task at hand as one that is audio-visual autism behavior recognition, which uses audio and visual cues, including any speech present in the audio, to recognize autism-related behaviors. To facilitate this new research direction, we collected an audio-visual autism spectrum dataset (AV-ASD), currently the largest video dataset for autism screening using a behavioral approach. It covers an extensive range of autism-associated behaviors, including those related to social communication and interaction. To pave the way for further research on this new problem, we intensively explored leveraging foundation models and multimodal large language models across different modalities. Our experiments on the AV-ASD dataset demonstrate that integrating audio, visual, and speech modalities significantly enhances the performance in autism behavior recognition. Additionally, we explored the use of a post-hoc to ad-hoc pipeline in a multimodal large language model to investigate its potential to augment the model's explanatory capability during autism behavior recognition. We will release our dataset, code, and pre-trained models.
△ Less
Submitted 22 March, 2024;
originally announced June 2024.
-
Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach
Authors:
Namasivayam Kalithasan,
Arnav Tuli,
Vishal Bindal,
Himanshu Gaurav Singh,
Parag Singla,
Rohan Paul
Abstract:
Automatically detecting and recovering from failures is an important but challenging problem for autonomous robots. Most of the recent work on learning to plan from demonstrations lacks the ability to detect and recover from errors in the absence of an explicit state representation and/or a (sub-) goal check function. We propose an approach (blending learning with symbolic search) for automated er…
▽ More
Automatically detecting and recovering from failures is an important but challenging problem for autonomous robots. Most of the recent work on learning to plan from demonstrations lacks the ability to detect and recover from errors in the absence of an explicit state representation and/or a (sub-) goal check function. We propose an approach (blending learning with symbolic search) for automated error discovery and recovery, without needing annotated data of failures. Central to our approach is a neuro-symbolic state representation, in the form of dense scene graph, structured based on the objects present within the environment. This enables efficient learning of the transition function and a discriminator that not only identifies failures but also localizes them facilitating fast re-planning via computation of heuristic distance function. We also present an anytime version of our algorithm, where instead of recovering to the last correct state, we search for a sub-goal in the original plan minimizing the total distance to the goal given a re-planning budget. Experiments on a physics simulator with a variety of simulated failures show the effectiveness of our approach compared to existing baselines, both in terms of efficiency as well as accuracy of our recovery mechanism.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Rough geometric integration
Authors:
Ajay Chandra,
Harprit Singh
Abstract:
We introduce a notion of distributional $k$-forms on $d$-dimensional manifolds which can be integrated against suitably regular $k$-submanifolds. Our approach combines ideas from Whitney's geometric integration [Whi57] with those of sewing approaches to rough integration [Gub04, FdLP06].
We introduce a notion of distributional $k$-forms on $d$-dimensional manifolds which can be integrated against suitably regular $k$-submanifolds. Our approach combines ideas from Whitney's geometric integration [Whi57] with those of sewing approaches to rough integration [Gub04, FdLP06].
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Anti-plane segregation and diffusion in dense, bidisperse granular shear flow
Authors:
Harkirat Singh,
David L. Henann
Abstract:
Many dense granular systems are non-monodisperse, consisting of particles of different sizes, and will segregate based on size during flow. This phenomenon is an important aspect of many industrial and geophysical processes, necessitating predictive continuum models. This paper systematically studies a key aspect of the three-dimensional nature of segregation and diffusion in flowing, dense, bidis…
▽ More
Many dense granular systems are non-monodisperse, consisting of particles of different sizes, and will segregate based on size during flow. This phenomenon is an important aspect of many industrial and geophysical processes, necessitating predictive continuum models. This paper systematically studies a key aspect of the three-dimensional nature of segregation and diffusion in flowing, dense, bidisperse granular mixtures -- namely, segregation and diffusion acting along the direction perpendicular to the plane of shearing, which we refer to as the anti-plane modes of segregation and diffusion. To this end, we consider discrete-element method (DEM) simulations of flows of dense, bidisperse mixtures of frictional spheres in an idealized configuration that isolates anti-plane segregation and diffusion. We find that previously-developed constitutive equations, calibrated to DEM simulation results from flows in which both the segregation and diffusion processes occur within the plane of shearing, do not capture aspects of the anti-plane segregation dynamics. Accordingly, we utilize DEM simulation results to inform and calibrate constitutive equations for the segregation and diffusion fluxes in their anti-plane modes. Predictions of the resulting continuum model for the anti-plane segregation dynamics are tested against additional DEM simulation results across different cases, while parameters such as the shear strain rate and mixture composition are varied, and we find that the calibrated model predictions match well with the DEM simulation results. Finally, we suggest a strategy for generalizing the constitutive forms for the segregation and diffusion fluxes to obtain three-dimensional constitutive equations that account for both the in-plane and anti-plane modes of the segregation and diffusion processes.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Emergence of Navier-Stokes hydrodynamics in chaotic quantum circuits
Authors:
Hansveer Singh,
Ewan McCulloch,
Sarang Gopalakrishnan,
Romain Vasseur
Abstract:
We construct an ensemble of two-dimensional nonintegrable quantum circuits that are chaotic but have a conserved particle current, and thus a finite Drude weight. The long-wavelength hydrodynamics of such systems is given by the incompressible Navier-Stokes equations. By analyzing circuit-to-circuit fluctuations in the ensemble we argue that these are negligible, so the circuit-averaged value of t…
▽ More
We construct an ensemble of two-dimensional nonintegrable quantum circuits that are chaotic but have a conserved particle current, and thus a finite Drude weight. The long-wavelength hydrodynamics of such systems is given by the incompressible Navier-Stokes equations. By analyzing circuit-to-circuit fluctuations in the ensemble we argue that these are negligible, so the circuit-averaged value of transport coefficients like the viscosity is also (in the long-time limit) the value in a typical circuit. The circuit-averaged transport coefficients can be mapped onto a classical irreversible Markov process. Therefore, remarkably, our construction allows us to efficiently compute the viscosity of a family of strongly interacting chaotic two-dimensional quantum systems.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Multi-wavelength Photometric Study of RR Lyrae Variables in the Globular Cluster NGC 5272 (Messier 3)
Authors:
Nitesh Kumar,
Anupam Bhardwaj,
Harinder P. Singh,
Marina Rejkuba,
Marcella Marconi,
Philippe Prugniel
Abstract:
We present a comprehensive photometric study of RR Lyrae stars in the M3 globular cluster, utilising a vast dataset of 3140 optical ($UBVRI$) CCD images spanning 35 years from astronomical data archives. We have successfully identified previously known 238 RR Lyrae stars from the photometric data, comprising 178 RRab, 49 RRc, and 11 RRd stars. Multi-band periodogram was used to significantly impro…
▽ More
We present a comprehensive photometric study of RR Lyrae stars in the M3 globular cluster, utilising a vast dataset of 3140 optical ($UBVRI$) CCD images spanning 35 years from astronomical data archives. We have successfully identified previously known 238 RR Lyrae stars from the photometric data, comprising 178 RRab, 49 RRc, and 11 RRd stars. Multi-band periodogram was used to significantly improve the long-term periods of $65\%$ of RR Lyrae stars in our sample, thanks to the unprecedentedly long temporal coverage of the observations. The light curve templates were used to obtain accurate and precise mean magnitudes and amplitudes of all RR Lyrae variables. We combined optical ($UBVRI$) and near-infrared (NIR, $JHK_{s}$) photometry of RR Lyrae variables to investigate their location in the colour-magnitude diagrams as well as the pulsation properties such as period distributions, Bailey diagrams and amplitude ratios. The Period-Luminosity relations in $R$ and $I$ bands and Period-Wesenheit relations were derived after excluding outliers identified in CMDs. The Period-Wesenheit relations calibrated via the theoretically predicted relations were used to determine a distance modulus of $μ= 15.04 \pm 0.04 \,{\rm (stats)} \pm 0.19 \,{\rm {(syst.)}}$ mag (using metal-independent $W_{BV}$ Wesenheit) and $μ= 15.03 \pm 0.04 \,{\rm (stats)} \pm 0.17 \,{\rm {(syst.)}}$ mag (using metal-dependent $W_{VI}$ Wesenheit). Our distance measurements are in excellent agreement with published distances to M3 in the literature. We also employed an artificial neural network based comparison of theoretical and observed light curves to determine physical parameters (mass, luminosity, and effective temperature) for $79$ non-Blazhko RRab stars that agree with limited literature measurements.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Chiral symmetry and Atiyah-Patodi-Singer index theorem for staggered fermions
Authors:
Mendel Nguyen,
Hersh Singh
Abstract:
We consider the Atiyah-Patodi-Singer (APS) index theorem corresponding to the chiral symmetry of a continuum formulation of staggered fermions called Kähler-Dirac fermions, which have been recently investigated as an ingredient in lattice constructions of chiral gauge theories. We point out that there are two notions of chiral symmetry for Kähler-Dirac fermions, both having a mixed perturbative an…
▽ More
We consider the Atiyah-Patodi-Singer (APS) index theorem corresponding to the chiral symmetry of a continuum formulation of staggered fermions called Kähler-Dirac fermions, which have been recently investigated as an ingredient in lattice constructions of chiral gauge theories. We point out that there are two notions of chiral symmetry for Kähler-Dirac fermions, both having a mixed perturbative anomaly with gravity leading to index theorems on closed manifolds. By formulating these theories on a manifold with boundary, we find the APS index theorems corresponding to each of these symmetries, necessary for a complete picture of anomaly inflow, using a recently discovered physics-motivated proof. We comment on a fundamental difference between the nature of these two symmetries by showing that a sensible local, symmetric boundary condition only exists for one of the two symmetries. This sheds light on how these symmetries behave under lattice discretization, and in particular on their use for recent symmetric-mass generation proposals.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages
Authors:
Harman Singh,
Nitish Gupta,
Shikhar Bharadwaj,
Dinesh Tewari,
Partha Talukdar
Abstract:
As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse…
▽ More
As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse set 29 of Indic languages covering 13 scripts and 4 language families. IndicGenBench is composed of diverse generation tasks like cross-lingual summarization, machine translation, and cross-lingual question answering. IndicGenBench extends existing benchmarks to many Indic languages through human curation providing multi-way parallel evaluation data for many under-represented Indic languages for the first time. We evaluate a wide range of proprietary and open-source LLMs including GPT-3.5, GPT-4, PaLM-2, mT5, Gemma, BLOOM and LLaMA on IndicGenBench in a variety of settings. The largest PaLM-2 models performs the best on most tasks, however, there is a significant performance gap in all languages compared to English showing that further research is needed for the development of more inclusive multilingual language models. IndicGenBench is released at www.github.com/google-research-datasets/indic-gen-bench
△ Less
Submitted 7 August, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models
Authors:
Shashi Kant Gupta,
Aditya Basu,
Mauro Nievas,
Jerrin Thomas,
Nathan Wolfrath,
Adhitya Ramamurthi,
Bradley Taylor,
Anai N. Kothari,
Regina Schwind,
Therica M. Miller,
Sorena Nadaf-Rahrov,
Yanshan Wang,
Hrituraj Singh
Abstract:
Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss…
▽ More
Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients missing out on potential therapeutic options. Recent advancements in Large Language Models (LLMs) have made automating patient-trial matching possible, as shown in multiple concurrent research studies. However, the current approaches are confined to constrained, often synthetic datasets that do not adequately mirror the complexities encountered in real-world medical data. In this study, we present the first, end-to-end large-scale empirical evaluation of clinical trial matching using real-world EHRs. Our study showcases the capability of LLMs to accurately match patients with appropriate clinical trials. We perform experiments with proprietary LLMs, including GPT-4 and GPT-3.5, as well as our custom fine-tuned model called OncoLLM and show that OncoLLM, despite its significantly smaller size, not only outperforms GPT-3.5 but also matches the performance of qualified medical doctors. All experiments were carried out on real-world EHRs that include clinical notes and available clinical trials from a single cancer center in the United States.
△ Less
Submitted 26 April, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
Matrix Multiplication Reductions
Authors:
Ashish Gola,
Igor Shinkar,
Harsimran Singh
Abstract:
In this paper we study a worst case to average case reduction for the problem of matrix multiplication over finite fields. Suppose we have an efficient average case algorithm, that given two random matrices $A,B$ outputs a matrix that has a non-trivial correlation with their product $A \cdot B$. Can we transform it into a worst case algorithm, that outputs the correct answer for all inputs without…
▽ More
In this paper we study a worst case to average case reduction for the problem of matrix multiplication over finite fields. Suppose we have an efficient average case algorithm, that given two random matrices $A,B$ outputs a matrix that has a non-trivial correlation with their product $A \cdot B$. Can we transform it into a worst case algorithm, that outputs the correct answer for all inputs without incurring a significant overhead in the running time? We present two results in this direction.
(1) Two-sided error in the high agreement regime: We begin with a brief remark about a reduction for high agreement algorithms, i.e., an algorithm which agrees with the correct output on a large (say $>0.9$) fraction of entries, and show that the standard self-correction of linearity allows us to transform such algorithms into algorithms that work in worst case.
(2) One-sided error in the low agreement regime: Focusing on average case algorithms with one-sided error, we show that over $\mathbb{F}_2$ there is a reduction that gets an $O(T)$ time average case algorithm that given a random input $A,B$ outputs a matrix that agrees with $A \cdot B$ on at least $51\%$ of the entries (i.e., has only a slight advantage over the trivial algorithm), and transforms it into an $\widetilde{O}(T)$ time worst case algorithm, that outputs the correct answer for all inputs with high probability.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Sketch-Plan-Generalize: Learning and Planning with Neuro-Symbolic Programmatic Representations for Inductive Spatial Concepts
Authors:
Namasivayam Kalithasan,
Sachit Sachdeva,
Himanshu Gaurav Singh,
Vishal Bindal,
Arnav Tuli,
Gurarmaan Singh Panjeta,
Harsh Himanshu Vora,
Divyanshu Aggarwal,
Rohan Paul,
Parag Singla
Abstract:
Effective human-robot collaboration requires the ability to learn personalized concepts from a limited number of demonstrations, while exhibiting inductive generalization, hierarchical composition, and adaptability to novel constraints. Existing approaches that use code generation capabilities of pre-trained large (vision) language models as well as purely neural models show poor generalization to…
▽ More
Effective human-robot collaboration requires the ability to learn personalized concepts from a limited number of demonstrations, while exhibiting inductive generalization, hierarchical composition, and adaptability to novel constraints. Existing approaches that use code generation capabilities of pre-trained large (vision) language models as well as purely neural models show poor generalization to \emph{a-priori} unseen complex concepts. Neuro-symbolic methods (Grand et al., 2023) offer a promising alternative by searching in program space, but face challenges in large program spaces due to the inability to effectively guide the search using demonstrations. Our key insight is to factor inductive concept learning as: (i) {\it Sketch:} detecting and inferring a coarse signature of a new concept (ii) {\it Plan:} performing an MCTS search over grounded action sequences guided by human demonstrations (iii) {\it Generalize:} abstracting out grounded plans as inductive programs. Our pipeline facilitates generalization and modular re-use, enabling continual concept learning. Our approach combines the benefits of code generation ability of large language models (LLMs) along with grounded neural representations, resulting in neuro-symbolic programs that show stronger inductive generalization on the task of constructing complex structures vis-á-vis LLM-only and purely neural approaches. Further, we demonstrate reasoning and planning capabilities with learned concepts for embodied instruction following.
△ Less
Submitted 17 June, 2025; v1 submitted 11 April, 2024;
originally announced April 2024.
-
Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology
Authors:
Shashi Kant Gupta,
Aditya Basu,
Bradley Taylor,
Anai Kothari,
Hrituraj Singh
Abstract:
Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed R…
▽ More
Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed Retrieval-augmented generation (RAG) pipelines to answer any query. However, the task of retrieving information from EHR real-world clinical data contained within EHR systems in order to solve several downstream use cases is challenging due to the difficulty in creating query-document support pairs. We provide a blueprint for creating such datasets in an affordable manner using large language models. Our method results in a retriever that is 30-50 F-1 points better than propriety counterparts such as Ada and Mistral for oncology data elements. We further compare our model, called Onco-Retriever, against fine-tuned PubMedBERT model as well. We conduct an extensive manual evaluation on real-world EHR data along with latency analysis of the different models and provide a path forward for healthcare organizations to build domain-specific retrievers.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Effective Luttinger parameter and Kane-Fisher effect in quasiperiodic systems
Authors:
T. J. Vongkovit,
Hansveer Singh,
Romain Vasseur,
Sarang Gopalakrishnan
Abstract:
The ground states of interacting one-dimensional metals are generically Luttinger liquids. Luttinger liquid theory is usually considered for translation invariant systems. The Luttinger liquid description remains valid for weak quasiperiodic modulations; however, as the quasiperiodic modulation gets increasingly strong, it is increasingly renormalized and eventually fails, as the system becomes lo…
▽ More
The ground states of interacting one-dimensional metals are generically Luttinger liquids. Luttinger liquid theory is usually considered for translation invariant systems. The Luttinger liquid description remains valid for weak quasiperiodic modulations; however, as the quasiperiodic modulation gets increasingly strong, it is increasingly renormalized and eventually fails, as the system becomes localized. We explore how quasiperiodic modulation renormalizes the Luttinger parameter characterizing this emergent Luttinger liquid, using the renormalization of transmission coefficients across a barrier as a proxy that remains valid for general quasiperiodic modulation. We find, unexpectedly, that quasiperiodic modulation weakens the effects of short-range interactions, but enhances those of long-range interactions. We support the former finding with matrix-product numerics. We also discuss how interactions affect the localization phase boundary.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Authors:
Elita Lobo,
Harvineet Singh,
Marek Petrik,
Cynthia Rudin,
Himabindu Lakkaraju
Abstract:
Off-policy Evaluation (OPE) methods are a crucial tool for evaluating policies in high-stakes domains such as healthcare, where exploration is often infeasible, unethical, or expensive. However, the extent to which such methods can be trusted under adversarial threats to data quality is largely unexplored. In this work, we make the first attempt at investigating the sensitivity of OPE methods to m…
▽ More
Off-policy Evaluation (OPE) methods are a crucial tool for evaluating policies in high-stakes domains such as healthcare, where exploration is often infeasible, unethical, or expensive. However, the extent to which such methods can be trusted under adversarial threats to data quality is largely unexplored. In this work, we make the first attempt at investigating the sensitivity of OPE methods to marginal adversarial perturbations to the data. We design a generic data poisoning attack framework leveraging influence functions from robust statistics to carefully construct perturbations that maximize error in the policy value estimates. We carry out extensive experimentation with multiple healthcare and control datasets. Our results demonstrate that many existing OPE methods are highly prone to generating value estimates with large errors when subject to data poisoning attacks, even for small adversarial perturbations. These findings question the reliability of policy values derived using OPE methods and motivate the need for developing OPE methods that are statistically robust to train-time data poisoning attacks.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Stellar Variability and Distance Indicators in the Near-infrared in Nearby Galaxies. I. RR Lyrae and Anomalous Cepheids in Draco dwarf spheroidal
Authors:
Anupam Bhardwaj,
Marina Rejkuba,
Chow-Choong Ngeow,
Marcella Marconi,
Vincenzo Ripepi,
Abhinna Sundar Samantaray,
Harinder P. Singh
Abstract:
Draco dwarf Spheroidal galaxy (dSph) is one of the nearest and the most dark matter dominated satellites of the Milky Way. We obtained multi-epoch near-infrared (NIR, $JHK_s$) observations of the central region of Draco dSph covering a sky area of $\sim 21'\times21'$ using the WIRCam instrument at the 3.6-m Canada-France-Hawaii Telescope. Homogeneous $JHK_s$ time-series photometry for 212 RR Lyrae…
▽ More
Draco dwarf Spheroidal galaxy (dSph) is one of the nearest and the most dark matter dominated satellites of the Milky Way. We obtained multi-epoch near-infrared (NIR, $JHK_s$) observations of the central region of Draco dSph covering a sky area of $\sim 21'\times21'$ using the WIRCam instrument at the 3.6-m Canada-France-Hawaii Telescope. Homogeneous $JHK_s$ time-series photometry for 212 RR Lyrae (173 fundamental-mode, 24 first-overtone, and 15 mixed-mode variables) and 5 Anomalous Cepheids in Draco dSph is presented and used to derive their period-luminosity relations at NIR wavelengths for the first-time. The small scatter of $\sim 0.05$~mag in these empirical relations for RR Lyrae stars is consistent with those in globular clusters and suggests a very small metallicity spread, up to $\sim0.2$~dex, among these centrally located variables. Based on empirically calibrated NIR period-luminosity-metallicity relations for RR Lyrae in globular clusters, we determined a distance modulus to Draco dSph of $μ_\textrm{RRL} = 19.557 \pm 0.026$ mag. The calibrated $K_s$-band period-luminosity relations for Anomalous Cepheids in the Draco dSph and the Large Magellanic Cloud exhibit statistically consistent slopes but systematically different zero-points, hinting at possible metallicity dependence of $\sim-0.3$ mag~dex$^{-1}$. Finally, the apparent magnitudes of the tip of the red giant branch in $I$ and $J$ bands also agree well with their absolute calibrations with the adopted RR Lyrae distance to Draco. Our recommended $\sim1.5\%$ precise RR Lyrae distance, $D_\textrm{Draco} = 81.55 \pm 0.98 \textrm{(statistical)} \pm 1.17 \textrm{(systematic)}$~kpc, is the most accurate and precise distance to Draco dSph galaxy.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Towards Long Term SLAM on Thermal Imagery
Authors:
Colin Keil,
Aniket Gupta,
Pushyami Kaveti,
Hanumant Singh
Abstract:
Visual SLAM with thermal imagery, and other low contrast visually degraded environments such as underwater, or in areas dominated by snow and ice, remain a difficult problem for many state of the art (SOTA) algorithms. In addition to challenging front-end data association, thermal imagery presents an additional difficulty for long term relocalization and map reuse. The relative temperatures of obj…
▽ More
Visual SLAM with thermal imagery, and other low contrast visually degraded environments such as underwater, or in areas dominated by snow and ice, remain a difficult problem for many state of the art (SOTA) algorithms. In addition to challenging front-end data association, thermal imagery presents an additional difficulty for long term relocalization and map reuse. The relative temperatures of objects in thermal imagery change dramatically from day to night. Feature descriptors typically used for relocalization in SLAM are unable to maintain consistency over these diurnal changes. We show that learned feature descriptors can be used within existing Bag of Word based localization schemes to dramatically improve place recognition across large temporal gaps in thermal imagery. In order to demonstrate the effectiveness of our trained vocabulary, we have developed a baseline SLAM system, integrating learned features and matching into a classical SLAM algorithm. Our system demonstrates good local tracking on challenging thermal imagery, and relocalization that overcomes dramatic day to night thermal appearance changes. Our code and datasets are available here: https://github.com/neufieldrobotics/IRSLAM_Baseline
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Divergence conforming DG method for the optimal control of the Oseen equation with variable viscosity
Authors:
Harpal Singh,
Arbaz Khan
Abstract:
This study introduces the divergence-conforming discontinuous Galerkin finite element method (DGFEM) for numerically approximating optimal control problems with distributed constraints, specifically those governed by stationary generalized Oseen equations. We provide optimal a priori error estimates in energy norms for such problems using the divergence-conforming DGFEM approach. Moreover, we thor…
▽ More
This study introduces the divergence-conforming discontinuous Galerkin finite element method (DGFEM) for numerically approximating optimal control problems with distributed constraints, specifically those governed by stationary generalized Oseen equations. We provide optimal a priori error estimates in energy norms for such problems using the divergence-conforming DGFEM approach. Moreover, we thoroughly analyze $L^2$ error estimates for scenarios dominated by diffusion and convection. Additionally, we establish the new reliable and efficient a posteriori error estimators for the optimal control of the Oseen equation with variable viscosity. Theoretical findings are validated through numerical experiments conducted in both two and three dimensions.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Hawking radiation from a hot NUT-Kerr-Newman-Kasuya-Anti-de Sitter black hole
Authors:
Heisnam Shanjit Singh,
Chiranjeeb Singha,
Sraban Kumar Upadhyaya
Abstract:
In this work, we investigate the tunneling phenomenon of a charged Dirac particle emerging from a thermal horizon of a hot NUT-Kerr-Newman-Kasuya-Anti-de Sitter (HNKNK-AdS) black hole. Considering the tunneling formalism, we report the Hawking temperature of the charged Dirac particle through the horizon and the heat capacity of the HNKNK-AdS black hole. It is found that the effective Hawking temp…
▽ More
In this work, we investigate the tunneling phenomenon of a charged Dirac particle emerging from a thermal horizon of a hot NUT-Kerr-Newman-Kasuya-Anti-de Sitter (HNKNK-AdS) black hole. Considering the tunneling formalism, we report the Hawking temperature of the charged Dirac particle through the horizon and the heat capacity of the HNKNK-AdS black hole. It is found that the effective Hawking temperature for the black hole depends on the black hole's parameters, such as electric and magnetic charges, magnetic mass, the cosmological constant, and the angular momentum of the black hole. The Hawking radiation for the Kerr-Newman black hole is reduced. It is shown that there is a small correction to the Hawking thermal spectrum. The heat capacity for the HNKNK-AdS black hole is found different from the previous work, and for the Kerr-Newman black hole is also reduced.
△ Less
Submitted 23 March, 2025; v1 submitted 20 March, 2024;
originally announced March 2024.
-
On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine
Authors:
Jagatpreet Singh Nir,
Dennis Giaya,
Hanumant Singh
Abstract:
Deep learning techniques have significantly advanced in providing accurate visual odometry solutions by leveraging large datasets. However, generating uncertainty estimates for these methods remains a challenge. Traditional sensor fusion approaches in a Bayesian framework are well-established, but deep learning techniques with millions of parameters lack efficient methods for uncertainty estimatio…
▽ More
Deep learning techniques have significantly advanced in providing accurate visual odometry solutions by leveraging large datasets. However, generating uncertainty estimates for these methods remains a challenge. Traditional sensor fusion approaches in a Bayesian framework are well-established, but deep learning techniques with millions of parameters lack efficient methods for uncertainty estimation.
This paper addresses the issue of uncertainty estimation for pre-trained deep-learning models in monocular visual odometry. We propose formulating a factor graph on an implicit layer of the deep learning network to recover relative covariance estimates, which allows us to determine the covariance of the Visual Odometry (VO) solution. We showcase the consistency of the deep learning engine's covariance approximation with an empirical analysis of the covariance model on the EUROC datasets to demonstrate the correctness of our formulation.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Computational Study on the Impact of Gasoline-Ethanol Blending on Autoignition and Soot/NOx Emissions under Gasoline Compression Ignition Conditions
Authors:
Krishna C. Kalvakala,
Harsimran Singh,
Pinaki Pal,
Jorge P. Gonzalez,
Christopher P. Kolodziej,
Suresh K. Aggarwal
Abstract:
Computational fluid dynamics (CFD) simulations of a single-cylinder gasoline compression ignition engine are performed to investigate the impact of gasoline-ethanol blending on autoignition, nitrogen oxide (NOx), and soot emissions under low-load conditions. A four-component toluene primary reference fuel (TPRF) + ethanol (ETPRF) surrogate (with 10% ethanol by volume; E10) is employed to represent…
▽ More
Computational fluid dynamics (CFD) simulations of a single-cylinder gasoline compression ignition engine are performed to investigate the impact of gasoline-ethanol blending on autoignition, nitrogen oxide (NOx), and soot emissions under low-load conditions. A four-component toluene primary reference fuel (TPRF) + ethanol (ETPRF) surrogate (with 10% ethanol by volume; E10) is employed to represent the test gasoline (RD5-87). A 3D engine CFD model employing finite-rate chemistry with a skeletal kinetic mechanism, adaptive mesh refinement (AMR), and hybrid method of moments (HMOM) is adopted to capture in-cylinder combustion and soot/NOx emissions. The engine CFD model is validated against experimental data for three gasoline-ethanol blends: E10, E30 and E100, with varying ethanol content by volume. Model validation is carried out for multiple start-of-injection (SOI) timings (-21, -27, -36, and -45 crank angle degrees after top-dead-center (aTDC)) with respect to in-cylinder pressure, heat release rate, combustion phasing, NOx and soot emissions. For late injection timings (-21 and -27oaTDC), E30 yields higher soot than E10; while the trend reverses for early injection cases (-36 and -45oaTDC). E100 yields the lowest amount of soot among all fuels irrespective of SOI timing. Further, E10 shows a non-monotonic trend in soot emissions with SOI timing: SOI-36>SOI-45>SOI-21>SOI-27, while soot emissions from E30 exhibit monotonic decrease with advancing SOI timing. NOx emissions from various fuels follow a trend of E10>E30>E100. NOx emissions increase as SOI timing is advanced for all fuels, with an anomaly for E10 and E100 where NOx decreases when SOI is advanced beyond -36oaTDC. Detailed analysis of the numerical results is performed to investigate the emission trends and elucidate the impact of chemical composition and physical properties on autoignition and emissions characteristics.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices
Authors:
Zhiyong Zhang,
Huaizu Jiang,
Hanumant Singh
Abstract:
Real-time high-accuracy optical flow estimation is a crucial component in various applications, including localization and mapping in robotics, object tracking, and activity recognition in computer vision. While recent learning-based optical flow methods have achieved high accuracy, they often come with heavy computation costs. In this paper, we propose a highly efficient optical flow architecture…
▽ More
Real-time high-accuracy optical flow estimation is a crucial component in various applications, including localization and mapping in robotics, object tracking, and activity recognition in computer vision. While recent learning-based optical flow methods have achieved high accuracy, they often come with heavy computation costs. In this paper, we propose a highly efficient optical flow architecture, called NeuFlow, that addresses both high accuracy and computational cost concerns. The architecture follows a global-to-local scheme. Given the features of the input images extracted at different spatial resolutions, global matching is employed to estimate an initial optical flow on the 1/16 resolution, capturing large displacement, which is then refined on the 1/8 resolution with lightweight CNN layers for better accuracy. We evaluate our approach on Jetson Orin Nano and RTX 2080 to demonstrate efficiency improvements across different computing platforms. We achieve a notable 10x-80x speedup compared to several state-of-the-art methods, while maintaining comparable accuracy. Our approach achieves around 30 FPS on edge computing platforms, which represents a significant breakthrough in deploying complex computer vision tasks such as SLAM on small robots like drones. The full training and evaluation code is available at https://github.com/neufieldrobotics/NeuFlow.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium
Authors:
Hyewon Jeong,
Sarah Jabbour,
Yuzhe Yang,
Rahul Thapta,
Hussein Mozannar,
William Jongwon Han,
Nikita Mehandru,
Michael Wornow,
Vladislav Lialin,
Xin Liu,
Alejandro Lozano,
Jiacheng Zhu,
Rafal Dariusz Kocielnik,
Keith Harrigian,
Haoran Zhang,
Edward Lee,
Milos Vukadinovic,
Aparna Balagopalan,
Vincent Jeanselme,
Katherine Matton,
Ilker Demirel,
Jason Fries,
Parisa Rashidi,
Brett Beaulieu-Jones,
Xuhai Orson Xu
, et al. (18 additional authors not shown)
Abstract:
The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir…
▽ More
The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four virtual roundtables at ML4H 2022. The organization of the research roundtables at the conference involved 17 Senior Chairs and 19 Junior Chairs across 11 tables. Each roundtable session included invited senior chairs (with substantial experience in the field), junior chairs (responsible for facilitating the discussion), and attendees from diverse backgrounds with interest in the session's topic. Herein we detail the organization process and compile takeaways from these roundtable discussions, including recent advances, applications, and open challenges for each topic. We conclude with a summary and lessons learned across all roundtables. This document serves as a comprehensive review paper, summarizing the recent advancements in machine learning for healthcare as contributed by foremost researchers in the field.
△ Less
Submitted 5 April, 2024; v1 submitted 3 March, 2024;
originally announced March 2024.