Search | arXiv e-print repository

Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation

Authors: Mohammad Shoaib Ansari, Mohd Sohail Ali Khan, Shubham Revankar, Aditya Varma, Anil S. Mokhade

Abstract: This research paper investigates the application of Large Language Models (LLMs) in healthcare, specifically focusing on enhancing medical decision support through Retrieval-Augmented Generation (RAG) integrated with hospital-specific data and fine-tuning using Quantized Low-Rank Adaptation (QLoRA). The system utilizes Llama 3.2-3B-Instruct as its foundation model. By embedding and retrieving cont… ▽ More This research paper investigates the application of Large Language Models (LLMs) in healthcare, specifically focusing on enhancing medical decision support through Retrieval-Augmented Generation (RAG) integrated with hospital-specific data and fine-tuning using Quantized Low-Rank Adaptation (QLoRA). The system utilizes Llama 3.2-3B-Instruct as its foundation model. By embedding and retrieving context-relevant healthcare information, the system significantly improves response accuracy. QLoRA facilitates notable parameter efficiency and memory optimization, preserving the integrity of medical information through specialized quantization techniques. Our research also shows that our model performs relatively well on various medical benchmarks, indicating that it can be used to make basic medical suggestions. This paper details the system's technical components, including its architecture, quantization methods, and key healthcare applications such as enhanced disease prediction from patient symptoms and medical history, treatment suggestions, and efficient summarization of complex medical reports. We touch on the ethical considerations-patient privacy, data security, and the need for rigorous clinical validation-as well as the practical challenges of integrating such systems into real-world healthcare workflows. Furthermore, the lightweight quantized weights ensure scalability and ease of deployment even in low-resource hospital environments. Finally, the paper concludes with an analysis of the broader impact of LLMs on healthcare and outlines future directions for LLMs in medical settings. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: 12 pages

arXiv:2502.02462 [pdf, other]

Hydroelastic scattering and trapping of microswimmers

Authors: Sagnik Garai, Ursy Makanga, Akhil Varma, Christina Kurzthaler

Abstract: Deformable boundaries are omnipresent in the habitats of swimming microorganisms, leading to intricate hydroelastic couplings. Employing a perturbation theory, valid for small deformations, we study the swimming dynamics of pushers and pullers near instantaneously deforming boundaries, endowed with a bending rigidity and surface tension. Our results reveal that pushers can both reorient away from… ▽ More Deformable boundaries are omnipresent in the habitats of swimming microorganisms, leading to intricate hydroelastic couplings. Employing a perturbation theory, valid for small deformations, we study the swimming dynamics of pushers and pullers near instantaneously deforming boundaries, endowed with a bending rigidity and surface tension. Our results reveal that pushers can both reorient away from the boundary, leading to overall hydroelastic scattering, or become trapped by the boundary, akin to the enhanced trapping found for pullers. These findings demonstrate that the complex hydroelastic interactions can generate behaviors that are in striking contrast to swimming near planar walls. △ Less

Submitted 4 February, 2025; originally announced February 2025.

arXiv:2501.08226 [pdf, other]

Efficient Deep Learning-based Forward Solvers for Brain Tumor Growth Models

Authors: Zeineb Haouari, Jonas Weidner, Ivan Ezhov, Aswathi Varma, Daniel Rueckert, Bjoern Menze, Benedikt Wiestler

Abstract: Glioblastoma, a highly aggressive brain tumor, poses major challenges due to its poor prognosis and high morbidity rates. Partial differential equation-based models offer promising potential to enhance therapeutic outcomes by simulating patient-specific tumor behavior for improved radiotherapy planning. However, model calibration remains a bottleneck due to the high computational demands of optimi… ▽ More Glioblastoma, a highly aggressive brain tumor, poses major challenges due to its poor prognosis and high morbidity rates. Partial differential equation-based models offer promising potential to enhance therapeutic outcomes by simulating patient-specific tumor behavior for improved radiotherapy planning. However, model calibration remains a bottleneck due to the high computational demands of optimization methods like Monte Carlo sampling and evolutionary algorithms. To address this, we recently introduced an approach leveraging a neural forward solver with gradient-based optimization to significantly reduce calibration time. This approach requires a highly accurate and fully differentiable forward model. We investigate multiple architectures, including (i) an enhanced TumorSurrogate, (ii) a modified nnU-Net, and (iii) a 3D Vision Transformer (ViT). The optimized TumorSurrogate achieved the best overall results, excelling in both tumor outline matching and voxel-level prediction of tumor cell concentration. It halved the MSE relative to the baseline model and achieved the highest Dice score across all tumor cell concentration thresholds. Our study demonstrates significant enhancement in forward solver performance and outlines important future research directions. △ Less

Submitted 14 January, 2025; originally announced January 2025.

arXiv:2412.21085 [pdf, other]

Chaos-Driven Quantum State Discrimination Near Unit Fidelity

Authors: Sourav Paul, Anant Vijay Varma, Yogesh N. Joglekar, Sourin Das

Abstract: Distinguishing quantum states becomes exponentially difficult as their fidelity approaches unity, with diminishing success probabilities. This study revisits chaotic dynamics, leveraging their extreme sensitivity to initial conditions for rapid amplification of state discrimination measures. The discrete-time chaotic evolution of qubit states is generated via iterative application of a nonlinear c… ▽ More Distinguishing quantum states becomes exponentially difficult as their fidelity approaches unity, with diminishing success probabilities. This study revisits chaotic dynamics, leveraging their extreme sensitivity to initial conditions for rapid amplification of state discrimination measures. The discrete-time chaotic evolution of qubit states is generated via iterative application of a nonlinear conformal map on the Julia set. The "quantum microscope" is characterized by a magnification power quantified through a temporal Bell-type inequality. Fixed points of the conformal map are shown to dictate optimal measurement operators, enabling (a) well-defined magnification power and (b) bounded Bell-type inequality values, providing a device-independent framework for self-testing the microscopes performance. △ Less

Submitted 30 December, 2024; originally announced December 2024.

Comments: 12 pages, 9 figures

arXiv:2409.00952 [pdf, other]

doi 10.1103/PhysRevLett.134.053201

Many-body adiabatic passage: Instability, chaos, and quantum classical correspondence

Authors: Anant Vijay Varma, Amichay Vardi, Doron Cohen

Abstract: Adiabatic passage in systems of interacting bosons is substantially affected by interactions and inter-particle entanglement. We consider STIRAP-like schemes in Bose-Hubbard chains that exhibit low-dimensional chaos (a 3 site chain), and high-dimensional chaos (more than 3 sites). The dynamics that is generated by a transfer protocol exhibits striking classical and quantum chaos fingerprints that… ▽ More Adiabatic passage in systems of interacting bosons is substantially affected by interactions and inter-particle entanglement. We consider STIRAP-like schemes in Bose-Hubbard chains that exhibit low-dimensional chaos (a 3 site chain), and high-dimensional chaos (more than 3 sites). The dynamics that is generated by a transfer protocol exhibits striking classical and quantum chaos fingerprints that are manifest in the mean-field classical treatment, in the truncated-Wigner semiclassical treatment, and in the full many-body quantum simulations. △ Less

Submitted 6 February, 2025; v1 submitted 2 September, 2024; originally announced September 2024.

Comments: 14 pages, 14 figures, including SM

Journal ref: Phys. Rev. Lett. 134, 053201 (2025)

arXiv:2408.17345 [pdf, ps, other]

Dimensional confinement and superdiffusive rotational motion of uniaxial colloids in the presence of cylindrical obstacles

Authors: Vikki Anand Varma, Sujin B Babu

Abstract: In biological system like cell the macromolecules which are anisotropic particles diffuse in a crowded medium. In the present work we have studied the diffusion of spheroidal particles diffusing between cylindrical obstacles by varying the density of the obstacles as well as the spheroidal particles. Analytical calculation of the free energy showed that the orientational vector of a single oblate… ▽ More In biological system like cell the macromolecules which are anisotropic particles diffuse in a crowded medium. In the present work we have studied the diffusion of spheroidal particles diffusing between cylindrical obstacles by varying the density of the obstacles as well as the spheroidal particles. Analytical calculation of the free energy showed that the orientational vector of a single oblate particle will be aligned perpendicular and a prolate particle will be aligned parallel to the symmetry axis of the cylindrical obstacles in equilibrium. The nematic transition of the system with and without obstacle remained the same, but in the case of obstacles the nematic vector of the spheroid system always remained parallel to the cylindrical axis. The component of the translational diffusion coefficient of the spheroidal particle perpendicular to the axis of the cylinder is calculated for isotropic system which agrees with analytical calculation. When the cylinders overlap such that the spheroidal particles can only diffuse along the direction parallel to the axis of the cylinder we could observe dimensional confinement. This was observed by the discontinuous fall of the diffusion coefficient, when plotted against the chemical potential both for single particle as well as for finite volume fraction. The rotational diffusion coefficient quickly reached the bulk value as the distance between the obstacle increased in the isotropic phase. In the nematic phase the rotational motion of the spheroid should be arrested. We observed that even though the entire system remained in the nematic phase the oblate particle close to the cylinder underwent flipping motion. The consequence is that when the rotational mean squared displacement was calculated it showed a super-diffusive behavior even though the orientational self correlation function never relaxed to zero. △ Less

Submitted 30 August, 2024; originally announced August 2024.

Comments: 13 pages, 14 figures

arXiv:2408.05045 [pdf, other]

Weak-inertial effects on destabilized receding contact lines

Authors: Akhil Varma

Abstract: It is known that beyond a critical speed, the straight contact line of a partially-wetting liquid destabilizes into a corner. In one of the earliest theoretical works exploring this phenomenon, [L. Limat and H. A. Stone, Europhys. Lett. 65(3), 2004] elicited a self-similar conical structure of the interface in the viscous regime. However, noting that inertia is not expected to be negligible at con… ▽ More It is known that beyond a critical speed, the straight contact line of a partially-wetting liquid destabilizes into a corner. In one of the earliest theoretical works exploring this phenomenon, [L. Limat and H. A. Stone, Europhys. Lett. 65(3), 2004] elicited a self-similar conical structure of the interface in the viscous regime. However, noting that inertia is not expected to be negligible at contact line speeds close to, and beyond the critical value for many common liquids, we provide the leading-order inertial correction to their solution. In particular, we find the self-similar corrections to the interface shape as well as the flow-field, and also determine their scaling with the capillary number. We find that inertia invariably modifies the interface into a cusp-like shape with an increased film thickness. Furthermore, when incorporating contact line dynamics into the model, resulting in a narrowing of the corner as the contact line speed increases, we still observe an overall increase in the inertial contribution with speed despite the increased confinement. △ Less

Submitted 9 August, 2024; originally announced August 2024.

Comments: Accepted in Physical Review Fluids

arXiv:2408.03158 [pdf, other]

Inner core heterogeneity induced by a large variation in lower mantle heat flux

Authors: Aditya Varma, Binod Sreenivasan

Abstract: Seismic mapping of the top of the inner core indicates two distinct areas of high P-wave velocity, the stronger one located beneath Asia, and the other located beneath the Atlantic. This two-fold pattern supports the idea that a lower-mantle heterogeneity can be transmitted to the inner core through outer core convection. In this study, a two-component convective dynamo model, where thermal convec… ▽ More Seismic mapping of the top of the inner core indicates two distinct areas of high P-wave velocity, the stronger one located beneath Asia, and the other located beneath the Atlantic. This two-fold pattern supports the idea that a lower-mantle heterogeneity can be transmitted to the inner core through outer core convection. In this study, a two-component convective dynamo model, where thermal convection is near critical and compositional convection is strongly supercritical, produces a substantial inner core heterogeneity in the rapidly rotating strongly driven regime of Earth's core. While the temperature profile that models secular cooling ensures that the mantle heterogeneity propagates as far as the inner core boundary (ICB), the distribution of heat flux at the ICB is determined by the strength of compositional buoyancy. A large heat flux variation $q^*$ of $O(10)$ at the core-mantle boundary (CMB), where $q^*$ is the ratio of the maximum heat flux difference to the mean heat flux at the CMB, produces a core flow regime of long-lived convection in the east and time-varying convection in the west. Here, the P-wave velocity estimated from the ICB heat flux in the dynamo is higher in the east than in the west, with the hemispherical difference of the same order as the observed lower bound, 0.5%. Additional observational constraints are satisfied in this regime -- the variability of high-latitude magnetic flux in the east is lower than that in the west; and the stratified F-layer at the base of the outer core, which is fed by the mass flux from regional melting of the inner core and magnetically damped, attains a steady-state height of $\sim$ 200 km. △ Less

Submitted 3 August, 2024; originally announced August 2024.

Comments: 29 pages, 4 figures, 4 tables

arXiv:2403.06212 [pdf, other]

doi 10.1103/PhysRevE.109.064207

Characterization of hybrid quantum eigenstates in systems with mixed classical phasespace

Authors: Anant Vijay Varma, Amichay Vardi, Doron Cohen

Abstract: Generic low-dimensional Hamiltonian systems feature a structured, mixed classical phase-space. The traditional Percival classification of quantum spectra into regular states supported by quasi-integrable regions and irregular states supported by quasi-chaotic regions turns out to be insufficient to capture the richness of the Hilbert space. Berry's conjecture and the eigenstate thermalization hypo… ▽ More Generic low-dimensional Hamiltonian systems feature a structured, mixed classical phase-space. The traditional Percival classification of quantum spectra into regular states supported by quasi-integrable regions and irregular states supported by quasi-chaotic regions turns out to be insufficient to capture the richness of the Hilbert space. Berry's conjecture and the eigenstate thermalization hypothesis are not applicable and quantum effects such as tunneling, scarring, and localization, do not obey the standard paradigms. We demonstrate these statements for a prototype Bose-Hubbard model. We highlight the hybridization of chaotic and regular regions from opposing perspectives of ergodicity and localization. △ Less

Submitted 10 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: 11 pages, 13 figures

Journal ref: Phys. Rev. E 109, 064207 (2024)

arXiv:2401.15724 [pdf, other]

RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced Query Responses

Authors: Sahil Girhepuje, Siva Sankar Sajeev, Purvam Jain, Arya Sikder, Adithya Rama Varma, Ryan George, Akshay Govind Srinivasan, Mahendra Kurup, Ashmit Sinha, Sudip Mondal

Abstract: Large Language Models (LLMs) currently struggle with tool invocation and chaining, as they often hallucinate or miss essential steps in a sequence. We propose RE-GAINS and EnChAnT, two novel frameworks that empower LLMs to tackle complex user queries by making API calls to external tools based on tool descriptions and argument lists. Tools are chained based on the expected output, without receivin… ▽ More Large Language Models (LLMs) currently struggle with tool invocation and chaining, as they often hallucinate or miss essential steps in a sequence. We propose RE-GAINS and EnChAnT, two novel frameworks that empower LLMs to tackle complex user queries by making API calls to external tools based on tool descriptions and argument lists. Tools are chained based on the expected output, without receiving the actual results from each individual call. EnChAnT, an open-source solution, leverages an LLM format enforcer, OpenChat 3.5 (an LLM), and ToolBench's API Retriever. RE-GAINS utilizes OpenAI models and embeddings with a specialized prompt based on the $\underline{R}$easoning vi$\underline{a}$ $\underline{P}$lanning $(RAP)$ framework. Both frameworks are low cost (0.01\$ per query). Our key contribution is enabling LLMs for tool invocation and chaining using modifiable, externally described tools. △ Less

Submitted 20 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

arXiv:2401.10602 [pdf, other]

Fractional Conformal Map, Qubit Dynamics and the Leggett-Garg Inequality

Authors: Sourav Paul, Anant Vijay Varma, Sourin Das

Abstract: Any pure state of a qubit can be geometrically represented as a point on the extended complex plane through stereographic projection. By employing successive conformal maps on the extended complex plane, we can generate an effective discrete-time evolution of the pure states of the qubit. This work focuses on a subset of analytic maps known as fractional linear conformal maps. We show that these m… ▽ More Any pure state of a qubit can be geometrically represented as a point on the extended complex plane through stereographic projection. By employing successive conformal maps on the extended complex plane, we can generate an effective discrete-time evolution of the pure states of the qubit. This work focuses on a subset of analytic maps known as fractional linear conformal maps. We show that these maps serve as a unifying framework for a diverse range of quantum-inspired conceivable dynamics, including (i) unitary dynamics,(ii) non-unitary but linear dynamics and (iii) non-unitary and non-linear dynamics where linearity (non-linearity) refers to the action of the discrete time evolution operator on the Hilbert space. We provide a characterization of these maps in terms of Leggett-Garg Inequality complemented with No-signaling in Time (NSIT) and Arrow of Time (AoT) conditions. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 9 pages, 1 figure

arXiv:2312.10529 [pdf, other]

doi 10.1007/978-3-031-45725-8_14

Transformers in Unsupervised Structure-from-Motion

Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

Abstract: Transformers have revolutionized deep learning based computer vision with improved performance as well as robustness to natural corruptions and adversarial attacks. Transformers are used predominantly for 2D vision tasks, including image classification, semantic segmentation, and object detection. However, robots and advanced driver assistance systems also require 3D scene understanding for decisi… ▽ More Transformers have revolutionized deep learning based computer vision with improved performance as well as robustness to natural corruptions and adversarial attacks. Transformers are used predominantly for 2D vision tasks, including image classification, semantic segmentation, and object detection. However, robots and advanced driver assistance systems also require 3D scene understanding for decision making by extracting structure-from-motion (SfM). We propose a robust transformer-based monocular SfM method that learns to predict monocular pixel-wise depth, ego vehicle's translation and rotation, as well as camera's focal length and principal point, simultaneously. With experiments on KITTI and DDAD datasets, we demonstrate how to adapt different vision transformers and compare them against contemporary CNN-based methods. Our study shows that transformer-based architecture, though lower in run-time efficiency, achieves comparable performance while being more robust against natural corruptions, as well as untargeted and targeted attacks. △ Less

Submitted 16 December, 2023; originally announced December 2023.

Comments: International Joint Conference on Computer Vision, Imaging and Computer Graphics. Cham: Springer Nature Switzerland, 2022. Published at "Communications in Computer and Information Science, vol 1815. Springer Nature". arXiv admin note: text overlap with arXiv:2202.03131

arXiv:2311.02393 [pdf, other]

Continual Learning of Unsupervised Monocular Depth from Videos

Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

Abstract: Spatial scene understanding, including monocular depth estimation, is an important problem in various applications, such as robotics and autonomous driving. While improvements in unsupervised monocular depth estimation have potentially allowed models to be trained on diverse crowdsourced videos, this remains underexplored as most methods utilize the standard training protocol, wherein the models a… ▽ More Spatial scene understanding, including monocular depth estimation, is an important problem in various applications, such as robotics and autonomous driving. While improvements in unsupervised monocular depth estimation have potentially allowed models to be trained on diverse crowdsourced videos, this remains underexplored as most methods utilize the standard training protocol, wherein the models are trained from scratch on all data after new data is collected. Instead, continual training of models on sequentially collected data would significantly reduce computational and memory costs. Nevertheless, naive continual training leads to catastrophic forgetting, where the model performance deteriorates on older domains as it learns on newer domains, highlighting the trade-off between model stability and plasticity. While several techniques have been proposed to address this issue in image classification, the high-dimensional and spatiotemporally correlated outputs of depth estimation make it a distinct challenge. To the best of our knowledge, no framework or method currently exists focusing on the problem of continual learning in depth estimation. Thus, we introduce a framework that captures the challenges of continual unsupervised depth estimation (CUDE), and define the necessary metrics to evaluate model performance. We propose a rehearsal-based dual-memory method, MonoDepthCL, which utilizes spatiotemporal consistency for continual learning in depth estimation, even when the camera intrinsics are unknown. △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: Accepted at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

arXiv:2306.13098 [pdf, other]

doi 10.1140/epjc/s10052-023-11923-y

Three generations of colored fermions with $S_3$ family symmetry from Cayley-Dickson sedenions

Authors: Niels G. Gresnigt, Liam Gourlay, Abhinav Varma

Abstract: An algebraic representation of three generations of fermions with $SU(3)_C$ color symmetry based on the Cayley-Dickson algebra of sedenions $\mathbb{S}$ is constructed. Recent constructions based on division algebras convincingly describe a single generation of leptons and quarks with Standard Model gauge symmetries. Nonetheless, an algebraic origin for the existence of exactly three generations h… ▽ More An algebraic representation of three generations of fermions with $SU(3)_C$ color symmetry based on the Cayley-Dickson algebra of sedenions $\mathbb{S}$ is constructed. Recent constructions based on division algebras convincingly describe a single generation of leptons and quarks with Standard Model gauge symmetries. Nonetheless, an algebraic origin for the existence of exactly three generations has proven difficult to substantiate. We motivate $\mathbb{S}$ as a natural algebraic candidate to describe three generations with $SU(3)_C$ gauge symmetry. We initially represent one generation of leptons and quarks in terms of two minimal left ideals of $\mathbb{C}\ell(6)$, generated from a subset of all left actions of the complex sedenions on themselves. Subsequently we employ the finite group $S_3$, which are automorphisms of $\mathbb{S}$ but not of $\mathbb{O}$ to generate two additional generations. Given the relative obscurity of sedenions, efforts have been made to present the material in a self-contained manner. △ Less

Submitted 15 February, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

Comments: 18 pages, 1 figure

Journal ref: Eur. Phys. J. C 83, 747 (2023)

arXiv:2301.00620 [pdf, other]

Dynamically Modular and Sparse General Continual Learning

Authors: Arnav Varma, Elahe Arani, Bahram Zonooz

Abstract: Real-world applications often require learning continuously from a stream of data under ever-changing conditions. When trying to learn from such non-stationary data, deep neural networks (DNNs) undergo catastrophic forgetting of previously learned information. Among the common approaches to avoid catastrophic forgetting, rehearsal-based methods have proven effective. However, they are still prone… ▽ More Real-world applications often require learning continuously from a stream of data under ever-changing conditions. When trying to learn from such non-stationary data, deep neural networks (DNNs) undergo catastrophic forgetting of previously learned information. Among the common approaches to avoid catastrophic forgetting, rehearsal-based methods have proven effective. However, they are still prone to forgetting due to task-interference as all parameters respond to all tasks. To counter this, we take inspiration from sparse coding in the brain and introduce dynamic modularity and sparsity (Dynamos) for rehearsal-based general continual learning. In this setup, the DNN learns to respond to stimuli by activating relevant subsets of neurons. We demonstrate the effectiveness of Dynamos on multiple datasets under challenging continual learning evaluation protocols. Finally, we show that our method learns representations that are modular and specialized, while maintaining reusability by activating subsets of neurons with overlaps corresponding to the similarity of stimuli. △ Less

Submitted 2 January, 2023; originally announced January 2023.

Comments: Camera ready version - 18th International Conference on Computer Vision Theory and Applications (VISAPP 2023)

arXiv:2210.03570 [pdf]

AI-Driven Road Maintenance Inspection v2: Reducing Data Dependency & Quantifying Road Damage

Authors: Haris Iqbal, Hemang Chawla, Arnav Varma, Terence Brouns, Ahmed Badar, Elahe Arani, Bahram Zonooz

Abstract: Road infrastructure maintenance inspection is typically a labor-intensive and critical task to ensure the safety of all road users. Existing state-of-the-art techniques in Artificial Intelligence (AI) for object detection and segmentation help automate a huge chunk of this task given adequate annotated data. However, annotating videos from scratch is cost-prohibitive. For instance, it can take an… ▽ More Road infrastructure maintenance inspection is typically a labor-intensive and critical task to ensure the safety of all road users. Existing state-of-the-art techniques in Artificial Intelligence (AI) for object detection and segmentation help automate a huge chunk of this task given adequate annotated data. However, annotating videos from scratch is cost-prohibitive. For instance, it can take an annotator several days to annotate a 5-minute video recorded at 30 FPS. Hence, we propose an automated labelling pipeline by leveraging techniques like few-shot learning and out-of-distribution detection to generate labels for road damage detection. In addition, our pipeline includes a risk factor assessment for each damage by instance quantification to prioritize locations for repairs which can lead to optimal deployment of road maintenance machinery. We show that the AI models trained with these techniques can not only generalize better to unseen real-world data with reduced requirement for human annotation but also provide an estimate of maintenance urgency, thereby leading to safer roads. △ Less

Submitted 7 October, 2022; originally announced October 2022.

Comments: Accepted at IRF Global R2T Conference & Exhibition 2022

arXiv:2207.07032 [pdf, other]

doi 10.1109/IROS47612.2022.9982154

Adversarial Attacks on Monocular Pose Estimation

Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

Abstract: Advances in deep learning have resulted in steady progress in computer vision with improved accuracy on tasks such as object detection and semantic segmentation. Nevertheless, deep neural networks are vulnerable to adversarial attacks, thus presenting a challenge in reliable deployment. Two of the prominent tasks in 3D scene-understanding for robotics and advanced drive assistance systems are mono… ▽ More Advances in deep learning have resulted in steady progress in computer vision with improved accuracy on tasks such as object detection and semantic segmentation. Nevertheless, deep neural networks are vulnerable to adversarial attacks, thus presenting a challenge in reliable deployment. Two of the prominent tasks in 3D scene-understanding for robotics and advanced drive assistance systems are monocular depth and pose estimation, often learned together in an unsupervised manner. While studies evaluating the impact of adversarial attacks on monocular depth estimation exist, a systematic demonstration and analysis of adversarial perturbations against pose estimation are lacking. We show how additive imperceptible perturbations can not only change predictions to increase the trajectory drift but also catastrophically alter its geometry. We also study the relation between adversarial perturbations targeting monocular depth and pose estimation networks, as well as the transferability of perturbations to other networks with different architectures and losses. Our experiments show how the generated perturbations lead to notable errors in relative rotation and translation predictions and elucidate vulnerabilities of the networks. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

arXiv:2207.06554 [pdf, other]

doi 10.1109/VIS54862.2022.00012

Streamlining Visualization Authoring in D3 Through User-Driven Templates

Authors: Hannah Bako, Alisha Varma, Anuoluwapo Faboro, Mahreen Haider, Favour Nerrise, Bissaka Kenah, Leilani Battle

Abstract: D3 is arguably the most popular tool for implementing web based visualizations. Yet D3 has a steep learning curve that may hinder its adoption and continued use. To simplify the process of programming D3 visualizations, we must first understand the space of implementation practices that D3 users engage in. We present a qualitative analysis of 2500 D3 visualizations and their corresponding implemen… ▽ More D3 is arguably the most popular tool for implementing web based visualizations. Yet D3 has a steep learning curve that may hinder its adoption and continued use. To simplify the process of programming D3 visualizations, we must first understand the space of implementation practices that D3 users engage in. We present a qualitative analysis of 2500 D3 visualizations and their corresponding implementations. We find that 5 visualization types (Bar Charts, Geomaps, Line Charts, Scatterplots, and Force Directed Graphs) account for 80% of D3 visualizations found in our corpus. While implementation styles vary slightly across designs, the underlying code structure for all visualization types remains the same; presenting an opportunity for code reuse. Using our corpus of D3 examples, we synthesize reusable code templates for eight popular D3 visualization types and share them in our open source repository. Based on our results, we discuss design considerations for leveraging users' implementation patterns to reduce visualization design effort through design templates and auto-generated code recommendations. △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: 5 pages, 3 figures, VIS 2022 Short Paper. arXiv admin note: text overlap with arXiv:2112.03179

arXiv:2204.08321 [pdf, other]

doi 10.1021/acsomega.2c08244

Electron-phonon interaction contribution to the total energy of group IV semiconductor polymorphs: evaluation and implications

Authors: R. Arjun Varma, Shilpa Paul, Anup Itale, Pranav Pable, Radhika Tibrewala, Samruddhi Dodal, Harshal Yerunkar, Saurav Bhaumik, Vaishali Shah, M. P. Gururajan, T. R. S. Prasanna

Abstract: In density functional theory (DFT) based total energy studies, the van der Waals (vdW) and zero-point vibrational energy (ZPVE) correction terms are included to obtain energy differences between polymorphs. We propose and compute a new correction term to the total energy, due to electron-phonon interactions (EPI). We rely on Allen's general formalism, which goes beyond the Quasi-Harmonic Approxima… ▽ More In density functional theory (DFT) based total energy studies, the van der Waals (vdW) and zero-point vibrational energy (ZPVE) correction terms are included to obtain energy differences between polymorphs. We propose and compute a new correction term to the total energy, due to electron-phonon interactions (EPI). We rely on Allen's general formalism, which goes beyond the Quasi-Harmonic Approximation (QHA), to include the free energy contributions due to quasiparticle interactions. We show that, for semiconductors and insulators, the EPI contributions to the free energies of electrons and phonons are the corresponding zero-point energy contributions. Using an approximate version of Allen's formalism in combination with the Allen-Heine theory for EPI corrections, we calculate the zero-point EPI corrections to the total energy for cubic and hexagonal polytypes of Carbon, Silicon and Silicon Carbide. The EPI corrections alter the energy differences between polytypes. In SiC polytypes, the EPI correction term is more sensitive to crystal structure than the vdW and ZPVE terms and is thus essential in determining their energy differences. It clearly establishes that the cubic SiC-3C is metastable and hexagonal SiC-4H is the stable polytype. Our results are consistent with the experimental results of Kleykamp. Our study enables the inclusion of EPI corrections as a separate term in the free energy expression. This opens the way to go beyond the QHA by including the contribution of EPI on all thermodynamic properties. △ Less

Submitted 16 March, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

Comments: 30 pages and 2 figures

Journal ref: ACS Omega 2023, 8, 12, 11251-11260

arXiv:2204.06579 [pdf, other]

doi 10.1007/s11128-022-03718-z

Non-local spin entanglement in a fermionic chain

Authors: Sayan Jana, Anant V. Varma, Arijit Saha, Sourin Das

Abstract: An effective two-spin density matrix (TSDM) for a pair of spin-$1/2$ degree of freedom, residing at a distance of $R$ in a spinful Fermi sea, can be obtained from the two-electron density matrix following the framework prescribed in Phys. Rev. A 69, 054305 (2004). We note that the single spin density matrix (SSDM) obtained from this TSDM for generic spin-degenerate systems of free fermions is alwa… ▽ More An effective two-spin density matrix (TSDM) for a pair of spin-$1/2$ degree of freedom, residing at a distance of $R$ in a spinful Fermi sea, can be obtained from the two-electron density matrix following the framework prescribed in Phys. Rev. A 69, 054305 (2004). We note that the single spin density matrix (SSDM) obtained from this TSDM for generic spin-degenerate systems of free fermions is always pinned to the maximally mixed state $i.e.$ $(1/2) \ \mathbb{I}$, independent of the distance $R$ while the TSDM confirms to the form for the set of maximally entangled mixed state (the so called "X-state") at finite $R$. The X-state reduces to a pure state (a singlet) in the $R\rightarrow 0$ limit while it saturates to an X-state with largest allowed value of von-Neumann entropy of $2 \ln2$ as $R \rightarrow \infty$ independent of the value of chemical potential. However, once an external magnetic field is applied to lift the spin-degeneracy, we find that the von-Neumann entropy of SSDM becomes a function of the distance $R$ between the two spins. We also show that the von-Neumann entropy of TSDM in the $R\rightarrow \infty$ limit becomes a function of the chemical potential and it saturate to $2 \ln2$ only when the band in completely filled unlike the spin-degenerate case. Finally we extend our study to include spin-orbit coupling and show that it does effect these asymptotic results. Our findings are in sharp contrast with previous works which were based on continuum models owing to physics which stem from the lattice model. △ Less

Submitted 13 April, 2022; originally announced April 2022.

Comments: 7 pages, 5 figures

Journal ref: Quantum Inf Process 21, 374 (2022)

arXiv:2203.04991 [pdf, other]

doi 10.1103/PhysRevA.108.032202

Essential role of quantum speed limit in violation of Leggett-Garg inequality across a PT-transition

Authors: Anant V. Varma, Jacob E. Muldoon, Sourav Paul, Yogesh N. Joglekar, Sourin Das

Abstract: We study Leggett-Garg inequality (LGI) of a two level system (TLS) undergoing non-Hermitian dynamics governed by a non-linear Bloch equation (derived in J. Phys. A: Math. Theor. 54, 115301 (2021)) across a PT-transition. We present an algebraic identification of the parameter space for the maximum violation of LGI (in particular $K_{3}$). In the PT-symmetric regime the maximum allowed value for… ▽ More We study Leggett-Garg inequality (LGI) of a two level system (TLS) undergoing non-Hermitian dynamics governed by a non-linear Bloch equation (derived in J. Phys. A: Math. Theor. 54, 115301 (2021)) across a PT-transition. We present an algebraic identification of the parameter space for the maximum violation of LGI (in particular $K_{3}$). In the PT-symmetric regime the maximum allowed value for $K_{3}$ is always found to be greater than the quantum bound (Lüders bound) of $3/2$ but it does not reach the algebraic maximum of $K_{3}=3$ in general. However, in the limit where PT-symmetry breaking parameter approaches the exceptional point from the PT-symmetric side, $K_{3}$ is found to asymptotically approach its algebraic maximum of 3. In contrast, the maximum value of $K_{3}$ always reaches its algebraic maximum in the PT-broken phase $i.e.$ $K_{3}\rightarrow 3$. We find that (i) the speed of evolution (SOE) must reach its maximum value (in the parameter space of initial state and the time interval between successive measurements) to facilitate the value of $K_{3} \rightarrow 3$, (ii) together with the constraint that its minimum value must run into SOE equals to zero during the evolution of the state. In fact we show that the minimum speed of evolution can serve as an order parameter which is finite on the PT-symmetric side and identically zero on the PT-broken side. Finally, we discuss a possible experimental realization of this dynamics by quantum measurement followed by post-selection procedure in a three level atom coupled to cavity mode undergoing a Lindbladian dynamics. △ Less

Submitted 9 March, 2022; originally announced March 2022.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. A 108, 032202 (2023)

arXiv:2202.08784 [pdf, other]

doi 10.1016/j.pepi.2022.106944

The role of slow magnetostrophic waves in the formation of the axial dipole in planetary dynamos

Authors: Aditya Varma, Binod Sreenivasan

Abstract: The preference for the axial dipole in planetary dynamos is investigated through the analysis of wave motions in spherical dynamo models. Our study focuses on the role of slow magnetostrophic waves, which are generated from localized balances between the Lorentz, Coriolis and buoyancy (MAC) forces. Since the slow waves are known to intensify with increasing field strength, simulations in which the… ▽ More The preference for the axial dipole in planetary dynamos is investigated through the analysis of wave motions in spherical dynamo models. Our study focuses on the role of slow magnetostrophic waves, which are generated from localized balances between the Lorentz, Coriolis and buoyancy (MAC) forces. Since the slow waves are known to intensify with increasing field strength, simulations in which the field grows from a small seed towards saturation are useful in understanding the role of these waves in dynamo action. Axial group velocity measurements in the energy-containing scales show that fast inertial waves slightly modified by the magnetic field and buoyancy are dominant under weak fields. However, the dominance of the slow waves is evident for strong fields satisfying $|ω_M/ω_C| \sim $ 0.1, where $ω_M$ and $ω_C$ are the frequencies of the Alfvén and inertial waves respectively. A MAC wave window of azimuthal wavenumbers is identified wherein helicity generation by the slow waves strongly correlates with dipole generation. Analysis of the magnetic induction equation suggests a poloidal--poloidal field conversion in the formation of the dipole. Finally, the attenuation of slow waves may result in polarity reversals in a strongly driven Earth's core. △ Less

Submitted 17 February, 2022; originally announced February 2022.

Comments: 26 pages, 14 figures

arXiv:2202.03131 [pdf, other]

doi 10.5220/0010884000003124

Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics

Authors: Arnav Varma, Hemang Chawla, Bahram Zonooz, Elahe Arani

Abstract: The advent of autonomous driving and advanced driver assistance systems necessitates continuous developments in computer vision for 3D scene understanding. Self-supervised monocular depth estimation, a method for pixel-wise distance estimation of objects from a single camera without the use of ground truth labels, is an important task in 3D scene understanding. However, existing methods for this t… ▽ More The advent of autonomous driving and advanced driver assistance systems necessitates continuous developments in computer vision for 3D scene understanding. Self-supervised monocular depth estimation, a method for pixel-wise distance estimation of objects from a single camera without the use of ground truth labels, is an important task in 3D scene understanding. However, existing methods for this task are limited to convolutional neural network (CNN) architectures. In contrast with CNNs that use localized linear operations and lose feature resolution across the layers, vision transformers process at constant resolution with a global receptive field at every stage. While recent works have compared transformers against their CNN counterparts for tasks such as image classification, no study exists that investigates the impact of using transformers for self-supervised monocular depth estimation. Here, we first demonstrate how to adapt vision transformers for self-supervised monocular depth estimation. Thereafter, we compare the transformer and CNN-based architectures for their performance on KITTI depth prediction benchmarks, as well as their robustness to natural corruptions and adversarial attacks, including when the camera intrinsics are unknown. Our study demonstrates how transformer-based architecture, though lower in run-time efficiency, achieves comparable performance while being more robust and generalizable. △ Less

Submitted 7 February, 2022; originally announced February 2022.

Comments: Published in 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)

arXiv:2201.08683 [pdf, other]

A Comprehensive Study of Vision Transformers on Dense Prediction Tasks

Authors: Kishaan Jeeveswaran, Senthilkumar Kathiresan, Arnav Varma, Omar Magdy, Bahram Zonooz, Elahe Arani

Abstract: Convolutional Neural Networks (CNNs), architectures consisting of convolutional layers, have been the standard choice in vision tasks. Recent studies have shown that Vision Transformers (VTs), architectures based on self-attention modules, achieve comparable performance in challenging tasks such as object detection and semantic segmentation. However, the image processing mechanism of VTs is differ… ▽ More Convolutional Neural Networks (CNNs), architectures consisting of convolutional layers, have been the standard choice in vision tasks. Recent studies have shown that Vision Transformers (VTs), architectures based on self-attention modules, achieve comparable performance in challenging tasks such as object detection and semantic segmentation. However, the image processing mechanism of VTs is different from that of conventional CNNs. This poses several questions about their generalizability, robustness, reliability, and texture bias when used to extract features for complex tasks. To address these questions, we study and compare VT and CNN architectures as feature extractors in object detection and semantic segmentation. Our extensive empirical results show that the features generated by VTs are more robust to distribution shifts, natural corruptions, and adversarial attacks in both tasks, whereas CNNs perform better at higher image resolutions in object detection. Furthermore, our results demonstrate that VTs in dense prediction tasks produce more reliable and less texture-biased predictions. △ Less

Submitted 21 January, 2022; originally announced January 2022.

Comments: 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)

arXiv:2201.02787 [pdf, other]

Age-of-information minimization via opportunistic sampling by an energy harvesting source

Authors: Akanksha Jaiswal, Arpan Chattopadhyay, Amokh Varma

Abstract: Herein, minimization of time-averaged age-of-information (AoI) in an energy harvesting (EH) source setting is considered. The EH source opportunistically samples one or multiple processes over discrete time instants and sends the status updates to a sink node over a wireless fading channel. Each time, the EH node decides whether to probe the link quality and then decides whether to sample a proces… ▽ More Herein, minimization of time-averaged age-of-information (AoI) in an energy harvesting (EH) source setting is considered. The EH source opportunistically samples one or multiple processes over discrete time instants and sends the status updates to a sink node over a wireless fading channel. Each time, the EH node decides whether to probe the link quality and then decides whether to sample a process and communicate based on the channel probe outcome. The trade-off is between the freshness of information available at the sink node and the available energy at the source node. We use infinite horizon Markov decision process (MDP) to formulate the AoI minimization problem for two scenarios where energy arrival and channel fading processes are: (i) independent and identically distributed (i.i.d.), (ii) Markovian. In i.i.d. setting, after channel probing, the optimal source sampling policy is shown to be a threshold policy. Also, for unknown channel state and EH characteristics, a variant of the Q-learning algorithm is proposed for the two-stage action model, that seeks to learn the optimal policy. For Markovian system, the problem is again formulated as an MDP, and a learning algorithm is provided for unknown dynamics. Finally, numerical results demonstrate the policy structures and performance trade-offs. △ Less

Submitted 31 May, 2024; v1 submitted 8 January, 2022; originally announced January 2022.

Comments: 15 pages, 8 figures. arXiv admin note: text overlap with arXiv:2010.07626

arXiv:2112.11854 [pdf]

Movie Recommender System using critic consensus

Authors: A Nayan Varma, Kedareshwara Petluri

Abstract: Recommendation systems are perhaps one of the most important agents for industry growth through the modern Internet world. Previous approaches on recommendation systems include collaborative filtering and content based filtering recommendation systems. These 2 methods are disjointed in nature and require the continuous storage of user preferences for a better recommendation. To provide better inte… ▽ More Recommendation systems are perhaps one of the most important agents for industry growth through the modern Internet world. Previous approaches on recommendation systems include collaborative filtering and content based filtering recommendation systems. These 2 methods are disjointed in nature and require the continuous storage of user preferences for a better recommendation. To provide better integration of the two processes, we propose a hybrid recommendation system based on the integration of collaborative and content-based content, taking into account the top critic consensus and movie rating score. We would like to present a novel model that recommends movies based on the combination of user preferences and critical consensus scores. △ Less

Submitted 22 December, 2021; originally announced December 2021.

Comments: 4 pages, IEEE 2021 International Conference on Advances in Computing, Communication and Control (ICAC3'21) 7thEdition (3rd and 4th December 2021)

arXiv:2112.03179 [pdf, other]

doi 10.1145/3581641.3584041

User-Driven Support for Visualization Prototyping in D3

Authors: Hannah K. Bako, Alisha Varma, Anuoluwapo Faboro, Mahreen Haider, Favour Nerrise, Bissaka Kenah, John P. Dickerson, Leilani Battle

Abstract: Templates have emerged as an effective approach to simplifying the visualization design and programming process. For example, they enable users to quickly generate multiple visualization designs even when using complex toolkits like D3. However, these templates are often treated as rigid artifacts that respond poorly to changes made outside of the template's established parameters, limiting user c… ▽ More Templates have emerged as an effective approach to simplifying the visualization design and programming process. For example, they enable users to quickly generate multiple visualization designs even when using complex toolkits like D3. However, these templates are often treated as rigid artifacts that respond poorly to changes made outside of the template's established parameters, limiting user creativity. Preserving the user's creative flow requires a more dynamic approach to template-based visualization design, where tools can respond gracefully to users' edits when they modify templates in unexpected ways. In this paper, we leverage the structural similarities revealed by templates to design resilient support features for prototyping D3 visualizations: recommendations to suggest complementary interactions for a user's D3 program; and code augmentation to implement recommended interactions with a single click, even when users deviate from pre-defined templates. We demonstrate the utility of these features in Mirny, a d design-focused prototyping environment for D3. In a user study with 20 D3 users, we find that these automated features enable participants to prototype their design ideas with significantly fewer programming iterations. We also characterize key modification strategies used by participants to customize D3 templates. Informed by our findings and participants' feedback, we discuss the key implications of the use of templates for interleaving visualization programming and design. △ Less

Submitted 21 February, 2023; v1 submitted 6 December, 2021; originally announced December 2021.

Comments: 15 pages, 7 figures, In 28th International Conference on Intelligent User Interfaces (IUI 23), March, 2023, Sydney, NSW, Australia

arXiv:2110.10696 [pdf, other]

doi 10.1088/1751-8121/acc912

Leggett-Garg inequality in Markovian quantum dynamics: role of temporal sequencing of coupling to bath

Authors: Sayan Ghosh, Anant V. Varma, Sourin Das

Abstract: We study Leggett-Garg inequalities (LGIs) for a two level system (TLS) undergoing Markovian dynamics described by unital maps. We find analytic expression of LG parameter $K_{3}$ (simplest variant of LGIs) in terms of the parameters of two distinct unital maps representing time evolution for intervals: $t_{1}$ to $t_{2}$ and $t_{2}$ to $t_{3}$. We show that the maximum violation of LGI for these m… ▽ More We study Leggett-Garg inequalities (LGIs) for a two level system (TLS) undergoing Markovian dynamics described by unital maps. We find analytic expression of LG parameter $K_{3}$ (simplest variant of LGIs) in terms of the parameters of two distinct unital maps representing time evolution for intervals: $t_{1}$ to $t_{2}$ and $t_{2}$ to $t_{3}$. We show that the maximum violation of LGI for these maps can never exceed well known Lüders bound of $K_{3}^{L\ddot{u}ders}=3/2$ over the full parameter space. We further show that if the map for the time interval $t_{1}$ to $t_{2}$ is non-unitary unital then irrespective of the choice of the map for interval $t_{2}$ to $t_{3}$ we can never reach Lüders bound. On the other hand, if the measurement operator eigenstates remain pure upon evolution from $t_{1}$ to $t_{2}$, then depending on the degree of decoherence induced by the unital map for the interval $t_{2}$ to $t_{3}$ we may or may not obtain Lüders bound. Specifically, we find that if the unital map for interval $t_{2}$ to $t_{3}$ leads to the shrinking of the Bloch vector beyond half of its unit length, then achieving the bound $K_{3}^{L\ddot{u}ders}$ is not possible. Hence our findings not only establish a threshold for decoherence which will allow for $K_{3} = K_{3}^{L\ddot{u}ders}$, but also demonstrate the importance of temporal sequencing of the exposure of a TLS to Markovian baths in obtaining Lüders bound. △ Less

Submitted 20 October, 2021; originally announced October 2021.

Comments: 9 pages, 2 figures

Report number: Journal of Physics A: Mathematical and Theoretical, Volume 56, Number 20

Journal ref: J. Phys. A: Math. Theor. 56 205302 (2023)

arXiv:2107.14139 [pdf, other]

Vaccination Worldwide: Strategies, Distribution and Challenges

Authors: Chirag Samal, Kasia Jakimowicz, Krishnendu Dasgupta, Aniket Vashishtha, Francisco O., Arunakiry Natarajan, Haris Nazir, Alluri Siddhartha Varma, Tejal Dahake, Amitesh Anand Pandey, Ishaan Singh, John Sangyeob Kim, Mehrab Singh Gill, Saurish Srivastava, Orna Mukhopadhyay, Parth Patwa, Qamil Mirza, Sualeha Irshad, Sheshank Shankar, Rohan Iyer, Rohan Sukumaran, Ashley Mehra, Anshuman Sharma, Abhishek Singh, Maurizio Arseni , et al. (4 additional authors not shown)

Abstract: The Coronavirus 2019 (Covid-19) pandemic caused by the SARS-CoV-2 virus represents an unprecedented crisis for our planet. It is a bane of the über connected world that we live in that this virus has affected almost all countries and caused mortality and economic upheaval at a scale whose effects are going to be felt for generations to come. While we can all be buoyed at the pace at which vaccines… ▽ More The Coronavirus 2019 (Covid-19) pandemic caused by the SARS-CoV-2 virus represents an unprecedented crisis for our planet. It is a bane of the über connected world that we live in that this virus has affected almost all countries and caused mortality and economic upheaval at a scale whose effects are going to be felt for generations to come. While we can all be buoyed at the pace at which vaccines have been developed and brought to market, there are still challenges ahead for all countries to get their populations vaccinated equitably and effectively. This paper provides an overview of ongoing immunization efforts in various countries. In this early draft, we have identified a few key factors that we use to review different countries' current COVID-19 immunization strategies and their strengths and draw conclusions so that policymakers worldwide can learn from them. Our paper focuses on processes related to vaccine approval, allocation and prioritization, distribution strategies, population to vaccine ratio, vaccination governance, accessibility and use of digital solutions, and government policies. The statistics and numbers are dated as per the draft date [June 24th, 2021]. △ Less

Submitted 21 July, 2021; originally announced July 2021.

arXiv:2106.03242 [pdf, other]

Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Authors: Ahmed Badar, Arnav Varma, Adrian Staniec, Mahmoud Gamal, Omar Magdy, Haris Iqbal, Elahe Arani, Bahram Zonooz

Abstract: Convolutional neural networks (CNNs) have become commonplace in addressing major challenges in computer vision. Researchers are not only coming up with new CNN architectures but are also researching different techniques to improve the performance of existing architectures. However, there is a tendency to over-emphasize performance improvement while neglecting certain important variables such as si… ▽ More Convolutional neural networks (CNNs) have become commonplace in addressing major challenges in computer vision. Researchers are not only coming up with new CNN architectures but are also researching different techniques to improve the performance of existing architectures. However, there is a tendency to over-emphasize performance improvement while neglecting certain important variables such as simplicity, versatility, the fairness of comparisons, and energy efficiency. Overlooking these variables in architectural design and evaluation has led to research bias and a significantly negative environmental impact. Furthermore, this can undermine the positive impact of research in using deep learning models to tackle climate change. Here, we perform an extensive and fair empirical study of a number of proposed techniques to gauge the utility of each technique for segmentation and classification. Our findings restate the importance of favoring simplicity over complexity in model design (Occam's Razor). Furthermore, our results indicate that simple standardized practices can lead to a significant reduction in environmental impact with little drop in performance. We highlight that there is a need to rethink the design and evaluation of CNNs to alleviate the issue of research bias and carbon emissions. △ Less

Submitted 6 June, 2021; originally announced June 2021.

arXiv:2103.02451 [pdf, other]

doi 10.1109/ICRA48506.2021.9561441

Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation

Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

Abstract: Dense depth estimation is essential to scene-understanding for autonomous driving. However, recent self-supervised approaches on monocular videos suffer from scale-inconsistency across long sequences. Utilizing data from the ubiquitously copresent global positioning systems (GPS), we tackle this challenge by proposing a dynamically-weighted GPS-to-Scale (g2s) loss to complement the appearance-base… ▽ More Dense depth estimation is essential to scene-understanding for autonomous driving. However, recent self-supervised approaches on monocular videos suffer from scale-inconsistency across long sequences. Utilizing data from the ubiquitously copresent global positioning systems (GPS), we tackle this challenge by proposing a dynamically-weighted GPS-to-Scale (g2s) loss to complement the appearance-based losses. We emphasize that the GPS is needed only during the multimodal training, and not at inference. The relative distance between frames captured through the GPS provides a scale signal that is independent of the camera setup and scene distribution, resulting in richer learned feature representations. Through extensive evaluation on multiple datasets, we demonstrate scale-consistent and -aware depth estimation during inference, improving the performance even when training with low-frequency GPS data. △ Less

Submitted 3 March, 2021; originally announced March 2021.

Comments: Accepted at 2021 IEEE International Conference on Robotics and Automation (ICRA)

arXiv:2012.13415 [pdf, other]

doi 10.1103/PhysRevB.104.035153

Simulating non-Hermitian dynamics of a multi-spin quantum system and an emergent central spin model

Authors: Anant V. Varma, Sourin Das

Abstract: It is possible to simulate the dynamics of a single spin-$1/2$ ($\mathsf{PT~}$ symmetric) system by conveniently embedding it into a subspace of a larger Hilbert space with unitary dynamics. Our goal is to formulate a many body generalization of this idea i.e., embedding many body non-Hermitian dynamics. As a first step in this direction, we investigate embedding of "$N$" non-interacting spin-… ▽ More It is possible to simulate the dynamics of a single spin-$1/2$ ($\mathsf{PT~}$ symmetric) system by conveniently embedding it into a subspace of a larger Hilbert space with unitary dynamics. Our goal is to formulate a many body generalization of this idea i.e., embedding many body non-Hermitian dynamics. As a first step in this direction, we investigate embedding of "$N$" non-interacting spin-$1/2$ ($\mathsf{PT~}$ symmetric) degrees of freedom, thereby unfolding the complex nature of such an embedding procedure. It turns out that the resulting Hermitian Hamiltonian represents a cluster of $N+1$ spin halves with "all to all", $q$-body interaction terms ($q=1,...,N+1$) in which the additional spin-$1/2$ is a part of the larger embedding space. We can visualize it as a strongly correlated central spin model with the additional spin-$1/2$ playing the role of central spin. We find that due to the orthogonality catastrophe, even a vanishing small exchange field applied along the anisotropy axis of the central spin leads to a strong suppression of its decoherence arising from spin-flipping perturbations. △ Less

Submitted 24 December, 2020; originally announced December 2020.

Comments: 10 pages, 5 figures

Journal ref: Phys. Rev. B 104, 035153 (2021)

arXiv:2006.06834 [pdf, other]

Attention improves concentration when learning node embeddings

Authors: Matthew Dippel, Adam Kiezun, Tanay Mehta, Ravi Sundaram, Srikanth Thirumalai, Akshar Varma

Abstract: We consider the problem of predicting edges in a graph from node attributes in an e-commerce setting. Specifically, given nodes labelled with search query text, we want to predict links to related queries that share products. Experiments with a range of deep neural architectures show that simple feedforward networks with an attention mechanism perform best for learning embeddings. The simplicity o… ▽ More We consider the problem of predicting edges in a graph from node attributes in an e-commerce setting. Specifically, given nodes labelled with search query text, we want to predict links to related queries that share products. Experiments with a range of deep neural architectures show that simple feedforward networks with an attention mechanism perform best for learning embeddings. The simplicity of these models allows us to explain the performance of attention. We propose an analytically tractable model of query generation, AttEST, that views both products and the query text as vectors embedded in a latent space. We prove (and empirically validate) that the point-wise mutual information (PMI) matrix of the AttEST query text embeddings displays a low-rank behavior analogous to that observed in word embeddings. This low-rank property allows us to derive a loss function that maximizes the mutual information between related queries which is used to train an attention network to learn query embeddings. This AttEST network beats traditional memory-based LSTM architectures by over 20% on F-1 score. We justify this out-performance by showing that the weights from the attention mechanism correlate strongly with the weights of the best linear unbiased estimator (BLUE) for the product vectors, and conclude that attention plays an important role in variance reduction. △ Less

Submitted 11 June, 2020; originally announced June 2020.

Comments: 18 pages, 3 figures

arXiv:2005.02310 [pdf, other]

Testing Compilers for Programmable Switches Through Switch Hardware Simulation

Authors: Michael D. Wong, Aatish Kishan Varma, Anirudh Sivaraman

Abstract: Programmable switches have emerged as powerful and flexible alternatives to fixed-function forwarding devices. But because of the unique hardware constraints of network switches, the design and implementation of compilers targeting these devices is tedious and error prone. Despite the important role that compilers play in software development, there is a dearth of tools for testing compilers for p… ▽ More Programmable switches have emerged as powerful and flexible alternatives to fixed-function forwarding devices. But because of the unique hardware constraints of network switches, the design and implementation of compilers targeting these devices is tedious and error prone. Despite the important role that compilers play in software development, there is a dearth of tools for testing compilers for programmable network devices. We present Druzhba, a programmable switch simulator used for testing compilers targeting programmable packet-processing substrates. We show that we can model the low-level behavior of a switch's programmable hardware. We further show how our machine model can be used by compiler developers to target Druzhba as a compiler backend. Generated machine code programs are fed into Druzhba and tested using a fuzzing-based approach that allows compiler developers to test the correctness of their compilers. Using a program-synthesis-based compiler as a case study, we demonstrate how Druzhba has been successful in testing compiler-generated machine code for our simulated switch pipeline instruction set. △ Less

Submitted 27 October, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

Comments: 7 pages, 4 figures

ACM Class: B.4.4; C.2.0; D.2.5; D.3.4

arXiv:1911.02464 [pdf, other]

doi 10.1103/PhysRevFluids.4.124204

Modeling chemo-hydrodynamic interactions of phoretic particles: a unified framework

Authors: Akhil Varma, Sebastien Michelin

Abstract: Phoretic particles exploit local self-generated physico-chemical gradients to achieve self-propulsion at the micron scale. The collective dynamics of a large number of such particles is currently the focus of intense research efforts, both from a physical perspective to understand the precise mechanisms of the interactions and their respective roles, as well as from an experimental point of view t… ▽ More Phoretic particles exploit local self-generated physico-chemical gradients to achieve self-propulsion at the micron scale. The collective dynamics of a large number of such particles is currently the focus of intense research efforts, both from a physical perspective to understand the precise mechanisms of the interactions and their respective roles, as well as from an experimental point of view to explain the observations of complex dynamics as well as formation of coherent large-scale structures. However, an exact modelling of such multi-particle problems is difficult and most efforts so far rely on the superposition of far-field approximations for each particle's signature, which are only valid asymptotically in the dilute suspension limit. A systematic and unified analytical framework based on the classical Method of Reflections (MoR) is developed here for both Laplace and Stokes' problems to obtain the higher-order interactions and the resulting velocities of multiple phoretic particles, up to any order of accuracy in the radius-to-distance ratio $\varepsilon$ of the particles. Beyond simple pairwise chemical or hydrodynamic interactions, this model allows us to account for the generic chemo-hydrodynamic couplings as well as $N$-particle interactions ($N\geq 3$). The $\varepsilon^5$-accurate interaction velocities are then explicitly obtained and the resulting implementation of this MoR model is discussed and validated quantitatively against exact solutions of a few canonical problems. △ Less

Submitted 6 November, 2019; originally announced November 2019.

Comments: 31 pages, 13 figures, to appear in Physical Review Fluids

Journal ref: Phys. Rev. Fluids, 2019, 4, 124204

arXiv:1909.00260 [pdf, ps, other]

SCALABLE INTERNETWORKING: Final Technical Report

Authors: JJ Garcia-Luna-Aceves, A. Varma

Abstract: This document describes the work completed at the University of California, Santa Cruz under the project Scalable Internetworking sponsored by ARPA under Contract No. F19628-93-C-0175. This report covers work performed from 1 April 1993 to 31 December 1995. Results on routing and multicasting for large-scale internets are summarized. The technical material discussed assumes familiarity with the co… ▽ More This document describes the work completed at the University of California, Santa Cruz under the project Scalable Internetworking sponsored by ARPA under Contract No. F19628-93-C-0175. This report covers work performed from 1 April 1993 to 31 December 1995. Results on routing and multicasting for large-scale internets are summarized. The technical material discussed assumes familiarity with the content of our proposal and previous quarterly reports submitted in this project. △ Less

Submitted 31 August, 2019; originally announced September 2019.

Comments: 14 pages

Report number: TR-CCRG-95-F19628-93-C-0175

arXiv:1907.13400 [pdf, other]

doi 10.1088/1751-8121/abde76

Temporal correlation beyond quantum bounds in non-hermitian dynamics

Authors: Anant V. Varma, Ipsika Mohanty, Sourin Das

Abstract: We study the dynamics of two level systems described by non-hermitian Hamiltonians with real eigenvalues. Within the framework of hermitian quantum mechanics, it is known that maximal violation of Leggett-Garg inequality is bounded by $3/2$ (Luder's bound). We show that this absolute bound can be evaded when dynamics is governed by non-hermitian Hamiltonians. Moreover, the extent of violation can… ▽ More We study the dynamics of two level systems described by non-hermitian Hamiltonians with real eigenvalues. Within the framework of hermitian quantum mechanics, it is known that maximal violation of Leggett-Garg inequality is bounded by $3/2$ (Luder's bound). We show that this absolute bound can be evaded when dynamics is governed by non-hermitian Hamiltonians. Moreover, the extent of violation can be optimized to reach its algebraic maximum of $3$ which is otherwise only feasible when the Hilbert space is infinite dimensional in the hermitian case. The extreme violation of Leggett-Garg inequality is shown to be directly related to the two basic ingredients: (i) The Bloch equation for the two level system has non-linear terms which allow for accelerated dynamics of states on the Bloch sphere exceeding all known quantum speed limits of state evolution; and (ii) We need to ensure that the quantum trajectory of states always lies on a single great circle (geodesic path) on the Bloch sphere at all times. △ Less

Submitted 28 July, 2020; v1 submitted 31 July, 2019; originally announced July 2019.

Comments: v2: Includes (a) A numerical comparison of our predictions with existing experimental results (Ref. 44 in the article) ; and (b) An extensive discussion of Leggett-Garg Inequality in the context of possible embedding of the non-hermitian dynamics within a higher dimensional Hilbert space following unitary time evolution and postselection

Journal ref: J. Phys. A: Math. Theor. 54 115301 (2021)

arXiv:1903.11066 [pdf, other]

doi 10.1142/S0218271819440036

Quantum gravity as an emergent phenomenon

Authors: Shounak De, Tejinder P. Singh, Abhinav Varma

Abstract: There ought to exist a reformulation of quantum theory which does not depend on classical time. To achieve such a reformulation, we introduce the concept of an atom of space-time-matter (STM). An STM atom is a classical non-commutative geometry, based on an asymmetric metric, and sourced by a closed string. Different such atoms interact via entanglement. The statistical thermodynamics of a large n… ▽ More There ought to exist a reformulation of quantum theory which does not depend on classical time. To achieve such a reformulation, we introduce the concept of an atom of space-time-matter (STM). An STM atom is a classical non-commutative geometry, based on an asymmetric metric, and sourced by a closed string. Different such atoms interact via entanglement. The statistical thermodynamics of a large number of such atoms gives rise, at equilibrium, to a theory of quantum gravity. Far from equilibrium, where statistical fluctuations are large, the emergent theory reduces to classical general relativity. In this theory, classical black holes are far-from-equilibrium low entropy states, and their Hawking evaporation represents an attempt to return to the (maximum entropy) equilibrium quantum gravitational state. △ Less

Submitted 22 May, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

Comments: 8 pages, 1 figure, Essay written for the Gravity Research Foundation 2019 Awards for Essays on Gravitation. arXiv admin note: substantial text overlap with arXiv:1903.05402; v2: this essay is a significantly condensed version of arXiv:1903.05402, Ref. 2 updated, Honorable Mention, to appear in Int. J. Mod. Phys

arXiv:1811.11452 [pdf, other]

doi 10.1007/JHEP05(2019)154

LHC Constraints on a $B-L$ Gauge Model using Contur

Authors: S. Amrith, J. M. Butterworth, F. F. Deppisch, W. Liu, A. Varma, D. Yallup

Abstract: The large and growing library of measurements from the Large Hadron Collider has significant power to constrain extensions of the Standard Model. We consider such constraints on a well-motivated model involving a gauged and spontaneously-broken $B-L$ symmetry, within the Contur framework. The model contains an extra Higgs boson, a gauge boson, and right-handed neutrinos with Majorana masses. This… ▽ More The large and growing library of measurements from the Large Hadron Collider has significant power to constrain extensions of the Standard Model. We consider such constraints on a well-motivated model involving a gauged and spontaneously-broken $B-L$ symmetry, within the Contur framework. The model contains an extra Higgs boson, a gauge boson, and right-handed neutrinos with Majorana masses. This new particle content implies a varied phenomenology highly dependent on the parameters of the model, very well-suited to a general study of this kind. We find that existing LHC measurements significantly constrain the model in interesting regions of parameter space. Other regions remain open, some of which are within reach of future LHC data. △ Less

Submitted 7 January, 2020; v1 submitted 28 November, 2018; originally announced November 2018.

Comments: 25 pages, 7 figures, accepted by JHEP, plots updated with Rivet version 3

Report number: MCnet-18-30

Journal ref: JHEP 1905 (2019) 154

arXiv:1806.03812 [pdf, other]

Clustering-induced self-propulsion of isotropic autophoretic particles

Authors: Akhil Varma, Thomas D. Montenegro-Johnson, Sebastien Michelin

Abstract: Self-diffusiophoretic particles exploit local concentration gradients of a solute species in order to self-propel at the micron scale. While an isolated chemically- and geometrically-isotropic particle cannot swim, we show that it can achieve self-propulsion through interactions with other individually-non-motile particles by forming geometrically-anisotropic clusters via phoretic and hydrodynamic… ▽ More Self-diffusiophoretic particles exploit local concentration gradients of a solute species in order to self-propel at the micron scale. While an isolated chemically- and geometrically-isotropic particle cannot swim, we show that it can achieve self-propulsion through interactions with other individually-non-motile particles by forming geometrically-anisotropic clusters via phoretic and hydrodynamic interactions. This result identifies a new route to symmetry-breaking for the concentration field and to self-propulsion, that is not based on an anisotropic design, but on the collective dynamics of identical and homogeneous active particles. Using full numerical simulations as well as theoretical modelling of the clustering process, the statistics of the propulsion properties are obtained for arbitrary initial arrangement of the particles. The robustness of these results to thermal noise, and more generally the effect of Brownian motion of the particles, is also discussed. △ Less

Submitted 11 June, 2018; originally announced June 2018.

Comments: 27 pages, 15 figures, to appear in Soft Matter

arXiv:1804.11334 [pdf, other]

doi 10.1103/PhysRevD.98.064046

Einstein-Cartan-Dirac equations in the Newman-Penrose formalism

Authors: Swanand Khanapurkar, Abhinav Varma, Nehal Mittal, Navya Gupta, Tejinder P. Singh

Abstract: We formulate the Einstein-Cartan-Dirac equations in the Newman-Penrose (NP) formalism, thereby presenting a more accurate and explicit analysis of previous such studies. The equations show in a transparent way how the Einstein-Dirac equations are modified by the inclusion of torsion. In particular, the Hehl-Datta equation is presented in NP notation. We then describe a few solutions of the Hehl-Da… ▽ More We formulate the Einstein-Cartan-Dirac equations in the Newman-Penrose (NP) formalism, thereby presenting a more accurate and explicit analysis of previous such studies. The equations show in a transparent way how the Einstein-Dirac equations are modified by the inclusion of torsion. In particular, the Hehl-Datta equation is presented in NP notation. We then describe a few solutions of the Hehl-Datta equation on Minkowski space-time, and in particular report a solitonic solution which removes the unphysical behavioiur of the corresponding Dirac solution. The present work serves as a prelude to similar studies for non-degenerate Poincare gauge gravity. △ Less

Submitted 28 June, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

Comments: 32 pages, 4 figures, Section 5.4 on plane waves rewritten

Journal ref: Phys. Rev. D 98, 064046 (2018)

arXiv:1801.02900 [pdf, other]

Majorana zero energy modes in silicene

Authors: Anant Vijay Varma, Prasanta K. Panigrahi

Abstract: Zero energy modes are shown to exist in silicene under suitably chosen magnetic as well as electric fields. Two Majorana modes are found on the application of opposite local magnetization on the silicene sub-lattices. We identify a non-Majorana zero energy mode carrying pure spin current under the influence of inhomogeneous gate electric field. In both cases, wave functions reveal subtle interfere… ▽ More Zero energy modes are shown to exist in silicene under suitably chosen magnetic as well as electric fields. Two Majorana modes are found on the application of opposite local magnetization on the silicene sub-lattices. We identify a non-Majorana zero energy mode carrying pure spin current under the influence of inhomogeneous gate electric field. In both cases, wave functions reveal subtle interference pattern in phase space, showing structures finer than the Planck constant $\hbar$. A spin system coupled through spin-spin ($σ_{z}\otimesσ_{z}$) interaction with the Majorana modes exhibits periodic revival of coherence with a minimum period $\sim $1/n. The same system shows decoherence-free evolution in the case of gate electric field. Under a momentum dependent interaction, one Majorana mode having a bound state in the continuum character is found to be more robust as compared to the other. The mode arising on application of electric field shows rapid loss of coherence for such an interaction. △ Less

Submitted 9 January, 2018; originally announced January 2018.

arXiv:1701.06356 [pdf, other]

Let's HPC: A web-based interactive platform to aid High Performance Computing education

Authors: Akshar Varma, Yashwant Keswani, Yashodhan Bhatnagar, Bhaskar Chaudhury

Abstract: Let's HPC (www.letshpc.org) is an open-access online platform to supplement conventional classroom oriented High Performance Computing (HPC) and Parallel & Distributed Computing (PDC) education. The web based platform provides online plotting and analysis tools which allow users to learn, evaluate, teach and see the performance of parallel algorithms from a system's viewpoint. The user can quantit… ▽ More Let's HPC (www.letshpc.org) is an open-access online platform to supplement conventional classroom oriented High Performance Computing (HPC) and Parallel & Distributed Computing (PDC) education. The web based platform provides online plotting and analysis tools which allow users to learn, evaluate, teach and see the performance of parallel algorithms from a system's viewpoint. The user can quantitatively compare and understand the importance of numerous deterministic as well as non-deterministic factors of both the software and the hardware that impact the performance of parallel programs. At the heart of this platform is a database archiving the performance and execution environment related data of standard parallel algorithms executed on different computing architectures using different programming environments, this data is contributed by various stakeholders in the HPC community. The plotting and analysis tools of our platform can be combined seamlessly with the database to aid self-learning, teaching, evaluation and discussion of different HPC related topics. Instructors of HPC/PDC related courses can use the platform's tools to illustrate the importance of proper analysis in understanding factors impacting performance, to encourage peer learning among students, as well as to allow students to prepare a standard lab/project report aiding the instructor in uniform evaluation. The platform's modular design enables easy inclusion of performance related data from contributors as well as addition of new features in the future. △ Less

Submitted 23 January, 2017; originally announced January 2017.

Comments: 8 pages, 4 figures. Submitted to EduPar-17. This paper is regarding the Let's HPC platform which can be found here: http://www.letshpc.org

arXiv:1510.00958 [pdf, other]

Existence of k-ary Trees: Subtree Sizes, Heights and Depths

Authors: Akshar Varma

Abstract: The rooted tree is an important data structure, and the subtree size, height, and depth are naturally defined attributes of every node. We consider the problem of the existence of a k-ary tree given a list of attribute sequences. We give polynomial time (O(nlog(n))) algorithms for the existence of a k-ary tree given depth and/or height sequences. Our most significant results are the Strong NP-Comp… ▽ More The rooted tree is an important data structure, and the subtree size, height, and depth are naturally defined attributes of every node. We consider the problem of the existence of a k-ary tree given a list of attribute sequences. We give polynomial time (O(nlog(n))) algorithms for the existence of a k-ary tree given depth and/or height sequences. Our most significant results are the Strong NP-Completeness of the decision problems of existence of k-ary trees given subtree sizes sequences. We prove this by multi-stage reductions from NUMERICAL MATCHING WITH TARGET SUMS. In the process, we also prove a generalized version of the 3-PARTITION problem to be Strongly NP-Complete. By looking at problems where a combination of attribute sequences are given, we are able to draw the boundary between easy and hard problems related to existence of trees given attribute sequences and enhance our understanding of where the difficulty lies in such problems. △ Less

Submitted 17 July, 2016; v1 submitted 4 October, 2015; originally announced October 2015.

Comments: Revised version, 12 pages (excluding references)

ACM Class: F.2.2; G.2.2

Showing 1–44 of 44 results for author: Varma, A