Search | arXiv e-print repository

Machine-Learned Potentials for Solvation Modeling

Authors: Roopshree Banchode, Surajit Das, Shampa Raghunathan, Raghunathan Ramakrishnan

Abstract: Solvent environments play a central role in determining molecular structure, energetics, reactivity, and interfacial phenomena. However, modeling solvation from first principles remains difficult due to the complex interplay of interactions and unfavorable computational scaling of first-principles treatment with system size. Machine-learned potentials (MLPs) have recently emerged as efficient surr… ▽ More Solvent environments play a central role in determining molecular structure, energetics, reactivity, and interfacial phenomena. However, modeling solvation from first principles remains difficult due to the complex interplay of interactions and unfavorable computational scaling of first-principles treatment with system size. Machine-learned potentials (MLPs) have recently emerged as efficient surrogates for quantum chemistry methods, offering first-principles accuracy at greatly reduced computational cost. MLPs approximate the underlying potential energy surface, enabling efficient computation of energies and forces in solvated systems, and are capable of accounting for effects such as hydrogen bonding, long-range polarization, and conformational changes. This review surveys the development and application of MLPs in solvation modeling. We summarize the theoretical basis of MLP-based energy and force predictions and present a classification of MLPs based on training targets, model types, and design choices related to architectures, descriptors, and training protocols. Integration into established solvation workflows is discussed, with case studies spanning small molecules, interfaces, and reactive systems. We conclude by outlining open challenges and future directions toward transferable, robust, and physically grounded MLPs for solvation-aware atomistic modeling. △ Less

Submitted 29 May, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

Comments: minor edit, Fig. 1 revised

arXiv:2504.04186 [pdf, other]

doi 10.1145/3722212.3724430

AutoComp: Automated Data Compaction for Log-Structured Tables in Data Lakes

Authors: Anja Gruenheid, Jesús Camacho-Rodríguez, Carlo Curino, Raghu Ramakrishnan, Stanislav Pak, Sumedh Sakdeo, Lenisha Gandhi, Sandeep K. Singhal, Pooja Nilangekar, Daniel J. Abadi

Abstract: The proliferation of small files in data lakes poses significant challenges, including degraded query performance, increased storage costs, and scalability bottlenecks in distributed storage systems. Log-structured table formats (LSTs) such as Delta Lake, Apache Iceberg, and Apache Hudi exacerbate this issue due to their append-only write patterns and metadata-intensive operations. While compactio… ▽ More The proliferation of small files in data lakes poses significant challenges, including degraded query performance, increased storage costs, and scalability bottlenecks in distributed storage systems. Log-structured table formats (LSTs) such as Delta Lake, Apache Iceberg, and Apache Hudi exacerbate this issue due to their append-only write patterns and metadata-intensive operations. While compaction--the process of consolidating small files into fewer, larger files--is a common solution, existing automation mechanisms often lack the flexibility and scalability to adapt to diverse workloads and system requirements while balancing the trade-offs between compaction benefits and costs. In this paper, we present AutoComp, a scalable framework for automatic data compaction tailored to the needs of modern data lakes. Drawing on deployment experience at LinkedIn, we analyze the operational impact of small file proliferation, establish key requirements for effective automatic compaction, and demonstrate how AutoComp addresses these challenges. Our evaluation, conducted using synthetic benchmarks and production environments via integration with OpenHouse--a control plane for catalog management, schema governance, and data services--shows significant improvements in file count reduction and query performance. We believe AutoComp's built-in extensibility provides a robust foundation for evolving compaction systems, facilitating future integration of refined multi-objective optimization approaches, workload-aware compaction strategies, and expanded support for broader data layout optimizations. △ Less

Submitted 5 April, 2025; originally announced April 2025.

Journal ref: ACM SIGMOD 2025

arXiv:2504.01793 [pdf, ps, other]

Optimal shift-invariant spaces from uniform measurements

Authors: Rohan Joy, Radha Ramakrishnan

Abstract: Let $m$ be a positive integer and $\mathcal{C}$ be a collection of closed subspaces in $L^2(\mathbb{R})$. Given the measurements $\mathcal{F}_Y=\left\lbrace \left\lbrace y_k^1 \right\rbrace_{k\in \mathbb{Z}},\ldots, \left\lbrace y_k^m \right\rbrace_{k\in \mathbb{Z}} \right\rbrace \subset \ell^2(\mathbb{Z})$ of unknown functions… ▽ More Let $m$ be a positive integer and $\mathcal{C}$ be a collection of closed subspaces in $L^2(\mathbb{R})$. Given the measurements $\mathcal{F}_Y=\left\lbrace \left\lbrace y_k^1 \right\rbrace_{k\in \mathbb{Z}},\ldots, \left\lbrace y_k^m \right\rbrace_{k\in \mathbb{Z}} \right\rbrace \subset \ell^2(\mathbb{Z})$ of unknown functions $\mathcal{F}=\left\{f_1, \ldots,f_m \right\} \subset L^2( \mathbb{R})$, in this paper we study the problem of finding an optimal space $S$ in $\mathcal{C}$ that is ``closest" to the measurements $\mathcal{F}_Y$ of $\mathcal{F}$. Since the class of finitely generated shift-invariant spaces (FSISs) is popularly used for modelling signals, we assume $\mathcal{C}$ consists of FSISs. We will be considering three cases. In the first case, $\mathcal{C}$ consists of FSISs without any assumption on extra invariance. In the second case, we assume $\mathcal{C}$ consists of extra invariant FSISs, and in the third case, we assume $\mathcal{C}$ has translation-invariant FSISs. In all three cases, we prove the existence of an optimal space. △ Less

Submitted 2 April, 2025; originally announced April 2025.

arXiv:2503.20369 [pdf, other]

Unlocking Inverted Singlet-Triplet Gap in Alternant Hydrocarbons with Heteroatoms

Authors: Atreyee Majumdar, Surajit Das, Raghunathan Ramakrishnan

Abstract: Fifth-generation organic light-emitting diodes exhibit delayed fluorescence even at low temperatures, enabled by exothermic reverse intersystem crossing from a negative singlet-triplet gap (STG), where the first excited singlet lies anomalously below the triplet. This phenomenon -- termed delayed fluorescence from inverted singlet and triplet states (DFIST) -- has been experimentally confirmed onl… ▽ More Fifth-generation organic light-emitting diodes exhibit delayed fluorescence even at low temperatures, enabled by exothermic reverse intersystem crossing from a negative singlet-triplet gap (STG), where the first excited singlet lies anomalously below the triplet. This phenomenon -- termed delayed fluorescence from inverted singlet and triplet states (DFIST) -- has been experimentally confirmed only in two triangular molecules with a 12-annulene periphery and a central nitrogen atom. Here, we report a high-throughput virtual screening of 30,797 BN-substituted polycyclic aromatic hydrocarbons derived from 77 parent scaffolds (2--6 rings). Using a multi-level workflow combining structural stability criteria with accurate L-CC2 excited-state calculations, we identify 72 heteroaromatic candidates with STGs$<0$. Notably, this includes BN-helicenes, where inversion arises from through-space charge-transfer states. Several systems exhibit non-zero oscillator strengths, supporting their potential as fluorescent emitters. Our findings reveal new design motifs for DFIST beyond known frameworks, expanding the chemical space for next-generation emitters based on heteroatom-embedded aromatic systems. △ Less

Submitted 26 March, 2025; originally announced March 2025.

Comments: First draft (SI not included)

arXiv:2503.14712 [pdf, other]

Distribution and Purification of Entanglement States in Quantum Networks

Authors: Xiaojie Fan, Yukun Yang, Himanshu Gupta, C. R. Ramakrishnan

Abstract: We consider problems of distributing high-fidelity entangled states across nodes of a quantum network. We consider a repeater-based network architecture with entanglement swapping (fusion) operations for generating long-distance entanglements, and purification operations that produce high-fidelity states from several lower-fidelity states. The contributions of this paper are two-fold: First, while… ▽ More We consider problems of distributing high-fidelity entangled states across nodes of a quantum network. We consider a repeater-based network architecture with entanglement swapping (fusion) operations for generating long-distance entanglements, and purification operations that produce high-fidelity states from several lower-fidelity states. The contributions of this paper are two-fold: First, while there have been several works on fidelity-aware routing and incorporating purification into routing for generating EPs, this paper presents the first algorithms for optimal solutions to the high-fidelity EP distribution problem. We provide a dynamic programming algorithm for generating the optimal tree of operations to produce a high-fidelity EP, and an LP-based algorithm for generating an optimal collection of trees. Second, following the EP algorithms, this paper presents the first algorithms for the high-fidelity GHZ-state distribution problem and characterizes its optimality. We evaluate our techniques via simulations over NetSquid, a quantum network simulator. △ Less

Submitted 23 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

Comments: 9 pages, 8 figures

arXiv:2503.10904 [pdf, other]

Transferring Kinesthetic Demonstrations across Diverse Objects for Manipulation Planning

Authors: Dibyendu Das, Aditya Patankar, Nilanjan Chakraborty, C. R. Ramakrishnan, I. V. Ramakrishnan

Abstract: Given a demonstration of a complex manipulation task such as pouring liquid from one container to another, we seek to generate a motion plan for a new task instance involving objects with different geometries. This is non-trivial since we need to simultaneously ensure that the implicit motion constraints are satisfied (glass held upright while moving), the motion is collision-free, and that the ta… ▽ More Given a demonstration of a complex manipulation task such as pouring liquid from one container to another, we seek to generate a motion plan for a new task instance involving objects with different geometries. This is non-trivial since we need to simultaneously ensure that the implicit motion constraints are satisfied (glass held upright while moving), the motion is collision-free, and that the task is successful (e.g. liquid is poured into the target container). We solve this problem by identifying positions of critical locations and associating a reference frame (called motion transfer frames) on the manipulated object and the target, selected based on their geometries and the task at hand. By tracking and transferring the path of the motion transfer frames, we generate motion plans for arbitrary task instances with objects of different geometries and poses. We show results from simulation as well as robot experiments on physical objects to evaluate the effectiveness of our solution. △ Less

Submitted 13 March, 2025; originally announced March 2025.

arXiv:2502.09330 [pdf, ps, other]

Leveraging the Bias-Variance Tradeoff in Quantum Chemistry for Accurate Negative Singlet-Triplet Gap Predictions: A Case for Double-Hybrid DFT

Authors: Atreyee Majumdar, Raghunathan Ramakrishnan

Abstract: Molecules that have been suggested to violate the Hund's rule, having a first excited singlet state (S$_1$) energetically below the triplet state (T$_1$), are rare. Yet, they hold the promise to be efficient light emitters. Their high-throughput identification demands exceptionally accurate excited-state modeling to minimize qualitatively wrong predictions. We benchmark twelve S$_1$-T$_1$ energy g… ▽ More Molecules that have been suggested to violate the Hund's rule, having a first excited singlet state (S$_1$) energetically below the triplet state (T$_1$), are rare. Yet, they hold the promise to be efficient light emitters. Their high-throughput identification demands exceptionally accurate excited-state modeling to minimize qualitatively wrong predictions. We benchmark twelve S$_1$-T$_1$ energy gaps to find that the local-correlated versions of ADC(2) and CC2 excited state methods deliver excellent accuracy and speed for screening medium-sized molecules. Notably, we find that double-hybrid DFT approximations (e.g., B2GP-PLYP and PBE-QIDH) exhibit high mean absolute errors ($>100$ meV) despite very low standard deviations ($\approx10$ meV). Exploring their parameter space reveals that a configuration with 75% exchange and 55% correlation, which reduces the mean absolute error to below 5 meV, but with an increased variance. Using this low-bias parameterization as an internal reference, we correct the systematic error while maintaining low variance, effectively combining the strengths of both low-bias and low-variance DFT parameterizations to enhance overall accuracy. Our findings suggest that low-variance DFT methods, often overlooked due to their high bias, can serve as reliable tools for predictive modeling in first-principles molecular design. The bias-correction data-fitting procedure can be applied to any general problem where two flavors of a method, one with low bias and another with low variance, have been identified a priori. △ Less

Submitted 11 June, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

Comments: Please download version 2 to access SI

arXiv:2502.03429 [pdf, other]

On Fairness of Unified Multimodal Large Language Model for Image Generation

Authors: Ming Liu, Hao Chen, Jindong Wang, Liwen Wang, Bhiksha Raj Ramakrishnan, Wensheng Zhang

Abstract: Unified multimodal large language models (U-MLLMs) have demonstrated impressive performance in visual understanding and generation in an end-to-end pipeline. Compared with generation-only models (e.g., Stable Diffusion), U-MLLMs may raise new questions about bias in their outputs, which can be affected by their unified capabilities. This gap is particularly concerning given the under-explored risk… ▽ More Unified multimodal large language models (U-MLLMs) have demonstrated impressive performance in visual understanding and generation in an end-to-end pipeline. Compared with generation-only models (e.g., Stable Diffusion), U-MLLMs may raise new questions about bias in their outputs, which can be affected by their unified capabilities. This gap is particularly concerning given the under-explored risk of propagating harmful stereotypes. In this paper, we benchmark the latest U-MLLMs and find that most exhibit significant demographic biases, such as gender and race bias. To better understand and mitigate this issue, we propose a locate-then-fix strategy, where we audit and show how the individual model component is affected by bias. Our analysis shows that bias originates primarily from the language model. More interestingly, we observe a "partial alignment" phenomenon in U-MLLMs, where understanding bias appears minimal, but generation bias remains substantial. Thus, we propose a novel balanced preference model to balance the demographic distribution with synthetic data. Experiments demonstrate that our approach reduces demographic bias while preserving semantic fidelity. We hope our findings underscore the need for more holistic interpretation and debiasing strategies of U-MLLMs in the future. △ Less

Submitted 5 February, 2025; originally announced February 2025.

arXiv:2501.12107 [pdf, other]

The Quantum Internet (Technical Version)

Authors: Peter P. Rohde, Zixin Huang, Yingkai Ouyang, He-Liang Huang, Zu-En Su, Simon Devitt, Rohit Ramakrishnan, Atul Mantri, Si-Hui Tan, Nana Liu, Scott Harrison, Chandrashekar Radhakrishnan, Gavin K. Brennen, Ben Q. Baragiola, Jonathan P. Dowling, Tim Byrnes, William J. Munro

Abstract: Following the emergence of quantum computing, the subsequent quantum revolution will be that of interconnecting individual quantum computers at global level. In the same way that classical computers only realised their full potential with the emergence of the internet, a fully realised quantum internet is the next stage of evolution for quantum computation. This work examines in detail how the qua… ▽ More Following the emergence of quantum computing, the subsequent quantum revolution will be that of interconnecting individual quantum computers at global level. In the same way that classical computers only realised their full potential with the emergence of the internet, a fully realised quantum internet is the next stage of evolution for quantum computation. This work examines in detail how the quantum internet would evolve in practice, focusing not only on the technology itself but also on the implications it will have economically and politically. We present both original ideas, as well as an extensive review of relevant and related background material. This work begins with a description of classical networks before introducing the key concepts behind quantum networks, such as quantum internet protocols, quantum cryptography, and cloud quantum computing. The work is divided into technical sections (requiring only a basic knowledge of the notation of quantum mechanics), for those interested in mathematical details, as well as non-technical sections for those seeking a more general understanding. We target this work very broadly at quantum and classical computer scientists, classical computer systems, software and network engineers, physicists, economists, artists, musicians, and those just generally curious about the future of quantum technologies and what they might bring to humanity. △ Less

Submitted 22 January, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

Comments: 370 pages, comments are welcome; note that apart from a few sections, most of this project has been written before 2021

arXiv:2411.04036 [pdf, other]

Stepping Forward on the Last Mile

Authors: Chen Feng, Shaojie Zhuo, Xiaopeng Zhang, Ramchalam Kinattinkara Ramakrishnan, Zhaocong Yuan, Andrew Zou Li

Abstract: Continuously adapting pre-trained models to local data on resource constrained edge devices is the $\emph{last mile}$ for model deployment. However, as models increase in size and depth, backpropagation requires a large amount of memory, which becomes prohibitive for edge devices. In addition, most existing low power neural processing engines (e.g., NPUs, DSPs, MCUs, etc.) are designed as fixed-po… ▽ More Continuously adapting pre-trained models to local data on resource constrained edge devices is the $\emph{last mile}$ for model deployment. However, as models increase in size and depth, backpropagation requires a large amount of memory, which becomes prohibitive for edge devices. In addition, most existing low power neural processing engines (e.g., NPUs, DSPs, MCUs, etc.) are designed as fixed-point inference accelerators, without training capabilities. Forward gradients, solely based on directional derivatives computed from two forward calls, have been recently used for model training, with substantial savings in computation and memory. However, the performance of quantized training with fixed-point forward gradients remains unclear. In this paper, we investigate the feasibility of on-device training using fixed-point forward gradients, by conducting comprehensive experiments across a variety of deep learning benchmark tasks in both vision and audio domains. We propose a series of algorithm enhancements that further reduce the memory footprint, and the accuracy gap compared to backpropagation. An empirical study on how training with forward gradients navigates in the loss landscape is further explored. Our results demonstrate that on the last mile of model customization on edge devices, training with fixed-point forward gradients is a feasible and practical approach. △ Less

Submitted 6 November, 2024; originally announced November 2024.

arXiv:2411.02329 [pdf, other]

Probabilistic Parallels in the Classical Limit of Quantum Mechanical Models

Authors: Raghunathan Ramakrishnan

Abstract: At large quantum numbers, the probability densities for particle-in-a-box or simple harmonic oscillator converge to the classical result upon coarse-graining the quantum mechanical probability densities by introducing a finite resolution in the measurement of the particle's position. This resolution in the position can be related to the resolution of the secondary total angular momentum quantum nu… ▽ More At large quantum numbers, the probability densities for particle-in-a-box or simple harmonic oscillator converge to the classical result upon coarse-graining the quantum mechanical probability densities by introducing a finite resolution in the measurement of the particle's position. This resolution in the position can be related to the resolution of the secondary total angular momentum quantum number ($m$) when interpreting the probabilistic outcomes of the Stern--Gerlach-type thought experiments for large values of the angular momentum quantum numbers ($j$). △ Less

Submitted 4 November, 2024; originally announced November 2024.

Comments: first draft

arXiv:2410.18275 [pdf, other]

Screw Geometry Meets Bandits: Incremental Acquisition of Demonstrations to Generate Manipulation Plans

Authors: Dibyendu Das, Aditya Patankar, Nilanjan Chakraborty, C. R. Ramakrishnan, I. V. Ramakrishnan

Abstract: In this paper, we study the problem of methodically obtaining a sufficient set of kinesthetic demonstrations, one at a time, such that a robot can be confident of its ability to perform a complex manipulation task in a given region of its workspace. Although Learning from Demonstrations has been an active area of research, the problems of checking whether a set of demonstrations is sufficient, and… ▽ More In this paper, we study the problem of methodically obtaining a sufficient set of kinesthetic demonstrations, one at a time, such that a robot can be confident of its ability to perform a complex manipulation task in a given region of its workspace. Although Learning from Demonstrations has been an active area of research, the problems of checking whether a set of demonstrations is sufficient, and systematically seeking additional demonstrations have remained open. We present a novel approach to address these open problems using (i) a screw geometric representation to generate manipulation plans from demonstrations, which makes the sufficiency of a set of demonstrations measurable; (ii) a sampling strategy based on PAC-learning from multi-armed bandit optimization to evaluate the robot's ability to generate manipulation plans in a subregion of its task space; and (iii) a heuristic to seek additional demonstration from areas of weakness. Thus, we present an approach for the robot to incrementally and actively ask for new demonstration examples until the robot can assess with high confidence that it can perform the task successfully. We present experimental results on two example manipulation tasks, namely, pouring and scooping, to illustrate our approach. A short video on the method: https://youtu.be/R-qICICdEos △ Less

Submitted 23 October, 2024; originally announced October 2024.

Comments: 8 pages, 6 figures, under review in IEEE Robotics and Automation Letters

arXiv:2410.18221 [pdf, other]

Data Augmentation for Automated Adaptive Rodent Training

Authors: Dibyendu Das, Alfredo Fontanini, Joshua F. Kogan, Haibin Ling, C. R. Ramakrishnan, I. V. Ramakrishnan

Abstract: Fully optimized automation of behavioral training protocols for lab animals like rodents has long been a coveted goal for researchers. It is an otherwise labor-intensive and time-consuming process that demands close interaction between the animal and the researcher. In this work, we used a data-driven approach to optimize the way rodents are trained in labs. In pursuit of our goal, we looked at da… ▽ More Fully optimized automation of behavioral training protocols for lab animals like rodents has long been a coveted goal for researchers. It is an otherwise labor-intensive and time-consuming process that demands close interaction between the animal and the researcher. In this work, we used a data-driven approach to optimize the way rodents are trained in labs. In pursuit of our goal, we looked at data augmentation, a technique that scales well in data-poor environments. Using data augmentation, we built several artificial rodent models, which in turn would be used to build an efficient and automatic trainer. Then we developed a novel similarity metric based on the action probability distribution to measure the behavioral resemblance of our models to that of real rodents. △ Less

Submitted 23 October, 2024; originally announced October 2024.

Comments: 5 pages, 3 figures

arXiv:2410.14357 [pdf, other]

Efficient charge-preserving excited state preparation with variational quantum algorithms

Authors: Zohim Chandani, Kazuki Ikeda, Zhong-Bo Kang, Dmitri E. Kharzeev, Alexander McCaskey, Andrea Palermo, C. R. Ramakrishnan, Pooja Rao, Ranjani G. Sundaram, Kwangmin Yu

Abstract: Determining the spectrum and wave functions of excited states of a system is crucial in quantum physics and chemistry. Low-depth quantum algorithms, such as the Variational Quantum Eigensolver (VQE) and its variants, can be used to determine the ground-state energy. However, current approaches to computing excited states require numerous controlled unitaries, making the application of the original… ▽ More Determining the spectrum and wave functions of excited states of a system is crucial in quantum physics and chemistry. Low-depth quantum algorithms, such as the Variational Quantum Eigensolver (VQE) and its variants, can be used to determine the ground-state energy. However, current approaches to computing excited states require numerous controlled unitaries, making the application of the original Variational Quantum Deflation (VQD) algorithm to problems in chemistry or physics suboptimal. In this study, we introduce a charge-preserving VQD (CPVQD) algorithm, designed to incorporate symmetry and the corresponding conserved charge into the VQD framework. This results in dimension reduction, significantly enhancing the efficiency of excited-state computations. We present benchmark results with GPU-accelerated simulations using systems up to 24 qubits, showcasing applications in high-energy physics, nuclear physics, and quantum chemistry. This work is performed on NERSC's Perlmutter system using NVIDIA's open-source platform for accelerated quantum supercomputing - CUDA-Q. △ Less

Submitted 18 October, 2024; originally announced October 2024.

Comments: 20 pages, 6 figures, 1 table

arXiv:2410.09135 [pdf, other]

Enabling Advanced Land Cover Analytics: An Integrated Data Extraction Pipeline for Predictive Modeling with the Dynamic World Dataset

Authors: Victor Radermecker, Andrea Zanon, Nancy Thomas, Annita Vapsi, Saba Rahimi, Rama Ramakrishnan, Daniel Borrajo

Abstract: Understanding land cover holds considerable potential for a myriad of practical applications, particularly as data accessibility transitions from being exclusive to governmental and commercial entities to now including the broader research community. Nevertheless, although the data is accessible to any community member interested in exploration, there exists a formidable learning curve and no stan… ▽ More Understanding land cover holds considerable potential for a myriad of practical applications, particularly as data accessibility transitions from being exclusive to governmental and commercial entities to now including the broader research community. Nevertheless, although the data is accessible to any community member interested in exploration, there exists a formidable learning curve and no standardized process for accessing, pre-processing, and leveraging the data for subsequent tasks. In this study, we democratize this data by presenting a flexible and efficient end to end pipeline for working with the Dynamic World dataset, a cutting-edge near-real-time land use/land cover (LULC) dataset. This includes a pre-processing and representation framework which tackles noise removal, efficient extraction of large amounts of data, and re-representation of LULC data in a format well suited for several downstream tasks. To demonstrate the power of our pipeline, we use it to extract data for an urbanization prediction problem and build a suite of machine learning models with excellent performance. This task is easily generalizable to the prediction of any type of land cover and our pipeline is also compatible with a series of other downstream tasks. △ Less

Submitted 11 October, 2024; originally announced October 2024.

arXiv:2410.04846 [pdf, ps, other]

Discrete Calderon condition for the twisted wavelet system

Authors: Radha Ramakrishnan, Rabeetha Velsamy

Abstract: In this paper, we obtain an analogue of the discrete Calderon condition and prove that this condition is sufficient for an orthonormal twisted wavelet system to be complete in L^{2}(R^{2}). In this paper, we obtain an analogue of the discrete Calderon condition and prove that this condition is sufficient for an orthonormal twisted wavelet system to be complete in L^{2}(R^{2}). △ Less

Submitted 7 October, 2024; originally announced October 2024.

arXiv:2408.06757 [pdf, other]

Deep convolutional neural networks and data approximation using the fractional Fourier transform

Authors: M. H. A. Biswas, P. Massopust, R. Ramakrishnan

Abstract: In the first part of this paper, we define a deep convolutional neural network connected with the fractional Fourier transform (FrFT) using the $θ$-translation operator, the translation operator associated with the FrFT. Subsequently, we study $θ$-translation invariance properties of this network. Unlike the classical case, these networks are not translation invariant. \par In the second part, we… ▽ More In the first part of this paper, we define a deep convolutional neural network connected with the fractional Fourier transform (FrFT) using the $θ$-translation operator, the translation operator associated with the FrFT. Subsequently, we study $θ$-translation invariance properties of this network. Unlike the classical case, these networks are not translation invariant. \par In the second part, we study data approximation problems using the FrFT. More precisely, given a data set $\fl=\{f_1,\cdots, f_m\}\subset L^2(\R^n)$, we obtain $Φ=\{φ_1,\cdots,φ_\ell\}$ such that \[ V_θ(Φ)=\argmin\sum_{j=1}^m \|f_j-P_{V}f_j\|^2, \] where the minimum is taken over all $θ$-shift invariant spaces generated by at most $\ell$ elements. Moreover, we prove the existence of a space of bandlimited functions in the FrFT domain which is ``closest" to $\fl$ in the above sense. △ Less

Submitted 13 August, 2024; originally announced August 2024.

arXiv:2407.04552 [pdf, other]

Influence of Pseudo-Jahn-Teller Activity on the Singlet-Triplet Gap of Azaphenalenes

Authors: Atreyee Majumdar, Komal Jindal, Surajit Das, Raghunathan Ramakrishnan

Abstract: We analyze the possibility of symmetry-lowering induced by pseudo-Jahn--Teller interactions in six previously studied azaphenalenes that are known to have their first excited singlet state (S$_1$) lower in energy than the triplet state (T$_1$). The primary aim of this study is to explore whether Hund's rule violation is observed in these molecules when their structures are distorted from… ▽ More We analyze the possibility of symmetry-lowering induced by pseudo-Jahn--Teller interactions in six previously studied azaphenalenes that are known to have their first excited singlet state (S$_1$) lower in energy than the triplet state (T$_1$). The primary aim of this study is to explore whether Hund's rule violation is observed in these molecules when their structures are distorted from $C_{\rm 2v}$ or $D_{\rm 3h}$ point group symmetries by vibronic coupling. Along two interatomic distances connecting these point groups to their subgroups $C_{\rm s}$ or $C_{\rm 3h}$, we relaxed the other internal degrees of freedom and calculated two-dimensional potential energy subsurfaces. The many-body perturbation theory (MP2) suggests that the high-symmetry structures are the energy minima for all six systems. However, single-point energy calculations using the coupled-cluster method (CCSD(T)) indicate symmetry lowering in four cases. The singlet-triplet energy gap plotted on the potential energy surface also shows variations when deviating from high-symmetry structures. A full geometry optimization at the CCSD(T) level with the cc-pVTZ basis set reveals that the $D_{\rm 3h}$ structure of cyclazine (1AP) is a saddle point, connecting two equivalent minima of $C_{\rm 3h}$ symmetry undergoing rapid automerization. The combined effects of symmetry lowering and high-level corrections result in a nearly zero singlet-triplet gap for the $C_{\rm 3h}$ structure of cyclazine. Azaphenalenes containing nitrogen atoms at electron-deficient sites -- 2AP, 3AP, and 4AP -- exhibit more pronounced in-plane structural distortion; the effect is captured by the long-range exchange-interaction corrected DFT method, $ω$B97XD. Excited state calculations of these systems indicate that in their low-symmetry energy minima, T$_1$ is indeed lower in energy than S$_1$, upholding the validity of Hund's rule. △ Less

Submitted 5 October, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

Comments: second version

arXiv:2405.20033 [pdf, other]

Chemical Space-Informed Machine Learning Models for Rapid Predictions of X-ray Photoelectron Spectra of Organic Molecules

Authors: Susmita Tripathy, Surajit Das, Shweta Jindal, Raghunathan Ramakrishnan

Abstract: We present machine learning models based on kernel-ridge regression for predicting X-ray photoelectron spectra of organic molecules originating from the $K$-shell ionization energies of carbon (C), nitrogen (N), oxygen (O), and fluorine (F) atoms. We constructed the training dataset through high-throughput calculations of $K$-shell core-electron binding energies (CEBEs) for 12,880 small organic mo… ▽ More We present machine learning models based on kernel-ridge regression for predicting X-ray photoelectron spectra of organic molecules originating from the $K$-shell ionization energies of carbon (C), nitrogen (N), oxygen (O), and fluorine (F) atoms. We constructed the training dataset through high-throughput calculations of $K$-shell core-electron binding energies (CEBEs) for 12,880 small organic molecules in the bigQM7$ω$ dataset, employing the $Δ$-SCF formalism coupled with meta-GGA-DFT and a variationally converged basis set. The models are cost-effective, as they require the atomic coordinates of a molecule generated using universal force fields while estimating the target-level CEBEs corresponding to DFT-level equilibrium geometry. We explore transfer learning by utilizing the atomic environment feature vectors learned using a graph neural network framework in kernel-ridge regression. Additionally, we enhance accuracy within the $Δ$-machine learning framework by leveraging inexpensive baseline spectra derived from Kohn--Sham eigenvalues. When applied to 208 combinatorially substituted uracil molecules larger than those in the training set, our analyses suggest that the models may not provide quantitatively accurate predictions of CEBEs but offer a strong linear correlation relevant for virtual high-throughput screening. We present the dataset and models as the Python module, ${\tt cebeconf}$, to facilitate further explorations. △ Less

Submitted 15 August, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: Major Revision, New Figures and Tables added in the SI, Figures 1 and 4 revised

arXiv:2405.07499 [pdf, other]

Distributed Quantum Computation with Minimum Circuit Execution Time over Quantum Networks

Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

Abstract: Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to ex… ▽ More Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to execute remote gates) which can incur significant generation latency and, thus, lead to decoherence of qubits. In this work, we consider the problem of distributing quantum circuits across a quantum network to minimize the execution time. The problem entails mapping the circuit qubits to network memories, including within each computer since limited connectivity within computers can affect the circuit execution time. We provide two-step solutions for the above problem: In the first step, we allocate qubits to memories to minimize the estimated execution time; for this step, we design an efficient algorithm based on an approximation algorithm for the max-quadratic-assignment problem. In the second step, we determine an efficient execution scheme, including generating required entanglements with minimum latency under the network resource and decoherence constraints; for this step, we develop two algorithms with appropriate performance guarantees under certain settings or assumptions. We consider multiple protocols for executing remote gates, viz., telegates and cat-entanglements. With extensive simulations over NetSquid, a quantum network simulator, we demonstrate the effectiveness of our developed techniques and show that they outperform a scheme based on prior work by up to 95%. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.01813 [pdf, other]

doi 10.1145/3555041.3589674

Towards Building Autonomous Data Services on Azure

Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to gain the most value from them. For cloud providers, managing every aspect of an ever-increasing set of data services, while meeting customer SLAs and minimizing operational cost is becoming more challenging. Cloud technology enables the collection of significant amounts of workload traces and system telemetry. With the progress in data science (DS) and machine learning (ML), it is feasible and desirable to utilize a data-driven, ML-based approach to automate various aspects of data services, resulting in the creation of autonomous data services. This paper presents our perspectives and insights on creating autonomous data services on Azure. It also covers the future endeavors we plan to undertake and unresolved issues that still need attention. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

arXiv:2405.00222 [pdf, other]

Optimized Distribution of Entanglement Graph States in Quantum Networks

Authors: Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

Abstract: Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantu… ▽ More Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantum networks lie at the heart of the success of future quantum information technologies. In quantum networks, multipartite entangled states distributed over the network help implement and support many quantum network applications for communications, sensing, and computing. Our work focuses on developing optimal techniques to generate and distribute multipartite entanglement states efficiently. Prior works on generating general multipartite entanglement states have focused on the objective of minimizing the number of maximally entangled pairs (EPs) while ignoring the heterogeneity of the network nodes and links as well as the stochastic nature of underlying processes. In this work, we develop a hypergraph based linear programming framework that delivers optimal (under certain assumptions) generation schemes for general multipartite entanglement represented by graph states, under the network resources, decoherence, and fidelity constraints, while considering the stochasticity of the underlying processes. We illustrate our technique by developing generation schemes for the special cases of path and tree graph states, and discuss optimized generation schemes for more general classes of graph states. Using extensive simulations over a quantum network simulator (NetSquid), we demonstrate the effectiveness of our developed techniques and show that they outperform prior known schemes by up to orders of magnitude. △ Less

Submitted 18 March, 2025; v1 submitted 30 April, 2024; originally announced May 2024.

Comments: 16 pages, 20 figures

arXiv:2402.13801 [pdf, other]

doi 10.1039/D4CP00886C

Resilience of Hund's rule in the Chemical Space of Small Organic Molecules

Authors: Atreyee Majumdar, Raghunathan Ramakrishnan

Abstract: We embark on a quest to identify small molecules in the chemical space that can potentially violate Hund's rule. Utilizing twelve TDDFT approximations and the ADC(2) many-body method, we report the energies of S$_1$ and T$_1$ excited states of 12,880 closed-shell organic molecules within the bigQM7$ω$ dataset with up to 7 CONF atoms. In this comprehensive dataset, none of the molecules, in their m… ▽ More We embark on a quest to identify small molecules in the chemical space that can potentially violate Hund's rule. Utilizing twelve TDDFT approximations and the ADC(2) many-body method, we report the energies of S$_1$ and T$_1$ excited states of 12,880 closed-shell organic molecules within the bigQM7$ω$ dataset with up to 7 CONF atoms. In this comprehensive dataset, none of the molecules, in their minimum energy geometry, exhibit a negative S$_1$-T$_1$ energy gap at the ADC($2$) level while several molecules display values $<0.1$ eV. The spin-component-scaled double-hybrid method, SCS-PBE-QIDH, demonstrates the best agreement with ADC(2). Yet, at this level, a few molecules with a strained $sp^3$-N center turn out as false-positives with the S$_1$ state lower in energy than T$_1$. We investigate a prototypical cage molecule with an energy gap $<-0.2$ eV, which a closer examination revealed as another false positive. We conclude that in the chemical space of small closed-shell organic molecules, it is possible to identify geometric and electronic structural features giving rise to S$_1$-T$_1$ degeneracy; still, there is no evidence of a negative gap. We share the dataset generated for this study as a module, to facilitate seamless molecular discovery through data mining. △ Less

Submitted 3 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: Minor revision. Fig.5 revised, SI Tables reorganized

arXiv:2401.11162 [pdf]

Extending Polaris to Support Transactions

Authors: Josep Aguilar-Saborit, Raghu Ramakrishnan, Kevin Bocksrocker, Alan Halverson, Konstantin Kosinsky, Ryan O'Connor, Nadejda Poliakova, Moe Shafiei, Taewoo Kim, Phil Kon-Kim, Haris Mahmud-Ansari, Blazej Matuszyk, Matt Miles, Sumin Mohanan, Cristian Petculescu, Ishan Rahesh-Madan, Emma Rose-Wirshing, Elias Yousefi

Abstract: In Polaris, we introduced a cloud-native distributed query processor to perform analytics at scale. In this paper, we extend the underlying Polaris distributed computation framework, which can be thought of as a read-only transaction engine, to execute general transactions (including updates, deletes, inserts and bulk loads, in addition to queries) for Tier 1 warehousing workloads in a highly perf… ▽ More In Polaris, we introduced a cloud-native distributed query processor to perform analytics at scale. In this paper, we extend the underlying Polaris distributed computation framework, which can be thought of as a read-only transaction engine, to execute general transactions (including updates, deletes, inserts and bulk loads, in addition to queries) for Tier 1 warehousing workloads in a highly performant and predictable manner. We take advantage of the immutability of data files in log-structured data stores and build on SQL Server transaction management to deliver full transactional support with Snapshot Isolation semantics, including multi-table and multi-statement transactions. With the enhancements described in this paper, Polaris supports both query processing and transactions for T-SQL in Microsoft Fabric. △ Less

Submitted 20 January, 2024; originally announced January 2024.

Comments: 12 pages, 12 Figures

arXiv:2401.09621 [pdf, other]

XTable in Action: Seamless Interoperability in Data Lakes

Authors: Ashvin Agrawal, Tim Brown, Anoop Johnson, Jesús Camacho-Rodríguez, Kyle Weller, Carlo Curino, Raghu Ramakrishnan

Abstract: Contemporary approaches to data management are increasingly relying on unified analytics and AI platforms to foster collaboration, interoperability, seamless access to reliable data, and high performance. Data Lakes featuring open standard table formats such as Delta Lake, Apache Hudi, and Apache Iceberg are central components of these data architectures. Choosing the right format for managing a t… ▽ More Contemporary approaches to data management are increasingly relying on unified analytics and AI platforms to foster collaboration, interoperability, seamless access to reliable data, and high performance. Data Lakes featuring open standard table formats such as Delta Lake, Apache Hudi, and Apache Iceberg are central components of these data architectures. Choosing the right format for managing a table is crucial for achieving the objectives mentioned above. The challenge lies in selecting the best format, a task that is onerous and can yield temporary results, as the ideal choice may shift over time with data growth, evolving workloads, and the competitive development of table formats and processing engines. Moreover, restricting data access to a single format can hinder data sharing resulting in diminished business value over the long term. The ability to seamlessly interoperate between formats and with negligible overhead can effectively address these challenges. Our solution in this direction is an innovative omni-directional translator, XTable, that facilitates writing data in one format and reading it in any format, thus achieving the desired format interoperability. In this work, we demonstrate the effectiveness of XTable through application scenarios inspired by real-world use cases. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2312.11569 [pdf]

doi 10.5281/zenodo.10500601

Application of AI in Nutrition

Authors: Ritu Ramakrishnan, Tianxiang Xing, Tianfeng Chen, Ming-Hao Lee, Jinzhu Gao

Abstract: In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health ex… ▽ More In healthcare, artificial intelligence (AI) has been changing the way doctors and health experts take care of people. This paper will cover how AI is making major changes in the health care system, especially with nutrition. Various machine learning and deep learning algorithms have been developed to extract valuable information from healthcare data which help doctors, nutritionists, and health experts to make better decisions and make our lifestyle healthy. This paper provides an overview of the current state of AI applications in healthcare with a focus on the utilization of AI-driven recommender systems in nutrition. It will discuss the positive outcomes and challenges that arise when AI is used in this field. This paper addresses the challenges to develop AI recommender systems in healthcare, providing a well-rounded perspective on the complexities. Real-world examples and research findings are presented to underscore the tangible and significant impact AI recommender systems have in the field of healthcare, particularly in nutrition. The ongoing efforts of applying AI in nutrition lay the groundwork for a future where personalized recommendations play a pivotal role in guiding individuals toward healthier lifestyles. △ Less

Submitted 17 December, 2023; originally announced December 2023.

Journal ref: Journal of Advances in Information Science and Technology, Volume 1, Issue 1, 2023, Pages 7-12

arXiv:2312.08598 [pdf, other]

MotherNet: Fast Training and Inference via Hyper-Network Transformers

Authors: Andreas Müller, Carlo Curino, Raghu Ramakrishnan

Abstract: Foundation models are transforming machine learning across many modalities, with in-context learning replacing classical model training. Recent work on tabular data hints at a similar opportunity to build foundation models for classification for numerical data. However, existing meta-learning approaches can not compete with tree-based methods in terms of inference time. In this paper, we propose M… ▽ More Foundation models are transforming machine learning across many modalities, with in-context learning replacing classical model training. Recent work on tabular data hints at a similar opportunity to build foundation models for classification for numerical data. However, existing meta-learning approaches can not compete with tree-based methods in terms of inference time. In this paper, we propose MotherNet, a hypernetwork architecture trained on synthetic classification tasks that, once prompted with a never-seen-before training set generates the weights of a trained ``child'' neural-network by in-context learning using a single forward pass. In contrast to most existing hypernetworks that are usually trained for relatively constrained multi-task settings, MotherNet can create models for multiclass classification on arbitrary tabular datasets without any dataset specific gradient descent. The child network generated by MotherNet outperforms neural networks trained using gradient descent on small datasets, and is comparable to predictions by TabPFN and standard ML methods like Gradient Boosting. Unlike a direct application of TabPFN, MotherNet generated networks are highly efficient at inference time. We also demonstrate that HyperFast is unable to perform effective in-context learning on small datasets, and heavily relies on dataset specific fine-tuning and hyper-parameter tuning, while MotherNet requires no fine-tuning or per-dataset hyper-parameters. △ Less

Submitted 9 May, 2025; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: 17 pages, 13 figures

ACM Class: I.2.6

arXiv:2311.09593 [pdf, other]

Multi-Step Dialogue Workflow Action Prediction

Authors: Ramya Ramakrishnan, Ethan R. Elenberg, Hashan Narangodage, Ryan McDonald

Abstract: In task-oriented dialogue, a system often needs to follow a sequence of actions, called a workflow, that complies with a set of guidelines in order to complete a task. In this paper, we propose the novel problem of multi-step workflow action prediction, in which the system predicts multiple future workflow actions. Accurate prediction of multiple steps allows for multi-turn automation, which can f… ▽ More In task-oriented dialogue, a system often needs to follow a sequence of actions, called a workflow, that complies with a set of guidelines in order to complete a task. In this paper, we propose the novel problem of multi-step workflow action prediction, in which the system predicts multiple future workflow actions. Accurate prediction of multiple steps allows for multi-turn automation, which can free up time to focus on more complex tasks. We propose three modeling approaches that are simple to implement yet lead to more action automation: 1) fine-tuning on a training dataset, 2) few-shot in-context learning leveraging retrieval and large language model prompting, and 3) zero-shot graph traversal, which aggregates historical action sequences into a graph for prediction. We show that multi-step action prediction produces features that improve accuracy on downstream dialogue tasks like predicting task success, and can increase automation of steps by 20% without requiring as much feedback from a human overseeing the system. △ Less

Submitted 12 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.08300 [pdf, other]

Workflow-Guided Response Generation for Task-Oriented Dialogue

Authors: Do June Min, Paloma Sodhi, Ramya Ramakrishnan

Abstract: Task-oriented dialogue (TOD) systems aim to achieve specific goals through interactive dialogue. Such tasks usually involve following specific workflows, i.e. executing a sequence of actions in a particular order. While prior work has focused on supervised learning methods to condition on past actions, they do not explicitly optimize for compliance to a desired workflow. In this paper, we propose… ▽ More Task-oriented dialogue (TOD) systems aim to achieve specific goals through interactive dialogue. Such tasks usually involve following specific workflows, i.e. executing a sequence of actions in a particular order. While prior work has focused on supervised learning methods to condition on past actions, they do not explicitly optimize for compliance to a desired workflow. In this paper, we propose a novel framework based on reinforcement learning (RL) to generate dialogue responses that are aligned with a given workflow. Our framework consists of ComplianceScorer, a metric designed to evaluate how well a generated response executes the specified action, combined with an RL opimization process that utilizes an interactive sampling technique. We evaluate our approach on two TOD datasets, Action-Based Conversations Dataset (ABCD) (Chen et al., 2021a) and MultiWOZ 2.2 (Zang et al., 2020) on a range of automated and human evaluation metrics. Our findings indicate that our RL-based framework outperforms baselines and is effective at enerating responses that both comply with the intended workflows while being expressed in a natural and fluent manner. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2308.13238 [pdf, ps, other]

Twisted shift preserving operators on $L^{2}(\mathbb{R}^{2n})$

Authors: Rabeetha Velsamy, Radha Ramakrishnan

Abstract: We introduce the J map using the Zak transform associated with the Weyl transform on $L^{2}(\mathbb{R}^{2n})$. We obtain a decomposition for a twisted shift-invariant subspace of $L^{2}(\mathbb{R}^{2n})$ as a direct sum of mutually orthogonal principal twisted shift-invariant spaces such that the respective system of twisted translates forms a Parseval frame sequence. We establish that the twisted… ▽ More We introduce the J map using the Zak transform associated with the Weyl transform on $L^{2}(\mathbb{R}^{2n})$. We obtain a decomposition for a twisted shift-invariant subspace of $L^{2}(\mathbb{R}^{2n})$ as a direct sum of mutually orthogonal principal twisted shift-invariant spaces such that the respective system of twisted translates forms a Parseval frame sequence. We establish that the twisted shift preserving operators and the corresponding range operators simultaneously share some properties in common, namely, self-adjoint, unitary, range of the spectrum and bounded below properties. We prove that the frame operator and its inverse associated with a system of twisted translates of {\varphi_s}_{s\in Z} are shift preserving. We also show that the corresponding range operators turn out to be the dual Gramian and its inverse associated with the collection {J\varphi_s(. , .)}_{s\in Z}. △ Less

Submitted 10 May, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

MSC Class: Primary 42C15; Secondary 42B10; 47B02

arXiv:2307.00732 [pdf, other]

Variational augmentation of Gaussian continuum basis sets for calculating atomic higher harmonic generation spectra

Authors: Sai Vijay Bhaskar Mocherla, Raghunathan Ramakrishnan

Abstract: We present a variational augmentation procedure to optimize the exponents of Gaussian continuum basis sets for simulating strong-field laser ionization phenomena such as higher harmonic generation (HHG) in atoms and ions using the time-dependent configuration interaction (TDCI) method. We report the distribution of the optimized exponents and discuss how efficiently the resulting basis functions s… ▽ More We present a variational augmentation procedure to optimize the exponents of Gaussian continuum basis sets for simulating strong-field laser ionization phenomena such as higher harmonic generation (HHG) in atoms and ions using the time-dependent configuration interaction (TDCI) method. We report the distribution of the optimized exponents and discuss how efficiently the resulting basis functions span the variational space to describe the near-continuum states involved in HHG. Further, we calculated the higher harmonic spectra of three two-electron systems -- H$^{-}$, He and Li$^{+}$ -- generated by 800nm driving laser-pulses with pulse-width of 54fs and peak intensities in the tunnel ionization regime of each system. We analyze the performance of these basis sets with an increasing number of higher angular momentum functions and show that up to $g$-type functions are required to obtain qualitatively accurate harmonic spectra. Additionally, we also comment on the impact of electron correlation on the HHG spectra. Finally, we show that by systematically augmenting additional shells we model the strong-field dynamics at higher laser peak intensities. △ Less

Submitted 2 July, 2023; originally announced July 2023.

Comments: 7 figures, 2 tables, 27 SI figures. Working paper

arXiv:2306.17756 [pdf, other]

doi 10.1063/5.0166149

Band gaps of long-period polytypes of IV, IV-IV, and III-V semiconductors estimated with an Ising-type additivity model

Authors: Raghunathan Ramakrishnan, Shruti Jain

Abstract: We apply an Ising-type model to estimate the band gaps of the polytypes of group IV elements (C, Si, and Ge) and binary compounds of groups: IV-IV (SiC, GeC, and GeSi), and III-V (nitride, phosphide, and arsenide of B, Al, and Ga). The models use reference band gaps of the simplest polytypes comprising 2--6 bilayers calculated with the hybrid density functional approximation, HSE06. We report four… ▽ More We apply an Ising-type model to estimate the band gaps of the polytypes of group IV elements (C, Si, and Ge) and binary compounds of groups: IV-IV (SiC, GeC, and GeSi), and III-V (nitride, phosphide, and arsenide of B, Al, and Ga). The models use reference band gaps of the simplest polytypes comprising 2--6 bilayers calculated with the hybrid density functional approximation, HSE06. We report four models capable of estimating band gaps of nine polytypes containing 7 and 8 bilayers with an average error of $\lesssim0.05$ eV. We apply the best model with an error of $<0.04$ eV to predict the band gaps of 497 polytypes with up to 15 bilayers in the unit cell, providing a comprehensive view of the variation in the electronic structure with the degree of hexagonality of the crystal structure. Within our enumeration, we identify four rhombohedral polytypes of SiC -- 9$R$, 12$R$, 15$R$(1), and 15$R$(2) -- and perform detailed stability and band structure analysis. Of these, 15$R$(1) that has not been experimentally characterized has the widest band gap ($>3.4$ eV); phonon analysis and cohesive energy reveal 15$R$(1)-SiC to be metastable. Additionally, we model the energies of valence and conduction bands of the rhombohedral SiC phases at the high-symmetry points of the Brillouin zone and predict band structure characteristics around the Fermi level. The models presented in this study may aid in identifying polytypic phases suitable for various applications, such as the design of wide-gap materials, that are relevant to high-voltage applications. In particular, the method holds promise for forecasting electronic properties of long-period and ultra-long-period polytypes for which accurate first-principles modeling is computationally challenging. △ Less

Submitted 28 August, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

Comments: Major revision with new figure (FIG.2), new table (TABLE VI)

Journal ref: J. Chem. Phys. 159, 124702 (2023)

arXiv:2306.15758 [pdf, ps, other]

On the reconstruction of bandlimited signals from random samples quantized via noise-shaping

Authors: Rohan Joy, Felix Krahmer, Alessandro Lupoli, Radha Ramakrishnan

Abstract: Noise-shaping quantization techniques are widely used for converting bandlimited signals from the analog to the digital domain. They work by "shaping" the quantization noise so that it falls close to the reconstruction operator's null space. We investigate the compatibility of two such schemes, specifically $ΣΔ$ quantization and distributed noise-shaping quantization, with random samples of bandli… ▽ More Noise-shaping quantization techniques are widely used for converting bandlimited signals from the analog to the digital domain. They work by "shaping" the quantization noise so that it falls close to the reconstruction operator's null space. We investigate the compatibility of two such schemes, specifically $ΣΔ$ quantization and distributed noise-shaping quantization, with random samples of bandlimited functions. Let $f$ be a real-valued $π$-bandlimited function. Suppose $R>1$ is a real number and assume that $\{x_i\}_{i=1}^m$ is a sequence of i.i.d random variables uniformly distributed on $[-\tilde{R},\tilde{R}]$, where $\tilde{R}>R$ is appropriately chosen. We show that by using a noise-shaping quantizer to quantize the values of $f$ at $\{x_i\}_{i=1}^m$, a function $f^{\sharp}$ can be reconstructed from these quantized values such that $\|f-f^{\sharp}\|_{L^2[-R, R]}$ decays with high probability as $m$ and $\tilde{R}$ increase. We emphasize that the sample points $\{x_i\}_{i=1}^m$ are completely random, i.e., they have no predefined structure, which makes our findings the first of their kind. △ Less

Submitted 1 July, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

Comments: 31 pages, 3 figures

MSC Class: 94A20; 94A12; 42C15; 41A29

arXiv:2306.00394 [pdf, ps, other]

Coupled Nonlinear Schrödinger System: Role of Four-Wave Mixing Effect on Nondegenerate Vector Solitons

Authors: R. Ramakrishnan, M. Kirane, S. Stalin, M. Lakshmanan

Abstract: In this paper, we investigate the role of four-wave mixing effect on the structure of nondegenerate vector solitons and their collision dynamics. For this purpose, we consider the generalized coupled nonlinear Schrödinger (GCNLS) system which describes the evolution and nonlinear interaction of the two optical modes. The fundamental as well as higher-order nondegenerate vector soliton solutions ar… ▽ More In this paper, we investigate the role of four-wave mixing effect on the structure of nondegenerate vector solitons and their collision dynamics. For this purpose, we consider the generalized coupled nonlinear Schrödinger (GCNLS) system which describes the evolution and nonlinear interaction of the two optical modes. The fundamental as well as higher-order nondegenerate vector soliton solutions are derived through the Hirota bilinear method and their forms are rewritten in a compact way using Gram determinants. Very interestingly, we find that the presence of four-wave mixing effect induces a breathing vector soliton state in both the optical modes. Such breather formation is not possible in the fundamental vector bright solitons of the Manakov system. Then, for both strong and weak four-wave mixing effects, we show that the nondegenerate solitons in the GCNLS system undergo, in general, novel shape changing collisions, in addition to shape preserving collision under suitable choice of wave numbers. Further, we analyze the degenerate soliton collision induced novel shape changing property of nondegenerate vector soliton by deriving the partially nondegenerate two-soliton solution. For completeness, the various collision scenarios related to the pure degenerate bright solitons are indicated. We believe that the results reported in this paper will be useful in nonlinear optics for manipulating light by light through collision. △ Less

Submitted 29 February, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

Comments: 37 pages, 15 figures, Accepted for publication in Nonlinear Dynamics

arXiv:2305.12706 [pdf, other]

doi 10.1039/D3CP03598K

Stereo-Electronic Factors Influencing the Stability of Hydroperoxyalkyl Radicals: Transferability of Chemical Trends across Hydrocarbons and ab initio Methods

Authors: Saurabh Chandra Kandpal, Kgalaletso P. Otukile, Shweta Jindal, Salini Senthil, Cameron Matthews, Sabyasachi Chakraborty, Lyudmila V. Moskaleva, Raghunathan Ramakrishnan

Abstract: The hydroperoxyalkyl radicals (.QOOH) are known to play a significant role in combustion and tropospheric processes, yet their direct spectroscopic detection remains challenging. In this study, we investigate molecular stereo-electronic effects influencing the kinetic and thermodynamic stability of a .QOOH along its formation path from the precursor, alkylperoxyl radical (ROO.), and the depletion… ▽ More The hydroperoxyalkyl radicals (.QOOH) are known to play a significant role in combustion and tropospheric processes, yet their direct spectroscopic detection remains challenging. In this study, we investigate molecular stereo-electronic effects influencing the kinetic and thermodynamic stability of a .QOOH along its formation path from the precursor, alkylperoxyl radical (ROO.), and the depletion path resulting in the formation of cyclic ether + .OH. We focus on reactive intermediates encountered in the oxidation of acyclic hydrocarbon radicals: ethyl, isopropyl, isobutyl, tert-butyl, neopentyl, and their alicyclic counterparts: cyclohexyl, cyclohexenyl, and cyclohexadienyl. We report reaction energies and barriers calculated with the highly accurate method Weizmann-1 (W1) for the channels: ROO. <=> .QOOH, ROO. <=> alkene + .OOH, .QOOH <=> alkene + .OOH, and .QOOH <=> cyclic ether + .OH. Using W1 results as a reference, we have systematically benchmarked the accuracy of popular density functional theory (DFT), composite thermochemistry methods, and an explicitly correlated coupled-cluster method. We ascertain inductive, resonance, and steric effects on the overall stability of .QOOH and computationally investigate the possibility of forming more stable species. With new reactions as test cases, we probe the capacity of various ab initio methods to yield quantitative insights on the elementary steps of combustion. △ Less

Submitted 21 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

Comments: Final version, major revision. SI is enclosed

Journal ref: Phys. Chem. Chem. Phys., 2023

arXiv:2305.04488 [pdf, ps, other]

Zak transform associated with the Weyl transform and the system of twisted translates on R^{2n}

Authors: Radha Ramakrishnan, Rabeetha Velsamy

Abstract: We introduce the Zak transform on $L^{2}(\mathbb{R}^{2n})$ associated with the Weyl transform. By making use of this transform, we define a bracket map and prove that the system of twisted translates $\{T^{t}_{(k,l)}φ: k,l\in \mathbb{Z}^{n}\}$ is a frame sequence iff $0<A\leq \left[φ,φ\right](ξ,ξ^{'})\leq B<\infty,$ for a.e $(ξ,ξ^{'})\in Ω_φ,$ where… ▽ More We introduce the Zak transform on $L^{2}(\mathbb{R}^{2n})$ associated with the Weyl transform. By making use of this transform, we define a bracket map and prove that the system of twisted translates $\{T^{t}_{(k,l)}φ: k,l\in \mathbb{Z}^{n}\}$ is a frame sequence iff $0<A\leq \left[φ,φ\right](ξ,ξ^{'})\leq B<\infty,$ for a.e $(ξ,ξ^{'})\in Ω_φ,$ where $Ω_φ=\{(ξ,ξ^{'})\in \mathbb{T}^{n}\times\mathbb{T}^{n} : \left[φ,φ\right](ξ,ξ^{'})\neq 0\}$. We also prove a similar result for the system $\{T^{t}_{(k,l)}φ: k,l\in \mathbb{Z}^{n}\}$ to be a Riesz sequence. For a given function belonging to the principal twisted shift-invariant space $V^{t}(φ)$, we find a necessary and sufficient condition for the existence of a canonical biorthogonal function. Further, we obtain a characterization for the system $\{T^{t}_{(k,l)}φ: k,l\in\mathbb{Z}\}$ to be a Schauder basis for $V^{t}(φ)$ in terms of a Muckenhoupt $\mathcal{A}_{2}$ weight function. △ Less

Submitted 8 May, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

MSC Class: Primary 42C15; Secondary 43A30

arXiv:2305.01120 [pdf, other]

doi 10.1145/3639314

LST-Bench: Benchmarking Log-Structured Tables in the Cloud

Authors: Jesús Camacho-Rodríguez, Ashvin Agrawal, Anja Gruenheid, Ashit Gosalia, Cristian Petculescu, Josep Aguilar-Saborit, Avrilia Floratou, Carlo Curino, Raghu Ramakrishnan

Abstract: Data processing engines increasingly leverage distributed file systems for scalable, cost-effective storage. While the Apache Parquet columnar format has become a popular choice for data storage and retrieval, the immutability of Parquet files renders it impractical to meet the demands of frequent updates in contemporary analytical workloads. Log-Structured Tables (LSTs), such as Delta Lake, Apach… ▽ More Data processing engines increasingly leverage distributed file systems for scalable, cost-effective storage. While the Apache Parquet columnar format has become a popular choice for data storage and retrieval, the immutability of Parquet files renders it impractical to meet the demands of frequent updates in contemporary analytical workloads. Log-Structured Tables (LSTs), such as Delta Lake, Apache Iceberg, and Apache Hudi, offer an alternative for scenarios requiring data mutability, providing a balance between efficient updates and the benefits of columnar storage. They provide features like transactions, time-travel, and schema evolution, enhancing usability and enabling access from multiple engines. Moreover, engines like Apache Spark and Trino can be configured to leverage the optimizations and controls offered by LSTs to meet specific business needs. Conventional benchmarks and tools are inadequate for evaluating the transformative changes in the storage layer resulting from these advancements, as they do not allow us to measure the impact of design and optimization choices in this new setting. In this paper, we propose a novel benchmarking approach and metrics that build upon existing benchmarks, aiming to systematically assess LSTs. We develop a framework, LST-Bench, which facilitates effective exploration and evaluation of the collaborative functioning of LSTs and data processing engines through tailored benchmark packages. A package is a mix of use patterns reflecting a target workload; LST-Bench makes it easy to define a wide range of use patterns and combine them into a package, and we include a baseline package for completeness. Our assessment demonstrates the effectiveness of our framework and benchmark packages in extracting valuable insights across diverse environments. The code for LST-Bench is open-sourced and is available at https://github.com/microsoft/lst-bench/ . △ Less

Submitted 19 January, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

Journal ref: Proceedings of the ACM on Management of Data (2024) Volume 2 Issue 1

arXiv:2212.05678 [pdf, other]

The system of translates and the special affine Fourier transform

Authors: Md Hasan Ali Biswas, Frank Filbir, Radha Ramakrishnan

Abstract: The translation operator $T^A$ associated with the special affine Fourier transform (SAFT) $\mathscr{F}_A$ is introduced from harmonic analysis point of view. The analogues of Wendel's theorem, Wiener theorem, Weiner-Tauberian theorem and Bernstein type inequality in the context of the SAFT are established. The shift invariant space $V_A$ associated with the special affine Fourier transform is int… ▽ More The translation operator $T^A$ associated with the special affine Fourier transform (SAFT) $\mathscr{F}_A$ is introduced from harmonic analysis point of view. The analogues of Wendel's theorem, Wiener theorem, Weiner-Tauberian theorem and Bernstein type inequality in the context of the SAFT are established. The shift invariant space $V_A$ associated with the special affine Fourier transform is introduced and studied along with sampling problems. △ Less

Submitted 21 July, 2024; v1 submitted 11 December, 2022; originally announced December 2022.

MSC Class: Primary 42A38; Secondary 42A85; 42C15

arXiv:2210.14047 [pdf, other]

OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance from Database Logs [Technical Report]

Authors: Fotis Psallidas, Ashvin Agrawal, Chandru Sugunan, Khaled Ibrahim, Konstantinos Karanasos, Jesús Camacho-Rodríguez, Avrilia Floratou, Carlo Curino, Raghu Ramakrishnan

Abstract: Provenance encodes information that connects datasets, their generation workflows, and associated metadata (e.g., who or when executed a query). As such, it is instrumental for a wide range of critical governance applications (e.g., observability and auditing). Unfortunately, in the context of database systems, extracting coarse-grained provenance is a long-standing problem due to the complexity a… ▽ More Provenance encodes information that connects datasets, their generation workflows, and associated metadata (e.g., who or when executed a query). As such, it is instrumental for a wide range of critical governance applications (e.g., observability and auditing). Unfortunately, in the context of database systems, extracting coarse-grained provenance is a long-standing problem due to the complexity and sheer volume of database workflows. Provenance extraction from query event logs has been recently proposed as favorable because, in principle, can result in meaningful provenance graphs for provenance applications. Current approaches, however, (a) add substantial overhead to the database and provenance extraction workflows and (b)~extract provenance that is noisy, omits query execution dependencies, and is not rich enough for upstream applications. To address these problems, we introduce OneProvenance: an efficient provenance extraction system from query event logs. OneProvenance addresses the unique challenges of log-based extraction by (a)~identifying query execution dependencies through efficient log analysis, (b) extracting provenance through novel event transformations that account for query dependencies, and (c)~introducing effective filtering optimizations. Our thorough experimental analysis shows that OneProvenance can improve extraction by up to ~18X compared to state-of-the-art baselines; our optimizations reduce the extraction noise and optimize performance even further. OneProvenance is deployed at scale by Microsoft Purview and actively supports customer provenance extraction needs (https://bit.ly/3N2JVGF). △ Less

Submitted 3 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

ACM Class: H.2

arXiv:2207.03696 [pdf, ps, other]

Modulation spaces, multipliers associated with the special affine Fourier transform

Authors: M. H. A. Biswas, H. G. Feichtinger, R. Ramakrishnan

Abstract: We study some fundamental properties of the special affine Fourier transform (SAFT) in connection with the Fourier analysis and time-frequency analysis. We introduce the modulation space $\boldsymbol {M}^{r,s}_A$ in connection with SAFT and prove that if a bounded linear operator between new modulation spaces commutes with $A$-translation, then it is a $A$-convolution operator. We also establish H… ▽ More We study some fundamental properties of the special affine Fourier transform (SAFT) in connection with the Fourier analysis and time-frequency analysis. We introduce the modulation space $\boldsymbol {M}^{r,s}_A$ in connection with SAFT and prove that if a bounded linear operator between new modulation spaces commutes with $A$-translation, then it is a $A$-convolution operator. We also establish Hörmander multiplier theorem and Littlewood-Paley theorem associated with the SAFT. △ Less

Submitted 8 July, 2022; originally announced July 2022.

Comments: 26 pages

MSC Class: 42A38; 42B25; 42B35

arXiv:2206.15383 [pdf, other]

doi 10.1007/s41683-023-00115-1

Integrated Photonic Platforms for Quantum Technology: A Review

Authors: Rohit K Ramakrishnan, Aravinth Balaji Ravichandran, Arpita Mishra, Archana Kaushalram, Gopalkrishna Hegde, Srinivas Talabattula, Peter P Rohde

Abstract: Quantum information processing has conceptually changed the way we process and transmit information. Quantum physics, which explains the strange behaviour of matter at the microscopic dimensions, has matured into a quantum technology that can harness this strange behaviour for technological applications with far-reaching consequences, which uses quantum bits (qubits) for information processing. Ex… ▽ More Quantum information processing has conceptually changed the way we process and transmit information. Quantum physics, which explains the strange behaviour of matter at the microscopic dimensions, has matured into a quantum technology that can harness this strange behaviour for technological applications with far-reaching consequences, which uses quantum bits (qubits) for information processing. Experiments suggest that photons are the most successful candidates for realising qubits, which indicates that integrated photonic platforms will play a crucial role in realising quantum technology. This paper surveys the various photonic platforms based on different materials for quantum information processing. The future of this technology depends on the successful materials that can be used to universally realise quantum devices, similar to silicon, which shaped the industry towards the end of the last century. Though a prediction is implausible at this point, we provide an overview of the current status of research on the platforms based on various materials. △ Less

Submitted 30 June, 2022; originally announced June 2022.

Comments: 48 pages, 3 figures

arXiv:2206.15376 [pdf, other]

doi 10.1007/s41745-022-00336-7

The Quantum Internet: A Hardware Review

Authors: Rohit K. Ramakrishnan, Aravinth Balaji Ravichandran, Ishwar Kaushik, Gopalkrishna Hegde, Srinivas Talabattula, Peter P. Rohde

Abstract: In the century following its discovery, applications for quantum physics are opening a new world of technological possibilities. With the current decade witnessing quantum supremacy, quantum technologies are already starting to change the ways information is generated, transmitted, stored and processed. The next major milestone in quantum technology is already rapidly emerging -- the quantum inter… ▽ More In the century following its discovery, applications for quantum physics are opening a new world of technological possibilities. With the current decade witnessing quantum supremacy, quantum technologies are already starting to change the ways information is generated, transmitted, stored and processed. The next major milestone in quantum technology is already rapidly emerging -- the quantum internet. Since light is the most logical candidate for quantum communication, quantum photonics is a critical enabling technology. This paper reviews the hardware aspects of the quantum internet, mainly from a photonics perspective. Though a plethora of quantum technologies and devices have emerged in recent years, we are more focused on devices or components that may enable the quantum internet. Our approach is primarily qualitative, providing a broad overview of the necessary technologies for a large-scale quantum internet. △ Less

Submitted 1 June, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

Comments: 38 pages, 1 table

arXiv:2206.13018 [pdf, other]

doi 10.1209/0295-5075/ac9d01

Learning stochastic filtering

Authors: Rahul O. Ramakrishnan, Andrea Auconi, Benjamin M. Friedrich

Abstract: We quantify the performance of approximations to stochastic filtering by the Kullback-Leibler divergence to the optimal Bayesian filter. Using a two-state Markov process that drives a Brownian measurement process as prototypical test case, we compare two stochastic filtering approximations: a static low-pass filter as baseline, and machine learning of Voltera expansions using nonlinear Vector Auto… ▽ More We quantify the performance of approximations to stochastic filtering by the Kullback-Leibler divergence to the optimal Bayesian filter. Using a two-state Markov process that drives a Brownian measurement process as prototypical test case, we compare two stochastic filtering approximations: a static low-pass filter as baseline, and machine learning of Voltera expansions using nonlinear Vector Auto Regression (nVAR). We highlight the crucial role of the chosen performance metric, and present two solutions to the specific challenge of predicting a likelihood bounded between $0$ and $1$. △ Less

Submitted 26 June, 2022; originally announced June 2022.

Comments: 15 pages, 3 figures

arXiv:2206.06437 [pdf, other]

Distribution of Quantum Circuits Over General Quantum Networks

Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

Abstract: Near-term quantum computers can hold only a small number of qubits. One way to facilitate large-scale quantum computations is through a distributed network of quantum computers. In this work, we consider the problem of distributing quantum programs represented as quantum circuits across a quantum network of heterogeneous quantum computers, in a way that minimizes the overall communication cost req… ▽ More Near-term quantum computers can hold only a small number of qubits. One way to facilitate large-scale quantum computations is through a distributed network of quantum computers. In this work, we consider the problem of distributing quantum programs represented as quantum circuits across a quantum network of heterogeneous quantum computers, in a way that minimizes the overall communication cost required to execute the distributed circuit. We consider two ways of communicating: cat-entanglement that creates linked copies of qubits across pairs of computers, and teleportation. The heterogeneous computers impose constraints on cat-entanglement and teleportation operations that can be chosen by an algorithm. We first focus on a special case that only allows cat-entanglements and not teleportations for communication. We provide a two-step heuristic for solving this specialized setting: (i) finding an assignment of qubits to computers using Tabu search, and (ii) using an iterative greedy algorithm designed for a constrained version of the set cover problem to determine cat-entanglement operations required to execute gates locally. For the general case, which allows both forms of communication, we propose two algorithms that subdivide the quantum circuit into several portions and apply the heuristic for the specialized setting on each portion. Teleportations are then used to stitch together the solutions for each portion. Finally, we simulate our algorithms on a wide range of randomly generated quantum networks and circuits, and study the properties of their results with respect to several varying parameters. △ Less

Submitted 13 June, 2022; originally announced June 2022.

arXiv:2205.07352 [pdf, other]

Long-term Control for Dialogue Generation: Methods and Evaluation

Authors: Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q. Weinberger, Ryan McDonald

Abstract: Current approaches for controlling dialogue response generation are primarily focused on high-level attributes like style, sentiment, or topic. In this work, we focus on constrained long-term dialogue generation, which involves more fine-grained control and requires a given set of control words to appear in generated responses. This setting requires a model to not only consider the generation of t… ▽ More Current approaches for controlling dialogue response generation are primarily focused on high-level attributes like style, sentiment, or topic. In this work, we focus on constrained long-term dialogue generation, which involves more fine-grained control and requires a given set of control words to appear in generated responses. This setting requires a model to not only consider the generation of these control words in the immediate context, but also produce utterances that will encourage the generation of the words at some time in the (possibly distant) future. We define the problem of constrained long-term control for dialogue generation, identify gaps in current methods for evaluation, and propose new metrics that better measure long-term control. We also propose a retrieval-augmented method that improves performance of long-term controlled generation via logit modification techniques. We show through experiments on three task-oriented dialogue datasets that our metrics better assess dialogue control relative to current alternatives and that our method outperforms state-of-the-art constrained generation baselines. △ Less

Submitted 15 May, 2022; originally announced May 2022.

arXiv:2205.04036 [pdf, other]

doi 10.1109/QCE53715.2022.00064

Pre-Distribution of Entanglements in Quantum Networks

Authors: Mohammad Ghaderibaneh, Himanshu Gupta, C. R. Ramakrishnan, Ertai Luo

Abstract: Quantum network communication is challenging, as the No-Cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability… ▽ More Quantum network communication is challenging, as the No-Cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability of success of the underlying physical processes. To reduce EP generation latency, prior works have looked at selection of efficient entanglement-routing paths and simultaneous use of multiple such paths for EP generation. In this paper, we propose and investigate a complementary technique to reduce EP generation latency--to pre-distribute EPs over certain (pre-determined) pairs of network nodes; these pre-distributed EPs can then be used to generate EPs for the requested pairs, when needed, with lower generation latency. For such an pre-distribution approach to be most effective, we need to address an optimization problem of selection of node-pairs where the EPs should be pre-distributed to minimize the generation latency of expected EP requests, under a given cost constraint. In this paper, we appropriately formulate the above optimization problem and design two efficient algorithms, one of which is a greedy approach based on an approximation algorithm for a special case. Via extensive evaluations over the NetSquid simulator, we demonstrate the effectiveness of our approach and developed techniques; we show that our developed algorithms outperform a naive approach by up to an order of magnitude. △ Less

Submitted 9 May, 2022; originally announced May 2022.

Comments: 11 pages, 9 figures

arXiv:2203.05492 [pdf, other]

An Empirical Study of Low Precision Quantization for TinyML

Authors: Shaojie Zhuo, Hongyu Chen, Ramchalam Kinattinkara Ramakrishnan, Tommy Chen, Chen Feng, Yicheng Lin, Parker Zhang, Liang Shen

Abstract: Tiny machine learning (tinyML) has emerged during the past few years aiming to deploy machine learning models to embedded AI processors with highly constrained memory and computation capacity. Low precision quantization is an important model compression technique that can greatly reduce both memory consumption and computation cost of model inference. In this study, we focus on post-training quanti… ▽ More Tiny machine learning (tinyML) has emerged during the past few years aiming to deploy machine learning models to embedded AI processors with highly constrained memory and computation capacity. Low precision quantization is an important model compression technique that can greatly reduce both memory consumption and computation cost of model inference. In this study, we focus on post-training quantization (PTQ) algorithms that quantize a model to low-bit (less than 8-bit) precision with only a small set of calibration data and benchmark them on different tinyML use cases. To achieve a fair comparison, we build a simulated quantization framework to investigate recent PTQ algorithms. Furthermore, we break down those algorithms into essential components and re-assembled a generic PTQ pipeline. With ablation study on different alternatives of components in the pipeline, we reveal key design choices when performing low precision quantization. We hope this work could provide useful data points and shed lights on the future research of low precision quantization. △ Less

Submitted 10 March, 2022; originally announced March 2022.

Comments: tinyML Research Symposium 2022

arXiv:2203.03478 [pdf, other]

doi 10.1021/acs.jpcb.2c06803

Understanding the role of intramolecular ion-pair interactions in conformational stability using an ab initio thermodynamic cycle

Authors: Sabyasachi Chakraborty, Kalyaneswar Mandal, Raghunathan Ramakrishnan

Abstract: Intramolecular ion-pair interactions yield shape and functionality to many molecules. With proper orientation, these interactions overcome steric factors and are responsible for the compact structures of several peptides. In this study, we present a thermodynamic cycle based on isoelectronic and alchemical mutation to estimate intramolecular ion-pair interaction energy. We determine these energies… ▽ More Intramolecular ion-pair interactions yield shape and functionality to many molecules. With proper orientation, these interactions overcome steric factors and are responsible for the compact structures of several peptides. In this study, we present a thermodynamic cycle based on isoelectronic and alchemical mutation to estimate intramolecular ion-pair interaction energy. We determine these energies for 26 benchmark molecules with common ion-pair combinations and compare them with results obtained using intramolecular symmetry-adapted perturbation theory. For systems with long linkers, the ion-pair energies evaluated using both approaches deviate by less than 2.5% in vacuum phase. The thermodynamic cycle based on density functional theory facilitates calculations of salt-bridge interactions in model tripeptides with continuum/microsolvation modeling, and four large peptides: 1EJG (crambin), 1BDK (bradykinin), 1L2Y (a mini-protein with a tryptophan cage), and 1SCO (a toxin from the scorpion venom). △ Less

Submitted 21 December, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

Comments: version 3, experimental relevance of salt-bridge interaction energy is discussed in the introduction, and minor revisions of figures

arXiv:2202.03487 [pdf]

Targeted-BEHRT: Deep learning for observational causal inference on longitudinal electronic health records

Authors: Shishir Rao, Mohammad Mamouei, Gholamreza Salimi-Khorshidi, Yikuan Li, Rema Ramakrishnan, Abdelaali Hassaine, Dexter Canoy, Kazem Rahimi

Abstract: Observational causal inference is useful for decision making in medicine when randomized clinical trials (RCT) are infeasible or non generalizable. However, traditional approaches fail to deliver unconfounded causal conclusions in practice. The rise of "doubly robust" non-parametric tools coupled with the growth of deep learning for capturing rich representations of multimodal data, offers a uniqu… ▽ More Observational causal inference is useful for decision making in medicine when randomized clinical trials (RCT) are infeasible or non generalizable. However, traditional approaches fail to deliver unconfounded causal conclusions in practice. The rise of "doubly robust" non-parametric tools coupled with the growth of deep learning for capturing rich representations of multimodal data, offers a unique opportunity to develop and test such models for causal inference on comprehensive electronic health records (EHR). In this paper, we investigate causal modelling of an RCT-established null causal association: the effect of antihypertensive use on incident cancer risk. We develop a dataset for our observational study and a Transformer-based model, Targeted BEHRT coupled with doubly robust estimation, we estimate average risk ratio (RR). We compare our model to benchmark statistical and deep learning models for causal inference in multiple experiments on semi-synthetic derivations of our dataset with various types and intensities of confounding. In order to further test the reliability of our approach, we test our model on situations of limited data. We find that our model provides more accurate estimates of RR (least sum absolute error from ground truth) compared to benchmarks for risk ratio estimation on high-dimensional EHR across experiments. Finally, we apply our model to investigate the original case study: antihypertensives' effect on cancer and demonstrate that our model generally captures the validated null association. △ Less

Submitted 7 February, 2022; originally announced February 2022.

Comments: This work has been submitted to the IEEE for possible publication

arXiv:2112.11002 [pdf, other]

doi 10.1109/TQE.2022.3168784

Efficient Quantum Network Communication using Optimized Entanglement-Swapping Trees

Authors: Mohammad Ghaderibaneh, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

Abstract: Quantum network communication is challenging, as the No-cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable communication approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the lo… ▽ More Quantum network communication is challenging, as the No-cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable communication approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability of success of the underlying physical processes. The focus of our work is to develop efficient techniques that minimize EP generation latency. Prior works have focused on selecting entanglement paths; in contrast, we select entanglement swapping trees--a more accurate representation of the entanglement generation structure. We develop a dynamic programming algorithm to select an optimal swapping-tree for a single pair of nodes, under the given capacity and fidelity constraints. For the general setting, we develop an efficient iterative algorithm to compute a set of swapping trees. We present simulation results which show that our solutions outperform the prior approaches by an order of magnitude and are viable for long-distance entanglement generation. △ Less

Submitted 4 April, 2024; v1 submitted 21 December, 2021; originally announced December 2021.

Showing 1–50 of 105 results for author: Ramakrishnan, R