Search | arXiv e-print repository

doi 10.1109/ICRA57147.2024.10610519

EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning

Authors: Kallol Saha, Vishal Mandadi, Jayaram Reddy, Ajit Srikanth, Aditya Agarwal, Bipasha Sen, Arun Singh, Madhava Krishna

Abstract: Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without s… ▽ More Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without specially designed cost functions for a given scene, the overall solutions tend to have low success rates. While deep-learning-based algorithms tremendously improve success rates, they are much harder to adopt without specialized training datasets. We propose EDMP, an Ensemble-of-costs-guided Diffusion for Motion Planning that aims to combine the strengths of classical and deep-learning-based motion planning. Our diffusion-based network is trained on a set of diverse kinematically valid trajectories. Like classical planning, for any new scene at the time of inference, we compute scene-specific costs such as "collision cost" and guide the diffusion to generate valid trajectories that satisfy the scene-specific constraints. Further, instead of a single cost function that may be insufficient in capturing diversity across scenes, we use an ensemble of costs to guide the diffusion process, significantly improving the success rate compared to classical planners. EDMP performs comparably with SOTA deep-learning-based methods while retaining the generalization capabilities primarily associated with classical planners. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 8 pages, 8 figures, submitted to ICRA 2024 (International Conference on Robotics and Automation)

Journal ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)

arXiv:2309.05896 [pdf, other]

doi 10.1103/PhysRevApplied.21.014040

Silicon charge pump operation limit above and below liquid helium temperature

Authors: Ajit Dash, Steve Yianni, MengKe Feng, Fay Hudson, Andre Saraiva, Andrew S. Dzurak, Tuomo Tanttu

Abstract: Semiconductor tunable barrier single-electron pumps can produce output current of hundreds of picoamperes at sub ppm precision, approaching the metrological requirement for the direct implementation of the current standard. Here, we operate a silicon metal-oxide-semiconductor electron pump up to a temperature of 14 K to understand the temperature effect on charge pumping accuracy. The uncertainty… ▽ More Semiconductor tunable barrier single-electron pumps can produce output current of hundreds of picoamperes at sub ppm precision, approaching the metrological requirement for the direct implementation of the current standard. Here, we operate a silicon metal-oxide-semiconductor electron pump up to a temperature of 14 K to understand the temperature effect on charge pumping accuracy. The uncertainty of the charge pump is tunnel limited below liquid helium temperature, implying lowering the temperature further does not greatly suppress errors. Hence, highly accurate charge pumps could be confidently achieved in a $^4$He cryogenic system, further promoting utilization of the revised quantum current standard across the national measurement institutes and industries worldwide. △ Less

Submitted 11 September, 2023; originally announced September 2023.

arXiv:2309.01397 [pdf, other]

Unlabelled Sensing with Priors: Algorithm and Bounds

Authors: Garweet Sresth, Ajit Rajwade, Satish Mulleti

Abstract: In this study, we consider a variant of unlabelled sensing where the measurements are sparsely permuted, and additionally, a few correspondences are known. We present an estimator to solve for the unknown vector. We derive a theoretical upper bound on the $\ell_2$ reconstruction error of the unknown vector. Through numerical experiments, we demonstrate that the additional known correspondences res… ▽ More In this study, we consider a variant of unlabelled sensing where the measurements are sparsely permuted, and additionally, a few correspondences are known. We present an estimator to solve for the unknown vector. We derive a theoretical upper bound on the $\ell_2$ reconstruction error of the unknown vector. Through numerical experiments, we demonstrate that the additional known correspondences result in a significant improvement in the reconstruction error. Additionally, we compare our estimator with the classical robust regression estimator and we find that our method outperforms it on the normalized reconstruction error metric by up to $20\%$ in the high permutation regimes $(>30\%)$. Lastly, we showcase the practical utility of our framework on a non-rigid motion estimation problem. We show that using a few manually annotated points along point pairs with the key-point (SIFT-based) descriptor pairs with unknown or incorrectly known correspondences can improve motion estimation. △ Less

Submitted 4 September, 2023; originally announced September 2023.

Comments: 14 pages, 6 figures

arXiv:2308.08588 [pdf, other]

Entanglement and Topology in Su-Schrieffer-Heeger Cavity Quantum Electrodynamics

Authors: Daniel Shaffer, Martin Claassen, Ajit Srivastava, Luiz H. Santos

Abstract: Cavity materials are a frontier to investigate the role of light-matter interactions on the properties of electronic phases of matter. In this work, we raise a fundamental question: can non-local interactions mediated by cavity photons destabilize a topological electronic phase? We investigate this question by characterizing entanglement, energy spectrum and correlation functions of the topologica… ▽ More Cavity materials are a frontier to investigate the role of light-matter interactions on the properties of electronic phases of matter. In this work, we raise a fundamental question: can non-local interactions mediated by cavity photons destabilize a topological electronic phase? We investigate this question by characterizing entanglement, energy spectrum and correlation functions of the topological Su-Schrieffer-Heeger (SSH) chain interacting with an optical cavity mode. Employing density-matrix renormalization group (DMRG) and exact diagonalization (ED), we demonstrate the stability of the edge state and establish an area law scaling for the ground state entanglement entropy, despite long-range correlations induced by light-matter interactions. These features are linked to gauge invariance and the scaling of virtual photon excitations entangled with matter, effectively computed in a low-dimensional Krylov subspace of the full Hilbert space. This work provides a framework for characterizing novel equilibrium phenomena in topological cavity materials. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Comments: 4.5 pages and 3 figures

arXiv:2308.03124 [pdf]

Substrate temperature dependent dielectric and ferroelectric properties of (100) oriented lead-free Na$_{0.4}$K$_{0.1}$Bi$_{0.5}$TiO$_3$ thin films grown by pulsed laser deposition

Authors: Krishnarjun Banerjee, Adityanarayan H. Pandey, Pravin Varade, Ajit R. Kulkarni, Abhijeet L. Sangle, N. Venkataramani

Abstract: Pb-free ferroelectric thin films are gaining attention due to their applicability in memory, sensor, actuator, and microelectromechanical system. In this work, Na$_{0.4}$K$_{0.1}$Bi$_{0.5}$TiO$_3$ (NKBT0.1) ferroelectric thin films were deposited on Pt(111)/Ti/SiO$_2$/Si substrates using the pulsed laser deposition technique at various substrate temperatures (600-750 $^\circ$C). The comprehensive… ▽ More Pb-free ferroelectric thin films are gaining attention due to their applicability in memory, sensor, actuator, and microelectromechanical system. In this work, Na$_{0.4}$K$_{0.1}$Bi$_{0.5}$TiO$_3$ (NKBT0.1) ferroelectric thin films were deposited on Pt(111)/Ti/SiO$_2$/Si substrates using the pulsed laser deposition technique at various substrate temperatures (600-750 $^\circ$C). The comprehensive structural, microstructural, and ferroelectric properties characterizations depicted that the grain size, dielectric constant, and remnant polarization increased with higher deposition temperatures. The influence of higher substrate temperatures on the control of (100)-preferential orientations was observed, indicating the importance of deposition conditions. Significantly, films deposited at 700 deg C exhibited reduced dielectric loss of 0.08 (at 1kHz), high dielectric constant of 673, and remnant polarization of 17 microC/cm2 at room temperature. At this deposition temperature, a maximum effective piezoelectric coefficient of 76 pm/V was availed. Based on the structural analysis, dielectric properties, and ferroelectric behavior, the optimal deposition temperature for the NKBT0.1 thin films was 700 $^\circ$C. This study contributes to the understanding of the influence of substrate temperature on the structural and ferroelectric properties of Pb-free NKBT0.1 thin films, providing insights for the development of high-performance ferroelectric devices. △ Less

Submitted 6 August, 2023; originally announced August 2023.

Comments: 16 pages, 4 figures

arXiv:2307.14910 [pdf, other]

doi 10.1109/MedComNet58619.2023.10168852

Low-Latency Massive Access with Multicast Wake Up Radio

Authors: Anay Ajit Deshpande, Federico Chiariotti, Andrea Zanella

Abstract: The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled, saving energy. However, polling-based Time Division Multiple Access (TDMA) may significantly increase data transmission delay if pac… ▽ More The use of Wake-Up Radio (WUR) in Internet of Things (IoT) networks can significantly improve their energy efficiency: battery-powered sensors can remain in a low-power (sleep) mode while listening for wake-up messages using their WUR and reactivate only when polled, saving energy. However, polling-based Time Division Multiple Access (TDMA) may significantly increase data transmission delay if packets are generated sporadically, as nodes with no information still need to be polled. In this paper, we examine the effect of multicast polling for WUR-enabled wireless nodes. The idea is to assign nodes to multicast groups so that all nodes in the same group can be solicited by a multicast polling message. This may cause collisions, which can be solved by requesting retransmissions from the involved nodes. We analyze the performance of different multicast polling and retransmission strategies, showing that the optimal approach can significantly reduce the delay over TDMA and ALOHA in low-traffic scenarios while keeping good energy efficiency. △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: 2023 21st Mediterranean Communication and Computer Networking Conference (MedComNet)

arXiv:2307.14519 [pdf, other]

doi 10.1103/PhysRevB.108.235153

Competition between fractional quantum Hall liquid and electron solid phases in the Landau levels of multilayer graphene

Authors: Rakesh K. Dora, Ajit C. Balram

Abstract: We study the competition between the electron liquid and solid phases, such as Wigner crystal and bubbles, in partially filled Landau levels (LLs) of multilayer graphene. Graphene systems offer a versatile platform for controlling band dispersion by varying the number of its stacked layers. The band dispersion determines the LL wave functions, and consequently, the LL-projected Coulomb interaction… ▽ More We study the competition between the electron liquid and solid phases, such as Wigner crystal and bubbles, in partially filled Landau levels (LLs) of multilayer graphene. Graphene systems offer a versatile platform for controlling band dispersion by varying the number of its stacked layers. The band dispersion determines the LL wave functions, and consequently, the LL-projected Coulomb interaction in graphene and its multilayers is different from that in conventional semiconductors like GaAs. As a result, the energies of the liquid and solid phases are different in the different LLs of multilayer graphene, leading to an alternative phase diagram for the stability of these phases, which we work out. The phase diagram of competing solid and liquid phases in the LLs of monolayer graphene has been studied previously. Here, we primarily consider $AB{-}$ or Bernal$-$stacked bilayer graphene (BLG) and $ABC{-}$stacked trilayer graphene (TLG) and focus on the Laughlin fractions. We determine the cohesive energy of the solid phase using the Hartree-Fock approximation, and the energy of the Laughlin liquid is computed analytically via the plasma sum rules. We find that at the Laughlin fillings, the electron liquid phase has the lowest energy among the phases considered in the $\mathcal{N}{=}0, 1, 2$ LLs of BLG, as well as in the $\mathcal{N}{=}3, 4$ LLs of TLG, while in the $\mathcal{N}{>}2$ LLs of BLG and $\mathcal{N}{>}4$ LLs of TLG, the solid phases are more favorable. We also discuss the effect of impurities on the above-mentioned phase diagram. △ Less

Submitted 19 December, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

Comments: 27 pages, 37 figures

Journal ref: Phys. Rev. B 108, 235153 (2023)

arXiv:2307.06034 [pdf]

Temperature dependent magnetoelectric response of lead-free Na$_{0.4}$K$_{0.1}$Bi$_{0.5}$TiO$_3$-NiFe$_2$O$_4$ laminated composites

Authors: Adityanarayan Pandey, Amritesh Kumar, Pravin Varade, K. Miriyala, A. Arockiarajan, Ajit. R. Kulkarni, N. Venkataramani

Abstract: This study investigates the temperature-dependent quasi-static magnetoelectric (ME) response of electrically poled lead-free Na$_{0.4}$K$_{0.1}$Bi$_{0.5}$TiO$_3$-NiFe$_2$O$_4$ (NKBT-NFO) laminated composites. The aim is to understand the temperature stability of ME-based sensors and devices. The relaxor ferroelectric nature of NKBT is confirmed through impedance and polarization-electric (PE) hyst… ▽ More This study investigates the temperature-dependent quasi-static magnetoelectric (ME) response of electrically poled lead-free Na$_{0.4}$K$_{0.1}$Bi$_{0.5}$TiO$_3$-NiFe$_2$O$_4$ (NKBT-NFO) laminated composites. The aim is to understand the temperature stability of ME-based sensors and devices. The relaxor ferroelectric nature of NKBT is confirmed through impedance and polarization-electric (PE) hysteresis loop studies, with a depolarization temperature (Td) of approximately 110$^\circ$C. Heating causes a decrease and disappearance of planar electromechanical coupling, charge coefficient, and remnant polarization above Td. The temperature rise also leads to a reduction in magnetostriction and magnetostriction coefficient of NFO by approximately 33% and 25%, respectively, up to approximately 125$^\circ$C. At room temperature, the bilayer and trilayer configurations exhibit maximum ME responses of approximately 33 mV/cm.Oe and 80 mV/cm.Oe, respectively, under low magnetic field conditions (300-450 Oe). The ME response of NKBT/NFO is highly sensitive to temperature, decreasing with heating in accordance with the individual temperature-dependent properties of NKBT and NFO. This study demonstrates a temperature window for the effective utilization of NKBT-NFO-based laminated composite ME devices. △ Less

Submitted 12 July, 2023; originally announced July 2023.

Comments: 14 pages, 4 figures

arXiv:2306.10797 [pdf, other]

Variability of echo state network prediction horizon for partially observed dynamical systems

Authors: Ajit Mahata, Reetish Padhi, Amit Apte

Abstract: Study of dynamical systems using partial state observation is an important problem due to its applicability to many real-world systems. We address the problem by studying an echo state network (ESN) framework with partial state input with partial or full state output. Application to the Lorenz system and Chua's oscillator (both numerically simulated and experimental systems) demonstrate the effect… ▽ More Study of dynamical systems using partial state observation is an important problem due to its applicability to many real-world systems. We address the problem by studying an echo state network (ESN) framework with partial state input with partial or full state output. Application to the Lorenz system and Chua's oscillator (both numerically simulated and experimental systems) demonstrate the effectiveness of our method. We show that the ESN, as an autonomous dynamical system, is capable of making short-term predictions up to a few Lyapunov times. However, the prediction horizon has high variability depending on the initial condition-an aspect that we explore in detail using the distribution of the prediction horizon. Further, using a variety of statistical metrics to compare the long-term dynamics of the ESN predictions with numerically simulated or experimental dynamics and observed similar results, we show that the ESN can effectively learn the system's dynamics even when trained with noisy numerical or experimental datasets. Thus, we demonstrate the potential of ESNs to serve as cheap surrogate models for simulating the dynamics of systems where complete observations are unavailable. △ Less

Submitted 5 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

arXiv:2306.05495 [pdf, other]

Is Attentional Channel Processing Design Required? Comprehensive Analysis Of Robustness Between Vision Transformers And Fully Attentional Networks

Authors: Abhishri Ajit Medewar, Swanand Ashokrao Kavitkar

Abstract: The robustness testing has been performed for standard CNN models and Vision Transformers, however there is a lack of comprehensive study between the robustness of traditional Vision Transformers without an extra attentional channel design and the latest fully attentional network(FAN) models. So in this paper, we use the ImageNet dataset to compare the robustness of fully attentional network(FAN)… ▽ More The robustness testing has been performed for standard CNN models and Vision Transformers, however there is a lack of comprehensive study between the robustness of traditional Vision Transformers without an extra attentional channel design and the latest fully attentional network(FAN) models. So in this paper, we use the ImageNet dataset to compare the robustness of fully attentional network(FAN) models with traditional Vision Transformers to understand the role of an attentional channel processing design using white box attacks and also study the transferability between the same using black box attacks. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: 4 pages, 12 figures

arXiv:2306.04944 [pdf, ps, other]

Colouring planar graphs with a precoloured induced cycle

Authors: Ajit Diwan

Abstract: Let $C$ be a cycle and $f : V(C) \rightarrow \{c_1,c_2,\ldots,c_k\}$ a proper $k$-colouring of $C$ for some $k \ge 4$. We say the colouring $f$ is safe if for any planar graph $G$ in which $C$ is an induced cycle, there exists a proper $k$-colouring $f'$ of $G$ such that $f'(v) = f(v)$ for all $v \in V(C)$. The only safe $4$-colouring is any proper colouring of a triangle. We give a simple necessa… ▽ More Let $C$ be a cycle and $f : V(C) \rightarrow \{c_1,c_2,\ldots,c_k\}$ a proper $k$-colouring of $C$ for some $k \ge 4$. We say the colouring $f$ is safe if for any planar graph $G$ in which $C$ is an induced cycle, there exists a proper $k$-colouring $f'$ of $G$ such that $f'(v) = f(v)$ for all $v \in V(C)$. The only safe $4$-colouring is any proper colouring of a triangle. We give a simple necessary condition for a $k$-colouring of a cycle to be safe and conjecture that it is sufficient for all $k \ge 4$. The sufficiency for $k=4$ follows from the four colour theorem and we prove it for $k = 5$, independent of the four colour theorem. We show that a stronger condition is sufficient for all $k \ge 4$. As a consequence, it follows that any proper $k$-colouring of a cycle that uses at most $k-3$ distinct colours is safe. Also, any proper $k$-colouring of a cycle of length at most $2k-5$ that uses at most $k-1$ distinct colours is safe. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: 18 pages

MSC Class: 05C10; 05C15

arXiv:2305.17429 [pdf, other]

doi 10.1016/j.sigpro.2023.109233

Performance Bounds for LASSO under Multiplicative Noise: Applications to Pooled RT-PCR Testing

Authors: Richeek Das, Aaron Jerry Ninan, Adithya Bhaskar, Ajit Rajwade

Abstract: Group testing is a technique which avoids individually testing $n$ samples for a rare disease and instead tests $n < p$ pools, where a pool consists of a mixture of small, equal portions of a subset of the $p$ samples. Group testing saves testing time and resources in many applications, including RT-PCR, with guarantees for the recovery of the status of the $p$ samples from results on $n$ pools. T… ▽ More Group testing is a technique which avoids individually testing $n$ samples for a rare disease and instead tests $n < p$ pools, where a pool consists of a mixture of small, equal portions of a subset of the $p$ samples. Group testing saves testing time and resources in many applications, including RT-PCR, with guarantees for the recovery of the status of the $p$ samples from results on $n$ pools. The noise in quantitative RT- PCR is inherently known to follow a multiplicative data-dependent model. In recent literature, the corresponding linear systems for inferring the health status of $p$ samples from results on $n$ pools have been solved using the Lasso estimator and its variants, which have been typically used in additive Gaussian noise settings. There is no existing literature which establishes performance bounds for Lasso for the multiplicative noise model associated with RT-PCR. After noting that a recent general technique, Hunt et al., works for Poisson inverse problems, we adapt it to handle sparse signal reconstruction from compressive measurements with multiplicative noise: we present high probability performance bounds and data-dependent weights for the Lasso and its weighted version. We also show numerical results on simulated pooled RT-PCR data to empirically validate our bounds. △ Less

Submitted 28 August, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

Comments: Elsevier Signal Processing Journal

Journal ref: Signal Processing 2023

arXiv:2305.16850 [pdf, other]

doi 10.1142/S0217732324300040

Pulsar as a Weber detector of gravitational waves and a probe to its internal phase transitions

Authors: Partha Bagchi, Oindrila Ganguly, Biswanath Layek, Anjishnu Sarkar, Ajit M. Srivastava

Abstract: It is believed that cores of neutron stars provide a natural laboratory where exotic high baryon density QCD phases may exist.The theoretically well established {\it neutron superfluid phase} is also believed to be found only inside neutron stars. Focus on neutron stars has intensified in recent years with the direct detection of gravitational waves (GWs) from binary neutron star (BNS) merger, whi… ▽ More It is believed that cores of neutron stars provide a natural laboratory where exotic high baryon density QCD phases may exist.The theoretically well established {\it neutron superfluid phase} is also believed to be found only inside neutron stars. Focus on neutron stars has intensified in recent years with the direct detection of gravitational waves (GWs) from binary neutron star (BNS) merger, which has allowed the possibility of directly probing the properties of the interior of a neutron star. A remarkable phenomenon manifested by rapidly rotating neutron stars is in their {\it avatar} as {\it Pulsars}. The accuracy of pulsar timing allowed the first indirect detection of GWs from a BNS system and opened up a few exciting possibilities. Any pulsar deformation, even if incredibly tiny, can leave imprints on the pulses by introducing tiny perturbations of the moment of inertia (MI) tensor components. While the diagonal MI components of the perturbed MI tensor affect the pulse timings, the off-diagonal components lead to the pulsar's wobbling and affecting the pulse profile. This opens up an opportunity to explore various phase transitions inside a pulsar core by induced density fluctuations through the observable effects on the pulse timing and profile. Such perturbations also naturally induce a rapidly changing quadrupole moment of the star, thereby providing a new source of GW emission. Another remarkable possibility arises when we consider the effect of an external GW on a neutron star. With the possibility of detecting any minute changes in its configuration through pulse observations, the neutron star has the potential to perform as a Weber detector of GWs. This brief review focuses on these specific aspects of a pulsar, specifically on the type of physics that can be probed by utilizing the effect of changes in the MI tensor on pulse properties. △ Less

Submitted 20 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: Review article (revised), 31 pages, title changed, abstract reduced, fig.6 is corrected (larger fonts), references added, accepted for publication in Modern Physics Letters A (MPLA)

arXiv:2305.16820 [pdf, other]

Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization

Authors: Pranav Ajit Nair, Sukomal Pal, Pradeepika Verma

Abstract: Domain generalization is hitherto an underexplored area applied in abstractive summarization. Moreover, most existing works on domain generalization have sophisticated training algorithms. In this paper, we propose a lightweight, weight averaging based, Domain Aligned Prefix Averaging approach to domain generalization for abstractive summarization. Given a number of source domains, our method firs… ▽ More Domain generalization is hitherto an underexplored area applied in abstractive summarization. Moreover, most existing works on domain generalization have sophisticated training algorithms. In this paper, we propose a lightweight, weight averaging based, Domain Aligned Prefix Averaging approach to domain generalization for abstractive summarization. Given a number of source domains, our method first trains a prefix for each one of them. These source prefixes generate summaries for a small number of target domain documents. The similarity of the generated summaries to their corresponding documents is used for calculating weights required to average source prefixes. In DAPA, prefix tuning allows for lightweight finetuning, and weight averaging allows for the computationally efficient addition of new source domains. When evaluated on four diverse summarization domains, DAPA shows comparable or better performance against the baselines, demonstrating the effectiveness of its prefix averaging scheme. △ Less

Submitted 29 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: 13 pages, Accepted to ACL 2023 Findings

arXiv:2305.15108 [pdf, other]

The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

Authors: Debayan Banerjee, Pranav Ajit Nair, Ricardo Usbeck, Chris Biemann

Abstract: In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs)… ▽ More In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted as a short paper to ACL 2023 findings

arXiv:2305.07639 [pdf, other]

Efficient Neural Network based Classification and Outlier Detection for Image Moderation using Compressed Sensing and Group Testing

Authors: Sabyasachi Ghosh, Sanyam Saxena, Ajit Rajwade

Abstract: Popular social media platforms employ neural network based image moderation engines to classify images uploaded on them as having potentially objectionable content. Such moderation engines must answer a large number of queries with heavy computational cost, even though the actual number of images with objectionable content is usually a tiny fraction. Inspired by recent work on Neural Group Testing… ▽ More Popular social media platforms employ neural network based image moderation engines to classify images uploaded on them as having potentially objectionable content. Such moderation engines must answer a large number of queries with heavy computational cost, even though the actual number of images with objectionable content is usually a tiny fraction. Inspired by recent work on Neural Group Testing, we propose an approach which exploits this fact to reduce the overall computational cost of such engines using the technique of Compressed Sensing (CS). We present the quantitative matrix-pooled neural network (QMPNN), which takes as input $n$ images, and a $m \times n$ binary pooling matrix with $m < n$, whose rows indicate $m$ pools of images i.e. selections of $r$ images out of $n$. The QMPNN efficiently outputs the product of this matrix with the unknown sparse binary vector indicating whether each image is objectionable or not, i.e. it outputs the number of objectionable images in each pool. For suitable matrices, this is decoded using CS decoding algorithms to predict which images were objectionable. The computational cost of running the QMPNN and the CS algorithms is significantly lower than the cost of using a neural network with the same number of parameters separately on each image to classify the images, which we demonstrate via extensive experiments. Our technique is inherently resilient to moderate levels of errors in the prediction from the QMPNN. Furthermore, we present pooled deep outlier detection, which brings CS and group testing techniques to deep outlier detection, to provide for the case when the objectionable images do not belong to a set of pre-defined classes. This technique enables efficient automated moderation of off-topic images shared on topical forums dedicated to sharing images of a certain single class, many of which are currently human-moderated. △ Less

Submitted 12 May, 2023; originally announced May 2023.

arXiv:2305.04883 [pdf, other]

Fuzzy Gene Selection and Cancer Classification Based on Deep Learning Model

Authors: Mahmood Khalsan, Mu Mu, Eman Salih Al-Shamery, Lee Machado, Suraj Ajit, Michael Opoku Agyeman

Abstract: Machine learning (ML) approaches have been used to develop highly accurate and efficient applications in many fields including bio-medical science. However, even with advanced ML techniques, cancer classification using gene expression data is still complicated because of the high dimensionality of the datasets employed. We developed a new fuzzy gene selection technique (FGS) to identify informativ… ▽ More Machine learning (ML) approaches have been used to develop highly accurate and efficient applications in many fields including bio-medical science. However, even with advanced ML techniques, cancer classification using gene expression data is still complicated because of the high dimensionality of the datasets employed. We developed a new fuzzy gene selection technique (FGS) to identify informative genes to facilitate cancer classification and reduce the dimensionality of the available gene expression data. Three feature selection methods (Mutual Information, F-ClassIf, and Chi-squared) were evaluated and employed to obtain the score and rank for each gene. Then, using Fuzzification and Defuzzification methods to obtain the best single score for each gene, which aids in the identification of significant genes. Our study applied the fuzzy measures to six gene expression datasets including four Microarray and two RNA-seq datasets for evaluating the proposed algorithm. With our FGS-enhanced method, the cancer classification model achieved 96.5%,96.2%,96%, and 95.9% for accuracy, precision, recall, and f1-score respectively, which is significantly higher than 69.2% accuracy, 57.8% precision, 66% recall, and 58.2% f1-score when the standard MLP method was used. In examining the six datasets that were used, the proposed model demonstrates it's capacity to classify cancer effectively. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: Journal of Intelligent Information Systems (25,17)

arXiv:2304.11507 [pdf, other]

Machine learning framework for end-to-end implementation of Incident duration prediction

Authors: Smrithi Ajit, Varsha R Mouli, Skylar Knickerbocker, Jonathan S. Wood

Abstract: Traffic congestion caused by non-recurring incidents such as vehicle crashes and debris is a key issue for Traffic Management Centers (TMCs). Clearing incidents in a timely manner is essential for improving safety and reducing delays and emissions for the traveling public. However, TMCs and other responders face a challenge in predicting the duration of incidents (until the roadway is clear), maki… ▽ More Traffic congestion caused by non-recurring incidents such as vehicle crashes and debris is a key issue for Traffic Management Centers (TMCs). Clearing incidents in a timely manner is essential for improving safety and reducing delays and emissions for the traveling public. However, TMCs and other responders face a challenge in predicting the duration of incidents (until the roadway is clear), making decisions of what resources to deploy difficult. To address this problem, this research developed an analytical framework and end-to-end machine-learning solution for predicting incident duration based on information available as soon as an incident report is received. Quality predictions of incident duration can help TMCs and other responders take a proactive approach in deploying responder services such as tow trucks, maintenance crews or activating alternative routes. The predictions use a combination of classification and regression machine learning modules. The performance of the developed solution has been evaluated based on the Mean Absolute Error (MAE), or deviation from the actual incident duration as well as Area Under the Curve (AUC) and Mean Absolute Percentage Error (MAPE). The results showed that the framework significantly improved incident duration prediction compared to methods from previous research. △ Less

Submitted 22 April, 2023; originally announced April 2023.

arXiv:2304.11277 [pdf, other]

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Authors: Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Ajit Mathews, Shen Li

Abstract: It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine learning systems research, which has enabled the development and exploration of large models, such abilities remain confined to a small group of advanced users and industry leaders, resulting in an implicit tech… ▽ More It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine learning systems research, which has enabled the development and exploration of large models, such abilities remain confined to a small group of advanced users and industry leaders, resulting in an implicit technical barrier for the wider community to access and leverage these technologies. In this paper, we introduce PyTorch Fully Sharded Data Parallel (FSDP) as an industry-grade solution for large model training. FSDP has been closely co-designed with several key PyTorch core components including Tensor implementation, dispatcher system, and CUDA memory caching allocator, to provide non-intrusive user experiences and high training efficiency. Additionally, FSDP natively incorporates a range of techniques and settings to optimize resource utilization across a variety of hardware configurations. The experimental results demonstrate that FSDP is capable of achieving comparable performance to Distributed Data Parallel while providing support for significantly larger models with near-linear scalability in terms of TFLOPS. △ Less

Submitted 12 September, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

arXiv:2304.10799 [pdf, other]

A scalable solution for the extended multi-channel facility location problem

Authors: Etika Agarwal, Karthik S. Gurumoorthy, Ankit Ajit Jain, Shantala Manchenahally

Abstract: We study the extended version of the non-uniform, capacitated facility location problem with multiple fulfilment channels between the facilities and clients, each with their own channel capacities and service cost. Though the problem has been extensively studied in the literature, all the prior works assume a single channel of fulfilment, and the existing methods based on linear programming, prima… ▽ More We study the extended version of the non-uniform, capacitated facility location problem with multiple fulfilment channels between the facilities and clients, each with their own channel capacities and service cost. Though the problem has been extensively studied in the literature, all the prior works assume a single channel of fulfilment, and the existing methods based on linear programming, primal-dual relationships, local search heuristics etc. do not scale for a large supply chain system involving millions of decision variables. Using the concepts of sub-modularity and optimal transport theory, we present a scalable algorithm for determining the set of facilities to be opened under a cardinality constraint. By introducing various schemes such as: (i) iterative facility selection using incremental gain, (ii) approximation of the linear program using novel multi-stage Sinkhorn iterations, (iii) creation of facilities one for each fulfilment channel etc., we develop a fast but a tight approximate solution, requiring $\mathcal{O}\left(\frac{3+k}{m}ln\left(\frac{1}ε\right)\right)$ instances of optimal transport problems to select k facilities from m options, each solvable in linear time. Our algorithm is implicitly endowed with all the theoretical guarantees enjoyed by submodular maximisation problems and the Sinkhorn distances. When compared against the state-of-the-art commercial MILP solvers, we obtain a 100-fold speedup in computation, while the difference in objective values lies within a narrow range of 3%. △ Less

Submitted 21 April, 2023; originally announced April 2023.

arXiv:2304.08769 [pdf, ps, other]

Cooperative Multi-Agent Reinforcement Learning for Inventory Management

Authors: Madhav Khirwar, Karthik S. Gurumoorthy, Ankit Ajit Jain, Shantala Manchenahally

Abstract: With Reinforcement Learning (RL) for inventory management (IM) being a nascent field of research, approaches tend to be limited to simple, linear environments with implementations that are minor modifications of off-the-shelf RL algorithms. Scaling these simplistic environments to a real-world supply chain comes with a few challenges such as: minimizing the computational requirements of the enviro… ▽ More With Reinforcement Learning (RL) for inventory management (IM) being a nascent field of research, approaches tend to be limited to simple, linear environments with implementations that are minor modifications of off-the-shelf RL algorithms. Scaling these simplistic environments to a real-world supply chain comes with a few challenges such as: minimizing the computational requirements of the environment, specifying agent configurations that are representative of dynamics at real world stores and warehouses, and specifying a reward framework that encourages desirable behavior across the whole supply chain. In this work, we present a system with a custom GPU-parallelized environment that consists of one warehouse and multiple stores, a novel architecture for agent-environment dynamics incorporating enhanced state and action spaces, and a shared reward specification that seeks to optimize for a large retailer's supply chain needs. Each vertex in the supply chain graph is an independent agent that, based on its own inventory, able to place replenishment orders to the vertex upstream. The warehouse agent, aside from placing orders from the supplier, has the special property of also being able to constrain replenishment to stores downstream, which results in it learning an additional allocation sub-policy. We achieve a system that outperforms standard inventory control policies such as a base-stock policy and other RL-based specifications for 1 product, and lay out a future direction of work for multiple products. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Comments: 14 pages, 5 figures

arXiv:2304.08740 [pdf, other]

Estimating Joint Probability Distribution With Low-Rank Tensor Decomposition, Radon Transforms and Dictionaries

Authors: Pranava Singhal, Waqar Mirza, Ajit Rajwade, Karthik S. Gurumoorthy

Abstract: In this paper, we describe a method for estimating the joint probability density from data samples by assuming that the underlying distribution can be decomposed as a mixture of product densities with few mixture components. Prior works have used such a decomposition to estimate the joint density from lower-dimensional marginals, which can be estimated more reliably with the same number of samples… ▽ More In this paper, we describe a method for estimating the joint probability density from data samples by assuming that the underlying distribution can be decomposed as a mixture of product densities with few mixture components. Prior works have used such a decomposition to estimate the joint density from lower-dimensional marginals, which can be estimated more reliably with the same number of samples. We combine two key ideas: dictionaries to represent 1-D densities, and random projections to estimate the joint distribution from 1-D marginals, explored separately in prior work. Our algorithm benefits from improved sample complexity over the previous dictionary-based approach by using 1-D marginals for reconstruction. We evaluate the performance of our method on estimating synthetic probability densities and compare it with the previous dictionary-based approach and Gaussian Mixture Models (GMMs). Our algorithm outperforms these other approaches in all the experimental settings. △ Less

Submitted 18 April, 2023; originally announced April 2023.

MSC Class: 62G07

arXiv:2304.06376 [pdf, other]

Signal Reconstruction from Samples at Unknown Locations with Application to 2D Unknown View Tomography

Authors: Sheel Shah, Kaishva Shah, Karthik S. Gurumoorthy, Ajit Rajwade

Abstract: It is well known that a band-limited signal can be reconstructed from its uniformly spaced samples if the sampling rate is sufficiently high. More recently, it has been proved that one can reconstruct a 1D band-limited signal even if the exact sample locations are unknown, but given a uniform distribution of the sample locations and their ordering in 1D. In this work, we extend the analytical erro… ▽ More It is well known that a band-limited signal can be reconstructed from its uniformly spaced samples if the sampling rate is sufficiently high. More recently, it has been proved that one can reconstruct a 1D band-limited signal even if the exact sample locations are unknown, but given a uniform distribution of the sample locations and their ordering in 1D. In this work, we extend the analytical error bounds in such scenarios for quasi-bandlimited (QBL) signals, and for the case of arbitrary but known sampling distributions. We also prove that such reconstruction methods are resilient to a certain proportion of errors in the specification of the sample location ordering. We then express the problem of tomographic reconstruction of 2D images from 1D Radon projections under unknown angles (2D UVT) with known angle distribution, as a special case for reconstruction of QBL signals from samples at unknown locations with known distribution. Building upon our theoretical background, we present asymptotic bounds for 2D QBL image reconstruction from 1D Radon projections in the unknown angles setting, and present an extensive set of simulations to verify these bounds in varied parameter regimes. To the best of our knowledge, this is the first piece of work to perform such an analysis for 2D UVT and explicitly relate it to advances in sampling theory, even though the associated reconstruction algorithms have been known for a long time. △ Less

Submitted 18 December, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

Comments: This is a preprint of a paper accepted to Signal Processing (Elsevier)

arXiv:2304.00086 [pdf, other]

Machine Learning for Economics Research: When What and How?

Authors: Ajit Desai

Abstract: This article provides a curated review of selected papers published in prominent economics journals that use machine learning (ML) tools for research and policy analysis. The review focuses on three key questions: (1) when ML is used in economics, (2) what ML models are commonly preferred, and (3) how they are used for economic applications. The review highlights that ML is particularly used to pr… ▽ More This article provides a curated review of selected papers published in prominent economics journals that use machine learning (ML) tools for research and policy analysis. The review focuses on three key questions: (1) when ML is used in economics, (2) what ML models are commonly preferred, and (3) how they are used for economic applications. The review highlights that ML is particularly used to process nontraditional and unstructured data, capture strong nonlinearity, and improve prediction accuracy. Deep learning models are suitable for nontraditional data, whereas ensemble learning models are preferred for traditional datasets. While traditional econometric models may suffice for analyzing low-complexity data, the increasing complexity of economic data due to rapid digitalization and the growing literature suggests that ML is becoming an essential addition to the econometrician's toolbox. △ Less

Submitted 20 April, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

arXiv:2303.18039 [pdf, other]

doi 10.1103/PhysRevD.108.124035

Laying the foundation of the effective-one-body waveform models SEOBNRv5: improved accuracy and efficiency for spinning non-precessing binary black holes

Authors: Lorenzo Pompili, Alessandra Buonanno, Héctor Estellés, Mohammed Khalil, Maarten van de Meent, Deyan P. Mihaylov, Serguei Ossokine, Michael Pürrer, Antoni Ramos-Buades, Ajit Kumar Mehta, Roberto Cotesta, Sylvain Marsat, Michael Boyle, Lawrence E. Kidder, Harald P. Pfeiffer, Mark A. Scheel, Hannes R. Rüter, Nils Vu, Reetika Dudi, Sizheng Ma, Keefe Mitman, Denyz Melchor, Sierra Thomas, Jennifer Sanchez

Abstract: We present SEOBNRv5HM, a more accurate and faster inspiral-merger-ringdown gravitational waveform model for quasi-circular, spinning, nonprecessing binary black holes within the effective-one-body (EOB) formalism. Compared to its predecessor, SEOBNRv4HM, the waveform model i) incorporates recent high-order post- Newtonian results in the inspiral, with improved resummations, ii) includes the gravit… ▽ More We present SEOBNRv5HM, a more accurate and faster inspiral-merger-ringdown gravitational waveform model for quasi-circular, spinning, nonprecessing binary black holes within the effective-one-body (EOB) formalism. Compared to its predecessor, SEOBNRv4HM, the waveform model i) incorporates recent high-order post- Newtonian results in the inspiral, with improved resummations, ii) includes the gravitational modes (l, |m|) = (3, 2), (4, 3), in addition to the (2, 2), (3, 3), (2, 1), (4, 4), (5, 5) modes already implemented in SEOBNRv4HM, iii) is calibrated to larger mass-ratios and spins using a catalog of 442 numerical-relativity (NR) simulations and 13 additional waveforms from black-hole perturbation theory, iv) incorporates information from second-order gravitational self-force (2GSF) in the nonspinning modes and radiation-reaction force. Computing the unfaithfulness against NR simulations, we find that for the dominant (2, 2) mode the maximum unfaithfulness in the total mass range $10-300 M_{\odot}$ is below $10^{-3}$ for 90% of the cases (38% for SEOBNRv4HM). When including all modes up to l = 5 we find 98% (49%) of the cases with unfaithfulness below $10^{-2} (10^{-3})$, while these numbers reduce to 88% (5%) when using SEOBNRv4HM. Furthermore, the model shows improved agreement with NR in other dynamical quantities (e.g., the angular momentum flux and binding energy), providing a powerful check of its physical robustness. We implemented the waveform model in a high-performance Python package (pySEOBNR), which leads to evaluation times faster than SEOBNRv4HM by a factor 10 to 50, depending on the configuration, and provides the flexibility to easily include spin-precession and eccentric effects, thus making it the starting point for a new generation of EOBNR waveform models (SEOBNRv5) to be employed for upcoming observing runs of the LIGO-Virgo-KAGRA detectors. △ Less

Submitted 31 March, 2023; originally announced March 2023.

Journal ref: Phys. Rev. D 108, 124035 (2023)

arXiv:2303.13284 [pdf, other]

GETT-QA: Graph Embedding based T2T Transformer for Knowledge Graph Question Answering

Authors: Debayan Banerjee, Pranav Ajit Nair, Ricardo Usbeck, Chris Biemann

Abstract: In this work, we present an end-to-end Knowledge Graph Question Answering (KGQA) system named GETT-QA. GETT-QA uses T5, a popular text-to-text pre-trained language model. The model takes a question in natural language as input and produces a simpler form of the intended SPARQL query. In the simpler form, the model does not directly produce entity and relation IDs. Instead, it produces correspondin… ▽ More In this work, we present an end-to-end Knowledge Graph Question Answering (KGQA) system named GETT-QA. GETT-QA uses T5, a popular text-to-text pre-trained language model. The model takes a question in natural language as input and produces a simpler form of the intended SPARQL query. In the simpler form, the model does not directly produce entity and relation IDs. Instead, it produces corresponding entity and relation labels. The labels are grounded to KG entity and relation IDs in a subsequent step. To further improve the results, we instruct the model to produce a truncated version of the KG embedding for each entity. The truncated KG embedding enables a finer search for disambiguation purposes. We find that T5 is able to learn the truncated KG embeddings without any change of loss function, improving KGQA performance. As a result, we report strong results for LC-QuAD 2.0 and SimpleQuestions-Wikidata datasets on end-to-end KGQA over Wikidata. △ Less

Submitted 28 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: 16 pages single column format accepted at ESWC 2023 research track

arXiv:2303.06277 [pdf, other]

SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

Authors: Avinash Ajit Nargund, Misha Sra

Abstract: 3D human motion prediction is a research area of high significance and a challenge in computer vision. It is useful for the design of many applications including robotics and autonomous driving. Traditionally, autogregressive models have been used to predict human motion. However, these models have high computation needs and error accumulation that make it difficult to use them for realtime applic… ▽ More 3D human motion prediction is a research area of high significance and a challenge in computer vision. It is useful for the design of many applications including robotics and autonomous driving. Traditionally, autogregressive models have been used to predict human motion. However, these models have high computation needs and error accumulation that make it difficult to use them for realtime applications. In this paper, we present a non-autogressive model for human motion prediction. We focus on learning spatio-temporal representations non-autoregressively for generation of plausible future motions. We propose a novel architecture that leverages the recently proposed Transformers. Human motion involves complex spatio-temporal dynamics with joints affecting the position and rotation of each other even though they are not connected directly. The proposed model extracts these dynamics using both convolutions and the self-attention mechanism. Using specialized spatial and temporal self-attention to augment the features extracted through convolution allows our model to generate spatio-temporally coherent predictions in parallel independent of the activity. Our contributions are threefold: (i) we frame human motion prediction as a sequence-to-sequence problem and propose a non-autoregressive Transformer to forecast a sequence of poses in parallel; (ii) our method is activity agnostic; (iii) we show that despite its simplicity, our approach is able to make accurate predictions, achieving better or comparable results compared to the state-of-the-art on two public datasets, with far fewer parameters and much faster inference. △ Less

Submitted 10 March, 2023; originally announced March 2023.

arXiv:2303.04259 [pdf, other]

doi 10.1103/PhysRevB.107.235121

Robust quantum many-body scars in the one-dimensional spin-1 Kitaev model

Authors: Sashikanta Mohapatra, Ajit C. Balram

Abstract: Experimental observation of coherent oscillations in a Rydberg atom chain [Bernien et al., Nature 551, 579 (2017)] has led to the discovery of quantum many-body scars (QMBS) which is a new paradigm for ergodicity-breaking. The experimental findings in the Rydberg chain can be well captured by a kinetically constrained model called the "PXP" model, which has been shown to host the Eigenstate Therma… ▽ More Experimental observation of coherent oscillations in a Rydberg atom chain [Bernien et al., Nature 551, 579 (2017)] has led to the discovery of quantum many-body scars (QMBS) which is a new paradigm for ergodicity-breaking. The experimental findings in the Rydberg chain can be well captured by a kinetically constrained model called the "PXP" model, which has been shown to host the Eigenstate Thermalization Hypothesis (ETH)-violating scar states in the middle of the spectrum. Much effort has been put into identifying similar kinetically restricted systems that show a violation of ETH. In this work, we study the QMBS that can arise in one such model, namely the spin-$1$ Kitaev chain, where owing to some conserved quantities, the Hilbert space gets fragmented into unequal disconnected subspaces. Recently, You et al. [Phys. Rev. Research 4, 013103 (2022)] showed that the ground state sector of this chain can be mapped exactly onto the prototypical PXP model and thus hosts QMBSs. Here, we demonstrate that the phenomenon of scarring is also present in other sectors, and in particular, we identify a sector that exhibits substantially more scarring than the ground state one. We propose an initial state and numerically demonstrate that its fidelity revivals are robust and longer-lived than those in the PXP model. △ Less

Submitted 7 March, 2023; originally announced March 2023.

arXiv:2303.03647 [pdf, ps, other]

Parity distribution and divisibility of Mex-related partition functions

Authors: Subhrajyoti Bhattacharyya, Rupam Barman, Ajit Singh, Apu Kumar Saha

Abstract: Andrews and Newman introduced the mex-function $\text{mex}_{A,a}(λ)$ for an integer partition $λ$ of a positive integer $n$ as the smallest positive integer congruent to $a$ modulo $A$ that is not a part of $λ$. They then defined $p_{A,a}(n)$ to be the number of partitions $λ$ of $n$ satisfying $\text{mex}_{A,a}(λ)\equiv a\pmod{2A}$. They found the generating function for $p_{t,t}(n)$ and… ▽ More Andrews and Newman introduced the mex-function $\text{mex}_{A,a}(λ)$ for an integer partition $λ$ of a positive integer $n$ as the smallest positive integer congruent to $a$ modulo $A$ that is not a part of $λ$. They then defined $p_{A,a}(n)$ to be the number of partitions $λ$ of $n$ satisfying $\text{mex}_{A,a}(λ)\equiv a\pmod{2A}$. They found the generating function for $p_{t,t}(n)$ and $p_{2t,t}(n)$ for any positive integer $t$, and studied their arithmetic properties for some small values of $t$. In this article, we study the partition function $p_{mt,t}(n)$ for all positive integers $m$ and $t$. We show that for sufficiently large $X$, the number of all positive integer $n\leq X$ such that $p_{mt,t}(n)$ is an even number is at least $\mathcal{O}(\sqrt{X/3})$ for all positive integers $m$ and $t$. We also prove that for sufficiently large $X$, the number of all positive integer $n\leq X$ such that $p_{mp,p}(n)$ is an odd number is at least $\mathcal{O}(\log \log X)$ for all $m\not \equiv 0\pmod{3}$ and all primes $p\equiv 1\pmod{3}$. Finally, we establish identities connecting the ordinary partition function to $p_{mt,t}(n)$. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: 9 pages

arXiv:2302.14635 [pdf, other]

H-AES: Towards Automated Essay Scoring for Hindi

Authors: Shubhankar Singh, Anirudh Pupneja, Shivaansh Mital, Cheril Shah, Manish Bawkar, Lakshman Prasad Gupta, Ajit Kumar, Yaman Kumar, Rushali Gupta, Rajiv Ratn Shah

Abstract: The use of Natural Language Processing (NLP) for Automated Essay Scoring (AES) has been well explored in the English language, with benchmark models exhibiting performance comparable to human scorers. However, AES in Hindi and other low-resource languages remains unexplored. In this study, we reproduce and compare state-of-the-art methods for AES in the Hindi domain. We employ classical feature-ba… ▽ More The use of Natural Language Processing (NLP) for Automated Essay Scoring (AES) has been well explored in the English language, with benchmark models exhibiting performance comparable to human scorers. However, AES in Hindi and other low-resource languages remains unexplored. In this study, we reproduce and compare state-of-the-art methods for AES in the Hindi domain. We employ classical feature-based Machine Learning (ML) and advanced end-to-end models, including LSTM Networks and Fine-Tuned Transformer Architecture, in our approach and derive results comparable to those in the English language domain. Hindi being a low-resource language, lacks a dedicated essay-scoring corpus. We train and evaluate our models using translated English essays and empirically measure their performance on our own small-scale, real-world Hindi corpus. We follow this up with an in-depth analysis discussing prompt-specific behavior of different language models implemented. △ Less

Submitted 28 February, 2023; originally announced February 2023.

Comments: 9 pages, 3 Tables, To be published as a part of Proceedings of the 37th AAAI Conference on Artificial Intelligence

arXiv:2302.09383 [pdf, ps, other]

Polynomial representation for multipartite entanglement of resonating valence bond ladders

Authors: Ajit Iqbal Singh, Aditi Sen De, Ujjwal Sen

Abstract: A resonating valence bond (RVB) state of a lattice of quantum systems is a potential resource for quantum computing and communicating devices. It is a superposition of singlet, i.e., dimer, coverings - often restricted to nearest-neighbour ones - of the lattice. We develop a polynomial representation of multipartite quantum states to prove that RVB states on ladder lattices possess genuine multipa… ▽ More A resonating valence bond (RVB) state of a lattice of quantum systems is a potential resource for quantum computing and communicating devices. It is a superposition of singlet, i.e., dimer, coverings - often restricted to nearest-neighbour ones - of the lattice. We develop a polynomial representation of multipartite quantum states to prove that RVB states on ladder lattices possess genuine multipartite entanglement. The multipartite entanglement of doped RVB states and RVB states that are superposed with varying weights for singlet coverings of ladder lattices can both be detected by using this technique. △ Less

Submitted 18 February, 2023; originally announced February 2023.

Comments: 43 pages, 16 figures

arXiv:2302.02478 [pdf, other]

doi 10.1103/PhysRevB.107.235111

Prediction of non-Abelian fractional quantum Hall effect at $ν= 2 + \frac{4}{11}$

Authors: Koyena Bose, Ajit C. Balram

Abstract: The fractional quantum Hall effect (FQHE) in the second Landau level (SLL) likely stabilizes non-Abelian topological orders. Recently, a parton sequence has been proposed to capture many of the fractions observed in the SLL [Ajit C. Balram, SciPost Phys. {\bf 10}, 083 (2021)]. We consider the first member of this sequence which has not yet been studied, which is a non-Abelian state that occurs at… ▽ More The fractional quantum Hall effect (FQHE) in the second Landau level (SLL) likely stabilizes non-Abelian topological orders. Recently, a parton sequence has been proposed to capture many of the fractions observed in the SLL [Ajit C. Balram, SciPost Phys. {\bf 10}, 083 (2021)]. We consider the first member of this sequence which has not yet been studied, which is a non-Abelian state that occurs at $4/11$. As yet FQHE in the SLL at this fraction has not been observed in experiments. Nevertheless, by studying its competition with other candidate FQHE states in the SLL we show that this parton state might be viable. We also make predictions for experimentally measurable properties of the parton state which can distinguish it from other topological orders. △ Less

Submitted 6 June, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

Comments: 7 pages, 2 figures

arXiv:2301.11192 [pdf, ps, other]

Certain Diophantine equations and new parity results for $21$-regular partitions

Authors: Ajit Singh, Gurinder Singh, Rupam Barman

Abstract: For a positive integer $t\geq 2$, let $b_{t}(n)$ denote the number of $t$-regular partitions of a nonnegative integer $n$. In a recent paper, Keith and Zanello investigated the parity of $b_{t}(n)$ when $t\leq 28$. They discovered new infinite families of Ramanujan type congruences modulo 2 for $b_{21}(n)$ involving every prime $p$ with $p\equiv 13, 17, 19, 23 \pmod{24}$. In this paper, we investi… ▽ More For a positive integer $t\geq 2$, let $b_{t}(n)$ denote the number of $t$-regular partitions of a nonnegative integer $n$. In a recent paper, Keith and Zanello investigated the parity of $b_{t}(n)$ when $t\leq 28$. They discovered new infinite families of Ramanujan type congruences modulo 2 for $b_{21}(n)$ involving every prime $p$ with $p\equiv 13, 17, 19, 23 \pmod{24}$. In this paper, we investigate the parity of $b_{21}(n)$ involving the primes $p$ with $p\equiv 1, 5, 7, 11 \pmod{24}$. We prove new infinite families of Ramanujan type congruences modulo 2 for $b_{21}(n)$ involving the odd primes $p$ for which the Diophantine equation $8x^2+27y^2=jp$ has primitive solutions for some $j\in\left\lbrace1,4,8\right\rbrace$, and we also prove that the Dirichlet density of such primes is equal to $1/6$. Recently, Yao provided new infinite families of congruences modulo $2$ for $b_{3}(n)$ and those congruences involve every prime $p\geq 5$ based on Newman's results. Following a similar approach, we prove new infinite families of congruences modulo $2$ for $b_{21}(n)$, and these congruences imply that $b_{21}(n)$ is odd infinitely often. △ Less

Submitted 26 January, 2023; originally announced January 2023.

Comments: 15 pages

arXiv:2301.07338 [pdf, ps, other]

Generalizations of Chainability and Compactness, and the Hypertopologies

Authors: Ajit Kumar Gupta, Saikat Mukherjee

Abstract: We study two properties for subsets of a metric space. One of them is generalization of chainability, finite chainability, and Menger convexity for metric spaces; while the other is a generalization of compactness. We explore the basic results related to these two properties. Further, in the perspective of these properties, we explore relations among the Hausdorff, Vietoris, and locally finite hyp… ▽ More We study two properties for subsets of a metric space. One of them is generalization of chainability, finite chainability, and Menger convexity for metric spaces; while the other is a generalization of compactness. We explore the basic results related to these two properties. Further, in the perspective of these properties, we explore relations among the Hausdorff, Vietoris, and locally finite hypertopologies. △ Less

Submitted 18 January, 2023; originally announced January 2023.

MSC Class: 54B20

arXiv:2301.04169 [pdf, other]

doi 10.1103/PhysRevLett.130.176501

Signatures of Supersymmetry in the $ν{=}5/2$ Fractional Quantum Hall Effect

Authors: Songyang Pu, Ajit C. Balram, Mikael Fremling, Andrey Gromov, Zlatko Papić

Abstract: The Moore-Read state, one of the leading candidates for describing the fractional quantum Hall effect at filling factor $ν{=}5/2$, is a paradigmatic $p$-wave superconductor with non-Abelian topological order. Among its many exotic properties, the state hosts two collective modes: a bosonic density wave and a neutral fermion mode that arises from an unpaired electron in the condensate. It has recen… ▽ More The Moore-Read state, one of the leading candidates for describing the fractional quantum Hall effect at filling factor $ν{=}5/2$, is a paradigmatic $p$-wave superconductor with non-Abelian topological order. Among its many exotic properties, the state hosts two collective modes: a bosonic density wave and a neutral fermion mode that arises from an unpaired electron in the condensate. It has recently been proposed that the descriptions of the two modes can be unified by postulating supersymmetry (SUSY) that relates them in the long-wavelength limit. Here we extend the SUSY description to construct wave functions of the two modes on closed surfaces, such as the sphere and torus, and we test the resulting states in large-scale numerical simulations. We demonstrate the equivalence in the long-wavelength limit between SUSY wave functions and previous descriptions of collective modes based on the Girvin-MacDonald-Platzman ansatz, Jack polynomials, and bipartite composite fermions. Leveraging the first-quantized form of the SUSY wave functions, we study their energies using the Monte Carlo method and show that realistic $ν{=}5/2$ systems are close to the putative SUSY point, where the two collective modes become degenerate in energy. △ Less

Submitted 7 May, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Comments: Main text 6 pages, 4 figures with attached supplementary information

Journal ref: Phys. Rev. Lett. 130, 176501 (2023)

arXiv:2212.01706 [pdf, other]

doi 10.1103/PhysRevLett.130.126201

Fractional quantum Hall effect with unconventional pairing in monolayer graphene

Authors: Anirban Sharma, Songyang Pu, Ajit C. Balram, Jainendra K. Jain

Abstract: Motivated by the observation of even denominator fractional quantum Hall effect in the $n=3$ Landau level of monolayer graphene [Y. Kim $\textit{et al.}$, Nature Physics $\textbf{15}$, 154 (2019)], we consider a Bardeen-Cooper-Schrieffer variational state for composite fermions and find that the composite-fermion Fermi sea in this Landau level is unstable to an $f$-wave pairing. Analogous calculat… ▽ More Motivated by the observation of even denominator fractional quantum Hall effect in the $n=3$ Landau level of monolayer graphene [Y. Kim $\textit{et al.}$, Nature Physics $\textbf{15}$, 154 (2019)], we consider a Bardeen-Cooper-Schrieffer variational state for composite fermions and find that the composite-fermion Fermi sea in this Landau level is unstable to an $f$-wave pairing. Analogous calculation suggests the possibility of a $p$-wave pairing of composite fermions at half filling in the $n=2$ graphene Landau level, whereas no pairing instability is found at half filling in the $n=0$ and $1$ graphene Landau levels. The relevance of these results to experiments is discussed. △ Less

Submitted 3 December, 2022; originally announced December 2022.

Comments: 13 pages, 7 figures

arXiv:2212.00686 [pdf, other]

doi 10.1103/PhysRevB.107.125119

Supergravity model of the Haldane-Rezayi fractional quantum Hall state

Authors: Dung Xuan Nguyen, Kartik Prabhu, Ajit C. Balram, Andrey Gromov

Abstract: Supersymmetry and supergravity were invented in the 1970s to solve fundamental problems in high-energy physics. Even though neither of these ideas has yet been confirmed in high-energy and cosmology experiments, they have been beneficial in constructing numerous theoretical models, including superstring theory. Despite the absence of supersymmetry in particle physics, it can potentially emerge in… ▽ More Supersymmetry and supergravity were invented in the 1970s to solve fundamental problems in high-energy physics. Even though neither of these ideas has yet been confirmed in high-energy and cosmology experiments, they have been beneficial in constructing numerous theoretical models, including superstring theory. Despite the absence of supersymmetry in particle physics, it can potentially emerge in exotic phases of strongly correlated condensed matter systems. In this paper, we propose a supergravity model that describes the low-energy physics of the Haldane-Rezayi state, a gapless quantum Hall state that occurs in a half-filled Landau level. We show that the corresponding edge modes of the Haldane-Rezayi state and the Girvin-MacDonald-Platzman algebra appear naturally in the supergravity model. Finally, we substantiate our theoretical findings with numerical exact diagonalization calculations that support the appearance of the emergent graviton and gravitino excitations in the Haldane-Rezayi state. △ Less

Submitted 9 March, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

Comments: v2: new appendix B deriving the explicit action, updated references, matches published version. v1: 11 pages 1 figure. Comments are welcome

Journal ref: Phys. Rev. B 107, 125119 (2023)

arXiv:2211.08388 [pdf, other]

doi 10.1093/mnras/stac3336

Photometric identification of compact galaxies, stars and quasars using multiple neural networks

Authors: Siddharth Chaini, Atharva Bagul, Anish Deshpande, Rishi Gondkar, Kaushal Sharma, M. Vivek, Ajit Kembhavi

Abstract: We present MargNet, a deep learning-based classifier for identifying stars, quasars and compact galaxies using photometric parameters and images from the Sloan Digital Sky Survey (SDSS) Data Release 16 (DR16) catalogue. MargNet consists of a combination of Convolutional Neural Network (CNN) and Artificial Neural Network (ANN) architectures. Using a carefully curated dataset consisting of 240,000 c… ▽ More We present MargNet, a deep learning-based classifier for identifying stars, quasars and compact galaxies using photometric parameters and images from the Sloan Digital Sky Survey (SDSS) Data Release 16 (DR16) catalogue. MargNet consists of a combination of Convolutional Neural Network (CNN) and Artificial Neural Network (ANN) architectures. Using a carefully curated dataset consisting of 240,000 compact objects and an additional 150,000 faint objects, the machine learns classification directly from the data, minimising the need for human intervention. MargNet is the first classifier focusing exclusively on compact galaxies and performs better than other methods to classify compact galaxies from stars and quasars, even at fainter magnitudes. This model and feature engineering in such deep learning architectures will provide greater success in identifying objects in the ongoing and upcoming surveys, such as Dark Energy Survey (DES) and images from the Vera C. Rubin Observatory. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: 14 pages, 10 figures, Accepted for publication in MNRAS

arXiv:2211.07335 [pdf, other]

doi 10.1103/PhysRevLett.130.186302

Composite fermion pairing induced by Landau level mixing

Authors: Tongzhou Zhao, Ajit C. Balram, J. K. Jain

Abstract: Pairing of composite fermions provides a possible mechanism for fractional quantum Hall effect at even denominator fractions and is believed to serve as a platform for realizing quasiparticles with non-Abelian braiding statistics. We present results from fixed-phase diffusion Monte Carlo calculations which predict that substantial Landau level mixing can induce a pairing of composite fermions at f… ▽ More Pairing of composite fermions provides a possible mechanism for fractional quantum Hall effect at even denominator fractions and is believed to serve as a platform for realizing quasiparticles with non-Abelian braiding statistics. We present results from fixed-phase diffusion Monte Carlo calculations which predict that substantial Landau level mixing can induce a pairing of composite fermions at filling factors $ν=1/2$ and $ν=1/4$ in the $l=-3$ relative angular momentum channel, thereby destabilizing the composite-fermion Fermi seas to produce non-Abelian fractional quantum Hall states. △ Less

Submitted 4 May, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

Journal ref: Phys. Rev. Lett. 130 (2023) 186302

arXiv:2211.03731 [pdf, other]

doi 10.1109/TSP.2023.3287671

Group Testing with Side Information via Generalized Approximate Message Passing

Authors: Shu-Jie Cao, Ritesh Goenka, Chau-Wai Wong, Ajit Rajwade, Dror Baron

Abstract: Group testing can help maintain a widespread testing program using fewer resources amid a pandemic. In a group testing setup, we are given n samples, one per individual. Each individual is either infected or uninfected. These samples are arranged into m < n pooled samples, where each pool is obtained by mixing a subset of the n individual samples. Infected individuals are then identified using a g… ▽ More Group testing can help maintain a widespread testing program using fewer resources amid a pandemic. In a group testing setup, we are given n samples, one per individual. Each individual is either infected or uninfected. These samples are arranged into m < n pooled samples, where each pool is obtained by mixing a subset of the n individual samples. Infected individuals are then identified using a group testing algorithm. In this paper, we incorporate side information (SI) collected from contact tracing (CT) into nonadaptive/single-stage group testing algorithms. We generate different types of possible CT SI data by incorporating different possible characteristics of the spread of disease. These data are fed into a group testing framework based on generalized approximate message passing (GAMP). Numerical results show that our GAMP-based algorithms provide improved accuracy. △ Less

Submitted 16 June, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

Comments: To appear in IEEE Trans. Signal Processing. arXiv admin note: substantial text overlap with arXiv:2106.02699, arXiv:2011.14186

arXiv:2210.17489 [pdf, other]

Clique factors in powers of graphs

Authors: Ajit Diwan, Aniruddha Joshi

Abstract: The $k$th power of a graph $G$, denoted $G^k$, has the same vertex set as $G$, and two vertices are adjacent in $G^k$ if and only if there exists a path between them in $G$ of length at most $k$. A $K_r$-factor in a graph is a spanning subgraph in which every component is a complete graph of order $r$. It is easy to show that for any connected graph $G$ of order divisible by $r$, $G^{2r-2}$ contai… ▽ More The $k$th power of a graph $G$, denoted $G^k$, has the same vertex set as $G$, and two vertices are adjacent in $G^k$ if and only if there exists a path between them in $G$ of length at most $k$. A $K_r$-factor in a graph is a spanning subgraph in which every component is a complete graph of order $r$. It is easy to show that for any connected graph $G$ of order divisible by $r$, $G^{2r-2}$ contains a $K_r$-factor. This is best possible as there exist connected graphs $G$ of order divisible by $r$ such that $G^{2r-3}$ does not contain a $K_r$-factor. We conjecture that for any 2-connected graph $G$ of order divisible by $r$, $G^r$ contains a $K_r$-factor. This was known for $r \le 3$ and we prove it for $r = 4$. We prove a stronger statement that the vertex set of any 2-connected graph $G$ of order $4k$ can be partitioned into $k$ parts of size $4$, such that the four vertices in any part are contained in a subtree of $G$ of order at most 5. More generally, we conjecture that for any partition of $n = n_1+n_2+\cdots+n_k$, the vertex set of any 2-connected graph $G$ of order $n$ can be partitioned into $k$ parts $V_1,V_2,\ldots,V_k$, such that $|V_i| = n_i$ and $V_i \subseteq V(T_i)$ for some subtree $T_i$ of $G$ of order at most $n_i+1$, for $1 \le i \le k$. △ Less

Submitted 27 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

Comments: This version has a slightly simplified proof

MSC Class: 05C12; 05C40; 05C69

arXiv:2210.13769 [pdf, other]

GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion Estimates

Authors: Jerin Geo James, Devansh Jain, Ajit Rajwade

Abstract: Videos shot by laymen using hand-held cameras contain undesirable shaky motion. Estimating the global motion between successive frames, in a manner not influenced by moving objects, is central to many video stabilization techniques, but poses significant challenges. A large body of work uses 2D affine transformations or homography for the global motion. However, in this work, we introduce a more g… ▽ More Videos shot by laymen using hand-held cameras contain undesirable shaky motion. Estimating the global motion between successive frames, in a manner not influenced by moving objects, is central to many video stabilization techniques, but poses significant challenges. A large body of work uses 2D affine transformations or homography for the global motion. However, in this work, we introduce a more general representation scheme, which adapts any existing optical flow network to ignore the moving objects and obtain a spatially smooth approximation of the global motion between video frames. We achieve this by a knowledge distillation approach, where we first introduce a low pass filter module into the optical flow network to constrain the predicted optical flow to be spatially smooth. This becomes our student network, named as \textsc{GlobalFlowNet}. Then, using the original optical flow network as the teacher network, we train the student network using a robust loss function. Given a trained \textsc{GlobalFlowNet}, we stabilize videos using a two stage process. In the first stage, we correct the instability in affine parameters using a quadratic programming approach constrained by a user-specified cropping limit to control loss of field of view. In the second stage, we stabilize the video further by smoothing global motion parameters, expressed using a small number of discrete cosine transform coefficients. In extensive experiments on a variety of different videos, our technique outperforms state of the art techniques in terms of subjective quality and different quantitative measures of video stability. The source code is publicly available at \href{https://github.com/GlobalFlowNet/GlobalFlowNet}{https://github.com/GlobalFlowNet/GlobalFlowNet} △ Less

Submitted 4 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

Comments: Accepted in WACV 2023

arXiv:2210.07668 [pdf, other]

doi 10.1103/PhysRevB.106.245137

Anti-site disorder and Berry curvature driven anomalous Hall effect in spin gapless semiconducting Mn2CoAl Heusler compound

Authors: Nisha Shahi, Ajit K. Jena, Gaurav K. Shukla, Vishal Kumar, Shivani Rastogi, K. K. Dubey, Indu Rajput, Sonali Baral, Archana Lakhani, Seung-Cheol Lee, Satadeep Bhattacharjee, Sanjay Singh

Abstract: Spin gapless semiconductors exhibit a finite band gap for one spin channel and closed gap for other spin channel, emerged as a new state of magnetic materials with a great potential for spintronic applications. The first experimental evidence for the spin gapless semiconducting behavior was observed in an inverse Heusler compound Mn2CoAl. Here, we report a detailed investigation of the crystal str… ▽ More Spin gapless semiconductors exhibit a finite band gap for one spin channel and closed gap for other spin channel, emerged as a new state of magnetic materials with a great potential for spintronic applications. The first experimental evidence for the spin gapless semiconducting behavior was observed in an inverse Heusler compound Mn2CoAl. Here, we report a detailed investigation of the crystal structure and anomalous Hall effect in the Mn2CoAl using experimental and theoretical studies. The analysis of the high-resolution synchrotron x-ray diffraction data shows anti-site disorder between Mn and Al atoms within the inverse Heusler structure. The temperature-dependent resistivity shows semiconducting behavior and follows Mooijs criteria for disordered metal. Scaling behavior of the anomalous Hall resistivity suggests that the anomalous Hall effect in the Mn2CoAl is primarily governed by intrinsic mechanism due to the Berry curvature in momentum space. The experimental intrinsic anomalous Hall conductivity (AHC) is found to be 35 S/cm, which is considerably larger than the theoretically predicted value for ordered Mn2CoAl. Our first-principle calculations conclude that the anti-site disorder between Mn and Al atoms enhances the Berry curvature and hence the value of intrinsic AHC, which is in a very well agreement with the experiment. △ Less

Submitted 14 October, 2022; originally announced October 2022.

arXiv:2210.04662 [pdf]

doi 10.1088/2053-1583/acc7b6

Controlled defect production in monolayer MoS2 via electron irradiation at ultralow accelerating voltages

Authors: Ajit Kumar Dash, Hariharan Swaminathan, Ethan Berger, Mainak Mondal, Touko Lehenkari, Pushp Raj Prasad, Kenji Watanabe, Takashi Taniguchi, Hannu-Pekka Komsa, Akshay Singh

Abstract: Control on spatial location and density of defects in 2D materials can be achieved using electron beam irradiation. Conversely, ultralow accelerating voltages (less than or equal to 5kV) are used to measure surface morphology, with no expected defect creation. We find clear signatures of defect creation in monolayer (ML) MoS2 at these voltages. Evolution of E' and A1' Raman modes with electron dos… ▽ More Control on spatial location and density of defects in 2D materials can be achieved using electron beam irradiation. Conversely, ultralow accelerating voltages (less than or equal to 5kV) are used to measure surface morphology, with no expected defect creation. We find clear signatures of defect creation in monolayer (ML) MoS2 at these voltages. Evolution of E' and A1' Raman modes with electron dose, and appearance of defect activated peaks indicate defect formation. To simulate Raman spectra of MoS2 at realistic defect distributions, while retaining density-functional theory accuracy, we combine machine-learning force fields for phonons and eigenmode projection approach for Raman tensors. Simulated spectra agree with experiments, with sulphur vacancies as suggested defects. We decouple defects, doping and carbonaceous contamination using control (hBN covered and encapsulated MoS2) samples. We observe cryogenic PL quenching and defect peaks, and find that carbonaceous contamination does not affect defect creation. These studies have applications in photonics and quantum emitters. △ Less

Submitted 28 March, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

Comments: 49 pages, 24 figures, 4 tables

arXiv:2209.15392 [pdf, other]

Improving the Efficiency of Payments Systems Using Quantum Computing

Authors: Christopher McMahon, Donald McGillivray, Ajit Desai, Francisco Rivadeneyra, Jean-Paul Lam, Thomas Lo, Danica Marsden, Vladimir Skavysh

Abstract: High-value payment systems (HVPSs) are typically liquidity-intensive as the payment requests are indivisible and settled on a gross basis. Finding the right order in which payments should be processed to maximize the liquidity efficiency of these systems is an $NP$-hard combinatorial optimization problem, which quantum algorithms may be able to tackle at meaningful scales. We developed an algorith… ▽ More High-value payment systems (HVPSs) are typically liquidity-intensive as the payment requests are indivisible and settled on a gross basis. Finding the right order in which payments should be processed to maximize the liquidity efficiency of these systems is an $NP$-hard combinatorial optimization problem, which quantum algorithms may be able to tackle at meaningful scales. We developed an algorithm and ran it on a hybrid quantum annealing solver to find an ordering of payments that reduced the amount of system liquidity necessary without substantially increasing payment delays. Despite the limitations in size and speed of today's quantum computers, our algorithm provided quantifiable efficiency improvements when applied to the Canadian HVPS using a 30-day sample of transaction data. By reordering each batch of 70 payments as they entered the queue, we achieved an average of C\$240 million in daily liquidity savings, with a settlement delay of approximately 90 seconds. For a few days in the sample, the liquidity savings exceeded C\$1 billion. This algorithm could be incorporated as a centralized preprocessor into existing HVPS without entailing a fundamental change to their risk management models. △ Less

Submitted 17 January, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

arXiv:2209.07882 [pdf, other]

Solving Stochastic PDEs Using FEniCS and UQtk

Authors: Ajit Desai

Abstract: The intrusive (sample-free) spectral stochastic finite element method (SSFEM) is a powerful numerical tool for solving stochastic partial differential equations (PDEs). However, it is not widely adopted in academic and industrial applications because it demands intrusive adjustments in the PDE solver, which require substantial coding efforts compared to the non-intrusive (sampling) SSFEM. Using an… ▽ More The intrusive (sample-free) spectral stochastic finite element method (SSFEM) is a powerful numerical tool for solving stochastic partial differential equations (PDEs). However, it is not widely adopted in academic and industrial applications because it demands intrusive adjustments in the PDE solver, which require substantial coding efforts compared to the non-intrusive (sampling) SSFEM. Using an example of stochastic PDE, in this article, we demonstrate that the implementational challenges of the intrusive approach can be alleviated using FEniCS -- a general purpose finite element package and UQTk -- a collection of libraries and tools for the quantification of uncertainty. Furthermore, the algorithmic details and code snippets are provided to assist computational scientists in implementing these methods for their applications. This article is extracted from the author's thesis [1]. △ Less

Submitted 19 September, 2022; v1 submitted 16 September, 2022; originally announced September 2022.

arXiv:2209.01639 [pdf, ps, other]

Arithmetic properties of certain $t$-regular partitions

Authors: Rupam Barman, Ajit Singh, Gurinder Singh

Abstract: For a positive integer $t\geq 2$, let $b_{t}(n)$ denote the number of $t$-regular partitions of a nonnegative integer $n$. Motivated by some recent conjectures of Keith and Zanello, we establish infinite families of congruences modulo $2$ for $b_9(n)$ and $b_{19}(n)$. We prove some specific cases of two conjectures of Keith and Zanello on self-similarities of $b_9(n)$ and $b_{19}(n)$ modulo $2$. W… ▽ More For a positive integer $t\geq 2$, let $b_{t}(n)$ denote the number of $t$-regular partitions of a nonnegative integer $n$. Motivated by some recent conjectures of Keith and Zanello, we establish infinite families of congruences modulo $2$ for $b_9(n)$ and $b_{19}(n)$. We prove some specific cases of two conjectures of Keith and Zanello on self-similarities of $b_9(n)$ and $b_{19}(n)$ modulo $2$. We also relate $b_{t}(n)$ to the ordinary partition function, and prove that $b_{t}(n)$ satisfies the Ramanujan's famous congruences for some infinite families of $t$. For $t\in \{6,10,14,15,18,20,22,26,27,28\}$, Keith and Zanello conjectured that there are no integers $A>0$ and $B\geq 0$ for which $b_t(An+ B)\equiv 0\pmod 2$ for all $n\geq 0$. We prove that, for any $t\geq 2$ and prime $\ell$, there are infinitely many arithmetic progressions $An+B$ for which $\sum_{n=0}^{\infty}b_t(An+B)q^n\not\equiv0 \pmod{\ell}$. Next, we obtain quantitative estimates for the distributions of $b_{6}(n), b_{10}(n)$ and $b_{14}(n)$ modulo 2. We further study the odd densities of certain infinite families of eta-quotients related to the 7-regular and $13$-regular partition functions. △ Less

Submitted 4 September, 2022; originally announced September 2022.

Comments: 17 pages

arXiv:2209.00948 [pdf, other]

doi 10.3390/forecast5040036

Macroeconomic Predictions using Payments Data and Machine Learning

Authors: James T. E. Chapman, Ajit Desai

Abstract: Predicting the economy's short-term dynamics -- a vital input to economic agents' decision-making process -- often uses lagged indicators in linear models. This is typically sufficient during normal times but could prove inadequate during crisis periods. This paper aims to demonstrate that non-traditional and timely data such as retail and wholesale payments, with the aid of nonlinear machine lear… ▽ More Predicting the economy's short-term dynamics -- a vital input to economic agents' decision-making process -- often uses lagged indicators in linear models. This is typically sufficient during normal times but could prove inadequate during crisis periods. This paper aims to demonstrate that non-traditional and timely data such as retail and wholesale payments, with the aid of nonlinear machine learning approaches, can provide policymakers with sophisticated models to accurately estimate key macroeconomic indicators in near real-time. Moreover, we provide a set of econometric tools to mitigate overfitting and interpretability challenges in machine learning models to improve their effectiveness for policy use. Our models with payments data, nonlinear methods, and tailored cross-validation approaches help improve macroeconomic nowcasting accuracy up to 40\% -- with higher gains during the COVID-19 period. We observe that the contribution of payments data for economic predictions is small and linear during low and normal growth periods. However, the payments data contribution is large, asymmetrical, and nonlinear during strong negative or positive growth periods. △ Less

Submitted 2 September, 2022; originally announced September 2022.

Report number: 2023, 5(4)

Journal ref: Forecasting, 2023

arXiv:2208.10713 [pdf, ps, other]

Domain Decomposition of Stochastic PDEs: Development of Probabilistic Wirebasket-based Two-level Preconditioners

Authors: Ajit Desai, Mohammad Khalil, Chris L. Pettit, Dominique Poirel, Abhijit Sarkar

Abstract: Realistic physical phenomena exhibit random fluctuations across many scales in the input and output processes. Models of these phenomena require stochastic PDEs. For three-dimensional coupled (vector-valued) stochastic PDEs (SPDEs), for instance, arising in linear elasticity, the existing two-level domain decomposition solvers with the vertex-based coarse grid show poor numerical and parallel scal… ▽ More Realistic physical phenomena exhibit random fluctuations across many scales in the input and output processes. Models of these phenomena require stochastic PDEs. For three-dimensional coupled (vector-valued) stochastic PDEs (SPDEs), for instance, arising in linear elasticity, the existing two-level domain decomposition solvers with the vertex-based coarse grid show poor numerical and parallel scalabilities. Therefore, new algorithms with a better resolved coarse grid are needed. The probabilistic wirebasket-based coarse grid for a two-level solver is devised in three dimensions. This enriched coarse grid provides an efficient mechanism for global error propagation and thus improves the convergence. This development enhances the scalability of the two-level solver in handling stochastic PDEs in three dimensions. Numerical and parallel scalabilities of this algorithm are studied using MPI and PETSc libraries on high-performance computing (HPC) systems. Implementational challenges of the intrusive spectral stochastic finite element methods (SSFEM) are addressed by coupling domain decomposition solvers with FEniCS general purpose finite element package. This work generalizes the applications of intrusive SSFEM to tackle a variety of stochastic PDEs and emphasize the usefulness of the domain decomposition-based solvers and HPC for uncertainty quantification. △ Less

Submitted 22 August, 2022; originally announced August 2022.

arXiv:2208.08079 [pdf, other]

doi 10.1209/0295-5075/ac8d71

Hawking radiation from acoustic black holes in hydrodynamic flow of electrons

Authors: Shreyansh S. Dave, Oindrila Ganguly, Saumia P. S., Ajit M. Srivastava

Abstract: Acoustic black holes are formed when a fluid flowing with subsonic velocities, accelerates and becomes supersonic. When the flow is directed from the subsonic to supersonic region, the surface on which the normal component of fluid velocity equals the local speed of sound acts as an acoustic horizon. This is because no acoustic perturbation from the supersonic region can cross it to reach the subs… ▽ More Acoustic black holes are formed when a fluid flowing with subsonic velocities, accelerates and becomes supersonic. When the flow is directed from the subsonic to supersonic region, the surface on which the normal component of fluid velocity equals the local speed of sound acts as an acoustic horizon. This is because no acoustic perturbation from the supersonic region can cross it to reach the subsonic part of the fluid. One can show that if the fluid velocity is locally irrotational, the field equations for acoustic perturbations of the velocity potential are identical to that of a massless scalar field propagating in a black hole background. One, therefore, expects Hawking radiation in the form of a thermal spectrum of phonons. There have been numerous investigations of this possibility, theoretically, as well as experimentally, in systems ranging from cold atom systems to quark-gluon plasma formed in relativistic heavy-ion collisions. Here we investigate this possibility in the hydrodynamic flow of electrons. Resulting Hawking radiation in this case should be observable in terms of current fluctuations. Further, current fluctuations on both sides of the acoustic horizon should show correlations expected for pairs of Hawking particles. △ Less

Submitted 17 August, 2022; originally announced August 2022.

Comments: 7 pages, 2 figures

Journal ref: Europhysics Letters 139, 60003 (2022)

Showing 101–150 of 533 results for author: Ajit