Search | arXiv e-print repository

doi 10.1073/pnas.2320608121

Congruence modules in higher codimension and zeta lines in Galois cohomology

Authors: Srikanth B. Iyengar, Chandrashekhar B. Khare, Jeffrey Manning, Eric Urban

Abstract: This work builds on earlier work of the first three authors where a notion of congruence modules in higher codimension is introduced. The main new results are a criterion for detecting regularity of local rings in terms of congruence modules, and a more refined version of a result tracking the change of congruence modules under deformation is proved. Number theoretic applications include the const… ▽ More This work builds on earlier work of the first three authors where a notion of congruence modules in higher codimension is introduced. The main new results are a criterion for detecting regularity of local rings in terms of congruence modules, and a more refined version of a result tracking the change of congruence modules under deformation is proved. Number theoretic applications include the construction of canonical lines in certain Galois cohomology groups arising from adjoint motives of Hilbert modular forms. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: 22 pages

MSC Class: 11F80 (primary); 11F33; 13D02 (secondary)

arXiv:2311.12779 [pdf, other]

Finding Adversarial Inputs for Heuristics using Multi-level Optimization

Authors: Pooria Namyar, Behnaz Arzani, Ryan Beckett, Santiago Segarra, Himanshu Raj, Umesh Krishnaswamy, Ramesh Govindan, Srikanth Kandula

Abstract: Production systems use heuristics because they are faster or scale better than their optimal counterparts. Yet, practitioners are often unaware of the performance gap between a heuristic and the optimum or between two heuristics in realistic scenarios. We present MetaOpt, a system that helps analyze heuristics. Users specify the heuristic and the optimal (or another heuristic) as input, and MetaOp… ▽ More Production systems use heuristics because they are faster or scale better than their optimal counterparts. Yet, practitioners are often unaware of the performance gap between a heuristic and the optimum or between two heuristics in realistic scenarios. We present MetaOpt, a system that helps analyze heuristics. Users specify the heuristic and the optimal (or another heuristic) as input, and MetaOpt automatically encodes these efficiently for a solver to find performance gaps and their corresponding adversarial inputs. Its suite of built-in optimizations helps it scale its analysis to practical problem sizes. To show it is versatile, we used MetaOpt to analyze heuristics from three domains (traffic engineering, vector bin packing, and packet scheduling). We found a production traffic engineering heuristic can require 30% more capacity than the optimal to satisfy realistic demands. Based on the patterns in the adversarial inputs MetaOpt produced, we modified the heuristic to reduce its performance gap by 12.5$\times$. We examined adversarial inputs to a vector bin packing heuristic and proved a new lower bound on its performance. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.11141 [pdf]

doi 10.1016/j.vacuum.2022.111461

Effect of laser irradiation on the tribological properties of RFsputtered nickel oxide (NiO) thin films

Authors: Srikanth Itapu, Vamsi Borra, Frank X. Li, Pedro Cortes, Mohit Hemanth Kumar

Abstract: The present work aims at investigating the effect of laser irradiation on the tribological properties of RF-sputtered NiO thin films deposited on industrial grade aluminum substrate. A semiconductor laser based on Nd:YAG operating at its 4th harmonic wavelength, lambda = 266 nm with varying laser fluence and spot size of about 5 um is irradiated on the NiO film. The localized heating allows for sm… ▽ More The present work aims at investigating the effect of laser irradiation on the tribological properties of RF-sputtered NiO thin films deposited on industrial grade aluminum substrate. A semiconductor laser based on Nd:YAG operating at its 4th harmonic wavelength, lambda = 266 nm with varying laser fluence and spot size of about 5 um is irradiated on the NiO film. The localized heating allows for smoothening of the NiO film along with contributions to the changes in the stoichiometry of NiO (reduction of excess oxygen). In particular, the effects of tuning laser fluence and the subsequent tribology tests pertaining to the coefficient of friction variations for tribological tests are discussed. △ Less

Submitted 18 November, 2023; originally announced November 2023.

Journal ref: Vacuum 205 (2022): 111461

arXiv:2311.10660 [pdf]

doi 10.1109/TNANO.2020.2998955

Laser-Based Fabrication of Microstructures on Nickel Thin Films and Its Applications in On-Chip Thin Film Inductors

Authors: Srikanth Itapu, Vamsi Borra, Daniel G. Georgiev

Abstract: This work reports on the fabrication of microbump structures on Ni films by single-pulse, localized laser irradiation. Conditions for the reproducible formation of such microstructures have been identified in terms of laser-irradiation and film parameters after systematic studies involving a relevant parameter space. The cracks and voids morphology of the sputtered films was rendered undesirable a… ▽ More This work reports on the fabrication of microbump structures on Ni films by single-pulse, localized laser irradiation. Conditions for the reproducible formation of such microstructures have been identified in terms of laser-irradiation and film parameters after systematic studies involving a relevant parameter space. The cracks and voids morphology of the sputtered films was rendered undesirable and hence, smoother Ni thin film of same thickness (200nm) were deposited by vacuum evaporation. The continuous nature of the film resulted in radially symmetric thermal expansion and deformation, thus achieving a high yield of microstructures. An improvement in the inductance and the quality factor of on-chip spiral inductors incorporating such laser-microstructured ferromagnetic nickel thin films was observed, which demonstrates the potential of such a laser-based method for fabrication or fine tuning of various micro-/nanoelectric/electronic sensor and other components and systems. △ Less

Submitted 17 November, 2023; originally announced November 2023.

Comments: 15 pages, 10 figures

Journal ref: in IEEE Transactions on Nanotechnology, vol. 19, pp. 455-460, 2020

arXiv:2311.09542 [pdf, other]

Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering

Authors: Neha Srikanth, Rupak Sarkar, Heran Mane, Elizabeth M. Aparicio, Quynh C. Nguyen, Rachel Rudinger, Jordan Boyd-Graber

Abstract: Questions posed by information-seeking users often contain implicit false or potentially harmful assumptions. In a high-risk domain such as maternal and infant health, a question-answering system must recognize these pragmatic constraints and go beyond simply answering user questions, examining them in context to respond helpfully. To achieve this, we study assumptions and implications, or pragmat… ▽ More Questions posed by information-seeking users often contain implicit false or potentially harmful assumptions. In a high-risk domain such as maternal and infant health, a question-answering system must recognize these pragmatic constraints and go beyond simply answering user questions, examining them in context to respond helpfully. To achieve this, we study assumptions and implications, or pragmatic inferences, made when mothers ask questions about pregnancy and infant care by collecting a dataset of 2,727 inferences from 500 questions across three diverse sources. We study how health experts naturally address these inferences when writing answers, and illustrate that informing existing QA pipelines with pragmatic inferences produces responses that are more complete, mitigating the propagation of harmful beliefs. △ Less

Submitted 2 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: Accepted to NAACL 2024

arXiv:2311.08402 [pdf, other]

Retrieve and Copy: Scaling ASR Personalization to Large Catalogs

Authors: Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki, Sravan Bodapati

Abstract: Personalization of automatic speech recognition (ASR) models is a widely studied topic because of its many practical applications. Most recently, attention-based contextual biasing techniques are used to improve the recognition of rare words and domain specific entities. However, due to performance constraints, the biasing is often limited to a few thousand entities, restricting real-world usabili… ▽ More Personalization of automatic speech recognition (ASR) models is a widely studied topic because of its many practical applications. Most recently, attention-based contextual biasing techniques are used to improve the recognition of rare words and domain specific entities. However, due to performance constraints, the biasing is often limited to a few thousand entities, restricting real-world usability. To address this, we first propose a "Retrieve and Copy" mechanism to improve latency while retaining the accuracy even when scaled to a large catalog. We also propose a training strategy to overcome the degradation in recall at such scale due to an increased number of confusing entities. Overall, our approach achieves up to 6% more Word Error Rate reduction (WERR) and 3.6% absolute improvement in F1 when compared to a strong baseline. Our method also allows for large catalog sizes of up to 20K without significantly affecting WER and F1-scores, while achieving at least 20% inference speedup per acoustic frame. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: EMNLP 2023

arXiv:2311.02482 [pdf, other]

Generalized zero-shot audio-to-intent classification

Authors: Veera Raghavendra Elluru, Devang Kulshreshtha, Rohit Paturi, Sravan Bodapati, Srikanth Ronanki

Abstract: Spoken language understanding systems using audio-only data are gaining popularity, yet their ability to handle unseen intents remains limited. In this study, we propose a generalized zero-shot audio-to-intent classification framework with only a few sample text sentences per intent. To achieve this, we first train a supervised audio-to-intent classifier by making use of a self-supervised pre-trai… ▽ More Spoken language understanding systems using audio-only data are gaining popularity, yet their ability to handle unseen intents remains limited. In this study, we propose a generalized zero-shot audio-to-intent classification framework with only a few sample text sentences per intent. To achieve this, we first train a supervised audio-to-intent classifier by making use of a self-supervised pre-trained model. We then leverage a neural audio synthesizer to create audio embeddings for sample text utterances and perform generalized zero-shot classification on unseen intents using cosine similarity. We also propose a multimodal training strategy that incorporates lexical information into the audio representation to improve zero-shot performance. Our multimodal training approach improves the accuracy of zero-shot intent classification on unseen intents of SLURP by 2.75% and 18.2% for the SLURP and internal goal-oriented dialog datasets, respectively, compared to audio-only training. △ Less

Submitted 4 November, 2023; originally announced November 2023.

arXiv:2311.00720 [pdf]

doi 10.1109/SeFeT57834.2023.10245108

Harmonic content analysis of a soft starting variable frequency motor drive based on FPGA

Authors: Yogesh Sapkota, Suman Devkota, Vamsi Borra, Pedro Cortes, Frank Li, Srikanth Itapu

Abstract: As the demands for electric vehicles, electric aircrafts, unmanned aircraft systems, and other motor-driven systems increase, high-performance motor drives employing variable frequency control with higher efficiency and reliability are becoming increasingly important parts of the ever-changing technological landscape. This study proposes a Field Programmable Gate Array (FPGA)-based variable freque… ▽ More As the demands for electric vehicles, electric aircrafts, unmanned aircraft systems, and other motor-driven systems increase, high-performance motor drives employing variable frequency control with higher efficiency and reliability are becoming increasingly important parts of the ever-changing technological landscape. This study proposes a Field Programmable Gate Array (FPGA)-based variable frequency soft-starting motor drive for a three-phase induction motor. The inverter output voltage and the load currents are analyzed for the harmonic contents using MATLAB. In the experimental realization, a four-pole squirrel cage delta-connected induction motor is utilized with a switching frequency of 4 kHz. The current and voltage characteristics of the induction motor are studied under different operating conditions to study harmonic contents and the effect of changing soft-start duration. The findings demonstrate a low-cost, flexible control of the induction motor with improved harmonic performance. △ Less

Submitted 28 October, 2023; originally announced November 2023.

Comments: 10 figures

Journal ref: Sapkota, Yogesh, et al. "Harmonic content analysis of a soft starting variable frequency motor drive based on FPGA." 2023 IEEE 3rd International Conference on Sustainable Energy and Future Electric Transportation (SEFET). IEEE, 2023

arXiv:2310.19827 [pdf]

doi 10.1088/1361-6463/aa8ec7

Sn whisker growth mitigation by using NiO sublayers

Authors: Vamsi Borra, Srikanth Itapu, Daniel G. Georgiev

Abstract: The potential of NiO sublayers for whisker growth mitigation has been examined. A thin NiO film was applied on a Cu-coated substrate before the deposition of a thicker Sn layer. The growth of Sn whiskers was then followed by optical and scanning electron microscopy and was compared with the whisker growth on a control sample without the NiO sublayer. No whiskers were observed on the sample with th… ▽ More The potential of NiO sublayers for whisker growth mitigation has been examined. A thin NiO film was applied on a Cu-coated substrate before the deposition of a thicker Sn layer. The growth of Sn whiskers was then followed by optical and scanning electron microscopy and was compared with the whisker growth on a control sample without the NiO sublayer. No whiskers were observed on the sample with the NiO layer even after 12 months, whereas the control sample developed whiskers of size and density that is generally expected, based on the vast amount of published work on the topic. The mechanisms of whisker growth and whisker growth suppression are briefly discussed as well. △ Less

Submitted 27 October, 2023; originally announced October 2023.

Comments: 7 figures

Journal ref: Journal of Physics D: Applied Physics, 50(47), 475309 (2017)

arXiv:2310.18495 [pdf]

doi 10.1016/j.scriptamat.2021.114357

Modification of Tin (Sn) Metal Surfaces by Surface Plasmon Polariton Excitation

Authors: Vamsi Borra, Srikanth Itapu, Victor G. Karpov, Daniel G. Georgiev

Abstract: We report on the modification of tin (Sn) film surfaces under a laser beam irradiation that triggers surface plasmon polariton (SPP) excitations. The observed surface features in the form of small raised grains, with well-defined rooting, look similar to tin whisker nodules. We attribute the appearance of those features to the field-induced nucleation caused by the SPP related strong electric fiel… ▽ More We report on the modification of tin (Sn) film surfaces under a laser beam irradiation that triggers surface plasmon polariton (SPP) excitations. The observed surface features in the form of small raised grains, with well-defined rooting, look similar to tin whisker nodules. We attribute the appearance of those features to the field-induced nucleation caused by the SPP related strong electric field. Possible implications of our findings include accelerated-life testing for tin whisker growth-related reliability as well as applications to nanoparticle nucleation. △ Less

Submitted 27 October, 2023; originally announced October 2023.

Comments: 5 figures

Journal ref: Scripta Materialia, 208, 114357 (2022)

arXiv:2310.17620 [pdf, other]

Radar-Only Off-Road Local Navigation

Authors: Timothy Overbye, Srikanth Saripalli

Abstract: Off-road robotics have traditionally utilized lidar for local navigation due to its accuracy and high resolution. However, the limitations of lidar, such as reduced performance in harsh environmental conditions and limited range, have prompted the exploration of alternative sensing technologies. This paper investigates the potential of radar for off-road local navigation, as it offers the advantag… ▽ More Off-road robotics have traditionally utilized lidar for local navigation due to its accuracy and high resolution. However, the limitations of lidar, such as reduced performance in harsh environmental conditions and limited range, have prompted the exploration of alternative sensing technologies. This paper investigates the potential of radar for off-road local navigation, as it offers the advantages of a longer range and the ability to penetrate dust and light vegetation. We adapt existing lidar-based methods for radar and evaluate the performance in comparison to lidar under various off-road conditions. We show that radar can provide a significant range advantage over lidar while maintaining accuracy for both ground plane estimation and obstacle detection. And finally, we demonstrate successful autonomous navigation at a speed of 2.5 m/s over a path length of 350 m using only radar for ground plane estimation and obstacle detection. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 7 pages, 17 figures, ITSC 2023

arXiv:2310.13551 [pdf, other]

ROSS: Radar Off-road Semantic Segmentation

Authors: Peng Jiang, Srikanth Saripalli

Abstract: As the demand for autonomous navigation in off-road environments increases, the need for effective solutions to understand these surroundings becomes essential. In this study, we confront the inherent complexities of semantic segmentation in RADAR data for off-road scenarios. We present a novel pipeline that utilizes LIDAR data and an existing annotated off-road LIDAR dataset for generating RADAR… ▽ More As the demand for autonomous navigation in off-road environments increases, the need for effective solutions to understand these surroundings becomes essential. In this study, we confront the inherent complexities of semantic segmentation in RADAR data for off-road scenarios. We present a novel pipeline that utilizes LIDAR data and an existing annotated off-road LIDAR dataset for generating RADAR labels, in which the RADAR data are represented as images. Validated with real-world datasets, our pragmatic approach underscores the potential of RADAR technology for navigation applications in off-road environments. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: 10 pages, 6 figures, accepted by the 18th International Symposium on Experimental Robotics (ISER 2023)

arXiv:2310.12411 [pdf, other]

doi 10.1109/SDF-MFI59545.2023.10361310

Online Multi-IMU Calibration Using Visual-Inertial Odometry

Authors: Jacob Hartzer, Srikanth Saripalli

Abstract: This work presents a centralized multi-IMU filter framework with online intrinsic and extrinsic calibration for unsynchronized inertial measurement units that is robust against changes in calibration parameters. The novel EKF-based method estimates the positional and rotational offsets of the system of sensors as well as their intrinsic biases without the use of rigid body geometric constraints. A… ▽ More This work presents a centralized multi-IMU filter framework with online intrinsic and extrinsic calibration for unsynchronized inertial measurement units that is robust against changes in calibration parameters. The novel EKF-based method estimates the positional and rotational offsets of the system of sensors as well as their intrinsic biases without the use of rigid body geometric constraints. Additionally, the filter is flexible in the total number of sensors used while leveraging the commonly used MSCKF framework for camera measurements. The filter framework has been validated using Monte Carlo simulation as well as experimentally. In both simulations and experiments, using multiple IMU measurement streams within the proposed filter framework outperforms the use of a single IMU in a filter prediction step while also producing consistent and accurate estimates of initial calibration errors. Compared to current state-of-the-art optimizers, the filter produces similar intrinsic and extrinsic calibration parameters for each sensor. Finally, an open source repository has been provided at https://github.com/unmannedlab/ekf-cal containing both the online estimator and the simulation used for testing and evaluation. △ Less

Submitted 3 January, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

arXiv:2310.11308 [pdf, other]

doi 10.1103/PhysRevA.109.032435

Protocols for counterfactual and twin-field quantum digital signature

Authors: Vinod N. Rao, Shrikant Utagi, Anirban Pathak, R. Srikanth

Abstract: Quantum digital signature (QDS) is the quantum version of its classical counterpart, and can offer security against attacks of repudiation, signature forging and external eavesdropping, on the basis of quantum mechanical no-go principles. Here we propose a QDS scheme based on quantum counterfactuality, which leverages the concept of interaction-free measurement. Employing the idea behind twin-fiel… ▽ More Quantum digital signature (QDS) is the quantum version of its classical counterpart, and can offer security against attacks of repudiation, signature forging and external eavesdropping, on the basis of quantum mechanical no-go principles. Here we propose a QDS scheme based on quantum counterfactuality, which leverages the concept of interaction-free measurement. Employing the idea behind twin-field cryptography, we show how this two-way protocol can be turned into an equivalent non-counterfactual, one-way protocol, that is both more practical and also theoretically helpful in assessing the experimental feasibility of the first protocol. The proposed QDS protocol can be experimentally implemented with current quantum technology. △ Less

Submitted 19 June, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 11 pages, 4 figures

Journal ref: Phys. Rev. A 109, 032435 (2024)

arXiv:2310.09699 [pdf, other]

Solving Max-Min Fair Resource Allocations Quickly on Large Graphs

Authors: Pooria Namyar, Behnaz Arzani, Srikanth Kandula, Santiago Segarra, Daniel Crankshaw, Umesh Krishnaswamy, Ramesh Govindan, Himanshu Raj

Abstract: We consider the max-min fair resource allocation problem. The best-known solutions use either a sequence of optimizations or waterfilling, which only applies to a narrow set of cases. These solutions have become a practical bottleneck in WAN traffic engineering and cluster scheduling, especially at larger problem sizes. We improve both approaches: (1) we show how to convert the optimization sequen… ▽ More We consider the max-min fair resource allocation problem. The best-known solutions use either a sequence of optimizations or waterfilling, which only applies to a narrow set of cases. These solutions have become a practical bottleneck in WAN traffic engineering and cluster scheduling, especially at larger problem sizes. We improve both approaches: (1) we show how to convert the optimization sequence into a single fast optimization, and (2) we generalize waterfilling to the multi-path case. We empirically show our new algorithms Pareto-dominate prior techniques: they produce faster, fairer, and more efficient allocations. Some of our allocators also have theoretical guarantees: they trade off a bounded amount of unfairness for faster allocation. We have deployed our allocators in Azure's WAN traffic engineering pipeline, where we preserve solution quality and achieve a roughly $3\times$ speedup. △ Less

Submitted 14 October, 2023; originally announced October 2023.

Comments: Accepted to USENIX NSDI 2024

arXiv:2310.01104 [pdf, other]

Multi-period static hedging of European options

Authors: Purba Banerjee, Srikanth Iyer, Shashi Jain

Abstract: We consider the hedging of European options when the price of the underlying asset follows a single-factor Markovian framework. By working in such a setting, Carr and Wu \cite{carr2014static} derived a spanning relation between a given option and a continuum of shorter-term options written on the same asset. In this paper, we have extended their approach to simultaneously include options over mult… ▽ More We consider the hedging of European options when the price of the underlying asset follows a single-factor Markovian framework. By working in such a setting, Carr and Wu \cite{carr2014static} derived a spanning relation between a given option and a continuum of shorter-term options written on the same asset. In this paper, we have extended their approach to simultaneously include options over multiple short maturities. We then show a practical implementation of this with a finite set of shorter-term options to determine the hedging error using a Gaussian Quadrature method. We perform a wide range of experiments for both the \textit{Black-Scholes} and \textit{Merton Jump Diffusion} models, illustrating the comparative performance of the two methods. △ Less

Submitted 18 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: 32 pages, 7 figures, 4 sub-figures

arXiv:2309.16057 [pdf, other]

WiDEVIEW: An UltraWideBand and Vision Dataset for Deciphering Pedestrian-Vehicle Interactions

Authors: Jia Huang, Alvika Gautam, Junghun Choi, Srikanth Saripalli

Abstract: Robust and accurate tracking and localization of road users like pedestrians and cyclists is crucial to ensure safe and effective navigation of Autonomous Vehicles (AVs), particularly so in urban driving scenarios with complex vehicle-pedestrian interactions. Existing datasets that are useful to investigate vehicle-pedestrian interactions are mostly image-centric and thus vulnerable to vision fail… ▽ More Robust and accurate tracking and localization of road users like pedestrians and cyclists is crucial to ensure safe and effective navigation of Autonomous Vehicles (AVs), particularly so in urban driving scenarios with complex vehicle-pedestrian interactions. Existing datasets that are useful to investigate vehicle-pedestrian interactions are mostly image-centric and thus vulnerable to vision failures. In this paper, we investigate Ultra-wideband (UWB) as an additional modality for road users' localization to enable a better understanding of vehicle-pedestrian interactions. We present WiDEVIEW, the first multimodal dataset that integrates LiDAR, three RGB cameras, GPS/IMU, and UWB sensors for capturing vehicle-pedestrian interactions in an urban autonomous driving scenario. Ground truth image annotations are provided in the form of 2D bounding boxes and the dataset is evaluated on standard 2D object detection and tracking algorithms. The feasibility of UWB is evaluated for typical traffic scenarios in both line-of-sight and non-line-of-sight conditions using LiDAR as ground truth. We establish that UWB range data has comparable accuracy with LiDAR with an error of 0.19 meters and reliable anchor-tag range data for up to 40 meters in line-of-sight conditions. UWB performance for non-line-of-sight conditions is subjective to the nature of the obstruction (trees vs. buildings). Further, we provide a qualitative analysis of UWB performance for scenarios susceptible to intermittent vision failures. The dataset can be downloaded via https://github.com/unmannedlab/UWB_Dataset. △ Less

Submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.14578 [pdf, other]

Solvent Effects on Extractant Conformational Energetics in Liquid-Liquid Extraction: A Simulation Study of Molecular Solvents and Ionic Liquids

Authors: Xiaoyu Wang, Srikanth Nayak, Richard E. Wilson, L. Soderholm, Michael J. Servis

Abstract: Extractant design in liquid-liquid extraction (LLE) is a research frontier of metal ion separations that typically focuses on the direct extractant-metal interactions. However, a more detailed understanding of energetic drivers of separations beyond primary metal coordination is often lacking, including the role of solvent in the extractant phase. In this work, we propose a new mechanism for enhan… ▽ More Extractant design in liquid-liquid extraction (LLE) is a research frontier of metal ion separations that typically focuses on the direct extractant-metal interactions. However, a more detailed understanding of energetic drivers of separations beyond primary metal coordination is often lacking, including the role of solvent in the extractant phase. In this work, we propose a new mechanism for enhancing metal-complexant energetics with nanostructured solvents. Using molecular dynamics simulations with umbrella sampling, we find that the organic solvent can reshape the energetics of the extractant's intramolecular conformational landscape. We calculate free energy profiles of different conformations of a representative bidentate extractant, n-octyl(phenyl)-N,N-diisobutyl carbamoyl methyl phosphinoxide (CMPO), in four different solvents: dodecane, tributyl phosphate (TBP), and dry and wet ionic liquid (IL) 1-ethyl-3-methylimidazolium bis(trifluoromethylsulfonyl)imide ([EMIM][Tf_2N]). By promoting reorganization of the extractant molecule into its binding conformation, our findings reveal how particular solvents can ameliorate this unfavorable step of the metal separation process. In particular, the charge alternating nanodomains formed in ILs substantially reduce the free energy penalty associated with extractant reorganization. Importantly, using alchemical free energy calculations, we find that this stabilization persists even when we explicitly include the extracted cation. These findings provide insight into the energic drivers of metal ion separations and potentially suggest a new approach to designing effective separations using a molecular-level understanding of solvent effects. △ Less

Submitted 13 December, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

arXiv:2309.11801 [pdf, ps, other]

Bifractal behavior of Solar supergranulation and magnetic activity

Authors: G. Rajani, G. M. Sowmya, U. Paniveni, R. Srikanth

Abstract: We study the complexity and scale of the supergranular network across the 23rd solar cycle, using the Ca II K digitized intensitygrams from the Kodaikanal Solar Observatory (KSO). Enhancing our previous data and refining our data analysis, we study supergranular fractal dimension as a function of cell size. We find that across the cycle phases, the cells show a bifractal behavior, with approximate… ▽ More We study the complexity and scale of the supergranular network across the 23rd solar cycle, using the Ca II K digitized intensitygrams from the Kodaikanal Solar Observatory (KSO). Enhancing our previous data and refining our data analysis, we study supergranular fractal dimension as a function of cell size. We find that across the cycle phases, the cells show a bifractal behavior, with approximately half the larger cells in the studied scale range showing a slightly greater fractal dimension than the smaller cells. We also study the discrepancy between supergranular scale as determined by direct inspection methods (around 17 Mm) and autocorrelation (around 30 Mm), and attribute this to a preferential selection of well defined cells in the former case. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 10 pages, 2 figures; Accepted for publication in Acta Astron

arXiv:2309.11787 [pdf, other]

Dependence of Solar supergranular lifetime on surface magnetic activity and rotation

Authors: Sowmya G. M., Rajani G., U. Paniveni, R. Srikanth

Abstract: The lifetimes and length-scales for supergranular cells in active and quiescent regions of the Solar chromosphere, and the relation between the two, were studied using a time series of Ca II K filtergrams. The lifetimes, in contrast to supergranular length scale and fractal dimension, show no significant dependence on Solar latitude, suggesting that cell lifetimes are independent of the differenti… ▽ More The lifetimes and length-scales for supergranular cells in active and quiescent regions of the Solar chromosphere, and the relation between the two, were studied using a time series of Ca II K filtergrams. The lifetimes, in contrast to supergranular length scale and fractal dimension, show no significant dependence on Solar latitude, suggesting that cell lifetimes are independent of the differential rotation and a possible supergranular super-rotation. The functional form of the relation was obtained guided by a comparison of the distributions of the two supergranular parameters. We infer a linear dependence of cell lifetime on area, which can be understood by the assumption of the network's evolution via a diffusion of the magnetic field. Our analysis suggests that the diffusion rate in quiet regions is about 10% greater than in active regions. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 15 pages, 10 figures; Accepted for publication in the Astrophysical Bulletin

arXiv:2309.11414 [pdf, other]

doi 10.1109/ICRA57147.2024.10610519

EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning

Authors: Kallol Saha, Vishal Mandadi, Jayaram Reddy, Ajit Srikanth, Aditya Agarwal, Bipasha Sen, Arun Singh, Madhava Krishna

Abstract: Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without s… ▽ More Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without specially designed cost functions for a given scene, the overall solutions tend to have low success rates. While deep-learning-based algorithms tremendously improve success rates, they are much harder to adopt without specialized training datasets. We propose EDMP, an Ensemble-of-costs-guided Diffusion for Motion Planning that aims to combine the strengths of classical and deep-learning-based motion planning. Our diffusion-based network is trained on a set of diverse kinematically valid trajectories. Like classical planning, for any new scene at the time of inference, we compute scene-specific costs such as "collision cost" and guide the diffusion to generate valid trajectories that satisfy the scene-specific constraints. Further, instead of a single cost function that may be insufficient in capturing diversity across scenes, we use an ensemble of costs to guide the diffusion process, significantly improving the success rate compared to classical planners. EDMP performs comparably with SOTA deep-learning-based methods while retaining the generalization capabilities primarily associated with classical planners. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 8 pages, 8 figures, submitted to ICRA 2024 (International Conference on Robotics and Automation)

Journal ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)

arXiv:2309.10047 [pdf, other]

A Modular Spatial Clustering Algorithm with Noise Specification

Authors: Akhil K, Srikanth H R

Abstract: Clustering techniques have been the key drivers of data mining, machine learning and pattern recognition for decades. One of the most popular clustering algorithms is DBSCAN due to its high accuracy and noise tolerance. Many superior algorithms such as DBSCAN have input parameters that are hard to estimate. Therefore, finding those parameters is a time consuming process. In this paper, we propose… ▽ More Clustering techniques have been the key drivers of data mining, machine learning and pattern recognition for decades. One of the most popular clustering algorithms is DBSCAN due to its high accuracy and noise tolerance. Many superior algorithms such as DBSCAN have input parameters that are hard to estimate. Therefore, finding those parameters is a time consuming process. In this paper, we propose a novel clustering algorithm Bacteria-Farm, which balances the performance and ease of finding the optimal parameters for clustering. Bacteria- Farm algorithm is inspired by the growth of bacteria in closed experimental farms - their ability to consume food and grow - which closely represents the ideal cluster growth desired in clustering algorithms. In addition, the algorithm features a modular design to allow the creation of versions of the algorithm for specific tasks / distributions of data. In contrast with other clustering algorithms, our algorithm also has a provision to specify the amount of noise to be excluded during clustering. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: Presented at International Conference for Machine Learning and Data Science 2018

arXiv:2309.08703 [pdf, ps, other]

The discrepancy of greater-than

Authors: Srikanth Srinivasan, Amir Yehudayoff

Abstract: The discrepancy of the $n \times n$ greater-than matrix is shown to be $\fracπ{2 \ln n}$ up to lower order terms. The discrepancy of the $n \times n$ greater-than matrix is shown to be $\fracπ{2 \ln n}$ up to lower order terms. △ Less

Submitted 15 September, 2023; originally announced September 2023.

arXiv:2308.16047 [pdf, other]

Mathematical Modeling of Blood Flow for a Diseased Model with Therapeutic Nanoparticles

Authors: Surabhi Rathore, Dasari Srikanth

Abstract: The use of nanoparticles for targeted drug delivery, especially in diseased arteries, is a novel procedure. We are incorporating nanoparticles into blood vessels using a catheter, which could potentially deliver drugs precisely to affected areas, reducing side effects and increasing treatment efficiency. Considering non-Newtonian fluid modeling because blood is a complex fluid with non-linear beha… ▽ More The use of nanoparticles for targeted drug delivery, especially in diseased arteries, is a novel procedure. We are incorporating nanoparticles into blood vessels using a catheter, which could potentially deliver drugs precisely to affected areas, reducing side effects and increasing treatment efficiency. Considering non-Newtonian fluid modeling because blood is a complex fluid with non-linear behavior. In this paper, we are using mathematical modeling to understand blood flow dynamics, temperature, and concentration dispersion, which can provide valuable insights into the behavior of therapeutic nanoparticles in the bloodstream. The perturbation method is used to solve the complex mathematical model with permeable flow boundary conditions. We are investigating flow field characteristics including wall shear stress, pressure, and impedance to understand how nanoparticles disperse and interact with different physiological aspects. In conclusion, the proposed study focuses on the use of nanotechnology and mathematical modeling to understand the effects of therapeutic nanoparticles in diseased arteries, which is an important and valuable contribution to the medical field. △ Less

Submitted 30 August, 2023; originally announced August 2023.

arXiv:2308.14670 [pdf, other]

Symmetric Models for Visual Force Policy Learning

Authors: Colin Kohler, Anuj Shrivatsav Srikanth, Eshan Arora, Robert Platt

Abstract: While it is generally acknowledged that force feedback is beneficial to robotic control, applications of policy learning to robotic manipulation typically only leverage visual feedback. Recently, symmetric neural models have been used to significantly improve the sample efficiency and performance of policy learning across a variety of robotic manipulation domains. This paper explores an applicatio… ▽ More While it is generally acknowledged that force feedback is beneficial to robotic control, applications of policy learning to robotic manipulation typically only leverage visual feedback. Recently, symmetric neural models have been used to significantly improve the sample efficiency and performance of policy learning across a variety of robotic manipulation domains. This paper explores an application of symmetric policy learning to visual-force problems. We present Symmetric Visual Force Learning (SVFL), a novel method for robotic control which leverages visual and force feedback. We demonstrate that SVFL can significantly outperform state of the art baselines for visual force learning and report several interesting empirical findings related to the utility of learning force feedback control policies in both general manipulation tasks and scenarios with low visual acuity. △ Less

Submitted 28 August, 2023; originally announced August 2023.

arXiv:2308.09843 [pdf]

Large thermo-spin effects in Heusler alloy based spin-gapless semiconductor thin films

Authors: Amit Chanda, Deepika Rani, Derick DeTellem, Noha Alzahrani, Dario A. Arena, Sarath Witanachchi, Ratnamala Chatterjee, Manh-Huong Phan, Hariharan Srikanth

Abstract: Recently, Heusler alloys-based spin gapless semiconductors (SGSs) with high Curie temperature (TC) and sizeable spin polarization have emerged as potential candidates for tunable spintronic applications. We report comprehensive investigation of the temperature dependent ANE and intrinsic longitudinal spin Seebeck effect (LSSE) in CoFeCrGa thin films grown on MgO substrates. Our findings show the a… ▽ More Recently, Heusler alloys-based spin gapless semiconductors (SGSs) with high Curie temperature (TC) and sizeable spin polarization have emerged as potential candidates for tunable spintronic applications. We report comprehensive investigation of the temperature dependent ANE and intrinsic longitudinal spin Seebeck effect (LSSE) in CoFeCrGa thin films grown on MgO substrates. Our findings show the anomalous Nernst coefficient for the MgO/CoFeCrGa (95 nm) film is $\cong 1.86$ micro V/K at room temperature which is nearly two orders of magnitude higher than that of the bulk polycrystalline sample of CoFeCrGa (= 0.018 micro V/K) but comparable to that of the magnetic Weyl semimetal Co2MnGa thin film (2-3 micro V/K). Furthermore, the LSSE coefficient for our MgO/CoFeCrGa(95nm)/Pt(5nm) heterostructure is $\cong 20.5$ $μ$V/K/$Ω$ at room temperature which is twice larger than that of the half-metallic ferromagnetic La$_{0.7}$Sr$_{0.3}$MnO$_3$ thin films ($\cong$ 20.5 $μ$V/K/$Ω$). We show that both ANE and LSSE coefficients follow identical temperature dependences and exhibit a maximum at $\cong$ 225 K which is understood as the combined effects of inelastic magnon scatterings and reduced magnon population at low temperatures. Our analyses not only indicate that the extrinsic skew scattering is the dominating mechanism for ANE in these films but also provide critical insights into the functional form of the observed temperature dependent LSSE at low temperatures. Furthermore, by employing radio frequency transverse susceptibility and broadband ferromagnetic resonance in combination with the LSSE measurements, we establish a correlation among the observed LSSE signal, magnetic anisotropy and Gilbert damping of the CoFeCrGa thin films, which will be beneficial for fabricating tunable and highly efficient Heusler alloys based spincaloritronic nanodevices. △ Less

Submitted 18 August, 2023; originally announced August 2023.

arXiv:2308.07236 [pdf]

Temperature Evolution of Magnon Propagation Length in Tm$_3$Fe$_5$O$_{12}$ Thin Films: Roles of Magnetic Anisotropy and Gilbert Damping

Authors: Amit Chanda, Christian Holzmann, Noah Schulz, Aladin Ullrich, Manfred Albrecht, Miela J. Gross, Caroline A. Ross, Dario. A. Arena, Manh-Huong Phan, Hariharan Srikanth

Abstract: The magnon propagation length ($\langleξ\rangle$) of a ferro/ferrimagnet (FM) is one of the key factors that controls the generation and propagation of thermally-driven spin current in FM/heavy metal (HM) bilayer based spincaloritronic devices. Theory predicts that for the FM layer, $\langleξ\rangle$ is inversely proportional to the Gilbert damping ($α$) and the square root of the effective magnet… ▽ More The magnon propagation length ($\langleξ\rangle$) of a ferro/ferrimagnet (FM) is one of the key factors that controls the generation and propagation of thermally-driven spin current in FM/heavy metal (HM) bilayer based spincaloritronic devices. Theory predicts that for the FM layer, $\langleξ\rangle$ is inversely proportional to the Gilbert damping ($α$) and the square root of the effective magnetic anisotropy constant ($K_{\rm eff}$). However, direct experimental evidence of this relationship is lacking. To experimentally confirm this prediction, we employ a combination of longitudinal spin Seebeck effect (LSSE), transverse susceptibility, and ferromagnetic resonance experiments to investigate the temperature evolution of $\langleξ\rangle$ and establish its correlation with the effective magnetic anisotropy field, $H_K^{\rm eff}$ ($\propto K_{\rm eff}$) and $α$ in Tm$_3$Fe$_5$O$_{12}$ (TmIG)/Pt bilayers. We observe concurrent drops in the LSSE voltage and $\langleξ\rangle$ below 200$^\circ$K in TmIG/Pt bilayers regardless of TmIG film thickness and substrate choice and attribute it to the noticeable increases in $H_K^{\rm eff}$ and $α$ that occur within the same temperature range. From the TmIG thickness dependence of the LSSE voltage, we determined the temperature dependence of $\langleξ\rangle$ and highlighted its correlation with the temperature-dependent $H_K^{\rm eff}$ and $α$ in TmIG/Pt bilayers, which will be beneficial for the development of rare-earth iron garnet-based efficient spincaloritronic nanodevices. △ Less

Submitted 13 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

Comments: 10 main text figures

arXiv:2308.06261 [pdf, other]

Enhancing Network Management Using Code Generated by Large Language Models

Authors: Sathiya Kumaran Mani, Yajie Zhou, Kevin Hsieh, Santiago Segarra, Ranveer Chandra, Srikanth Kandula

Abstract: Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate t… ▽ More Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate task-specific code from natural language queries. This method tackles the challenges of explainability, scalability, and privacy by allowing network operators to inspect the generated code, eliminating the need to share network data with LLMs, and concentrating on application-specific requests combined with general program synthesis techniques. We design and evaluate a prototype system using benchmark applications, showcasing high accuracy, cost-effectiveness, and the potential for further enhancements using complementary program synthesis techniques. △ Less

Submitted 11 August, 2023; originally announced August 2023.

arXiv:2308.01302 [pdf, other]

Handling Communication via APIs for Microservices

Authors: Vini Kanvar, Ridhi Jain, Srikanth Tamilselvam

Abstract: Enterprises in their journey to the cloud, want to decompose their monolith applications into microservices to maximize cloud benefits. Current research focuses a lot on how to partition the monolith into smaller clusters that perform well across standard metrics like coupling, cohesion, etc. However, there is little research done on taking the partitions, identifying their dependencies between th… ▽ More Enterprises in their journey to the cloud, want to decompose their monolith applications into microservices to maximize cloud benefits. Current research focuses a lot on how to partition the monolith into smaller clusters that perform well across standard metrics like coupling, cohesion, etc. However, there is little research done on taking the partitions, identifying their dependencies between the microservices, exploring ways to further reduce the dependencies, and making appropriate code changes to enable robust communication without modifying the application behaviour. In this work, we discuss the challenges with the conventional techniques of communication using JSON and propose an alternative way of ID-passing via APIs. We also devise an algorithm to reduce the number of APIs. For this, we construct subgraphs of methods and their associated variables in each class and relocate them to their more functionally aligned microservices. Our quantitative and qualitative studies on five public Java applications clearly demonstrate that our refactored microservices using ID have decidedly better time and memory complexities than JSON. Our automation reduces 40-60\% of the manual refactoring efforts. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: 15 pages

arXiv:2307.16271 [pdf, ps, other]

Lattices over finite group schemes and stratification

Authors: Tobias Barthel, Dave Benson, Srikanth B. Iyengar, Henning Krause, Julia Pevtsova

Abstract: This work concerns representations of a finite flat group scheme $G$, defined over a noetherian commutative ring $R$. The focus is on lattices, namely, finitely generated $G$-modules that are projective as $R$-modules, and on the full subcategory of all $G$-modules projective over $R$ generated by the lattices. The stable category of such $G$-modules is a rigidly-compactly generated, tensor triang… ▽ More This work concerns representations of a finite flat group scheme $G$, defined over a noetherian commutative ring $R$. The focus is on lattices, namely, finitely generated $G$-modules that are projective as $R$-modules, and on the full subcategory of all $G$-modules projective over $R$ generated by the lattices. The stable category of such $G$-modules is a rigidly-compactly generated, tensor triangulated category. The main result is that this stable category is stratified and costratified by the natural action of the cohomology ring of $G$. Applications include formulas for computing the support and cosupport of tensor products and the module of homomorphisms, and a classification of the thick ideals in the stable category of lattices. △ Less

Submitted 14 October, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

Comments: 35 pages. The Introductions, and sections 2 and 5 have been rewritten significantly

Report number: MPIM-Bonn-2023 MSC Class: 16G30 (primary); 18G80; 20C10; 20J06 (secondary)

arXiv:2307.16113 [pdf, other]

doi 10.1007/s11071-024-09641-5

Dynamics of Minimal Networks of Limit Cycle Oscillators

Authors: Andrea Elizabeth Biju, Sneha Srikanth, Krishna Manoj, Samadhan A. Pawar, R. I. Sujith

Abstract: The framework of mutually coupled oscillators on a network has served as a convenient tool for investigating the impact of various parameters on the dynamics of real-world systems. Compared to large networks of oscillators, minimal networks are more susceptible to changes in coupling parameters, the number of oscillators, and network topologies. In this study, we systematically explore the influen… ▽ More The framework of mutually coupled oscillators on a network has served as a convenient tool for investigating the impact of various parameters on the dynamics of real-world systems. Compared to large networks of oscillators, minimal networks are more susceptible to changes in coupling parameters, the number of oscillators, and network topologies. In this study, we systematically explore the influence of these parameters on the dynamics of a minimal network comprising Stuart-Landau oscillators coupled with a distance-dependent time delay. We examine three network topologies: ring, chain, and star. Specifically, for ring networks, we study the effects of increasing nonlocality from local to global coupling on the overall dynamics of the system. Our findings reveal the existence of various synchronized states, including splay and cluster states, a partially synchronized state such as chimeric quasiperiodicity, and an oscillation quenching state such as amplitude death in these networks. Moreover, through an analysis of long-lived transients, we discover novel amplitude-modulated states within ring networks. Interestingly, we observe that increasing nonlocality diminishes the influence of the number of oscillators on the overall behavior in these networks. Furthermore, we note that chain networks, unlike ring networks, do not exhibit perfect synchrony among the coupled oscillators. In contrast, star networks demonstrate greater stability and are unaffected by the number of oscillators within the network. The insights from this study deepen our understanding of the dynamics of minimal networks and have implications for various fields, ranging from biology to engineering. △ Less

Submitted 29 July, 2023; originally announced July 2023.

Comments: 14 pages, 11 figures

arXiv:2307.10468 [pdf]

The Greenland Telescope: Construction, Commissioning, and Operations in Pituffik

Authors: Ming-Tang Chen, Keiichi Asada, Satoki Matsushita, Philippe Raffin, Makoto Inoue, Paul T. P. Ho, Chih-Chiang Han, Derek Kubo, Timothy Norton, Nimesh A. Patel, George Nystrom, Chih-Wei L. Huang, Pierre Martin-Cocher, Jun Yi Koay, Cristina Romero-Cañizales, Ching-Tang Liu, Teddy Huang, Kuan-Yu Liu, Tashun Wei, Shu-Hao Chang, Ryan Chilson, Peter Oshiro, Homin Jiang, Chao-Te Li, Geoffrey Bower , et al. (29 additional authors not shown)

Abstract: In 2018, the Greenland Telescope (GLT) started scientific observation in Greenland. Since then, we have completed several significant improvements and added new capabilities to the telescope system. This paper presents a full review of the GLT system, a summary of our observation activities since 2018, the lessons learned from the operations in the Arctic regions, and the prospect of the telescope… ▽ More In 2018, the Greenland Telescope (GLT) started scientific observation in Greenland. Since then, we have completed several significant improvements and added new capabilities to the telescope system. This paper presents a full review of the GLT system, a summary of our observation activities since 2018, the lessons learned from the operations in the Arctic regions, and the prospect of the telescope. △ Less

Submitted 19 July, 2023; originally announced July 2023.

Comments: 26 pages, 11 figures, and 8 tables. This is the version of the article before publication editing, as submitted by an author to Publications of the Astronomical Society of the Pacific. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record will be added when it becomes available

arXiv:2307.09116 [pdf, other]

doi 10.1103/PhysRevA.108.042211

Asymmetric One-Sided Semi-Device-Independent Steerability of Quantum Discordant States

Authors: Chellasamy Jebarathinam, Debarshi Das, R. Srikanth

Abstract: Superlocality and superunsteerability provide operational characterization of quantum correlations in certain local and unsteerable states respectively. Such quantum correlated states have a nonzero quantum discord. A two-way nonzero quantum discord is necessary for quantum correlations pointed out by superlocality. On the other hand, in this work, we demonstrate that a two-way nonzero quantum dis… ▽ More Superlocality and superunsteerability provide operational characterization of quantum correlations in certain local and unsteerable states respectively. Such quantum correlated states have a nonzero quantum discord. A two-way nonzero quantum discord is necessary for quantum correlations pointed out by superlocality. On the other hand, in this work, we demonstrate that a two-way nonzero quantum discord is not necessary to demonstrate superunsteerability. To this end, we demonstrate superunsteerability for one-way quantum discordant states. This in turn implies the existence of one-way superunsteerability and also the presence of superunsteerability without superlocality. Superunsteerability for nonzero quantum discord states implies the occurence of steerability in a one-sided semi-device-independent way. Just like one-way steerability occurs for certain Bell-local states in a one-sided device-independent way, our result shows that one-way steerability can also occur for certain nonsuperlocal states but in a one-sided semi-device-independent way. △ Less

Submitted 17 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

Comments: v4: close to published version. v3: Revised version resubmitted to PRA. v2: Improved motivations in Introduction on v1. 9 pages, 1 figure

Journal ref: Phys. Rev. A 108, 042211 (2023)

arXiv:2307.04693 [pdf, other]

COMEX: A Tool for Generating Customized Source Code Representations

Authors: Debeshee Das, Noble Saji Mathews, Alex Mathai, Srikanth Tamilselvam, Kranthi Sedamaki, Sridhar Chimalakonda, Atul Kumar

Abstract: Learning effective representations of source code is critical for any Machine Learning for Software Engineering (ML4SE) system. Inspired by natural language processing, large language models (LLMs) like Codex and CodeGen treat code as generic sequences of text and are trained on huge corpora of code data, achieving state of the art performance on several software engineering (SE) tasks. However, v… ▽ More Learning effective representations of source code is critical for any Machine Learning for Software Engineering (ML4SE) system. Inspired by natural language processing, large language models (LLMs) like Codex and CodeGen treat code as generic sequences of text and are trained on huge corpora of code data, achieving state of the art performance on several software engineering (SE) tasks. However, valid source code, unlike natural language, follows a strict structure and pattern governed by the underlying grammar of the programming language. Current LLMs do not exploit this property of the source code as they treat code like a sequence of tokens and overlook key structural and semantic properties of code that can be extracted from code-views like the Control Flow Graph (CFG), Data Flow Graph (DFG), Abstract Syntax Tree (AST), etc. Unfortunately, the process of generating and integrating code-views for every programming language is cumbersome and time consuming. To overcome this barrier, we propose our tool COMEX - a framework that allows researchers and developers to create and combine multiple code-views which can be used by machine learning (ML) models for various SE tasks. Some salient features of our tool are: (i) it works directly on source code (which need not be compilable), (ii) it currently supports Java and C#, (iii) it can analyze both method-level snippets and program-level snippets by using both intra-procedural and inter-procedural analysis, and (iv) it is easily extendable to other languages as it is built on tree-sitter - a widely used incremental parser that supports over 40 languages. We believe this easy-to-use code-view generation and customization tool will give impetus to research in source code representation learning methods and ML4SE. Tool: https://pypi.org/project/comex - GitHub: https://github.com/IBM/tree-sitter-codeviews - Demo: https://youtu.be/GER6U87FVbU △ Less

Submitted 10 July, 2023; originally announced July 2023.

Comments: The paper has been accepted for publication at ASE 2023 (Tool Demonstrations Track)

arXiv:2307.03202 [pdf, other]

Do old globular clusters in low mass galaxies disprove modified gravity?

Authors: Michal Bílek, Hongsheng Zhao, Benoit Famaey, Srikanth T. Nagesh, Françoise Combes, Oliver Müller, Michael Hilker, Pavel Kroupa, Rodrigo Ibata

Abstract: The controversy "dark matter vs. modified gravity" constitutes a major topic of discussion. It was proposed that dynamical friction could be used to discriminate between the two alternatives. Analytic calculations indicate that, with modified gravity, globular clusters (GCs) of low-mass galaxies experience much stronger dynamical friction than in the equivalent system with Newtonian gravity and da… ▽ More The controversy "dark matter vs. modified gravity" constitutes a major topic of discussion. It was proposed that dynamical friction could be used to discriminate between the two alternatives. Analytic calculations indicate that, with modified gravity, globular clusters (GCs) of low-mass galaxies experience much stronger dynamical friction than in the equivalent system with Newtonian gravity and dark matter. As a result, in modified gravity the old GCs of low mass galaxies should have already settled in the centers of the galaxies. This is not observed. Here we report on our efforts to verify the analytic results by self-consistent simulations with the MOND-type (modified Newtonian dynamics) gravity. The core stalling mechanism, that was not considered in the analytic calculations, prevents GCs to settle in centers of ultra-diffuse galaxies. For isolated dwarf galaxies, which are gas-rich objects, supernova explosions prevent the GCs from settling. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: 4 pages, 2 figures, accepted for publication in the proceedings of IAU Symposium 379: Dynamical Masses of Local Group Galaxies

arXiv:2307.02899 [pdf, other]

doi 10.1103/PhysRevA.109.042419

Experimental realization of quantum non-Markovianity through the convex mixing of Pauli semigroups on an NMR quantum processor

Authors: Vaishali Gulati, Vinayak Jagadish, R. Srikanth, Kavita Dorai

Abstract: This experimental study aims to investigate the convex combinations of Pauli semigroups with arbitrary mixing parameters to determine whether the resulting dynamical map exhibits Markovian or non-Markovian behavior. Specifically, we consider the cases of equal as well as unequal mixing of two Pauli semigroups, and demonstrate that the resulting map is always non-Markovian. Additionally, we study t… ▽ More This experimental study aims to investigate the convex combinations of Pauli semigroups with arbitrary mixing parameters to determine whether the resulting dynamical map exhibits Markovian or non-Markovian behavior. Specifically, we consider the cases of equal as well as unequal mixing of two Pauli semigroups, and demonstrate that the resulting map is always non-Markovian. Additionally, we study three cases of three-way mixing of the three Pauli semigroups and determine the Markovianity or non-Markovianity of the resulting maps by experimentally determining the decay rates. To simulate the non-unitary dynamics of a single qubit system with different mixing combinations of Pauli semigroups on an NMR quantum processor, we use an algorithm involving two ancillary qubits. The experimental results align with the theoretical predictions. △ Less

Submitted 26 April, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

Comments: 9 pages, 8 figures

Journal ref: Phys. Rev. A 109, 042419 (2024)

arXiv:2307.00920 [pdf, other]

doi 10.21437/Interspeech.2023-1923

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews

Authors: Sergio Burdisso, Esaú Villatoro-Tello, Srikanth Madikeri, Petr Motlicek

Abstract: We propose a simple approach for weighting self-connecting edges in a Graph Convolutional Network (GCN) and show its impact on depression detection from transcribed clinical interviews. To this end, we use a GCN for modeling non-consecutive and long-distance semantics to classify the transcriptions into depressed or control subjects. The proposed method aims to mitigate the limiting assumptions of… ▽ More We propose a simple approach for weighting self-connecting edges in a Graph Convolutional Network (GCN) and show its impact on depression detection from transcribed clinical interviews. To this end, we use a GCN for modeling non-consecutive and long-distance semantics to classify the transcriptions into depressed or control subjects. The proposed method aims to mitigate the limiting assumptions of locality and the equal importance of self-connections vs. edges to neighboring nodes in GCNs, while preserving attractive features such as low computational cost, data agnostic, and interpretability capabilities. We perform an exhaustive evaluation in two benchmark datasets. Results show that our approach consistently outperforms the vanilla GCN model as well as previously reported results, achieving an F1=0.84 on both datasets. Finally, a qualitative analysis illustrates the interpretability capabilities of the proposed approach and its alignment with previous findings in psychology. △ Less

Submitted 11 March, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: Paper Accepted to Interspeech 2023

Journal ref: Interspeech 2023

arXiv:2306.15685 [pdf, other]

Implementing contextual biasing in GPU decoder for online ASR

Authors: Iuliia Nigmatulina, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motliček, Juan Zuluaga-Gomez, Karthik Pandia, Aravind Ganapathiraju

Abstract: GPU decoding significantly accelerates the output of ASR predictions. While GPUs are already being used for online ASR decoding, post-processing and rescoring on GPUs have not been properly investigated yet. Rescoring with available contextual information can considerably improve ASR predictions. Previous studies have proven the viability of lattice rescoring in decoding and biasing language model… ▽ More GPU decoding significantly accelerates the output of ASR predictions. While GPUs are already being used for online ASR decoding, post-processing and rescoring on GPUs have not been properly investigated yet. Rescoring with available contextual information can considerably improve ASR predictions. Previous studies have proven the viability of lattice rescoring in decoding and biasing language model (LM) weights in offline and online CPU scenarios. In real-time GPU decoding, partial recognition hypotheses are produced without lattice generation, which makes the implementation of biasing more complex. The paper proposes and describes an approach to integrate contextual biasing in real-time GPU decoding while exploiting the standard Kaldi GPU decoder. Besides the biasing of partial ASR predictions, our approach also permits dynamic context switching allowing a flexible rescoring per each speech segment directly on GPU. The code is publicly released and tested with open-sourced test sets. △ Less

Submitted 23 June, 2023; originally announced June 2023.

Comments: Accepted to Interspeech 2023

arXiv:2306.13678 [pdf]

Rigid3D: a hybrid multi-sphere DEM framework for simulation of non-spherical particles in multi-phase flow

Authors: Fei-Liang Yuan, Martin Sommerfeld, Pradeep Muramulla, Srikanth Gopireddy, Lars Pasternak, Nora Urbanetz, Thomas Profitlich

Abstract: This article presents the development and validation of a hybrid multi-sphere discrete element framework - Rigid3D, for the simulation of granular systems with arbitrarily shaped particles in 3D space. In this DEM framework, a non-spherical particle is approximated by three different geometric models: (1) multi-sphere model with overlapping spheres (MS model), (2) particle surface with triangle me… ▽ More This article presents the development and validation of a hybrid multi-sphere discrete element framework - Rigid3D, for the simulation of granular systems with arbitrarily shaped particles in 3D space. In this DEM framework, a non-spherical particle is approximated by three different geometric models: (1) multi-sphere model with overlapping spheres (MS model), (2) particle surface with triangle mesh (surface model), and (3) discretized particle body with polyhedral cells (cell model). The multi-sphere approach will be the "engine" for efficient DEM simulations, while the particle's mesh and cell models will be updated simultaneously according to the position and orientation of their associated MS model, for use in particle-related inter-phase couplings in a multi-phase flow. In this sense, Rigid3D tries to combine the best of both worlds in multi-sphere and polyhedral DEMs: multi-sphere method for the efficiency and acceptable accuracy in the DEM simulation of granular flows, while the surface and cell models for the couplings between particles and other phases (continuous or dispersed phases) without affecting the performance of DEM simulations. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: Manuscript for submission to Springer Journal - Computational Particle Mechanics

arXiv:2306.12773 [pdf, other]

doi 10.1103/PhysRevA.108.042202

Noninvertibility and non-Markovianity of quantum dynamical maps

Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

Abstract: We identify two broad types of noninvertibilities in quantum dynamical maps, one necessarily associated with CP indivisibility and one not so. We study the production of (non-)Markovian, invertible maps by the process of mixing noninvertible Pauli maps, and quantify the fraction of the same. The memory kernel perspective appears to be less transparent on the issue of invertibility than the approac… ▽ More We identify two broad types of noninvertibilities in quantum dynamical maps, one necessarily associated with CP indivisibility and one not so. We study the production of (non-)Markovian, invertible maps by the process of mixing noninvertible Pauli maps, and quantify the fraction of the same. The memory kernel perspective appears to be less transparent on the issue of invertibility than the approaches based on maps or master equations. Here we consider a related and potentially helpful issue: the identification of criteria of parameterized families of maps leading to the existence of a well-defined semigroup limit. △ Less

Submitted 14 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

Comments: 7 pages, 2 figures

Journal ref: Phys. Rev. A 108, 042202 (2023)

arXiv:2306.08175 [pdf, other]

DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer ASR

Authors: Goeric Huybrechts, Srikanth Ronanki, Xilai Li, Hadis Nosrati, Sravan Bodapati, Katrin Kirchhoff

Abstract: Conformer-based end-to-end models have become ubiquitous these days and are commonly used in both streaming and non-streaming automatic speech recognition (ASR). Techniques like dual-mode and dynamic chunk training helped unify streaming and non-streaming systems. However, there remains a performance gap between streaming with a full and limited past context. To address this issue, we propose the… ▽ More Conformer-based end-to-end models have become ubiquitous these days and are commonly used in both streaming and non-streaming automatic speech recognition (ASR). Techniques like dual-mode and dynamic chunk training helped unify streaming and non-streaming systems. However, there remains a performance gap between streaming with a full and limited past context. To address this issue, we propose the integration of a novel dynamic contextual carry-over mechanism in a state-of-the-art (SOTA) unified ASR system. Our proposed dynamic context Conformer (DCTX-Conformer) utilizes a non-overlapping contextual carry-over mechanism that takes into account both the left context of a chunk and one or more preceding context embeddings. We outperform the SOTA by a relative 25.0% word error rate, with a negligible latency impact due to the additional context embeddings. △ Less

Submitted 1 March, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

arXiv:2306.06493 [pdf, other]

RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on Edge

Authors: Adithya Krishna, Srikanth Rohit Nudurupati, Chandana D G, Pritesh Dwivedi, André van Schaik, Mahesh Mehendale, Chetan Singh Thakur

Abstract: Deep Neural Network (DNN) based inference at the edge is challenging as these compute and data-intensive algorithms need to be implemented at low cost and low power while meeting the latency constraints of the target applications. Sparsity, in both activations and weights inherent to DNNs, is a key knob to leverage. In this paper, we present RAMAN, a Re-configurable and spArse tinyML Accelerator f… ▽ More Deep Neural Network (DNN) based inference at the edge is challenging as these compute and data-intensive algorithms need to be implemented at low cost and low power while meeting the latency constraints of the target applications. Sparsity, in both activations and weights inherent to DNNs, is a key knob to leverage. In this paper, we present RAMAN, a Re-configurable and spArse tinyML Accelerator for infereNce on edge, architected to exploit the sparsity to reduce area (storage), power as well as latency. RAMAN can be configured to support a wide range of DNN topologies - consisting of different convolution layer types and a range of layer parameters (feature-map size and the number of channels). RAMAN can also be configured to support accuracy vs power/latency tradeoffs using techniques deployed at compile-time and run-time. We present the salient features of the architecture, provide implementation results and compare the same with the state-of-the-art. RAMAN employs novel dataflow inspired by Gustavson's algorithm that has optimal input activation (IA) and output activation (OA) reuse to minimize memory access and the overall data movement cost. The dataflow allows RAMAN to locally reduce the partial sum (Psum) within a processing element array to eliminate the Psum writeback traffic. Additionally, we suggest a method to reduce peak activation memory by overlapping IA and OA on the same memory space, which can reduce storage requirements by up to 50%. RAMAN was implemented on a low-power and resource-constrained Efinix Ti60 FPGA with 37.2K LUTs and 8.6K register utilization. RAMAN processes all layers of the MobileNetV1 model at 98.47 GOp/s/W and the DS-CNN model at 79.68 GOp/s/W by leveraging both weight and activation sparsity. △ Less

Submitted 10 June, 2023; originally announced June 2023.

arXiv:2306.03315 [pdf, other]

Few Shot Rationale Generation using Self-Training with Dual Teachers

Authors: Aditya Srikanth Veerubhotla, Lahari Poddar, Jun Yin, György Szarvas, Sharanya Eswaran

Abstract: Self-rationalizing models that also generate a free-text explanation for their predicted labels are an important tool to build trustworthy AI applications. Since generating explanations for annotated labels is a laborious and costly pro cess, recent models rely on large pretrained language models (PLMs) as their backbone and few-shot learning. In this work we explore a self-training approach lever… ▽ More Self-rationalizing models that also generate a free-text explanation for their predicted labels are an important tool to build trustworthy AI applications. Since generating explanations for annotated labels is a laborious and costly pro cess, recent models rely on large pretrained language models (PLMs) as their backbone and few-shot learning. In this work we explore a self-training approach leveraging both labeled and unlabeled data to further improve few-shot models, under the assumption that neither human written rationales nor annotated task labels are available at scale. We introduce a novel dual-teacher learning framework, which learns two specialized teacher models for task prediction and rationalization using self-training and distills their knowledge into a multi-tasking student model that can jointly generate the task label and rationale. Furthermore, we formulate a new loss function, Masked Label Regularization (MLR) which promotes explanations to be strongly conditioned on predicted labels. Evaluation on three public datasets demonstrate that the proposed methods are effective in modeling task labels and generating faithful rationales. △ Less

Submitted 5 June, 2023; originally announced June 2023.

Comments: ACL Findings 2023

arXiv:2306.01654 [pdf, other]

GANs Settle Scores!

Authors: Siddarth Asokan, Nishanth Shetty, Aadithya Srikanth, Chandra Sekhar Seelamantula

Abstract: Generative adversarial networks (GANs) comprise a generator, trained to learn the underlying distribution of the desired data, and a discriminator, trained to distinguish real samples from those output by the generator. A majority of GAN literature focuses on understanding the optimality of the discriminator through integral probability metric (IPM) or divergence based analysis. In this paper, we… ▽ More Generative adversarial networks (GANs) comprise a generator, trained to learn the underlying distribution of the desired data, and a discriminator, trained to distinguish real samples from those output by the generator. A majority of GAN literature focuses on understanding the optimality of the discriminator through integral probability metric (IPM) or divergence based analysis. In this paper, we propose a unified approach to analyzing the generator optimization through variational approach. In $f$-divergence-minimizing GANs, we show that the optimal generator is the one that matches the score of its output distribution with that of the data distribution, while in IPM GANs, we show that this optimal generator matches score-like functions, involving the flow-field of the kernel associated with a chosen IPM constraint space. Further, the IPM-GAN optimization can be seen as one of smoothed score-matching, where the scores of the data and the generator distributions are convolved with the kernel associated with the constraint. The proposed approach serves to unify score-based training and existing GAN flavors, leveraging results from normalizing flows, while also providing explanations for empirical phenomena such as the stability of non-saturating GAN losses. Based on these results, we propose novel alternatives to $f$-GAN and IPM-GAN training based on score and flow matching, and discriminator-guided Langevin sampling. △ Less

Submitted 2 June, 2023; originally announced June 2023.

arXiv:2305.18422 [pdf]

Extended Pacejka Tire Model for Enhanced Vehicle Stability Control

Authors: Kanwar Bharat Singh, Srikanth Sivaramakrishnan

Abstract: Despite their widespread use, current tire models have demonstrated a certain level of inaccuracy, primarily due to uncertainties related to unaccounted nonlinearity and disturbance effects resulting from tire operating conditions. Noteworthy factors such as tread depth, inflation pressure, tire temperature, and road surface condition significantly impact tire force and moment characteristics. The… ▽ More Despite their widespread use, current tire models have demonstrated a certain level of inaccuracy, primarily due to uncertainties related to unaccounted nonlinearity and disturbance effects resulting from tire operating conditions. Noteworthy factors such as tread depth, inflation pressure, tire temperature, and road surface condition significantly impact tire force and moment characteristics. These factors can vary considerably during tire operation and significantly affect both tire and vehicle performance. The improvement of tire models is crucial to enhance the effectiveness of advanced vehicle control systems, as accurate tire force characteristics are required for maintaining vehicle stability during demanding maneuvers. This paper investigates the impact of varying tire temperature, inflation pressure, and tread depth on steady-state tire force characteristics by analyzing the coefficients of the Pacejka 'magic formula' (MF) tire model. Based on this analysis, adaptation equations are proposed to compensate for the influence of these variables on the tire force curve. The advantages of using an adapted tire model are then demonstrated through simulation studies of a classical vehicle stability control system that can adapt to diverse operating conditions. A comparison is made between the adapted tire model-based controller and a controller based on a fixed reference model. △ Less

Submitted 28 May, 2023; originally announced May 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:1811.09997

arXiv:2305.13792 [pdf, other]

Mitigating the Performance Impact of Network Failures in Public Clouds

Authors: Pooria Namyar, Behnaz Arzani, Daniel Crankshaw, Daniel S. Berger, Kevin Hsieh, Srikanth Kandula, Ramesh Govindan

Abstract: Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWA… ▽ More Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWARM, the approach described in this paper, can pick orders of magnitude better mitigations by estimating end-to-end connection-level performance (CLP) metrics. At its core is a scalable CLP estimator that quickly ranks mitigations with high fidelity and, on failures observed at a large cloud provider, outperforms the state-of-the-art by over 700$\times$ in some cases. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.13479 [pdf, other]

Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem

Authors: Behnaz Arzani, Siva Kesava Reddy Kakarla, Miguel Castro, Srikanth Kandula, Saeed Maleki, Luke Marshall

Abstract: We show communication schedulers' recent work proposed for ML collectives does not scale to the increasing problem sizes that arise from training larger models. These works also often produce suboptimal schedules. We make a connection with similar problems in traffic engineering and propose a new method, TECCL, that finds better quality schedules (e.g., finishes collectives faster and/or while sen… ▽ More We show communication schedulers' recent work proposed for ML collectives does not scale to the increasing problem sizes that arise from training larger models. These works also often produce suboptimal schedules. We make a connection with similar problems in traffic engineering and propose a new method, TECCL, that finds better quality schedules (e.g., finishes collectives faster and/or while sending fewer bytes) and does so more quickly on larger topologies. We present results on many different GPU topologies that show substantial improvement over the state-of-the-art. △ Less

Submitted 22 May, 2023; originally announced May 2023.

arXiv:2305.13051 [pdf, other]

Learning Pedestrian Actions to Ensure Safe Autonomous Driving

Authors: Jia Huang, Alvika Gautam, Srikanth Saripalli

Abstract: To ensure safe autonomous driving in urban environments with complex vehicle-pedestrian interactions, it is critical for Autonomous Vehicles (AVs) to have the ability to predict pedestrians' short-term and immediate actions in real-time. In recent years, various methods have been developed to study estimating pedestrian behaviors for autonomous driving scenarios, but there is a lack of clear defin… ▽ More To ensure safe autonomous driving in urban environments with complex vehicle-pedestrian interactions, it is critical for Autonomous Vehicles (AVs) to have the ability to predict pedestrians' short-term and immediate actions in real-time. In recent years, various methods have been developed to study estimating pedestrian behaviors for autonomous driving scenarios, but there is a lack of clear definitions for pedestrian behaviors. In this work, the literature gaps are investigated and a taxonomy is presented for pedestrian behavior characterization. Further, a novel multi-task sequence to sequence Transformer encoders-decoders (TF-ed) architecture is proposed for pedestrian action and trajectory prediction using only ego vehicle camera observations as inputs. The proposed approach is compared against an existing LSTM encoders decoders (LSTM-ed) architecture for action and trajectory prediction. The performance of both models is evaluated on the publicly available Joint Attention Autonomous Driving (JAAD) dataset, CARLA simulation data as well as real-time self-driving shuttle data collected on university campus. Evaluation results illustrate that the proposed method reaches an accuracy of 81% on action prediction task on JAAD testing data and outperforms the LSTM-ed by 7.4%, while LSTM counterpart performs much better on trajectory prediction task for a prediction sequence length of 25 frames. △ Less

Submitted 22 May, 2023; originally announced May 2023.

Comments: 8 pages, 9 figures

arXiv:2305.12741 [pdf, other]

Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

Authors: Debarpan Bhattacharya, Neeraj Kumar Sharma, Debottam Dutta, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K K, Sadhana Gonuguntla, Murali Alagesan

Abstract: This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demogr… ▽ More This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demographic information associated with age, gender and geographic location, as well as the health information relating to the symptoms, pre-existing respiratory ailments, comorbidity and SARS-CoV-2 test status. Our study is the first of its kind to manually annotate the audio quality of the entire dataset (amounting to 65~hours) through manual listening. The paper summarizes the data collection procedure, demographic, symptoms and audio data information. A COVID-19 classifier based on bi-directional long short-term (BLSTM) architecture, is trained and evaluated on the different population sub-groups contained in the dataset to understand the bias/fairness of the model. This enabled the analysis of the impact of gender, geographic location, date of recording, and language proficiency on the COVID-19 detection performance. △ Less

Submitted 22 May, 2023; originally announced May 2023.

Comments: Accepted for publiation in Nature Scientific Data

arXiv:2305.11790 [pdf, other]

Prompting with Pseudo-Code Instructions

Authors: Mayank Mishra, Prince Kumar, Riyaz Bhat, Rudra Murthy V, Danish Contractor, Srikanth Tamilselvam

Abstract: Prompting with natural language instructions has recently emerged as a popular method of harnessing the capabilities of large language models. Given the inherent ambiguity present in natural language, it is intuitive to consider the possible advantages of prompting with less ambiguous prompt styles, such as the use of pseudo-code. In this paper we explore if prompting via pseudo-code instruction… ▽ More Prompting with natural language instructions has recently emerged as a popular method of harnessing the capabilities of large language models. Given the inherent ambiguity present in natural language, it is intuitive to consider the possible advantages of prompting with less ambiguous prompt styles, such as the use of pseudo-code. In this paper we explore if prompting via pseudo-code instructions helps improve the performance of pre-trained language models. We manually create a dataset of pseudo-code prompts for 132 different tasks spanning classification, QA and generative language tasks, sourced from the Super-NaturalInstructions dataset. Using these prompts along with their counterparts in natural language, we study their performance on two LLM families - BLOOM and CodeGen. Our experiments show that using pseudo-code instructions leads to better results, with an average increase (absolute) of 7-16 points in F1 scores for classification tasks and an improvement (relative) of 12-38% in aggregate ROUGE-L scores across all tasks. We include detailed ablation studies which indicate that code comments, docstrings, and the structural clues encoded in pseudo-code all contribute towards the improvement in performance. To the best of our knowledge our work is the first to demonstrate how pseudo-code prompts can be helpful in improving the performance of pre-trained LMs. △ Less

Submitted 19 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: Published in EMNLP 2023 main track

Showing 151–200 of 950 results for author: Srikanth