Search | arXiv e-print repository

doi 10.3847/1538-4357/acd8be

The prevalence and influence of circumstellar material around hydrogen-rich supernova progenitors

Authors: Rachel J. Bruch, Avishay Gal-Yam, Ofer Yaron, Ping Chen, Nora L. Strotjohann, Ido Irani, Erez Zimmerman, Steve Schulze, Yi Yang, Young-Lo Kim, Mattia Bulla, Jesper Sollerman, Mickael Rigault, Eran Ofek, Maayane Soumagnac, Frank J. Masci, Christoffer Fremling, Daniel Perley, Jakob Nordin, S. Bradley Cenko, Anna Y. Q. Ho, S. Adams, Igor Adreoni, Eric C. Bellm, Nadia Blagorodnova , et al. (22 additional authors not shown)

Abstract: Narrow transient emission lines (flash-ionization features) in early supernova (SN) spectra trace the presence of circumstellar material (CSM) around the massive progenitor stars of core-collapse SNe. The lines disappear within days after the SN explosion, suggesting that this material is spatially confined, and originates from enhanced mass loss shortly (months to a few years) prior to explosion.… ▽ More Narrow transient emission lines (flash-ionization features) in early supernova (SN) spectra trace the presence of circumstellar material (CSM) around the massive progenitor stars of core-collapse SNe. The lines disappear within days after the SN explosion, suggesting that this material is spatially confined, and originates from enhanced mass loss shortly (months to a few years) prior to explosion. We performed a systematic survey of H-rich (Type II) SNe discovered within less than two days from explosion during the first phase of the Zwicky Transient Facility (ZTF) survey (2018-2020), finding thirty events for which a first spectrum was obtained within $< 2$ days from explosion. The measured fraction of events showing flash ionisation features ($>36\%$ at $95\%$ confidence level) confirms that elevated mass loss in massive stars prior to SN explosion is common. We find that SNe II showing flash ionisation features are not significantly brighter, nor bluer, nor more slowly rising than those without. This implies that CSM interaction does not contribute significantly to their early continuum emission, and that the CSM is likely optically thin. We measured the persistence duration of flash ionisation emission and find that most SNe show flash features for $\approx 5 $ days. Rarer events, with persistence timescales $>10$ days, are brighter and rise longer, suggesting these may be intermediate between regular SNe II and strongly-interacting SNe IIn. △ Less

Submitted 13 December, 2022; v1 submitted 6 December, 2022; originally announced December 2022.

arXiv:2211.15702 [pdf, other]

doi 10.3847/1538-4357/aca80a

Inferencing Progenitor and Explosion Properties of Evolving Core-collapse Supernovae from Zwicky Transient Facility Light Curves

Authors: Bhagya M. Subrayan, Danny Milisavljevic, Takashi J. Moriya, Kathryn E. Weil, Geoffrey Lentner, Mark Linvill, John Banovetz, Braden Garretson, Jack Reynolds, Niharika Sravan, Ryan Chornock, Rafaella Margutti

Abstract: We analyze a sample of 45 Type II supernovae from the Zwicky Transient Facility (ZTF) public survey using a grid of hydrodynamical models in order to assess whether theoretically-driven forecasts can intelligently guide follow up observations supporting all-sky survey alert streams. We estimate several progenitor properties and explosion physics parameters including zero-age-main-sequence (ZAMS) m… ▽ More We analyze a sample of 45 Type II supernovae from the Zwicky Transient Facility (ZTF) public survey using a grid of hydrodynamical models in order to assess whether theoretically-driven forecasts can intelligently guide follow up observations supporting all-sky survey alert streams. We estimate several progenitor properties and explosion physics parameters including zero-age-main-sequence (ZAMS) mass, mass-loss rate, kinetic energy, 56Ni mass synthesized, host extinction, and the time of explosion. Using complete light curves we obtain confident characterizations for 34 events in our sample, with the inferences of the remaining 11 events limited either by poorly constraining data or the boundaries of our model grid. We also simulate real-time characterization of alert stream data by comparing our model grid to various stages of incomplete light curves (t less than 25 days, t less than 50 days, all data), and find that some parameters are more reliable indicators of true values at early epochs than others. Specifically, ZAMS mass, time of explosion, steepness parameter beta, and host extinction are reasonably constrained with incomplete light curve data, whereas mass-loss rate, kinetic energy and 56Ni mass estimates generally require complete light curves spanning greater than 100 days. We conclude that real-time modeling of transients, supported by multi-band synthetic light curves tailored to survey passbands, can be used as a powerful tool to identify critical epochs of follow up observations. Our findings are relevant to identify, prioritize, and coordinate efficient follow up of transients discovered by Vera C. Rubin Observatory. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: 27 pages, 14 figures, Accepted to The Astrophysical Journal

arXiv:2211.13280 [pdf, other]

Device Directedness with Contextual Cues for Spoken Dialog Systems

Authors: Dhanush Bekal, Sundararajan Srinivasan, Sravan Bodapati, Srikanth Ronanki, Katrin Kirchhoff

Abstract: In this work, we define barge-in verification as a supervised learning task where audio-only information is used to classify user spoken dialogue into true and false barge-ins. Following the success of pre-trained models, we use low-level speech representations from a self-supervised representation learning model for our downstream classification task. Further, we propose a novel technique to infu… ▽ More In this work, we define barge-in verification as a supervised learning task where audio-only information is used to classify user spoken dialogue into true and false barge-ins. Following the success of pre-trained models, we use low-level speech representations from a self-supervised representation learning model for our downstream classification task. Further, we propose a novel technique to infuse lexical information directly into speech representations to improve the domain-specific language information implicitly learned during pre-training. Experiments conducted on spoken dialog data show that our proposed model trained to validate barge-in entirely from speech representations is faster by 38% relative and achieves 4.5% relative F1 score improvement over a baseline LSTM model that uses both audio and Automatic Speech Recognition (ASR) 1-best hypotheses. On top of this, our best proposed model with lexically infused representations along with contextual features provides a further relative improvement of 5.7% in the F1 score but only 22% faster than the baseline. △ Less

Submitted 23 November, 2022; originally announced November 2022.

arXiv:2211.04961 [pdf, other]

Parallel-Connected Battery Current Imbalance Dynamics

Authors: Andrew Weng, Sravan Pannala, Jason B. Siegel, Anna G. Stefanopoulou

Abstract: In this work, we derive analytical expressions governing state-of-charge and current imbalance dynamics for two parallel-connected batteries. The model, based on equivalent circuits and an affine open circuit voltage relation, describes the evolution of state-of-charge and current imbalance over the course of a complete charge and discharge cycle. Using this framework, we identify the conditions u… ▽ More In this work, we derive analytical expressions governing state-of-charge and current imbalance dynamics for two parallel-connected batteries. The model, based on equivalent circuits and an affine open circuit voltage relation, describes the evolution of state-of-charge and current imbalance over the course of a complete charge and discharge cycle. Using this framework, we identify the conditions under which an aged battery will experience a higher current magnitude and state-of-charge deviation towards the end of a charge or discharge cycle. This work enables a quantitative understanding of how mismatches in battery capacities and resistances influence imbalance dynamics in parallel-connected battery systems, helping to pave a path forward for battery degradation modeling in heterogeneous battery systems. △ Less

Submitted 9 November, 2022; originally announced November 2022.

Comments: 7 pages, 4 figures, conference paper (MECC 2022)

arXiv:2211.01951 [pdf]

Creating an Optimal Portfolio of Crops Using Price Forecasting to Increase ROI for Indian Farmers

Authors: Akshai Gaddam, Sravan Malla, Sandhya Dasari, Narayana Darapaneni, Mukesh Kumar Shukla

Abstract: The Indian agricultural sector being in a constant phase of upgradation, has been on the road to modernization for the last couple of years. The fundamental source of livelihood for over 70 percent of the population living in rural parts of the country is still agriculture. The average Indian farmer, although has access to raw and trend data pertaining to crop prices, yield and demand from Indian… ▽ More The Indian agricultural sector being in a constant phase of upgradation, has been on the road to modernization for the last couple of years. The fundamental source of livelihood for over 70 percent of the population living in rural parts of the country is still agriculture. The average Indian farmer, although has access to raw and trend data pertaining to crop prices, yield and demand from Indian government and private websites, still struggles to make the right choices. They are constantly faced with the dilemma of choosing the right crop to address market demand and fetch them a decent profit. Since the process of shortlisting crops amongst the many suitable ones isn't completely scientific and usually dictated by area tradition, this project has aimed to address that issue by forecasting the price of those crops and uses that to create an optimal portfolio that the farmers can obtain to arrive at a data-driven decision for crop selection with optimal estimated ROI. △ Less

Submitted 24 October, 2022; originally announced November 2022.

Comments: 14 pages

arXiv:2210.16459 [pdf, other]

doi 10.1007/JHEP07(2023)094

Non-Gaussianities in generalized non-local $R^2$-like inflation

Authors: Alexey S. Koshelev, K. Sravan Kumar, Alexei A. Starobinsky

Abstract: In [1], a most general higher curvature non-local gravity action was derived that admits a particular $R^2$-like inflationary solution predicting the spectral index of primordial scalar perturbations $n_s(N)\approx 1-\frac{2}{N}$, where $N$ is the number of e-folds before the end of inflation, $N\gg 1$, any value of the tensor-to-scalar ratio $r(N)<0.036$ and the tensor tilt $n_t(N)$ violating the… ▽ More In [1], a most general higher curvature non-local gravity action was derived that admits a particular $R^2$-like inflationary solution predicting the spectral index of primordial scalar perturbations $n_s(N)\approx 1-\frac{2}{N}$, where $N$ is the number of e-folds before the end of inflation, $N\gg 1$, any value of the tensor-to-scalar ratio $r(N)<0.036$ and the tensor tilt $n_t(N)$ violating the $r= -8n_t$ condition. In this paper, we compute scalar primordial non-Gaussianities (PNGs) in this theory and effectively demonstrate that higher curvature non-local terms lead to reduced bispectrum $f_{\rm NL}\left( k_1,\,k_2,\,k_3 \right)$ mimicking several classes of scalar field models of inflation known in the literature. We obtain $\vert f_{\rm NL}\vert \sim O(1-10)$ in the equilateral, orthogonal, and squeezed limits and the running of these PNGs measured by the quantity $\vert\frac{d\ln f_{\rm NL}}{d\ln k}\vert\lesssim 1$. Such PNGs are sufficiently large to be measurable by future CMB and Large Scale Structure observations, thus providing a possibility to probe the nature of quantum gravity. Furthermore, we demonstrate that the $R^2$-like inflation in non-local modification of gravity brings non-trivial predictions which go beyond the current status of effective field theories (EFTs) of single field, quasi-single field and multiple field inflation. A distinguishable feature of non-local $R^2$-like inflation compared to local EFTs is that we can have running of PNGs at least an order of magnitude higher. In summary, through our generalized non-local $R^2$-like inflation, we obtain a robust geometric framework of inflation that can explain any detection of observable quantities related to scalar PNGs. △ Less

Submitted 17 July, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

Comments: 31 pages, 7 figures, discussions are improved, the abstract is slightly extended, matches with the version published in JHEP

Journal ref: JHEP 07 (2023) 094

arXiv:2210.14743 [pdf, other]

DEEPFAKE CLI: Accelerated Deepfake Detection using FPGAs

Authors: Omkar Bhilare, Rahul Singh, Vedant Paranjape, Sravan Chittupalli, Shraddha Suratkar, Faruk Kazi

Abstract: Because of the availability of larger datasets and recent improvements in the generative model, more realistic Deepfake videos are being produced each day. People consume around one billion hours of video on social media platforms every day, and thats why it is very important to stop the spread of fake videos as they can be damaging, dangerous, and malicious. There has been a significant improveme… ▽ More Because of the availability of larger datasets and recent improvements in the generative model, more realistic Deepfake videos are being produced each day. People consume around one billion hours of video on social media platforms every day, and thats why it is very important to stop the spread of fake videos as they can be damaging, dangerous, and malicious. There has been a significant improvement in the field of deepfake classification, but deepfake detection and inference have remained a difficult task. To solve this problem in this paper, we propose a novel DEEPFAKE C-L-I (Classification-Localization-Inference) in which we have explored the idea of accelerating Quantized Deepfake Detection Models using FPGAs due to their ability of maximum parallelism and energy efficiency compared to generalized GPUs. In this paper, we have used light MesoNet with EFF-YNet structure and accelerated it on VCK5000 FPGA, powered by state-of-the-art VC1902 Versal Architecture which uses AI, DSP, and Adaptable Engines for acceleration. We have benchmarked our inference speed with other state-of-the-art inference nodes, got 316.8 FPS on VCK5000 while maintaining 93\% Accuracy. △ Less

Submitted 26 October, 2022; originally announced October 2022.

Comments: This preprint has not undergone peer review or any post-submission improvement or corrections. The Version of Record of this contribution is published in LNCS [13798], PDCAT 2022 , and is available online at [https://doi.org/10.1007/ISBN ]

arXiv:2210.09510 [pdf, other]

Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting

Authors: Saket Dingliwal, Monica Sunkara, Sravan Bodapati, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff

Abstract: End-to-end speech recognition models trained using joint Connectionist Temporal Classification (CTC)-Attention loss have gained popularity recently. In these models, a non-autoregressive CTC decoder is often used at inference time due to its speed and simplicity. However, such models are hard to personalize because of their conditional independence assumption that prevents output tokens from previ… ▽ More End-to-end speech recognition models trained using joint Connectionist Temporal Classification (CTC)-Attention loss have gained popularity recently. In these models, a non-autoregressive CTC decoder is often used at inference time due to its speed and simplicity. However, such models are hard to personalize because of their conditional independence assumption that prevents output tokens from previous time steps to influence future predictions. To tackle this, we propose a novel two-way approach that first biases the encoder with attention over a predefined list of rare long-tail and out-of-vocabulary (OOV) words and then uses dynamic boosting and phone alignment network during decoding to further bias the subword predictions. We evaluate our approach on open-source VoxPopuli and in-house medical datasets to showcase a 60% improvement in F1 score on domain-specific rare words over a strong CTC baseline. △ Less

Submitted 13 November, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: To appear in SLT 2022

arXiv:2209.15614 [pdf, other]

doi 10.1109/ISIT50566.2022.9834589

TinyTurbo: Efficient Turbo Decoders on Edge

Authors: S Ashwin Hebbar, Rajesh K Mishra, Sravan Kumar Ankireddy, Ashok V Makkuva, Hyeji Kim, Pramod Viswanath

Abstract: In this paper, we introduce a neural-augmented decoder for Turbo codes called TINYTURBO . TINYTURBO has complexity comparable to the classical max-log-MAP algorithm but has much better reliability than the max-log-MAP baseline and performs close to the MAP algorithm. We show that TINYTURBO exhibits strong robustness on a variety of practical channels of interest, such as EPA and EVA channels, whic… ▽ More In this paper, we introduce a neural-augmented decoder for Turbo codes called TINYTURBO . TINYTURBO has complexity comparable to the classical max-log-MAP algorithm but has much better reliability than the max-log-MAP baseline and performs close to the MAP algorithm. We show that TINYTURBO exhibits strong robustness on a variety of practical channels of interest, such as EPA and EVA channels, which are included in the LTE standards. We also show that TINYTURBO strongly generalizes across different rate, blocklengths, and trellises. We verify the reliability and efficiency of TINYTURBO via over-the-air experiments. △ Less

Submitted 30 September, 2022; originally announced September 2022.

Comments: 10 pages, 6 figures. Published at the 2022 IEEE International Symposium on Information Theory (ISIT)

Journal ref: "TinyTurbo: Efficient Turbo Decoders on Edge," 2022 IEEE International Symposium on Information Theory (ISIT), 2022, pp. 2797-2802

arXiv:2209.11908 [pdf, other]

Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

Authors: Letian Chen, Sravan Jayanthi, Rohan Paleja, Daniel Martin, Viacheslav Zakharov, Matthew Gombolay

Abstract: Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. However, current LfD frameworks are not capable of fast adaptation to heterogeneous human demonstrations nor the large-scale deployment in ubiquitous robotics applications. In this paper, we propose a novel LfD framework, Fast Lif… ▽ More Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. However, current LfD frameworks are not capable of fast adaptation to heterogeneous human demonstrations nor the large-scale deployment in ubiquitous robotics applications. In this paper, we propose a novel LfD framework, Fast Lifelong Adaptive Inverse Reinforcement learning (FLAIR). Our approach (1) leverages learned strategies to construct policy mixtures for fast adaptation to new demonstrations, allowing for quick end-user personalization, (2) distills common knowledge across demonstrations, achieving accurate task inference; and (3) expands its model only when needed in lifelong deployments, maintaining a concise set of prototypical strategies that can approximate all behaviors via policy mixtures. We empirically validate that FLAIR achieves adaptability (i.e., the robot adapts to heterogeneous, user-specific task preferences), efficiency (i.e., the robot achieves sample-efficient adaptation), and scalability (i.e., the model grows sublinearly with the number of demonstrations while maintaining high performance). FLAIR surpasses benchmarks across three control tasks with an average 57% improvement in policy returns and an average 78% fewer episodes required for demonstration modeling using policy mixtures. Finally, we demonstrate the success of FLAIR in a table tennis task and find users rate FLAIR as having higher task (p<.05) and personalization (p<.05) performance. △ Less

Submitted 27 May, 2025; v1 submitted 23 September, 2022; originally announced September 2022.

Journal ref: Proceedings of Conference on Robot Learning (CoRL) 2022

arXiv:2209.05302 [pdf, other]

Unified State Representation Learning under Data Augmentation

Authors: Taylor Hearn, Sravan Jayanthi, Sehoon Ha

Abstract: The capacity for rapid domain adaptation is important to increasing the applicability of reinforcement learning (RL) to real world problems. Generalization of RL agents is critical to success in the real world, yet zero-shot policy transfer is a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. We propose USRA: Unified State Represent… ▽ More The capacity for rapid domain adaptation is important to increasing the applicability of reinforcement learning (RL) to real world problems. Generalization of RL agents is critical to success in the real world, yet zero-shot policy transfer is a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. We propose USRA: Unified State Representation Learning under Data Augmentation, a representation learning framework that learns a latent unified state representation by performing data augmentations on its observations to improve its ability to generalize to unseen target domains. We showcase the success of our approach on the DeepMind Control Generalization Benchmark for the Walker environment and find that USRA achieves higher sample efficiency and 14.3% better domain adaptation performance compared to the best baseline results. △ Less

Submitted 12 September, 2022; originally announced September 2022.

Comments: 5 pages, 3 figures, 1 table, Georgia Tech CS 8803: Deep Reinforcement Learning for Intelligent Control

arXiv:2209.03928 [pdf, other]

Parity asymmetry of primordial scalar and tensor power spectra

Authors: K. Sravan Kumar, João Marto

Abstract: Although the cosmic microwave background (CMB) is largely understood to be homogeneous and isotropic, the CMB angular power spectra present anomalies that seem to break down parity symmetry at large angular scales. We argue that the primordial scalar and tensor power spectra can be parity asymmetric in our new construction of inflationary quantum fluctuations. Our formulation stems from the founda… ▽ More Although the cosmic microwave background (CMB) is largely understood to be homogeneous and isotropic, the CMB angular power spectra present anomalies that seem to break down parity symmetry at large angular scales. We argue that the primordial scalar and tensor power spectra can be parity asymmetric in our new construction of inflationary quantum fluctuations. Our formulation stems from the foundational questions of quantum field theory in curved spacetime in which we impose geometric superselection rules to the vacuum structure for (single-field) inflationary quantum fluctuations based on discrete spacetime transformations ($\mathcal{P}\mathcal{T}$). As a result, we estimate the amplitude of power asymmetry in the scalar and tensor sectors at different scales of $ 10^{-4} {\rm Mpc^{-1}}\lesssim k\lesssim 10^{-3}{\rm Mpc^{-1}}$. In particular, we predict the parity asymmetry for the primordial gravitational waves (PGWs) and quantify it for different models, like Starobinsky and $α-$attractor single-field inflationary scenarios. △ Less

Submitted 10 November, 2024; v1 submitted 8 September, 2022; originally announced September 2022.

Comments: 14 pages, 3 figures. One new figure and two appendices are added, and additional discussions are included

arXiv:2209.02515 [pdf, other]

doi 10.1007/JHEP07(2023)146

Generalized non-local $R^2$-like inflation

Authors: Alexey S. Koshelev, K. Sravan Kumar, Alexei A. Starobinsky

Abstract: The $R^2$ inflation which is an extension of general relativity (GR) by quadratic scalar curvature introduces a quasi-de Sitter expansion of the early Universe governed by Ricci scalar being an eigenmode of d'Alembertian operator. In this paper, we derive a most general theory of gravity admitting $R^2$ inflationary solution which turned out to be higher curvature non-local extension of GR. We stu… ▽ More The $R^2$ inflation which is an extension of general relativity (GR) by quadratic scalar curvature introduces a quasi-de Sitter expansion of the early Universe governed by Ricci scalar being an eigenmode of d'Alembertian operator. In this paper, we derive a most general theory of gravity admitting $R^2$ inflationary solution which turned out to be higher curvature non-local extension of GR. We study in detail inflationary perturbations in this theory and analyse the structure of form-factors that leads to a massive scalar (scalaron) and massless tensor degrees of freedom. We argue that the theory contains only finite number of free parameters which can be fixed by cosmological observations. We derive predictions of our generalized non-local $R^2$-like inflation and obtain the scalar spectral index $n_s\approx 1-\frac{2}{N}$ and any value of the tensor-to-scalar ratio $r<0.036$. In this theory, tensor spectral index can be either positive or negative $n_t\lessgtr 0$ and the well-known consistency relation $r = -8n_t$ is violated in a non-trivial way. We also compute running of the tensor spectral index and discuss observational implications to distinguish this model from several classes of scalar field models of inflation. These predictions allow us to probe the nature of quantum gravity in the scope of future CMB and gravitational wave observations. Finally we comment on how the features of generalized non-local $R^2$-like inflation cannot be captured by established notions of the so-called effective field theory of single field inflation and how we must redefine the way we pursue inflationary cosmology. △ Less

Submitted 20 July, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

Comments: 37 pages, 3 figures, Discussions extended, typos corrected, matches with the version published in JHEP

Journal ref: JHEP 07(2023) 146

arXiv:2208.14436 [pdf, other]

doi 10.1093/mnras/stac2360

Hyperon bulk viscosity and $r$-modes of neutron stars

Authors: O. P. Jyothilakshmi, P. E. Sravan Krishnan, Prashant Thakur, V. Sreekanth, T. K. Jha

Abstract: We propose and apply a new parameterization of the modified chiral effective model to study rotating neutron stars with hyperon cores in the framework of the relativistic mean-field theory. The inclusion of mesonic cross couplings in the model has improved the density content of the symmetry energy slope parameters, which are in agreement with the findings from recent terrestrial experiments. The… ▽ More We propose and apply a new parameterization of the modified chiral effective model to study rotating neutron stars with hyperon cores in the framework of the relativistic mean-field theory. The inclusion of mesonic cross couplings in the model has improved the density content of the symmetry energy slope parameters, which are in agreement with the findings from recent terrestrial experiments. The bulk viscosity of the hyperonic medium is analyzed to investigate its role in the suppression of gravitationally driven $r$-modes. The hyperonic bulk viscosity coefficient caused by non-leptonic weak interactions and the corresponding damping timescales are calculated and the $r$-mode instability windows are obtained. The present model predicts a significant reduction of the unstable region due to a more effective damping of oscillations. We find that from $\sim 10^8$ K to $\sim 10^{9}$ K, hyperonic bulk viscosity completely suppresses the $r$-modes leading to a stable region between the instability windows. Our analysis indicates that the instability can reduce the angular velocity of the star up to $\sim$0.3~$Ω_K$, where $Ω_K$ is the Kepler frequency of the star. △ Less

Submitted 30 August, 2022; originally announced August 2022.

Comments: 9 pages, 9 figures; Accepted for publication in MNRAS

arXiv:2208.01254 [pdf, other]

A Robust Morphological Approach for Semantic Segmentation of Very High Resolution Images

Authors: Siddharth Saravanan, Aditya Challa, Sravan Danda

Abstract: State-of-the-art methods for semantic segmentation of images involve computationally intensive neural network architectures. Most of these methods are not adaptable to high-resolution image segmentation due to memory and other computational issues. Typical approaches in literature involve design of neural network architectures that can fuse global information from low-resolution images and local i… ▽ More State-of-the-art methods for semantic segmentation of images involve computationally intensive neural network architectures. Most of these methods are not adaptable to high-resolution image segmentation due to memory and other computational issues. Typical approaches in literature involve design of neural network architectures that can fuse global information from low-resolution images and local information from the high-resolution counterparts. However, architectures designed for processing high resolution images are unnecessarily complex and involve a lot of hyper parameters that can be difficult to tune. Also, most of these architectures require ground truth annotations of the high resolution images to train, which can be hard to obtain. In this article, we develop a robust pipeline based on mathematical morphological (MM) operators that can seamlessly extend any existing semantic segmentation algorithm to high resolution images. Our method does not require the ground truth annotations of the high resolution images. It is based on efficiently utilizing information from the low-resolution counterparts, and gradient information on the high-resolution images. We obtain high quality seeds from the inferred labels on low-resolution images using traditional morphological operators and propagate seed labels using a random walker to refine the semantic labels at the boundaries. We show that the semantic segmentation results obtained by our method beat the existing state-of-the-art algorithms on high-resolution images. We empirically prove the robustness of our approach to the hyper parameters used in our pipeline. Further, we characterize some necessary conditions under which our pipeline is applicable and provide an in-depth analysis of the proposed approach. △ Less

Submitted 26 October, 2023; v1 submitted 2 August, 2022; originally announced August 2022.

Comments: Under review at Computer Vision and Image Understanding

arXiv:2207.09809 [pdf, other]

Construction and analysis of surface phase diagrams to describe segregation and dissolution behavior of Al and Ca in Mg alloys

Authors: Jing Yang, K. B. Sravan Kumar, Mira Todorova, Jörg Neugebauer

Abstract: Segregation and dissolution behavior of Mg alloyed with Ca and Al are studied by performing density functional theory calculations considering an extensive set of surface structures and compositions. Combining ab initio surface science approaches with cluster expansion for ordered surface structures we construct surface phase diagrams for these alloys. We utilize these diagrams to study segregatio… ▽ More Segregation and dissolution behavior of Mg alloyed with Ca and Al are studied by performing density functional theory calculations considering an extensive set of surface structures and compositions. Combining ab initio surface science approaches with cluster expansion for ordered surface structures we construct surface phase diagrams for these alloys. We utilize these diagrams to study segregation phenomena and chemical trends for surfaces in contact with a dry environment or with an aqueous electrolyte. We show that the presence of water dramatically impacts the stability and chemical composition of the considered metallic surfaces. We furthermore find that the two alloying elements behave qualitatively different: whereas Ca strongly segregates to the surface and becomes dissolved upon exposure of the surface to water, Al shows an anti-segregation behavior, i.e., it remains in Mg bulk. These findings provide an explanation for the experimentally observed increase/decrease in corrosion rates when alloying Mg with Al/Ca. △ Less

Submitted 10 July, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

Comments: 12 pages, 9 figures, submitted to Phys. Rev. Materials

arXiv:2207.09054 [pdf, other]

doi 10.1109/TAP.2019.2938704

Towards a Low-SWaP 1024-beam Digital Array: A 32-beam Sub-system at 5.8 GHz

Authors: Arjuna Madanayake, Viduneth Ariyarathna, Suresh Madishetty, Sravan Pulipati, R. J. Cintra, Diego Coelho, Raíza Oliveira, Fábio M. Bayer, Leonid Belostotski, Soumyajit Mandal, Theodore S. Rappaport

Abstract: Millimeter wave communications require multibeam beamforming in order to utilize wireless channels that suffer from obstructions, path loss, and multi-path effects. Digital multibeam beamforming has maximum degrees of freedom compared to analog phased arrays. However, circuit complexity and power consumption are important constraints for digital multibeam systems. A low-complexity digital computin… ▽ More Millimeter wave communications require multibeam beamforming in order to utilize wireless channels that suffer from obstructions, path loss, and multi-path effects. Digital multibeam beamforming has maximum degrees of freedom compared to analog phased arrays. However, circuit complexity and power consumption are important constraints for digital multibeam systems. A low-complexity digital computing architecture is proposed for a multiplication-free 32-point linear transform that approximates multiple simultaneous RF beams similar to a discrete Fourier transform (DFT). Arithmetic complexity due to multiplication is reduced from the FFT complexity of $\mathcal{O}(N\: \log N)$ for DFT realizations, down to zero, thus yielding a 46% and 55% reduction in chip area and dynamic power consumption, respectively, for the $N=32$ case considered. The paper describes the proposed 32-point DFT approximation targeting a 1024-beams using a 2D array, and shows the multiplierless approximation and its mapping to a 32-beam sub-system consisting of 5.8 GHz antennas that can be used for generating 1024 digital beams without multiplications. Real-time beam computation is achieved using a Xilinx FPGA at 120 MHz bandwidth per beam. Theoretical beam performance is compared with measured RF patterns from both a fixed-point FFT as well as the proposed multiplier-free algorithm and are in good agreement. △ Less

Submitted 29 May, 2024; v1 submitted 18 July, 2022; originally announced July 2022.

Comments: 22 pages, 8 figures, 3 tables. Fixed typo in Table 1

Journal ref: IEEE Transactions on Antennas and Propagation, v. 68, n. 2, Feb. 2020

arXiv:2207.00532 [pdf, other]

QoE-Centric Multi-User mmWave Scheduling: A Beam Alignment and Buffer Predictive Approach

Authors: Babak Badnava, Sravan Reddy Chintareddy, Morteza Hashemi

Abstract: In this paper, we consider the multi-user scheduling problem in millimeter wave (mmWave) video streaming networks, which comprise a streaming server and several users, each requesting a video stream with a different resolution. The main objective is to optimize the long-term average quality of experience (QoE) for all users. We tackle this problem by considering the physical layer characteristics… ▽ More In this paper, we consider the multi-user scheduling problem in millimeter wave (mmWave) video streaming networks, which comprise a streaming server and several users, each requesting a video stream with a different resolution. The main objective is to optimize the long-term average quality of experience (QoE) for all users. We tackle this problem by considering the physical layer characteristics of the mmWave network, including the beam alignment overhead due to pencil-beams. To develop an efficient scheduling policy, we leverage the contextual multi-armed bandit (MAB) models to propose a beam alignment overhead and buffer predictive streaming solution, dubbed B2P-Stream. The proposed B2P-Stream algorithm optimally balances the trade-off between the overhead and users' buffer levels and improves the QoE by reducing the beam alignment overhead for users of higher resolutions. We also provide a theoretical guarantee for our proposed method and prove that it guarantees a sub-linear regret bound. Finally, we examine our proposed framework through extensive simulations. We provide a detailed comparison of the B2P-Stream against uniformly random and Round-robin (RR) policies and show that it outperforms both of them in providing a better QoE and fairness. We also analyze the scalability and robustness of the B2P-Stream algorithm with different network configurations. △ Less

Submitted 1 July, 2022; originally announced July 2022.

arXiv:2205.10684 [pdf, other]

Interpreting Neural Min-Sum Decoders

Authors: Sravan Kumar Ankireddy, Hyeji Kim

Abstract: In decoding linear block codes, it was shown that noticeable reliability gains can be achieved by introducing learnable parameters to the Belief Propagation (BP) decoder. Despite the success of these methods, there are two key open problems. The first is the lack of interpretation of the learned weights, and the other is the lack of analysis for non-AWGN channels. In this work, we aim to bridge th… ▽ More In decoding linear block codes, it was shown that noticeable reliability gains can be achieved by introducing learnable parameters to the Belief Propagation (BP) decoder. Despite the success of these methods, there are two key open problems. The first is the lack of interpretation of the learned weights, and the other is the lack of analysis for non-AWGN channels. In this work, we aim to bridge this gap by providing insights into the weights learned and their connection to the structure of the underlying code. We show that the weights are heavily influenced by the distribution of short cycles in the code. We next look at the performance of these decoders in non-AWGN channels, both synthetic and over-the-air channels, and study the complexity vs. performance trade-offs, demonstrating that increasing the number of parameters helps significantly in complex channels. Finally, we show that the decoders with learned weights achieve higher reliability than those with weights optimized analytically under the Gaussian approximation. △ Less

Submitted 11 April, 2023; v1 submitted 21 May, 2022; originally announced May 2022.

Comments: 7 pages, 8 figures, Accepted to IEEE International Conference on Communications (ICC) 2023

arXiv:2205.08513 [pdf, other]

doi 10.1038/s41467-023-43932-6

An updated nuclear-physics and multi-messenger astrophysics framework for binary neutron star mergers

Authors: Peter T. H. Pang, Tim Dietrich, Michael W. Coughlin, Mattia Bulla, Ingo Tews, Mouza Almualla, Tyler Barna, Weizmann Kiendrebeogo, Nina Kunert, Gargi Mansingh, Brandon Reed, Niharika Sravan, Andrew Toivonen, Sarah Antier, Robert O. VandenBerg, Jack Heinzel, Vsevolod Nedora, Pouyan Salehi, Ritwik Sharma, Rahul Somasundaram, Chris Van Den Broeck

Abstract: The multi-messenger detection of the gravitational-wave signal GW170817, the corresponding kilonova AT2017gfo and the short gamma-ray burst GRB170817A, as well as the observed afterglow has delivered a scientific breakthrough. For an accurate interpretation of all these different messengers, one requires robust theoretical models that describe the emitted gravitational-wave, the electromagnetic em… ▽ More The multi-messenger detection of the gravitational-wave signal GW170817, the corresponding kilonova AT2017gfo and the short gamma-ray burst GRB170817A, as well as the observed afterglow has delivered a scientific breakthrough. For an accurate interpretation of all these different messengers, one requires robust theoretical models that describe the emitted gravitational-wave, the electromagnetic emission, and dense matter reliably. In addition, one needs efficient and accurate computational tools to ensure a correct cross-correlation between the models and the observational data. For this purpose, we have developed the Nuclear-physics and Multi-Messenger Astrophysics framework NMMA. The code allows incorporation of nuclear-physics constraints at low densities as well as X-ray and radio observations of isolated neutron stars. In previous works, the NMMA code has allowed us to constrain the equation of state of supranuclear dense matter, to measure the Hubble constant, and to compare dense-matter physics probed in neutron-star mergers and in heavy-ion collisions, and to classify electromagnetic observations and perform model selection. Here, we show an extension of the NMMA code as a first attempt of analyzing the gravitational-wave signal, the kilonova, and the gamma-ray burst afterglow simultaneously. Incorporating all available information, we estimate the radius of a $1.4M_\odot$ neutron star to be $R=11.98^{+0.35}_{-0.40}$km. △ Less

Submitted 8 January, 2024; v1 submitted 17 May, 2022; originally announced May 2022.

Comments: code available at https://github.com/nuclear-multimessenger-astronomy

Report number: LA-UR-22-23872, LIGO-P2200150

Journal ref: Nature Commun. 14 (2023) 1, 8352

arXiv:2205.05967 [pdf, other]

Target Aware Network Architecture Search and Compression for Efficient Knowledge Transfer

Authors: S. H. Shabbeer Basha, Debapriya Tula, Sravan Kumar Vinakota, Shiv Ram Dubey

Abstract: Transfer Learning enables Convolutional Neural Networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while transferring the knowledge learned from one task to another task, the deeper layers of a pre-trained CNN are finetuned over the target dataset. However, thes… ▽ More Transfer Learning enables Convolutional Neural Networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while transferring the knowledge learned from one task to another task, the deeper layers of a pre-trained CNN are finetuned over the target dataset. However, these layers are originally designed for the source task which may be over-parameterized for the target task. Thus, finetuning these layers over the target dataset may affect the generalization ability of the CNN due to high network complexity. To tackle this problem, we propose a two-stage framework called TASCNet which enables efficient knowledge transfer. In the first stage, the configuration of the deeper layers is learned automatically and finetuned over the target dataset. Later, in the second stage, the redundant filters are pruned from the fine-tuned CNN to decrease the network's complexity for the target task while preserving the performance. This two-stage mechanism finds a compact version of the pre-trained CNN with optimal structure (number of filters in a convolutional layer, number of neurons in a dense layer, and so on) from the hypothesis space. The efficacy of the proposed method is evaluated using VGG-16, ResNet-50, and DenseNet-121 on CalTech-101, CalTech-256, and Stanford Dogs datasets. Similar to computer vision tasks, we have also conducted experiments on Movie Review Sentiment Analysis task. The proposed TASCNet reduces the computational complexity of pre-trained CNNs over the target task by reducing both trainable parameters and FLOPs which enables resource-efficient knowledge transfer. The source code is available at: https://github.com/Debapriya-Tula/TASCNet. △ Less

Submitted 24 January, 2024; v1 submitted 12 May, 2022; originally announced May 2022.

Comments: This paper is accepted for publication in Multimedia Systems Journal

arXiv:2205.02584 [pdf]

Study on the ERP Implementation Methodologies on SAP, Oracle NetSuite, and Microsoft Dynamics 365: A Review

Authors: Madabattula Archana, Dr VijayaKumar Varadarajan, Sai Sravan Medicherla

Abstract: There are Top three vendors in the ERP market: SAP, Oracle Net Suite and Microsoft dynamics 365 leading the Global ERP market.While analyzing the ERP selection and implementation trends, it is critical that any organization looking to implement an ERP system assesses the vendors through the lens of its own organization's specific requirements. When choosing the right ERP, a few things must be take… ▽ More There are Top three vendors in the ERP market: SAP, Oracle Net Suite and Microsoft dynamics 365 leading the Global ERP market.While analyzing the ERP selection and implementation trends, it is critical that any organization looking to implement an ERP system assesses the vendors through the lens of its own organization's specific requirements. When choosing the right ERP, a few things must be taken into consideration like the Time Budget and resources. The research paper analyses each phase and compares the methodologies of SAP, Oracle Net Suite and Microsoft Dynamics 365 and suggests the best methodologies to be practiced for any ERP projects. Like a poorly planned trip, if you don't have an effective methodology, you can expect a negative impact on your implementation, solution quality, and business satisfaction. Wrong choice of methodology may lead to poor decision-making, best practices may not be followed, and teams may be disjointed in the implementation, which can cause delays. Choosing the right ERP methodology is the key.Methodology is the lifeline for successful project implementation. △ Less

Submitted 26 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

arXiv:2204.09737 [pdf, other]

ARLIF-IDS -- Attention augmented Real-Time Isolation Forest Intrusion Detection System

Authors: Aman Priyanshu, Sarthak Shastri, Sai Sravan Medicherla

Abstract: Distributed Denial of Service (DDoS) attack is a malicious attempt to disrupt the normal traffic of a targeted server, service or network by overwhelming the target or its surrounding infrastructure with a flood of Internet traffic. Emerging technologies such as the Internet of Things and Software Defined Networking leverage lightweight strategies for the early detection of DDoS attacks. Previous… ▽ More Distributed Denial of Service (DDoS) attack is a malicious attempt to disrupt the normal traffic of a targeted server, service or network by overwhelming the target or its surrounding infrastructure with a flood of Internet traffic. Emerging technologies such as the Internet of Things and Software Defined Networking leverage lightweight strategies for the early detection of DDoS attacks. Previous literature demonstrates the utility of lower number of significant features for intrusion detection. Thus, it is essential to have a fast and effective security identification model based on low number of features. In this work, a novel Attention-based Isolation Forest Intrusion Detection System is proposed. The model considerably reduces training time and memory consumption of the generated model. For performance assessment, the model is assessed over two benchmark datasets, the NSL-KDD dataset & the KDDCUP'99 dataset. Experimental results demonstrate that the proposed attention augmented model achieves a significant reduction in execution time, by 91.78%, and an average detection F1-Score of 0.93 on the NSL-KDD and KDDCUP'99 dataset. The results of performance evaluation show that the proposed methodology has low complexity and requires less processing time and computational resources, outperforming other current IDS based on machine learning algorithms. △ Less

Submitted 20 April, 2022; originally announced April 2022.

Comments: Paper accepted at the Poster session at the 43rd IEEE Symposium on Security and Privacy

arXiv:2203.01357 [pdf, other]

doi 10.3847/2041-8213/ac5890

The Candidate Progenitor Companion Star of the Type Ib/c SN 2013ge

Authors: Ori D. Fox, Schuyler D. Van Dyk, Benjamin F. Williams, Maria Drout, Emmanouil Zapartas, Nathan Smith, Dan Milisavljevic, Jennifer E. Andrews, K. Azalee Bostroem, Alexei V. Filippenko, Sebastian Gomez, Patrick L. Kelly, S. E. de Mink, Justin Pierel, Armin Rest, Stuart Ryder, Niharika Sravan, Lou Strolger, Qinan Wang, Kathryn E. Weil

Abstract: This Letter presents the detection of a source at the position of the Type Ib/c supernova (SN) 2013ge more than four years after the radioactive component is expected to have faded. This source could mark the first post-SN direct detection of a surviving companion to a stripped-envelope Type Ib/c explosion. We test this hypothesis and find the shape of the source's spectral energy distribution is… ▽ More This Letter presents the detection of a source at the position of the Type Ib/c supernova (SN) 2013ge more than four years after the radioactive component is expected to have faded. This source could mark the first post-SN direct detection of a surviving companion to a stripped-envelope Type Ib/c explosion. We test this hypothesis and find the shape of the source's spectral energy distribution is most consistent with that of a B5 I supergiant. While binary models tend to predict OB-type stars for stripped-envelope companions, the location of the source on a color-magnitude diagram (CMD) places it redward of its more likely position on the main sequence (MS). The source may be temporarily out of thermal equilibrium, or a cool and inflated non-MS companion, which is similar to the suggested companion of Type Ib SN 2019yvr that was constrained from pre-SN imaging. We also consider other possible physical scenarios for the source, including a fading SN, circumstellar shock interaction, line of site coincidence, and an unresolved host star cluster, all of which will require future observations to more definitively rule out. Ultimately, the fraction of surviving companions ("binary fraction") will provide necessary constraints on binary evolution models and the underlying physics. △ Less

Submitted 2 March, 2022; originally announced March 2022.

Comments: Accepted to ApJL. 9 pages, 4 figures, 1 table

arXiv:2202.13502 [pdf, other]

doi 10.1109/LGRS.2022.3173793

ESW Edge-Weights : Ensemble Stochastic Watershed Edge-Weights for Hyperspectral Image Classification

Authors: Rohan Agarwal, Aman Aziz, Aditya Suraj Krishnan, Aditya Challa, Sravan Danda

Abstract: Hyperspectral image (HSI) classification is a topic of active research. One of the main challenges of HSI classification is the lack of reliable labelled samples. Various semi-supervised and unsupervised classification methods are proposed to handle the low number of labelled samples. Chief among them are graph convolution networks (GCN) and their variants. These approaches exploit the graph struc… ▽ More Hyperspectral image (HSI) classification is a topic of active research. One of the main challenges of HSI classification is the lack of reliable labelled samples. Various semi-supervised and unsupervised classification methods are proposed to handle the low number of labelled samples. Chief among them are graph convolution networks (GCN) and their variants. These approaches exploit the graph structure for semi-supervised and unsupervised classification. While several of these methods implicitly construct edge-weights, to our knowledge, not much work has been done to estimate the edge-weights explicitly. In this article, we estimate the edge-weights explicitly and use them for the downstream classification tasks - both semi-supervised and unsupervised. The proposed edge-weights are based on two key insights - (a) Ensembles reduce the variance and (b) Classes in HSI datasets and feature similarity have only one-sided implications. That is, while same classes would have similar features, similar features do not necessarily imply the same classes. Exploiting these, we estimate the edge-weights using an aggregate of ensembles of watersheds over subsamples of features. These edge weights are evaluated for both semi-supervised and unsupervised classification tasks. The evaluation for semi-supervised tasks uses Random-Walk based approach. For the unsupervised case, we use a simple filter using a graph convolution network (GCN). In both these cases, the proposed edge weights outperform the traditional approaches to compute edge-weights - Euclidean distances and cosine similarities. Fascinatingly, with the proposed edge-weights, the simplest GCN obtained results comparable to the recent state-of-the-art. △ Less

Submitted 27 February, 2022; originally announced February 2022.

Comments: This article is under review at Geoscience and Remote Sensing Letters. Copyright could be transferred at any time

arXiv:2202.07014 [pdf, other]

Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration

Authors: Sravan Jayanthi, Letian Chen, Matthew Gombolay

Abstract: Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. A key challenge in LfD research is that users tend to provide heterogeneous demonstrations for the same task due to various strategies and preferences. Therefore, it is essential to develop LfD algorithms that ensure \textit{flexi… ▽ More Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. A key challenge in LfD research is that users tend to provide heterogeneous demonstrations for the same task due to various strategies and preferences. Therefore, it is essential to develop LfD algorithms that ensure \textit{flexibility} (the robot adapts to personalized strategies), \textit{efficiency} (the robot achieves sample-efficient adaptation), and \textit{scalability} (robot reuses a concise set of strategies to represent a large amount of behaviors). In this paper, we propose a novel algorithm, Dynamic Multi-Strategy Reward Distillation (DMSRD), which distills common knowledge between heterogeneous demonstrations, leverages learned strategies to construct mixture policies, and continues to improve by learning from all available data. Our personalized, federated, and lifelong LfD architecture surpasses benchmarks in two continuous control problems with an average 77\% improvement in policy returns and 42\% improvement in log likelihood, alongside stronger task reward correlation and more precise strategy rewards. △ Less

Submitted 14 February, 2022; originally announced February 2022.

Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning (IML@AAAI'22)

arXiv:2112.08718 [pdf, other]

Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems

Authors: Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff

Abstract: Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains creating a need to adapt to new domains with small memory and deployment overhead. In this work, we introduce domain-prompts, a methodology that involves training a small number of domain embedding parameters to prime a Transformer-based Language Model (LM) to a particular do… ▽ More Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains creating a need to adapt to new domains with small memory and deployment overhead. In this work, we introduce domain-prompts, a methodology that involves training a small number of domain embedding parameters to prime a Transformer-based Language Model (LM) to a particular domain. Using this domain-adapted LM for rescoring ASR hypotheses can achieve 7-13% WER reduction for a new domain with just 1000 unlabeled textual domain-specific sentences. This improvement is comparable or even better than fully fine-tuned models even though just 0.02% of the parameters of the base LM are updated. Additionally, our method is deployment-friendly as the learnt domain embeddings are prefixed to the input to the model rather than changing the base model architecture. Therefore, our method is an ideal choice for on-the-fly adaptation of LMs used in ASR systems to progressively scale it to new domains. △ Less

Submitted 21 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

Comments: Accepted at InterSpeech 2022

arXiv:2112.05897 [pdf, other]

Autonomous real-time science-driven follow-up of survey transients

Authors: Niharika Sravan, Matthew J. Graham, Christoffer Fremling, Michael W. Coughlin

Abstract: Astronomical surveys continue to provide unprecedented insights into the time-variable Universe and will remain the source of groundbreaking discoveries for years to come. However, their data throughput has overwhelmed the ability to manually synthesize alerts for devising and coordinating necessary follow-up with limited resources. The advent of Rubin Observatory, with alert volumes an order of m… ▽ More Astronomical surveys continue to provide unprecedented insights into the time-variable Universe and will remain the source of groundbreaking discoveries for years to come. However, their data throughput has overwhelmed the ability to manually synthesize alerts for devising and coordinating necessary follow-up with limited resources. The advent of Rubin Observatory, with alert volumes an order of magnitude higher at otherwise sparse cadence, presents an urgent need to overhaul existing human-centered protocols in favor of machine-directed infrastructure for conducting science inference and optimally planning expensive follow-up observations. We present the first implementation of autonomous real-time science-driven follow-up using value iteration to perform sequential experiment design. We demonstrate it for strategizing photometric augmentation of Zwicky Transient Facility Type Ia supernova light-curves given the goal of minimizing SALT2 parameter uncertainties. We find a median improvement of 2-6% for SALT2 parameters and 3-11% for photometric redshift with 2-7 additional data points in g, r and/or i compared to random augmentation. The augmentations are automatically strategized to complete gaps and for resolving phases with high constraining power (e.g. around peaks). We suggest that such a technique can deliver higher impact during the era of Rubin Observatory for precision cosmology at high redshift and can serve as the foundation for the development of general-purpose resource allocation systems. △ Less

Submitted 26 January, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

Comments: Accepted for publication in Ninth International Conference on Big Data Analytics in Astronomy, Science and Engineering

arXiv:2111.04291 [pdf, other]

Non-local $R^2$-like inflation, Gravitational Waves and Non-Gaussianities

Authors: K. Sravan Kumar

Abstract: The emergence of $R^2$ (Starobinsky) inflation from the semi-classical modification of gravity due to matter quantum fields (trace anomaly) clearly points out the importance of fundamental physics and the first principles in the construction of successful cosmological models. Along with the observational success, $R^2$ gravity is also an important step beyond general relativity (GR) towards quantu… ▽ More The emergence of $R^2$ (Starobinsky) inflation from the semi-classical modification of gravity due to matter quantum fields (trace anomaly) clearly points out the importance of fundamental physics and the first principles in the construction of successful cosmological models. Along with the observational success, $R^2$ gravity is also an important step beyond general relativity (GR) towards quantum gravity. Furthermore, several approaches of quantum gravity to date are strongly indicating the presence of non-locality at small time and length scales. In this regard, ultraviolet (UV) completion of $R^2$ inflation has been recently studied in a string theory-inspired ghost-free analytic non-local gravity. We discuss the promising theoretical predictions of non-local $R^2$-like inflation with respect to the key observables such as tensor-to-scalar ratio, tensor tilt which tell us about the spectrum of primordial gravitational waves, and scalar Non-Gaussianities which tell us about the three-point correlations in the CMB fluctuations. Any signature of non-local physics in the early Universe will significantly improve our understanding of fundamental physics at UV energy scales and quantum gravity. △ Less

Submitted 8 November, 2021; originally announced November 2021.

Comments: 12 pages, 6 figures, Contribution to the Proceedings of the Sixteenth Marcel Grossmann Meeting (MG16), July 5-10, 2021 based on a talk delivered at AT7 parallel session of MG16 on "Ghost-free models of modified gravity" chaired by Dmitry Gal'tsov and Michael Volkov. The article is based on the results obtained in arXiv:2005.09550 [hep-th], 2003.00629 [hep-th], 1711.08864 [hep-th]

arXiv:2110.06502 [pdf, other]

Prompt-tuning in ASR systems for efficient domain-adaptation

Authors: Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff

Abstract: Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains. Since domain-specific systems perform better than their generic counterparts on in-domain evaluation, the need for memory and compute-efficient domain adaptation is obvious. Particularly, adapting parameter-heavy transformer-based language models used for rescoring ASR hypot… ▽ More Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains. Since domain-specific systems perform better than their generic counterparts on in-domain evaluation, the need for memory and compute-efficient domain adaptation is obvious. Particularly, adapting parameter-heavy transformer-based language models used for rescoring ASR hypothesis is challenging. In this work, we overcome the problem using prompt-tuning, a methodology that trains a small number of domain token embedding parameters to prime a transformer-based LM to a particular domain. With just a handful of extra parameters per domain, we achieve much better perplexity scores over the baseline of using an unadapted LM. Despite being parameter-efficient, these improvements are comparable to those of fully-fine-tuned models with hundreds of millions of parameters. We replicate our findings in perplexity numbers to Word Error Rate in a domain-specific ASR system for one such domain. △ Less

Submitted 22 October, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

Comments: WeCNLP 2021 camera-ready

arXiv:2109.05092 [pdf, other]

Remember the context! ASR slot error correction through memorization

Authors: Dhanush Bekal, Ashish Shenoy, Monica Sunkara, Sravan Bodapati, Katrin Kirchhoff

Abstract: Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems. Although it is a critical step with direct impact on downstream tasks such as language understanding, many domain agnostic ASR systems tend to perform poorly on domain specific or long tail words. They are often supp… ▽ More Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems. Although it is a critical step with direct impact on downstream tasks such as language understanding, many domain agnostic ASR systems tend to perform poorly on domain specific or long tail words. They are often supplemented with slot error correcting systems but it is often hard for any neural model to directly output such rare entity words. To address this problem, we propose k-nearest neighbor (k-NN) search that outputs domain-specific entities from an explicit datastore. We improve error correction rate by conveniently augmenting a pretrained joint phoneme and text based transformer sequence to sequence model with k-NN search during inference. We evaluate our proposed approach on five different domains containing long tail slot entities such as full names, airports, street names, cities, states. Our best performing error correction model shows a relative improvement of 7.4% in word error rate (WER) on rare word entities over the baseline and also achieves a relative WER improvement of 9.8% on an out of vocabulary (OOV) test set. △ Less

Submitted 17 September, 2021; v1 submitted 10 September, 2021; originally announced September 2021.

Comments: 8 pages, 3 figures, 4 tables, Accepted to ASRU 2021

arXiv:2108.07833 [pdf, other]

An Algorithmic Safety VEST For Li-ion Batteries During Fast Charging

Authors: Peyman Mohtat, Sravan Pannala, Valentin Sulzer, Jason B. Siegel, Anna G. Stefanopoulou

Abstract: Fast charging of lithium-ion batteries is crucial to increase desirability for consumers and hence accelerate the adoption of electric vehicles. A major barrier to shorter charge times is the accelerated aging of the battery at higher charging rates, which can be driven by lithium plating, increased solid electrolyte interphase growth due to elevated temperatures, and particle cracking due to mech… ▽ More Fast charging of lithium-ion batteries is crucial to increase desirability for consumers and hence accelerate the adoption of electric vehicles. A major barrier to shorter charge times is the accelerated aging of the battery at higher charging rates, which can be driven by lithium plating, increased solid electrolyte interphase growth due to elevated temperatures, and particle cracking due to mechanical stress. Lithium plating depends on the overpotential of the negative electrode, and mechanical stress depends on the concentration gradient, both of which cannot be measured directly. Techniques based on physics-based models of the battery and optimal control algorithms have been developed to this end. While these methods show promise in reducing degradation, their optimization algorithms' complexity can limit their implementation. In this paper, we present a method based on the constant current constant voltage (CC-CV) charging scheme, called CC-CV$ησ$T (VEST). The new approach is simpler to implement and can be used with any model to impose varying levels of constraints on variables pertinent to degradation, such as plating potential and mechanical stress. We demonstrate the new CC-CV$ησ$T charging using an electrochemical model with mechanical and thermal effects included. Furthermore, we discuss how uncertainties can be accounted for by considering safety margins for the plating and stress constraints. △ Less

Submitted 17 August, 2021; originally announced August 2021.

Comments: In press; Modeling, Estimation and Control Conference 2021

arXiv:2107.07827 [pdf, other]

A Theoretical Analysis of Granulometry-based Roughness Measures on Cartosat DEMs

Authors: Nagajothi Kannan, Sravan Danda, Aditya Challa, Daya Sagar B S

Abstract: The study of water bodies such as rivers is an important problem in the remote sensing community. A meaningful set of quantitative features reflecting the geophysical properties help us better understand the formation and evolution of rivers. Typically, river sub-basins are analysed using Cartosat Digital Elevation Models (DEMs), obtained at regular time epochs. One of the useful geophysical featu… ▽ More The study of water bodies such as rivers is an important problem in the remote sensing community. A meaningful set of quantitative features reflecting the geophysical properties help us better understand the formation and evolution of rivers. Typically, river sub-basins are analysed using Cartosat Digital Elevation Models (DEMs), obtained at regular time epochs. One of the useful geophysical features of a river sub-basin is that of a roughness measure on DEMs. However, to the best of our knowledge, there is not much literature available on theoretical analysis of roughness measures. In this article, we revisit the roughness measure on DEM data adapted from multiscale granulometries in mathematical morphology, namely multiscale directional granulometric index (MDGI). This measure was classically used to obtain shape-size analysis in greyscale images. In earlier works, MDGIs were introduced to capture the characteristic surficial roughness of a river sub-basin along specific directions. Also, MDGIs can be efficiently computed and are known to be useful features for classification of river sub-basins. In this article, we provide a theoretical analysis of a MDGI. In particular, we characterize non-trivial sufficient conditions on the structure of DEMs under which MDGIs are invariant. These properties are illustrated with some fictitious DEMs. We also provide connections to a discrete derivative of volume of a DEM. Based on these connections, we provide intuition as to why a MDGI is considered a roughness measure. Further, we experimentally illustrate on Lower-Indus, Wardha, and Barmer river sub-basins that the proposed features capture the characteristics of the river sub-basin. △ Less

Submitted 16 July, 2021; originally announced July 2021.

Comments: Under review at IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

arXiv:2106.09532 [pdf, other]

doi 10.18653/v1/2021.ecnlp-1.3

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling

Authors: Ashish Shenoy, Sravan Bodapati, Katrin Kirchhoff

Abstract: Automatic Speech Recognition (ASR) robustness toward slot entities are critical in e-commerce voice assistants that involve monetary transactions and purchases. Along with effective domain adaptation, it is intuitive that cross utterance contextual cues play an important role in disambiguating domain specific content words from speech. In this paper, we investigate various techniques to improve co… ▽ More Automatic Speech Recognition (ASR) robustness toward slot entities are critical in e-commerce voice assistants that involve monetary transactions and purchases. Along with effective domain adaptation, it is intuitive that cross utterance contextual cues play an important role in disambiguating domain specific content words from speech. In this paper, we investigate various techniques to improve contextualization, content word robustness and domain adaptation of a Transformer-XL neural language model (NLM) to rescore ASR N-best hypotheses. To improve contextualization, we utilize turn level dialogue acts along with cross utterance context carry over. Additionally, to adapt our domain-general NLM towards e-commerce on-the-fly, we use embeddings derived from a finetuned masked LM on in-domain data. Finally, to improve robustness towards in-domain content words, we propose a multi-task model that can jointly perform content word detection and language modeling tasks. Compared to a non-contextual LSTM LM baseline, our best performing NLM rescorer results in a content WER reduction of 19.2% on e-commerce audio test set and a slot labeling F1 improvement of 6.4%. △ Less

Submitted 15 June, 2021; originally announced June 2021.

Comments: Accepted at ACL-IJCNLP 2021 Workshop on e-Commerce and NLP (ECNLP)

arXiv:2106.08347 [pdf, other]

doi 10.1103/PhysRevApplied.17.014022

SENSEI: Characterization of Single-Electron Events Using a Skipper-CCD

Authors: Liron Barak, Itay M. Bloch, Ana Botti, Mariano Cababie, Gustavo Cancelo, Luke Chaplinsky, Fernando Chierchie, Michael Crisler, Alex Drlica-Wagner, Rouven Essig, Juan Estrada, Erez Etzion, Guillermo Fernandez Moroni, Daniel Gift, Stephen E. Holland, Sravan Munagavalasa, Aviv Orly, Dario Rodrigues, Aman Singal, Miguel Sofo Haro, Leandro Stefanazzi, Javier Tiffenberg, Sho Uemura, Tomer Volansky, Tien-Tien Yu

Abstract: We use a science-grade Skipper Charge Coupled Device (Skipper-CCD) operating in a low-radiation background environment to develop a semi-empirical model that characterizes the origin of single-electron events in CCDs. We identify, separate, and quantify three independent contributions to the single-electron events, which were previously bundled together and classified as "dark counts": dark curren… ▽ More We use a science-grade Skipper Charge Coupled Device (Skipper-CCD) operating in a low-radiation background environment to develop a semi-empirical model that characterizes the origin of single-electron events in CCDs. We identify, separate, and quantify three independent contributions to the single-electron events, which were previously bundled together and classified as "dark counts": dark current, amplifier light, and spurious charge. We measure a dark current, which depends on exposure, of (5.89+-0.77)x10^-4 e-/pix/day, and an unprecedentedly low spurious charge contribution of (1.52+-0.07)x10^-4 e-/pix, which is exposure-independent. In addition, we provide a technique to study events produced by light emitted from the amplifier, which allows the detector's operation to be optimized to minimize this effect to a level below the dark-current contribution. Our accurate characterization of the single-electron events allows one to greatly extend the sensitivity of experiments searching for dark matter or coherent neutrino scattering. Moreover, an accurate understanding of the origin of single-electron events is critical to further progress in ongoing R&D efforts of Skipper and conventional CCDs. △ Less

Submitted 26 January, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

Comments: 9 pages, 6 figures, 4 tables

Journal ref: Phys. Rev. Applied 17, 014022 (2022)

arXiv:2104.11070 [pdf, other]

doi 10.21437/Interspeech.2021-1849

Adapting Long Context NLM for ASR Rescoring in Conversational Agents

Authors: Ashish Shenoy, Sravan Bodapati, Monica Sunkara, Srikanth Ronanki, Katrin Kirchhoff

Abstract: Neural Language Models (NLM), when trained and evaluated with context spanning multiple utterances, have been shown to consistently outperform both conventional n-gram language models and NLMs that use limited context. In this paper, we investigate various techniques to incorporate turn based context history into both recurrent (LSTM) and Transformer-XL based NLMs. For recurrent based NLMs, we exp… ▽ More Neural Language Models (NLM), when trained and evaluated with context spanning multiple utterances, have been shown to consistently outperform both conventional n-gram language models and NLMs that use limited context. In this paper, we investigate various techniques to incorporate turn based context history into both recurrent (LSTM) and Transformer-XL based NLMs. For recurrent based NLMs, we explore context carry over mechanism and feature based augmentation, where we incorporate other forms of contextual information such as bot response and system dialogue acts as classified by a Natural Language Understanding (NLU) model. To mitigate the sharp nearby, fuzzy far away problem with contextual NLM, we propose the use of attention layer over lexical metadata to improve feature based augmentation. Additionally, we adapt our contextual NLM towards user provided on-the-fly speech patterns by leveraging encodings from a large pre-trained masked language model and performing fusion with a Transformer-XL based NLM. We test our proposed models using N-best rescoring of ASR hypotheses of task-oriented dialogues and also evaluate on downstream NLU tasks such as intent classification and slot labeling. The best performing model shows a relative WER between 1.6% and 9.1% and a slot labeling F1 score improvement of 4% over non-contextual baselines. △ Less

Submitted 4 June, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

Comments: Accepted to Interspeech 2021. arXiv admin note: text overlap with arXiv:2103.10325

arXiv:2103.13980 [pdf, other]

doi 10.1088/1475-7516/2021/07/025

An anisotropic bouncing universe in non-local gravity

Authors: K. Sravan Kumar, Shubham Maheshwari, Anupam Mazumdar, Jun Peng

Abstract: We show that it is possible to realize a cosmological bouncing solution in an anisotropic but homogeneous Bianchi-I background in a class of non-local, infinite derivative theories of gravity. We show that the anisotropic shear grows slower than in general relativity during the contraction phase, peaks to a finite value at the bounce point, and then decreases as the universe asymptotes towards iso… ▽ More We show that it is possible to realize a cosmological bouncing solution in an anisotropic but homogeneous Bianchi-I background in a class of non-local, infinite derivative theories of gravity. We show that the anisotropic shear grows slower than in general relativity during the contraction phase, peaks to a finite value at the bounce point, and then decreases as the universe asymptotes towards isotropy and homogeneity, and ultimately to de Sitter. Along with a cosmological constant, the matter sector required to drive such a bounce is found to consist of three components - radiation, stiff matter and $k$-matter (whose energy density decays like the inverse square of the average scale factor). Generically, $k$-matter exerts anisotropic pressures. We will test the bouncing solution in local and non-local gravity and show that in the latter case it is possible to simultaneously satisfy positivity of energy density and, at least in the late time de Sitter phase, avoid the introduction of propagating ghost/tachyonic modes. △ Less

Submitted 19 July, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

Comments: 18 pages, 1 figure, 1 table. We dedicate this work to the memory of John D. Barrow. v2 matches the one published in JCAP

Journal ref: JCAP07(2021)025

arXiv:2103.10325

Contextual Biasing of Language Models for Speech Recognition in Goal-Oriented Conversational Agents

Authors: Ashish Shenoy, Sravan Bodapati, Katrin Kirchhoff

Abstract: Goal-oriented conversational interfaces are designed to accomplish specific tasks and typically have interactions that tend to span multiple turns adhering to a pre-defined structure and a goal. However, conventional neural language models (NLM) in Automatic Speech Recognition (ASR) systems are mostly trained sentence-wise with limited context. In this paper, we explore different ways to incorpora… ▽ More Goal-oriented conversational interfaces are designed to accomplish specific tasks and typically have interactions that tend to span multiple turns adhering to a pre-defined structure and a goal. However, conventional neural language models (NLM) in Automatic Speech Recognition (ASR) systems are mostly trained sentence-wise with limited context. In this paper, we explore different ways to incorporate context into a LSTM based NLM in order to model long range dependencies and improve speech recognition. Specifically, we use context carry over across multiple turns and use lexical contextual cues such as system dialog act from Natural Language Understanding (NLU) models and the user provided structure of the chatbot. We also propose a new architecture that utilizes context embeddings derived from BERT on sample utterances provided during inference time. Our experiments show a word error rate (WER) relative reduction of 7% over non-contextual utterance-level NLM rescorers on goal-oriented audio datasets. △ Less

Submitted 4 June, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

Comments: Updated version with extensions are uploaded here arXiv:2104.11070

arXiv:2103.09384 [pdf, other]

doi 10.1109/TGRS.2021.3113721

Triplet-Watershed for Hyperspectral Image Classification

Authors: Aditya Challa, Sravan Danda, B. S. Daya Sagar, Laurent Najman

Abstract: Hyperspectral images (HSI) consist of rich spatial and spectral information, which can potentially be used for several applications. However, noise, band correlations and high dimensionality restrict the applicability of such data. This is recently addressed using creative deep learning network architectures such as ResNet, SSRN, and A2S2K. However, the last layer, i.e the classification layer, re… ▽ More Hyperspectral images (HSI) consist of rich spatial and spectral information, which can potentially be used for several applications. However, noise, band correlations and high dimensionality restrict the applicability of such data. This is recently addressed using creative deep learning network architectures such as ResNet, SSRN, and A2S2K. However, the last layer, i.e the classification layer, remains unchanged and is taken to be the softmax classifier. In this article, we propose to use a watershed classifier. Watershed classifier extends the watershed operator from Mathematical Morphology for classification. In its vanilla form, the watershed classifier does not have any trainable parameters. In this article, we propose a novel approach to train deep learning networks to obtain representations suitable for the watershed classifier. The watershed classifier exploits the connectivity patterns, a characteristic of HSI datasets, for better inference. We show that exploiting such characteristics allows the Triplet-Watershed to achieve state-of-art results in supervised and semi-supervised contexts. These results are validated on Indianpines (IP), University of Pavia (UP), Kennedy Space Center (KSC) and University of Houston (UH) datasets, relying on simple convnet architecture using a quarter of parameters compared to previous state-of-the-art networks. The source code for reproducing the experiments and supplementary material (high resolution images) is available at https://github.com/ac20/TripletWatershed Code. △ Less

Submitted 5 September, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

Comments: This work has been submitted to the IEEE for possible publication

Journal ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-14, 2022

arXiv:2103.05834 [pdf, other]

Best of Both Worlds: Robust Accented Speech Recognition with Adversarial Transfer Learning

Authors: Nilaksh Das, Sravan Bodapati, Monica Sunkara, Sundararajan Srinivasan, Duen Horng Chau

Abstract: Training deep neural networks for automatic speech recognition (ASR) requires large amounts of transcribed speech. This becomes a bottleneck for training robust models for accented speech which typically contains high variability in pronunciation and other semantics, since obtaining large amounts of annotated accented data is both tedious and costly. Often, we only have access to large amounts of… ▽ More Training deep neural networks for automatic speech recognition (ASR) requires large amounts of transcribed speech. This becomes a bottleneck for training robust models for accented speech which typically contains high variability in pronunciation and other semantics, since obtaining large amounts of annotated accented data is both tedious and costly. Often, we only have access to large amounts of unannotated speech from different accents. In this work, we leverage this unannotated data to provide semantic regularization to an ASR model that has been trained only on one accent, to improve its performance for multiple accents. We propose Accent Pre-Training (Acc-PT), a semi-supervised training strategy that combines transfer learning and adversarial training. Our approach improves the performance of a state-of-the-art ASR model by 33% on average over the baseline across multiple accents, training only on annotated samples from one standard accent, and as little as 105 minutes of unannotated speech from a target accent. △ Less

Submitted 9 March, 2021; originally announced March 2021.

arXiv:2102.06380 [pdf, ps, other]

Neural Inverse Text Normalization

Authors: Monica Sunkara, Chaitanya Shivade, Sravan Bodapati, Katrin Kirchhoff

Abstract: While there have been several contributions exploring state of the art techniques for text normalization, the problem of inverse text normalization (ITN) remains relatively unexplored. The best known approaches leverage finite state transducer (FST) based models which rely on manually curated rules and are hence not scalable. We propose an efficient and robust neural solution for ITN leveraging tr… ▽ More While there have been several contributions exploring state of the art techniques for text normalization, the problem of inverse text normalization (ITN) remains relatively unexplored. The best known approaches leverage finite state transducer (FST) based models which rely on manually curated rules and are hence not scalable. We propose an efficient and robust neural solution for ITN leveraging transformer based seq2seq models and FST-based text normalization techniques for data preparation. We show that this can be easily extended to other languages without the need for a linguistic expert to manually curate them. We then present a hybrid framework for integrating Neural ITN with an FST to overcome common recoverable errors in production environments. Our empirical evaluations show that the proposed solution minimizes incorrect perturbations (insertions, deletions and substitutions) to ASR output and maintains high quality even on out of domain data. A transformer based model infused with pretraining consistently achieves a lower WER across several datasets and is able to outperform baselines on English, Spanish, German and Italian datasets. △ Less

Submitted 12 February, 2021; originally announced February 2021.

Comments: 5 pages, accepted to ICASSP 2021

arXiv:2101.05288 [pdf, other]

doi 10.3847/1538-4357/abe2a7

The Center of Expansion and Age of the Oxygen-rich Supernova Remnant 1E 0102.2-7219

Authors: John Banovetz, Dan Milisavljevic, Niharika Sravan, Robert A. Fesen, Daniel J. Patnaude, Paul P. Plucinsky, William P. Blair, Kathryn E. Weil, Jon A. Morse, Raffaella Margutti, Maria R. Drout

Abstract: We present new proper motion measurements of optically emitting oxygen-rich knots of supernova remnant 1E 0102.2-7219 (E0102), which are used to estimate the remnant's center of expansion and age. Four epochs of high resolution Hubble Space Telescope images spanning 19 yr were retrieved and analyzed. We found a robust center of expansion of alpha=1:04:02.48 and delta=-72:01:53.92 (J2000) with 1-si… ▽ More We present new proper motion measurements of optically emitting oxygen-rich knots of supernova remnant 1E 0102.2-7219 (E0102), which are used to estimate the remnant's center of expansion and age. Four epochs of high resolution Hubble Space Telescope images spanning 19 yr were retrieved and analyzed. We found a robust center of expansion of alpha=1:04:02.48 and delta=-72:01:53.92 (J2000) with 1-sigma uncertainty of 1.77 arcseconds using 45 knots from images obtained with the Advanced Camera for Surveys using the F475W filter in 2003 and 2013 having the highest signal-to-noise ratio. We also estimate an upper limit explosion age of 1738 +/- 175 yr by selecting knots with the highest proper motions, that are assumed to be the least decelerated. We find evidence of an asymmetry in the proper motions of the knots as a function of position angle. We conclude that these asymmetries were most likely caused by interaction between E0102's original supernova blast wave and an inhomogeneous surrounding environment, as opposed to intrinsic explosion asymmetry. The observed non-homologous expansion suggests that the use of a free expansion model inaccurately offsets the center of expansion and leads to an overestimated explosion age. We discuss our findings as they compare to previous age and center of expansion estimates of E0102 and their relevance to a recently identified candidate central compact object. △ Less

Submitted 13 January, 2021; originally announced January 2021.

Comments: 14 pages, 11 figures, revised according to referee comments and resubmitted to ApJ

arXiv:2011.06195 [pdf, other]

Towards Semi-Supervised Semantics Understanding from Speech

Authors: Cheng-I Lai, Jin Cao, Sravan Bodapati, Shang-Wen Li

Abstract: Much recent work on Spoken Language Understanding (SLU) falls short in at least one of three ways: models were trained on oracle text input and neglected the Automatics Speech Recognition (ASR) outputs, models were trained to predict only intents without the slot values, or models were trained on a large amount of in-house data. We proposed a clean and general framework to learn semantics directly… ▽ More Much recent work on Spoken Language Understanding (SLU) falls short in at least one of three ways: models were trained on oracle text input and neglected the Automatics Speech Recognition (ASR) outputs, models were trained to predict only intents without the slot values, or models were trained on a large amount of in-house data. We proposed a clean and general framework to learn semantics directly from speech with semi-supervision from transcribed speech to address these. Our framework is built upon pretrained end-to-end (E2E) ASR and self-supervised language models, such as BERT, and fine-tuned on a limited amount of target SLU corpus. In parallel, we identified two inadequate settings under which SLU models have been tested: noise-robustness and E2E semantics evaluation. We tested the proposed framework under realistic environmental noises and with a new metric, the slots edit F1 score, on two public SLU corpora. Experiments show that our SLU framework with speech as input can perform on par with those with oracle text as input in semantics understanding, while environmental noises are present, and a limited amount of labeled semantics data is available. △ Less

Submitted 10 November, 2020; originally announced November 2020.

Comments: arXiv admin note: text overlap with arXiv:2010.13826

arXiv:2011.00828 [pdf, ps, other]

Millimeter-Wave Antenna Array Diagnosis with Partial Channel State Information

Authors: George Medina, Akashdeep Singh Jida, Sravan Pulipati, Rohith Talwar, Nancy Amala J, Tareq Y. Al-Naffouri, Arjuna Madanayake, Mohammed Eltayeb

Abstract: Large antenna arrays enable directional precoding for Millimeter-Wave (mmWave) systems and provide sufficient link budget to combat the high path-loss at these frequencies. Due to atmospheric conditions and hardware malfunction, outdoor mmWave antenna arrays are prone to blockages or complete failures. This results in a modified array geometry, distorted far-field radiation pattern, and system per… ▽ More Large antenna arrays enable directional precoding for Millimeter-Wave (mmWave) systems and provide sufficient link budget to combat the high path-loss at these frequencies. Due to atmospheric conditions and hardware malfunction, outdoor mmWave antenna arrays are prone to blockages or complete failures. This results in a modified array geometry, distorted far-field radiation pattern, and system performance degradation. Recent remote array diagnostic techniques have emerged as an effective way to detect defective antenna elements in an array with few diagnostic measurements. These techniques, however, require full and perfect channel state information (CSI), which can be challenging to acquire in the presence of antenna faults. This paper proposes a new remote array diagnosis technique that relaxes the need for full CSI and only requires knowledge of the incident angle-of-arrivals, i.e. partial channel knowledge. Numerical results demonstrate the effectiveness of the proposed technique and show that fault detection can be obtained with comparable number of diagnostic measurements required by diagnostic techniques based on full channel knowledge. In presence of channel estimation errors, the proposed technique is shown to out-perform recently proposed array diagnostic techniques. △ Less

Submitted 2 November, 2020; originally announced November 2020.

arXiv:2009.14270 [pdf, other]

Improved Battery State Estimation Under Parameter Uncertainty Caused by Aging Using Expansion Measurements

Authors: Sravan Pannala, Puneet Valecha, Peyman Mohtat, Jason B. Siegel, Anna G. Stefanopoulou

Abstract: Accurate tracking of the internal electrochemical states of lithium-ion battery during cycling enables advanced battery management systems to operate the battery safely and maintain high performance while minimizing battery degradation. To this end, techniques based on voltage measurement have shown promise for estimating the lithium surface concentration of active material particles, which is an… ▽ More Accurate tracking of the internal electrochemical states of lithium-ion battery during cycling enables advanced battery management systems to operate the battery safely and maintain high performance while minimizing battery degradation. To this end, techniques based on voltage measurement have shown promise for estimating the lithium surface concentration of active material particles, which is an important state for avoiding aging mechanisms such as lithium plating. However, methods relying on voltage often lead to large estimation errors when the model parameters change during aging. In this paper, we utilize the in-situ measurement of the battery expansion to augment the voltage and develop an observer to estimate the lithium surface concentration distribution in each electrode particle. We demonstrate that the addition of the expansion signal enables us to correct the negative electrode concentration states in addition to the positive electrode. As a result, compared to a voltage only observer, the proposed observer can successfully recover the surface concentration when the electrodes' stoichiometric window changes, which is a common occurrence under aging by loss of lithium inventory. With a 5% shift in the electrodes' stoichiometric window, the results indicate a reduction in state estimation error for the negative electrode surface concentration. Under this simulated aged condition, the voltage based observer had 9.3% error as compared to the proposed voltage and expansion observer which had 0.1% error in negative electrode surface concentration. △ Less

Submitted 29 September, 2020; originally announced September 2020.

Comments: 6 pages, 4 figures, Submitted to American Controls Conference 2021

arXiv:2009.06405 [pdf, other]

doi 10.3847/1538-4357/abb8d5

Progenitors of Type IIb Supernovae: II. Observable Properties

Authors: Niharika Sravan, Pablo Marchant, Vassiliki Kalogera, Dan Milisavljevic, Raffaella Margutti

Abstract: Type IIb supernovae (SNe IIb) present a unique opportunity for investigating the evolutionary channels and mechanisms governing the evolution of stripped-envelope SN progenitors due to a variety of observational constraints available. Comparison of these constraints with the full distribution of theoretical properties not only help ascertain the prevalence of observed properties in nature, but can… ▽ More Type IIb supernovae (SNe IIb) present a unique opportunity for investigating the evolutionary channels and mechanisms governing the evolution of stripped-envelope SN progenitors due to a variety of observational constraints available. Comparison of these constraints with the full distribution of theoretical properties not only help ascertain the prevalence of observed properties in nature, but can also reveal currently unobserved populations. In this follow-up paper, we use the large grid of models presented in Sravan et al. 2019 to derive distributions of single and binary SNe IIb progenitor properties and compare them to constraints from three independent observational probes: multi-band SN light-curves, direct progenitor detections, and X-ray/radio observations. Consistent with previous work, we find that while current observations exclude single stars as SN IIb progenitors, SN IIb progenitors in binaries can account for them. We also find that the distributions indicate the existence of an unobserved dominant population of binary SNe IIb at low metallicity that arise due to mass transfer initiated on the Hertzsprung Gap. In particular, our models indicate the existence of a group of highly stripped (envelope mass ~0.1-0.2 M_sun) progenitors that are compact (<50 R_sun) and blue (T_eff <~ 10^5K) with ~10^4.5-10^5.5 L_sun and low density circumstellar mediums. As discussed in Sravan et al. 2019, this group is necessary to account for SN IIb fractions and likely exist regardless of metallicity. The detection of the unobserved populations indicated by our models would support weak stellar winds and inefficient mass transfer in SN IIb progenitors. △ Less

Submitted 14 September, 2020; originally announced September 2020.

Comments: Resubmitted to the Astrophysical Journal after incorporating suggestions from the referee

arXiv:2008.01220 [pdf, other]

Xilinx RF-SoC-based Digital Multi-Beam Array Processors for 28/60~GHz Wireless Testbeds

Authors: Sravan Pulipati, Viduneth Ariyarathna, Aditya Dhananjay, Mohammed E. Eltayeb, Marco Mezzavilla, Josep M. Jornet, Soumyajit Mandal, Shubhendu Bhardwaj, Arjuna Madanayake

Abstract: Emerging wireless applications such as 5G cellular, large intelligent surfaces (LIS), and holographic massive MIMO require antenna array processing at mm-wave frequencies with large numbers of independent digital transceivers. This paper summarizes the authors' recent progress on the design and testing of 28 GHz and 60 GHz fully-digital array processing platforms based on wideband reconfigurable F… ▽ More Emerging wireless applications such as 5G cellular, large intelligent surfaces (LIS), and holographic massive MIMO require antenna array processing at mm-wave frequencies with large numbers of independent digital transceivers. This paper summarizes the authors' recent progress on the design and testing of 28 GHz and 60 GHz fully-digital array processing platforms based on wideband reconfigurable FPGA-based software-defined radios (SDRs). The digital baseband and microwave interfacing aspects of the SDRs are implemented on single-chip RF system-on-chip (RF-SoC) processors from Xilinx. Two versions of the RF-SoC technology (ZCU-111 and ZCU-1275) were used to implement fully-digital real-time array processors at 28~GHz (realizing 4 parallel beams with 0.8 GHz bandwidth per beam) and 60~GHz (realizing 4 parallel beams with 1.8~GHz bandwidth per beam). Dielectric lenslet arrays fed by a digital phased-array feed (PAF) located on the focal plane are proposed for further increasing antenna array gain. △ Less

Submitted 3 August, 2020; originally announced August 2020.

Comments: 6 pages

arXiv:2008.01203 [pdf, other]

doi 10.1109/MWSCAS48704.2020.9184595

A Passive STAR Microwave Circuit for 1-3 GHz Self-Interference Cancellation

Authors: Udara De Silva, Sravan Pulipati, Satheesh Bojja Venkatakrishnan, Shubhendu Bhardwaj, Arjuna Madanayake

Abstract: Simultaneous transmit and receive (STAR) allows full-duplex operation of a radio, which leads to doubled capacity for a given bandwidth. A circulator with high-isolation between transmit and receive ports, and low-loss from the antenna to receive port is typically required for achieving STAR. Conventional circulators do not offer wideband performance. Although wideband circulators have been propos… ▽ More Simultaneous transmit and receive (STAR) allows full-duplex operation of a radio, which leads to doubled capacity for a given bandwidth. A circulator with high-isolation between transmit and receive ports, and low-loss from the antenna to receive port is typically required for achieving STAR. Conventional circulators do not offer wideband performance. Although wideband circulators have been proposed using parametric, switched delay-line/capacitor, and N-path filter techniques using custom integrated circuits, these magnet-free devices have non-linearity, noise, aliasing, and switching noise injection issues. In this paper, a STAR front-end based on passive linear microwave circuit is proposed. Here, a dummy antenna located inside a miniature RF-silent absorption chamber allows circulator-free STAR using simple COTS components. The proposed approach is highly-linear, free from noise, does not require switching or parametric modulation circuits, and has virtually unlimited bandwidth only set by the performance of COTS passive microwave components. The trade-off is relatively large size of the miniature RF-shielded chamber, making this suitable for base-station side applications. Preliminary results show the measured performance of Tx/Rx isolation between 25-60 dB in the 1.0-3.0 GHz range, and 50-60 dB for the 2.4-2.7 GHz range. △ Less

Submitted 17 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

Comments: 4 figures, 4 pages

arXiv:2008.00702 [pdf, other]

Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech

Authors: Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff

Abstract: In this work, we explore a multimodal semi-supervised learning approach for punctuation prediction by learning representations from large amounts of unlabelled audio and text data. Conventional approaches in speech processing typically use forced alignment to encoder per frame acoustic features to word level features and perform multimodal fusion of the resulting acoustic and lexical representatio… ▽ More In this work, we explore a multimodal semi-supervised learning approach for punctuation prediction by learning representations from large amounts of unlabelled audio and text data. Conventional approaches in speech processing typically use forced alignment to encoder per frame acoustic features to word level features and perform multimodal fusion of the resulting acoustic and lexical representations. As an alternative, we explore attention based multimodal fusion and compare its performance with forced alignment based fusion. Experiments conducted on the Fisher corpus show that our proposed approach achieves ~6-9% and ~3-4% absolute improvement (F1 score) over the baseline BLSTM model on reference transcripts and ASR outputs respectively. We further improve the model robustness to ASR errors by performing data augmentation with N-best lists which achieves up to an additional ~2-6% improvement on ASR outputs. We also demonstrate the effectiveness of semi-supervised learning approach by performing ablation study on various sizes of the corpus. When trained on 1 hour of speech and text data, the proposed model achieved ~9-18% absolute improvement over baseline model. △ Less

Submitted 3 August, 2020; originally announced August 2020.

Comments: Accepted for Interspeech 2020

arXiv:2007.02025 [pdf, other]

Robust Prediction of Punctuation and Truecasing for Medical ASR

Authors: Monica Sunkara, Srikanth Ronanki, Kalpit Dixit, Sravan Bodapati, Katrin Kirchhoff

Abstract: Automatic speech recognition (ASR) systems in the medical domain that focus on transcribing clinical dictations and doctor-patient conversations often pose many challenges due to the complexity of the domain. ASR output typically undergoes automatic punctuation to enable users to speak naturally, without having to vocalise awkward and explicit punctuation commands, such as "period", "add comma" or… ▽ More Automatic speech recognition (ASR) systems in the medical domain that focus on transcribing clinical dictations and doctor-patient conversations often pose many challenges due to the complexity of the domain. ASR output typically undergoes automatic punctuation to enable users to speak naturally, without having to vocalise awkward and explicit punctuation commands, such as "period", "add comma" or "exclamation point", while truecasing enhances user readability and improves the performance of downstream NLP tasks. This paper proposes a conditional joint modeling framework for prediction of punctuation and truecasing using pretrained masked language models such as BERT, BioBERT and RoBERTa. We also present techniques for domain and task specific adaptation by fine-tuning masked language models with medical domain data. Finally, we improve the robustness of the model against common errors made in ASR by performing data augmentation. Experiments performed on dictation and conversational style corpora show that our proposed model achieves ~5% absolute improvement on ground truth text and ~10% improvement on ASR outputs over baseline models under F1 metric. △ Less

Submitted 11 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

Comments: Accepted for ACL NLPMC workshop 2020

Showing 101–150 of 194 results for author: Sraavan