-
Analytic infinite derivative gravity, $R^2$-like inflation, quantum gravity and CMB
Authors:
Alexey S. Koshelev,
K. Sravan Kumar,
Alexei A. Starobinsky
Abstract:
Emergence of $R^2$ inflation which is the best fit framework for CMB observations till date comes from the attempts to attack the problem of quantization of gravity which in turn have resulted in the trace anomaly discovery. Further developments in trace anomaly and different frameworks aiming to construct quantum gravity indicate an inevitability of non-locality in fundamental physics at small ti…
▽ More
Emergence of $R^2$ inflation which is the best fit framework for CMB observations till date comes from the attempts to attack the problem of quantization of gravity which in turn have resulted in the trace anomaly discovery. Further developments in trace anomaly and different frameworks aiming to construct quantum gravity indicate an inevitability of non-locality in fundamental physics at small time and length scales. A natural question would be to employ the $R^2$ inflation as a probe for signatures of non-locality in the early Universe physics. Recent advances of embedding $R^2$ inflation in a string theory inspired non-local gravity modification provides very promising theoretical predictions connecting the non-local physics in the early Universe and the forthcoming CMB observations.
△ Less
Submitted 27 December, 2020; v1 submitted 19 May, 2020;
originally announced May 2020.
-
Understanding Dynamic Scenes using Graph Convolution Networks
Authors:
Sravan Mylavarapu,
Mahtab Sandhu,
Priyesh Vijayan,
K Madhava Krishna,
Balaraman Ravindran,
Anoop Namboodiri
Abstract:
We present a novel Multi-Relational Graph Convolutional Network (MRGCN) based framework to model on-road vehicle behaviors from a sequence of temporally ordered frames as grabbed by a moving monocular camera. The input to MRGCN is a multi-relational graph where the graph's nodes represent the active and passive agents/objects in the scene, and the bidirectional edges that connect every pair of nod…
▽ More
We present a novel Multi-Relational Graph Convolutional Network (MRGCN) based framework to model on-road vehicle behaviors from a sequence of temporally ordered frames as grabbed by a moving monocular camera. The input to MRGCN is a multi-relational graph where the graph's nodes represent the active and passive agents/objects in the scene, and the bidirectional edges that connect every pair of nodes are encodings of their Spatio-temporal relations. We show that this proposed explicit encoding and usage of an intermediate spatio-temporal interaction graph to be well suited for our tasks over learning end-end directly on a set of temporally ordered spatial relations. We also propose an attention mechanism for MRGCNs that conditioned on the scene dynamically scores the importance of information from different interaction types. The proposed framework achieves significant performance gain over prior methods on vehicle-behavior classification tasks on four datasets. We also show a seamless transfer of learning to multiple datasets without resorting to fine-tuning. Such behavior prediction methods find immediate relevance in a variety of navigation tasks such as behavior planning, state estimation, and applications relating to the detection of traffic violations over videos.
△ Less
Submitted 14 August, 2020; v1 submitted 9 May, 2020;
originally announced May 2020.
-
AutoTune: Automatically Tuning Convolutional Neural Networks for Improved Transfer Learning
Authors:
S. H. Shabbeer Basha,
Sravan Kumar Vinakota,
Viswanath Pulabaigari,
Snehasis Mukherjee,
Shiv Ram Dubey
Abstract:
Transfer learning enables solving a specific task having limited data by using the pre-trained deep networks trained on large-scale datasets. Typically, while transferring the learned knowledge from source task to the target task, the last few layers are fine-tuned (re-trained) over the target dataset. However, these layers are originally designed for the source task that might not be suitable for…
▽ More
Transfer learning enables solving a specific task having limited data by using the pre-trained deep networks trained on large-scale datasets. Typically, while transferring the learned knowledge from source task to the target task, the last few layers are fine-tuned (re-trained) over the target dataset. However, these layers are originally designed for the source task that might not be suitable for the target task. In this paper, we introduce a mechanism for automatically tuning the Convolutional Neural Networks (CNN) for improved transfer learning. The pre-trained CNN layers are tuned with the knowledge from target data using Bayesian Optimization. First, we train the final layer of the base CNN model by replacing the number of neurons in the softmax layer with the number of classes involved in the target task. Next, the pre-trained CNN is tuned automatically by observing the classification performance on the validation data (greedy criteria). To evaluate the performance of the proposed method, experiments are conducted on three benchmark datasets, e.g., CalTech-101, CalTech-256, and Stanford Dogs. The classification results obtained through the proposed AutoTune method outperforms the standard baseline transfer learning methods over the three datasets by achieving $95.92\%$, $86.54\%$, and $84.67\%$ accuracy over CalTech-101, CalTech-256, and Stanford Dogs, respectively. The experimental results obtained in this study depict that tuning of the pre-trained CNN layers with the knowledge from the target dataset confesses better transfer learning ability. The source codes are available at https://github.com/JekyllAndHyde8999/AutoTune_CNN_TransferLearning.
△ Less
Submitted 3 December, 2020; v1 submitted 25 April, 2020;
originally announced May 2020.
-
Stable, non-singular bouncing universe with only a scalar mode
Authors:
K. Sravan Kumar,
Shubham Maheshwari,
Anupam Mazumdar,
Jun Peng
Abstract:
In this paper, we study a class of higher derivative, non-local gravity which admits homogeneous and isotropic non-singular, bouncing universes in the absence of matter. At the linearized level, the theory propagates only a scalar degree of freedom, and no vector or tensor modes. The scalar can be made free from perturbative ghost instabilities, and has oscillatory and bounded evolution across the…
▽ More
In this paper, we study a class of higher derivative, non-local gravity which admits homogeneous and isotropic non-singular, bouncing universes in the absence of matter. At the linearized level, the theory propagates only a scalar degree of freedom, and no vector or tensor modes. The scalar can be made free from perturbative ghost instabilities, and has oscillatory and bounded evolution across the bounce.
△ Less
Submitted 4 May, 2020;
originally announced May 2020.
-
SENSEI: Direct-Detection Results on sub-GeV Dark Matter from a New Skipper-CCD
Authors:
Liron Barak,
Itay M. Bloch,
Mariano Cababie,
Gustavo Cancelo,
Luke Chaplinsky,
Fernando Chierchie,
Michael Crisler,
Alex Drlica-Wagner,
Rouven Essig,
Juan Estrada,
Erez Etzion,
Guillermo Fernandez Moroni,
Daniel Gift,
Sravan Munagavalasa,
Aviv Orly,
Dario Rodrigues,
Aman Singal,
Miguel Sofo Haro,
Leandro Stefanazzi,
Javier Tiffenberg,
Sho Uemura,
Tomer Volansky,
Tien-Tien Yu
Abstract:
We present the first direct-detection search for eV-to-GeV dark matter using a new ~2-gram high-resistivity Skipper-CCD from a dedicated fabrication batch that was optimized for dark-matter searches. Using 24 days of data acquired in the MINOS cavern at the Fermi National Accelerator Laboratory, we measure the lowest rates in silicon detectors of events containing one, two, three, or four electron…
▽ More
We present the first direct-detection search for eV-to-GeV dark matter using a new ~2-gram high-resistivity Skipper-CCD from a dedicated fabrication batch that was optimized for dark-matter searches. Using 24 days of data acquired in the MINOS cavern at the Fermi National Accelerator Laboratory, we measure the lowest rates in silicon detectors of events containing one, two, three, or four electrons, and achieve world-leading sensitivity for a large range of sub-GeV dark matter masses. Data taken with different thicknesses of the detector shield suggest a correlation between the rate of high-energy tracks and the rate of single-electron events previously classified as "dark current." We detail key characteristics of the new Skipper-CCDs, which augur well for the planned construction of the ~100-gram SENSEI experiment at SNOLAB.
△ Less
Submitted 2 November, 2020; v1 submitted 23 April, 2020;
originally announced April 2020.
-
Dark matter and Standard Model reheating from conformal GUT inflation
Authors:
Simone Biondini,
K. Sravan Kumar
Abstract:
Spontaneous breaking of conformal symmetry has been widely exploited in successful model building of both inflationary cosmology and particle physics phenomenology. Conformal Grand Unified Theory (CGUT) inflation provides the same scalar tilt and tensor-to-scalar ratio as of Starobinsky and Higgs inflation. Moreover, it predicts a proton life time compatible with the current experimental bound. In…
▽ More
Spontaneous breaking of conformal symmetry has been widely exploited in successful model building of both inflationary cosmology and particle physics phenomenology. Conformal Grand Unified Theory (CGUT) inflation provides the same scalar tilt and tensor-to-scalar ratio as of Starobinsky and Higgs inflation. Moreover, it predicts a proton life time compatible with the current experimental bound. In this paper, we extend CGUT to account for the production of dark matter and the reheating of the Standard Model. To this end, we introduce a hidden sector directly coupled to the inflaton, whereas the reheating of the visible sector is realized through a portal coupling between the dark particles and the Higgs boson. The masses and interactions of the dark particles and the Higgs boson are determined by the form of the conformal potential and the non-vanishing VEV of the inflaton. We provide benchmark points in the parameter space of the model that give the observed dark matter relic density and reheating temperatures compatible with the Big Bang nucleosynthesis.
△ Less
Submitted 9 July, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Three-Dimensional Kinematic Reconstruction of the Optically-Emitting, High-Velocity, Oxygen-Rich Ejecta of Supernova Remnant N132D
Authors:
Charles J. Law,
Dan Milisavljevic,
Daniel J. Patnaude,
Paul P. Plucinsky,
Michael D. Gladders,
Judy Schmidt,
Niharika Sravan,
John Banovetz,
Hidetoshi Sano,
Jordan M. McGraw,
George Takahashi,
Salvatore Orlando
Abstract:
We present a three-dimensional kinematic reconstruction of the optically-emitting, oxygen-rich ejecta of supernova remnant N132D in the Large Magellanic Cloud. Data were obtained with the 6.5 m Magellan telescope in combination with the IMACS+GISMO instrument and survey [O III] $λλ$4959,5007 line emission in a ${\sim}$3$^{\prime}~\times$ 3$^{\prime}$ region centered on N132D. The spatial and spect…
▽ More
We present a three-dimensional kinematic reconstruction of the optically-emitting, oxygen-rich ejecta of supernova remnant N132D in the Large Magellanic Cloud. Data were obtained with the 6.5 m Magellan telescope in combination with the IMACS+GISMO instrument and survey [O III] $λλ$4959,5007 line emission in a ${\sim}$3$^{\prime}~\times$ 3$^{\prime}$ region centered on N132D. The spatial and spectral resolution of our data enable detailed examination of the optical ejecta structure. The majority of N132D's optically bright oxygen ejecta are arranged in a torus-like geometry tilted approximately 28$^{\circ}$ with respect to the plane of the sky. The torus has a radius of 4.4 pc ($D_{\rm LMC}$/50 kpc), exhibits a blue-shifted radial velocity asymmetry of $-3000$ to $+2300$ km s$^{-1}$, and has a conspicuous break in its circumference. Assuming homologous expansion from the geometric center of O-rich filaments, the average expansion velocity of 1745 km s$^{-1}$ translates to an age since explosion of 2450 $\pm$ 195 yr. A faint, spatially-separated "runaway knot" (RK) with total space velocity of 3650 km s$^{-1}$ is nearly perpendicular to the torus plane and coincident with X-ray emission that is substantially enhanced in Si relative to the LMC and N132D's bulk ejecta. These kinematic and chemical signatures suggest that the RK may have had its origin deep within the progenitor star. Overall, the main shell morphology and high-velocity, Si-enriched components of N132D have remarkable similarity with that of Cassiopeia A, which was the result of a Type IIb supernova explosion. Our results underscore the need for further observations and simulations that can robustly reconcile whether the observed morphology is dominated by explosion dynamics or shaped by interaction with the environment.
△ Less
Submitted 31 March, 2020;
originally announced April 2020.
-
Real-Time Value-Driven Data Augmentation in the Era of LSST
Authors:
Niharika Sravan,
Dan Milisavljevic,
Jack M. Reynolds,
Geoffrey Lentner,
Mark Linvill
Abstract:
The deluge of data from time-domain surveys is rendering traditional human-guided data collection and inference techniques impractical. We propose a novel approach for conducting data collection for science inference in the era of massive large-scale surveys that uses value-based metrics to autonomously strategize and co-ordinate follow-up in real-time. We demonstrate the underlying principles in…
▽ More
The deluge of data from time-domain surveys is rendering traditional human-guided data collection and inference techniques impractical. We propose a novel approach for conducting data collection for science inference in the era of massive large-scale surveys that uses value-based metrics to autonomously strategize and co-ordinate follow-up in real-time. We demonstrate the underlying principles in the Recommender Engine For Intelligent Transient Tracking (REFITT) that ingests live alerts from surveys and value-added inputs from data brokers to predict the future behavior of transients and design optimal data augmentation strategies given a set of scientific objectives. The prototype presented in this paper is tested to work given simulated Rubin Observatory Legacy Survey of Space and Time (LSST) core-collapse supernova (CC SN) light-curves from the PLAsTiCC dataset. CC SNe were selected for the initial development phase as they are known to be difficult to classify, with the expectation that any learning techniques for them should be at least as effective for other transients. We demonstrate the behavior of REFITT on a random LSST night given ~32000 live CC SNe of interest. The system makes good predictions for the photometric behavior of the events and uses them to plan follow-up using a simple data-driven metric. We argue that machine-directed follow-up maximizes the scientific potential of surveys and follow-up resources by reducing downtime and bias in data collection.
△ Less
Submitted 24 July, 2020; v1 submitted 19 March, 2020;
originally announced March 2020.
-
Non-Gaussianities and tensor-to-scalar ratio in non-local $R^{2}$-like inflation
Authors:
Alexey S. Koshelev,
K. Sravan Kumar,
Anupam Mazumdar,
Alexei A. Starobinsky
Abstract:
In this paper we will study $R^2$-like inflation in a non-local modification of gravity which contains quadratic in Ricci scalar and Weyl tensor terms with analytic infinite derivative form-factors in the action. It is known that the inflationary solution of the local $R+R^2$ gravity remains a particular exact solution in this model. It was shown earlier that the power spectrum of scalar perturbat…
▽ More
In this paper we will study $R^2$-like inflation in a non-local modification of gravity which contains quadratic in Ricci scalar and Weyl tensor terms with analytic infinite derivative form-factors in the action. It is known that the inflationary solution of the local $R+R^2$ gravity remains a particular exact solution in this model. It was shown earlier that the power spectrum of scalar perturbations generated during inflation in the non-local setup remains the same as in the local $R+R^2$ inflation, whereas the power spectrum of tensor perturbations gets modified due to the non-local Weyl tensor squared term. In the present paper we go beyond 2-point correlators and compute the non-Gaussian parameter $f_{NL}$ related to 3-point correlations generated during inflation, which we found to be different from those in the original local inflationary model and scenarios alike based on a local gravity. We evaluate non-local corrections to the scalar bi-spectrum which give non-zero contributions to squeezed, equilateral and orthogonal configurations. We show that $f_{NL}\sim O(1)$ with an arbitrary sign is achievable in this model based on the choice of form-factors and the scale of non-locality. We present the predictions for the tensor-to-scalar ratio, $r$, and the tensor tilt, $n_t$. In contrast to standard inflation in a local gravity, here the possibility $n_t$>0 is not excluded. Thus, future CMB data can probe non-local behaviour of gravity at high space-time curvatures.
△ Less
Submitted 7 July, 2020; v1 submitted 1 March, 2020;
originally announced March 2020.
-
Towards Accurate Vehicle Behaviour Classification With Multi-Relational Graph Convolutional Networks
Authors:
Sravan Mylavarapu,
Mahtab Sandhu,
Priyesh Vijayan,
K Madhava Krishna,
Balaraman Ravindran,
Anoop Namboodiri
Abstract:
Understanding on-road vehicle behaviour from a temporal sequence of sensor data is gaining in popularity. In this paper, we propose a pipeline for understanding vehicle behaviour from a monocular image sequence or video. A monocular sequence along with scene semantics, optical flow and object labels are used to get spatial information about the object (vehicle) of interest and other objects (seman…
▽ More
Understanding on-road vehicle behaviour from a temporal sequence of sensor data is gaining in popularity. In this paper, we propose a pipeline for understanding vehicle behaviour from a monocular image sequence or video. A monocular sequence along with scene semantics, optical flow and object labels are used to get spatial information about the object (vehicle) of interest and other objects (semantically contiguous set of locations) in the scene. This spatial information is encoded by a Multi-Relational Graph Convolutional Network (MR-GCN), and a temporal sequence of such encodings is fed to a recurrent network to label vehicle behaviours. The proposed framework can classify a variety of vehicle behaviours to high fidelity on datasets that are diverse and include European, Chinese and Indian on-road scenes. The framework also provides for seamless transfer of models across datasets without entailing re-annotation, retraining and even fine-tuning. We show comparative performance gain over baseline Spatio-temporal classifiers and detail a variety of ablations to showcase the efficacy of the framework.
△ Less
Submitted 12 May, 2020; v1 submitted 3 February, 2020;
originally announced February 2020.
-
AutoFCL: Automatically Tuning Fully Connected Layers for Handling Small Dataset
Authors:
S. H. Shabbeer Basha,
Sravan Kumar Vinakota,
Shiv Ram Dubey,
Viswanath Pulabaigari,
Snehasis Mukherjee
Abstract:
Deep Convolutional Neural Networks (CNN) have evolved as popular machine learning models for image classification during the past few years, due to their ability to learn the problem-specific features directly from the input images. The success of deep learning models solicits architecture engineering rather than hand-engineering the features. However, designing state-of-the-art CNN for a given ta…
▽ More
Deep Convolutional Neural Networks (CNN) have evolved as popular machine learning models for image classification during the past few years, due to their ability to learn the problem-specific features directly from the input images. The success of deep learning models solicits architecture engineering rather than hand-engineering the features. However, designing state-of-the-art CNN for a given task remains a non-trivial and challenging task, especially when training data size is less. To address this phenomena, transfer learning has been used as a popularly adopted technique. While transferring the learned knowledge from one task to another, fine-tuning with the target-dependent Fully Connected (FC) layers generally produces better results over the target task. In this paper, the proposed AutoFCL model attempts to learn the structure of FC layers of a CNN automatically using Bayesian optimization. To evaluate the performance of the proposed AutoFCL, we utilize five pre-trained CNN models such as VGG-16, ResNet, DenseNet, MobileNet, and NASNetMobile. The experiments are conducted on three benchmark datasets, namely CalTech-101, Oxford-102 Flowers, and UC Merced Land Use datasets. Fine-tuning the newly learned (target-dependent) FC layers leads to state-of-the-art performance, according to the experiments carried out in this research. The proposed AutoFCL method outperforms the existing methods over CalTech-101 and Oxford-102 Flowers datasets by achieving the accuracy of 94.38% and 98.89%, respectively. However, our method achieves comparable performance on the UC Merced Land Use dataset with 96.83% accuracy. The source codes of this research are available at https://github.com/shabbeersh/AutoFCL.
△ Less
Submitted 28 January, 2021; v1 submitted 22 January, 2020;
originally announced January 2020.
-
Zero-Shot Reinforcement Learning with Deep Attention Convolutional Neural Networks
Authors:
Sahika Genc,
Sunil Mallya,
Sravan Bodapati,
Tao Sun,
Yunzhe Tao
Abstract:
Simulation-to-simulation and simulation-to-real world transfer of neural network models have been a difficult problem. To close the reality gap, prior methods to simulation-to-real world transfer focused on domain adaptation, decoupling perception and dynamics and solving each problem separately, and randomization of agent parameters and environment conditions to expose the learning agent to a var…
▽ More
Simulation-to-simulation and simulation-to-real world transfer of neural network models have been a difficult problem. To close the reality gap, prior methods to simulation-to-real world transfer focused on domain adaptation, decoupling perception and dynamics and solving each problem separately, and randomization of agent parameters and environment conditions to expose the learning agent to a variety of conditions. While these methods provide acceptable performance, the computational complexity required to capture a large variation of parameters for comprehensive scenarios on a given task such as autonomous driving or robotic manipulation is high. Our key contribution is to theoretically prove and empirically demonstrate that a deep attention convolutional neural network (DACNN) with specific visual sensor configuration performs as well as training on a dataset with high domain and parameter variation at lower computational complexity. Specifically, the attention network weights are learned through policy optimization to focus on local dependencies that lead to optimal actions, and does not require tuning in real-world for generalization. Our new architecture adapts perception with respect to the control objective, resulting in zero-shot learning without pre-training a perception network. To measure the impact of our new deep network architecture on domain adaptation, we consider autonomous driving as a use case. We perform an extensive set of experiments in simulation-to-simulation and simulation-to-real scenarios to compare our approach to several baselines including the current state-of-art models.
△ Less
Submitted 2 January, 2020;
originally announced January 2020.
-
A Direct- Conversion Digital Beamforming Array Receiver with 800 MHz Channel Bandwidth at 28 GHz using Xilinx RF SoC
Authors:
Sravan Pulipati,
Viduneth Ariyarathna,
Udara De Silva,
Najath Akram,
Elias Alwan,
Arjuna Madanayake,
Soumyajit Mandal,
Theodore S. Rappaport
Abstract:
This paper discusses early results associated with a fully-digital direct-conversion array receiver at 28~GHz. The proposed receiver makes use of commercial off-the-shelf (COTS) electronics, including the receiver chain. The design consists of a custom 28~GHz patch antenna sub-array providing gain in the elevation plane, with azimuthal plane beamforming provided by real-time digital signal process…
▽ More
This paper discusses early results associated with a fully-digital direct-conversion array receiver at 28~GHz. The proposed receiver makes use of commercial off-the-shelf (COTS) electronics, including the receiver chain. The design consists of a custom 28~GHz patch antenna sub-array providing gain in the elevation plane, with azimuthal plane beamforming provided by real-time digital signal processing (DSP) algorithms running on a Xilinx Radio Frequency System on Chip (RF SoC). The proposed array receiver employs element-wise fully-digital array processing that supports ADC sample rates up to 2~GS/second and up to 1~GHz of operating bandwidth per antenna. The RF mixed-signal data conversion circuits and DSP algorithms operate on a single-chip RF SoC solution installed on the Xilinx ZCU1275 prototyping platform.
△ Less
Submitted 20 November, 2019;
originally announced November 2019.
-
SAVEHR: Self Attention Vector Representations for EHR based Personalized Chronic Disease Onset Prediction and Interpretability
Authors:
Sunil Mallya,
Marc Overhage,
Sravan Bodapati,
Navneet Srivastava,
Sahika Genc
Abstract:
Chronic disease progression is emerging as an important area of investment for healthcare providers. As the quantity and richness of available clinical data continue to increase along with advances in machine learning, there is great potential to advance our approaches to caring for patients. An ideal approach to this problem should generate good performance on at least three axes namely, a) perfo…
▽ More
Chronic disease progression is emerging as an important area of investment for healthcare providers. As the quantity and richness of available clinical data continue to increase along with advances in machine learning, there is great potential to advance our approaches to caring for patients. An ideal approach to this problem should generate good performance on at least three axes namely, a) perform across many clinical conditions without requiring deep clinical expertise or extensive data scientist effort, b) generalization across populations, and c) be explainable (model interpretability). We present SAVEHR, a self-attention based architecture on heterogeneous structured EHR data that achieves $>$ 0.51 AUC-PR and $>$ 0.87 AUC-ROC gains on predicting the onset of four clinical conditions (CHF, Kidney Failure, Diabetes and COPD) 15-months in advance, and transfers with high performance onto a new population. We demonstrate that SAVEHR model performs superior to ten baselines on all three axes stated formerly.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
Robustness to Capitalization Errors in Named Entity Recognition
Authors:
Sravan Bodapati,
Hyokun Yun,
Yaser Al-Onaizan
Abstract:
Robustness to capitalization errors is a highly desirable characteristic of named entity recognizers, yet we find standard models for the task are surprisingly brittle to such noise. Existing methods to improve robustness to the noise completely discard given orthographic information, mwhich significantly degrades their performance on well-formed text. We propose a simple alternative approach base…
▽ More
Robustness to capitalization errors is a highly desirable characteristic of named entity recognizers, yet we find standard models for the task are surprisingly brittle to such noise. Existing methods to improve robustness to the noise completely discard given orthographic information, mwhich significantly degrades their performance on well-formed text. We propose a simple alternative approach based on data augmentation, which allows the model to \emph{learn} to utilize or ignore orthographic information depending on its usefulness in the context. It achieves competitive robustness to capitalization errors while making negligible compromise to its performance on well-formed text and significantly improving generalization power on noisy user-generated text. Our experiments clearly and consistently validate our claim across different types of machine learning models, languages, and dataset sizes.
△ Less
Submitted 12 November, 2019;
originally announced November 2019.
-
Neural Word Decomposition Models for Abusive Language Detection
Authors:
Sravan Babu Bodapati,
Spandana Gella,
Kasturi Bhattacharjee,
Yaser Al-Onaizan
Abstract:
User generated text on social media often suffers from a lot of undesired characteristics including hatespeech, abusive language, insults etc. that are targeted to attack or abuse a specific group of people. Often such text is written differently compared to traditional text such as news involving either explicit mention of abusive words, obfuscated words and typological errors or implicit abuse i…
▽ More
User generated text on social media often suffers from a lot of undesired characteristics including hatespeech, abusive language, insults etc. that are targeted to attack or abuse a specific group of people. Often such text is written differently compared to traditional text such as news involving either explicit mention of abusive words, obfuscated words and typological errors or implicit abuse i.e., indicating or targeting negative stereotypes. Thus, processing this text poses several robustness challenges when we apply natural language processing techniques developed for traditional text. For example, using word or token based models to process such text can treat two spelling variants of a word as two different words. Following recent work, we analyze how character, subword and byte pair encoding (BPE) models can be aid some of the challenges posed by user generated text. In our work, we analyze the effectiveness of each of the above techniques, compare and contrast various word decomposition techniques when used in combination with others. We experiment with finetuning large pretrained language models, and demonstrate their robustness to domain shift by studying Wikipedia attack, toxicity and Twitter hatespeech datasets
△ Less
Submitted 2 October, 2019;
originally announced October 2019.
-
Multi Sense Embeddings from Topic Models
Authors:
Shobhit Jain,
Sravan Babu Bodapati,
Ramesh Nallapati,
Anima Anandkumar
Abstract:
Distributed word embeddings have yielded state-of-the-art performance in many NLP tasks, mainly due to their success in capturing useful semantic information. These representations assign only a single vector to each word whereas a large number of words are polysemous (i.e., have multiple meanings). In this work, we approach this critical problem in lexical semantics, namely that of representing v…
▽ More
Distributed word embeddings have yielded state-of-the-art performance in many NLP tasks, mainly due to their success in capturing useful semantic information. These representations assign only a single vector to each word whereas a large number of words are polysemous (i.e., have multiple meanings). In this work, we approach this critical problem in lexical semantics, namely that of representing various senses of polysemous words in vector spaces. We propose a topic modeling based skip-gram approach for learning multi-prototype word embeddings. We also introduce a method to prune the embeddings determined by the probabilistic representation of the word in each topic. We use our embeddings to show that they can capture the context and word similarity strongly and outperform various state-of-the-art implementations.
△ Less
Submitted 3 February, 2020; v1 submitted 17 September, 2019;
originally announced September 2019.
-
Perturbations in higher derivative gravity beyond maximally symmetric spacetimes
Authors:
K. Sravan Kumar,
Shubham Maheshwari,
Anupam Mazumdar
Abstract:
We study (covariant) scalar-vector-tensor (SVT) perturbations of infinite derivative gravity (IDG), at the quadratic level of the action, around conformally-flat, covariantly constant curvature backgrounds which are not maximally symmetric spacetimes (MSS). This extends a previous analysis of perturbations done around MSS, which were shown to be ghost-free. We motivate our choice of backgrounds wh…
▽ More
We study (covariant) scalar-vector-tensor (SVT) perturbations of infinite derivative gravity (IDG), at the quadratic level of the action, around conformally-flat, covariantly constant curvature backgrounds which are not maximally symmetric spacetimes (MSS). This extends a previous analysis of perturbations done around MSS, which were shown to be ghost-free. We motivate our choice of backgrounds which arise as solutions of IDG in the UV, avoiding big bang and black hole singularities. Contrary to MSS, in this paper we show that, generically, all SVT modes are coupled to each other at the quadratic level of the action. We consider simple examples of the full IDG action, and illustrate this mixing and also a case where the action can be diagonalized and ghost-free solutions constructed. Our study is widely applicable for both non-singular cosmology and black hole physics where backgrounds depart from MSS. In appendices, we provide SVT perturbations around conformally-flat and arbitrary backgrounds which can serve as a compendium of useful results when studying SVT perturbations of various higher derivative gravity models.
△ Less
Submitted 8 May, 2019;
originally announced May 2019.
-
Astro2020 Science White Paper: Are Supernovae the Dust Producer in the Early Universe?
Authors:
Jeonghee Rho,
Danny Milisavljevic,
Arkaprabha Sarangi,
Raffaella Margutti,
Ryan Chornock,
Armin Rest,
Melissa Graham,
J. Craig Wheeler,
Darren DePoy,
Lifan Wang,
Jennifer Marshall,
Grant Williams,
Rachel Street,
Warren Skidmore,
Yan Haojing,
Joshua Bloom,
Sumner Starrfield,
Chien-Hsiu Lee,
Philip S. Cowperthwaite,
Guy S. Stringfellow,
Deanne Coppejans,
Giacomo Terreran,
Niharika Sravan,
Thomas R. Geballe,
Aneurin Evans
, et al. (1 additional authors not shown)
Abstract:
Whether supernovae are a significant source of dust has been a long-standing debate. The large quantities of dust observed in high-redshift galaxies raise a fundamental question as to the origin of dust in the Universe since stars cannot have evolved to the AGB dust-producing phase in high-redshift galaxies. In contrast, supernovae occur within several millions of years after the onset of star for…
▽ More
Whether supernovae are a significant source of dust has been a long-standing debate. The large quantities of dust observed in high-redshift galaxies raise a fundamental question as to the origin of dust in the Universe since stars cannot have evolved to the AGB dust-producing phase in high-redshift galaxies. In contrast, supernovae occur within several millions of years after the onset of star formation. This white paper focuses on dust formation in supernova ejecta with US-Extremely Large Telescope (ELT) perspective during the era of JWST and LSST.
△ Less
Submitted 17 April, 2019;
originally announced April 2019.
-
Astro2020 Science White Paper: Discovery Frontiers of Explosive Transients - An ELT & LSST Perspective
Authors:
Melissa L. Graham,
Danny Milisavljevic,
Armin Rest,
J. Craig Wheeler,
Ryan Chornock,
Raffaella Margutti,
Jeonghee Rho,
Chien-Hsiu Lee,
Sung-Chul Yoon,
Charles D. Kilpatrick,
Gautham Narayan,
Nathan Smith,
G. Grant Williams,
Niharika Sravan,
Philip Cowperthwaite,
Deanne Coppejans,
Giacomo Terreran,
Adriano Baldeschi,
V. Zach Golkhou,
Sumner Starrfield
Abstract:
The Large Synoptic Survey Telescope (LSST) will open a discovery frontier for faint and fast transients with its ability to detect variable flux components down to $\sim$24.5 mag in a $\sim$30 second exposure. Spectroscopic follow-up of such phenomena - which are necessary for understanding the physics of stellar explosions - can require a rapid response and several hours with a 8-10m telescope, m…
▽ More
The Large Synoptic Survey Telescope (LSST) will open a discovery frontier for faint and fast transients with its ability to detect variable flux components down to $\sim$24.5 mag in a $\sim$30 second exposure. Spectroscopic follow-up of such phenomena - which are necessary for understanding the physics of stellar explosions - can require a rapid response and several hours with a 8-10m telescope, making it both expensive and difficult to acquire. The future Extremely Large Telescopes (ELTs) would be able to provide not only spectroscopy but capabilities such as spectropolarimetry and high-resolution diffraction-limited imaging that would contribute to future advances in our physical understanding of stellar explosions. In this white paper we focus on several specific scientific impacts in the field of explosive transient astrophysics that will be generated by the combination of LSST's discovery abilities and ELTs' follow-up capacities. First, we map the uncharted frontier of discovery phase-space in terms of intrinsic luminosity and timescales for explosive transients, where we expect the unexpected. We then focus on six areas with open science questions for known transients: the progenitors of thermonuclear supernovae (SNe), mass loss prior to core collapse, asymmetry in stellar explosions, light echoes, high-$z$ transients, and strongly lensed SNe. We conclude with a brief discussion of the practical aspects of ELT & LSST synergy.
△ Less
Submitted 11 April, 2019;
originally announced April 2019.
-
Achieving Transformative Understanding of Extreme Stellar Explosions with ELT-enabled Late-time Spectroscopy
Authors:
D. Milisavljevic,
R. Margutti,
R. Chornock,
A. Rest,
M. Graham,
D. DePoy,
J. Marshall,
V. Z. Golkhou,
G. Williams,
J. Rho,
R. Street,
W. Skidmore,
Y. Haojing,
J. Bloom,
S. Starrfield,
C. -H. Lee,
P. S. Cowperthwaite,
G. Stringfellow,
D. Coppejans,
G. Terreran,
N. Sravan,
O. Fox,
J. Mauerhan,
K. S. Long,
W. P. Blair
, et al. (13 additional authors not shown)
Abstract:
Supernovae are among the most powerful and influential explosions in the universe. They are also ideal multi-messenger laboratories to study extreme astrophysics. However, many fundamental properties of supernovae related to their diverse progenitor systems and explosion mechanisms remain poorly constrained. Here we outline how late-time spectroscopic observations obtained during the nebular phase…
▽ More
Supernovae are among the most powerful and influential explosions in the universe. They are also ideal multi-messenger laboratories to study extreme astrophysics. However, many fundamental properties of supernovae related to their diverse progenitor systems and explosion mechanisms remain poorly constrained. Here we outline how late-time spectroscopic observations obtained during the nebular phase (several months to years after explosion), made possible with the next generation of Extremely Large Telescopes, will facilitate transformational science opportunities and rapidly accelerate the community towards our goal of achieving a complete understanding of supernova explosions. We highlight specific examples of how complementary GMT and TMT instrumentation will enable high fidelity spectroscopy from which the line profiles and luminosities of elements tracing mass loss and ejecta can be used to extract kinematic and chemical information with unprecedented detail, for hundreds of objects. This will provide uniquely powerful constraints on the evolutionary phases stars may experience approaching a supernova explosion; the subsequent explosion dynamics; their nucleosynthesis yields; and the formation of compact objects that may act as central engines.
△ Less
Submitted 11 April, 2019;
originally announced April 2019.
-
Multi-Messenger Astronomy with Extremely Large Telescopes
Authors:
Ryan Chornock,
Philip S. Cowperthwaite,
Raffaella Margutti,
Dan Milisavljevic,
Kate D. Alexander,
Igor Andreoni,
Iair Arcavi,
Adriano Baldeschi,
Jennifer Barnes,
Eric Bellm,
Paz Beniamini,
Edo Berger,
Christopher P. L. Berry,
Federica Bianco,
Peter K. Blanchard,
Joshua S. Bloom,
Sarah Burke-Spolaor,
Eric Burns,
Dario Carbone,
S. Bradley Cenko,
Deanne Coppejans,
Alessandra Corsi,
Michael Coughlin,
Maria R. Drout,
Tarraneh Eftekhari
, et al. (60 additional authors not shown)
Abstract:
The field of time-domain astrophysics has entered the era of Multi-messenger Astronomy (MMA). One key science goal for the next decade (and beyond) will be to characterize gravitational wave (GW) and neutrino sources using the next generation of Extremely Large Telescopes (ELTs). These studies will have a broad impact across astrophysics, informing our knowledge of the production and enrichment hi…
▽ More
The field of time-domain astrophysics has entered the era of Multi-messenger Astronomy (MMA). One key science goal for the next decade (and beyond) will be to characterize gravitational wave (GW) and neutrino sources using the next generation of Extremely Large Telescopes (ELTs). These studies will have a broad impact across astrophysics, informing our knowledge of the production and enrichment history of the heaviest chemical elements, constrain the dense matter equation of state, provide independent constraints on cosmology, increase our understanding of particle acceleration in shocks and jets, and study the lives of black holes in the universe. Future GW detectors will greatly improve their sensitivity during the coming decade, as will near-infrared telescopes capable of independently finding kilonovae from neutron star mergers. However, the electromagnetic counterparts to high-frequency (LIGO/Virgo band) GW sources will be distant and faint and thus demand ELT capabilities for characterization. ELTs will be important and necessary contributors to an advanced and complete multi-messenger network.
△ Less
Submitted 11 March, 2019;
originally announced March 2019.
-
Multiplayer Multi-armed Bandits for Optimal Assignment in Heterogeneous Networks
Authors:
Harshvardhan Tibrewal,
Sravan Patchala,
Manjesh K. Hanawal,
Sumit J. Darak
Abstract:
We consider an ad hoc network where multiple users access the same set of channels. The channel characteristics are unknown and could be different for each user (heterogeneous). No controller is available to coordinate channel selections by the users, and if multiple users select the same channel, they collide and none of them receive any rate (or reward). For such a completely decentralized netwo…
▽ More
We consider an ad hoc network where multiple users access the same set of channels. The channel characteristics are unknown and could be different for each user (heterogeneous). No controller is available to coordinate channel selections by the users, and if multiple users select the same channel, they collide and none of them receive any rate (or reward). For such a completely decentralized network we develop algorithms that aim to achieve optimal network throughput. Due to lack of any direct communication between the users, we allow each user to exchange information by transmitting in a specific pattern and sense such transmissions from others. However, such transmissions and sensing for information exchange do not add to network throughput. For the wideband sensing and narrowband sensing scenarios, we first develop explore-and-commit algorithms that converge to near-optimal allocation with high probability in a small number of rounds. Building on this, we develop an algorithm that gives logarithmic regret, even when the number of users changes with time. We validate our claims through extensive experiments and show that our algorithms perform significantly better than the state-of-the-art CSM-MAB, dE3 and dE3-TS algorithms.
△ Less
Submitted 29 August, 2019; v1 submitted 12 January, 2019;
originally announced January 2019.
-
Non-local Starobinsky inflation in the light of future CMB
Authors:
K. Sravan Kumar,
Leonardo Modesto
Abstract:
Analytic infinite derivative (AID) non-local quadratic curvature gravity in Weyl basis is known to be ghost free, superrenormalizable or finite and perturbatively Unitary and as such it is Ultra-Violet (UV) complete. Recently $R+R^2$ ("Starobinsky") inflation was successfully embedded in AID non-local gravity and the corresponding observables were computed. Here in this paper, we derive the form f…
▽ More
Analytic infinite derivative (AID) non-local quadratic curvature gravity in Weyl basis is known to be ghost free, superrenormalizable or finite and perturbatively Unitary and as such it is Ultra-Violet (UV) complete. Recently $R+R^2$ ("Starobinsky") inflation was successfully embedded in AID non-local gravity and the corresponding observables were computed. Here in this paper, we derive the form factors compatible within near de Sitter aproaximation and prove that the theory must contain a scalaron that drives inflationary expansion. Further more we consider the form factors (AID non-local operators) proposed by Tomboulis in hep-th/9702146 and compute the corresponding predictions of tensor to scalar ratio and tensor tilt $\left( n_t,\,r \right)$ where the scalar tilt remains the same as the local Starobinsky model. Anticipating future CMB probes such will be able to test non-local Starobinsky inflation we constrain the scale of non-locality to be $10^{14}\,GeV\lesssim\mathcal{M}\lesssim 5\times 10^{14}\,GeV$ and $10^{-7}\lesssim r \lesssim 0.07$ for different form factors. We found that it possible to have a blue or red tensor tilt $\left( n_t\gtrless 0 \right)$ depending on the scale of non-locality and the form factor. We also comment on Higgs inflation in non-local context.
△ Less
Submitted 4 October, 2018;
originally announced October 2018.
-
Progenitors of Type IIb Supernovae: I. Evolutionary Pathways and Rates
Authors:
Niharika Sravan,
Pablo Marchant,
Vassiliki Kalogera
Abstract:
Type IIb supernovae (SNe) are important candidates to understand mechanisms that drive the stripping of stripped-envelope (SE) supernova (SN) progenitors. While binary interactions and their high incidence are generally cited to favor them as Type IIb SN progenitors, this idea has not been tested using models covering a broad parameter space. In this paper, we use non-rotating single- and binary-s…
▽ More
Type IIb supernovae (SNe) are important candidates to understand mechanisms that drive the stripping of stripped-envelope (SE) supernova (SN) progenitors. While binary interactions and their high incidence are generally cited to favor them as Type IIb SN progenitors, this idea has not been tested using models covering a broad parameter space. In this paper, we use non-rotating single- and binary-star models at solar and low metallicities spanning a wide parameter space in primary mass, mass ratio, orbital period, and mass transfer efficiencies. We find that our single- and binary-star models contribute to roughly equal, however small, numbers of Type IIb SNe at solar metallicity. Binaries only dominate as progenitors at low metallicity. We also find that our models can account for less than half the observationally inferred rate for Type IIb SNe at solar metallicity, with computed rates ~<4% of core-collapse (CC) SNe. On the other hand, our models can account for the rates currently indicated by observations at low metallicity, with computed rates as high as 15% of CC SNe. However, this requires low mass transfer efficiencies (~<0.1) to prevent most progenitors from entering contact. We suggest that the stellar wind mass-loss rates at solar metallicity used in our models are too high. Lower mass-loss rates would widen the parameter space for binary Type IIb SNe at solar metallicity by allowing stars that initiate mass transfer earlier in their evolution to reach CC without getting fully stripped.
△ Less
Submitted 2 October, 2019; v1 submitted 22 August, 2018;
originally announced August 2018.
-
Inflaton candidates: from string theory to particle physics
Authors:
K. Sravan Kumar
Abstract:
Cosmic inflation is the cornerstone of modern cosmology. In particular, following the Planck mission reports presented in 2015 regarding cosmic microwave background (CMB), there is an increasing interest in searching for inflaton candidates within fundamental theories and to ultimately test them with future CMB data. This thesis presents inflationary models using a methodology that can be describe…
▽ More
Cosmic inflation is the cornerstone of modern cosmology. In particular, following the Planck mission reports presented in 2015 regarding cosmic microwave background (CMB), there is an increasing interest in searching for inflaton candidates within fundamental theories and to ultimately test them with future CMB data. This thesis presents inflationary models using a methodology that can be described as venturing top-down or bottom-up along energy scales. In the top-down motivation, we study inflationary scenarios in string theory and supergravity (SUGRA), namely with (multiple) 3-forms, Dirac-Born-Infeld Galileon model, a string field theory setup and $\mathcal{N}=1$ SUGRA $α-$attractor models. In the bottom-up motivation, we construct a grand unified theory based inflationary model with an additional conformal symmetry and study not only inflation but also provide predictions related to particle physics. Our research work includes various classes of inflation driven by scalar fields under a canonical, non-canonical and induced gravity frameworks. All these models are consistent with Planck data, supported by key primordial cosmological parameters such as the scalar spectral index $n_{s}$, the tensor to scalar ratio $r$, together with the primordial non-Gaussianities. Future probes aiming to detect primordial gravitational waves and CMB non-Gaussianities can further help to distinguish between them.
△ Less
Submitted 24 July, 2018;
originally announced August 2018.
-
Conformal GUT inflation, proton lifetime and non-thermal leptogenesis
Authors:
K. Sravan Kumar,
Paulo Vargas Moniz
Abstract:
In this paper, we generalize Coleman-Weinberg (CW) inflation in grand unified theories (GUTs) such as $\text{SU}(5)$ and $\text{SO}(10)$ by means of considering two complex singlet fields with conformal invariance. In this framework, inflation emerges from a spontaneously broken conformal symmetry. The GUT symmetry implies a potential with a CW form, as a consequence of radiative corrections. The…
▽ More
In this paper, we generalize Coleman-Weinberg (CW) inflation in grand unified theories (GUTs) such as $\text{SU}(5)$ and $\text{SO}(10)$ by means of considering two complex singlet fields with conformal invariance. In this framework, inflation emerges from a spontaneously broken conformal symmetry. The GUT symmetry implies a potential with a CW form, as a consequence of radiative corrections. The conformal symmetry flattens the above VEV branch of the CW potential to a Starobinsky plateau. As a result, we obtain $n_{s}\sim 1-\frac{2}{N}$ and $r\sim \frac{12}{N^2}$ for $N\sim 50-60$ e-foldings. Furthermore, this framework allow us to estimate the proton lifetime as $τ_{p}\lesssim 10^{40}$ years, whose decay is mediated by the superheavy gauge bosons. Moreover, we implement a type I seesaw mechanism by weakly coupling the complex singlet, which carries two units of lepton number, to the three generations of singlet right handed neutrinos (RHNs). The spontaneous symmetry breaking of global lepton number amounts to the generation of neutrino masses. We also consider non-thermal leptogenesis in which the inflaton dominantly decays into heavy RHNs that sources the observed baryon asymmetry. We constrain the couplings of the inflaton field to the RHNs, which gives the reheating temperature as $10^{6}\text{ GeV}\lesssim T_{R}<10^{9}$ GeV.
△ Less
Submitted 3 November, 2019; v1 submitted 23 June, 2018;
originally announced June 2018.
-
$R^2$ inflation to probe non-perturbative quantum gravity
Authors:
Alexey S. Koshelev,
K. Sravan Kumar,
Alexei A. Starobinsky
Abstract:
It is natural to expect a consistent inflationary model of the very early Universe to be an effective theory of quantum gravity, at least at energies much less than the Planck one. For the moment, $R+R^2$, or shortly $R^2$, inflation is the most successful in accounting for the latest CMB data from the PLANCK satellite and other experiments. Moreover, recently it was shown to be ultra-violet (UV)…
▽ More
It is natural to expect a consistent inflationary model of the very early Universe to be an effective theory of quantum gravity, at least at energies much less than the Planck one. For the moment, $R+R^2$, or shortly $R^2$, inflation is the most successful in accounting for the latest CMB data from the PLANCK satellite and other experiments. Moreover, recently it was shown to be ultra-violet (UV) complete via an embedding into an analytic infinite derivative (AID) non-local gravity. In this paper, we derive a most general theory of gravity that contributes to perturbed linear equations of motion around maximally symmetric space-times. We show that such a theory is quadratic in the Ricci scalar and the Weyl tensor with AID operators along with the Einstein-Hilbert term and possibly a cosmological constant. We explicitly demonstrate that introduction of the Ricci tensor squared term is redundant. Working in this quadratic AID gravity framework without a cosmological term we prove that for a specified class of space homogeneous space-times, a space of solutions to the equations of motion is identical to the space of backgrounds in a local $R^2$ model. We further compute the full second order perturbed action around any background belonging to that class. We proceed by extracting the key inflationary parameters of our model such as a spectral index ($n_s$), a tensor-to-scalar ratio ($r$) and a tensor tilt ($n_t$). It appears that $n_s$ remains the same as in the local $R^2$ inflation in the leading slow-roll approximation, while $r$ and $n_t$ get modified due to modification of the tensor power spectrum. This class of models allows for any value of $r<0.07$ with a modified consistency relation which can be fixed by future observations of primordial $B$-modes of the CMB polarization. This makes the UV complete $R^2$ gravity a natural target for future CMB probes.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.
-
Finite quantum gravity in dS and AdS spacetimes
Authors:
Alexey S. Koshelev,
K. Sravan Kumar,
Leonardo Modesto,
Leslaw Rachwal
Abstract:
We hereby study the properties of a large class of weakly nonlocal gravitational theories around the (anti-) de Sitter spacetime background. In particular, we explicitly prove that the kinetic operator for the graviton field has the same structure as the one in Einstein-Hilbert theory around any maximally symmetric spacetime. Therefore, the perturbative spectrum is the same of standard general rel…
▽ More
We hereby study the properties of a large class of weakly nonlocal gravitational theories around the (anti-) de Sitter spacetime background. In particular, we explicitly prove that the kinetic operator for the graviton field has the same structure as the one in Einstein-Hilbert theory around any maximally symmetric spacetime. Therefore, the perturbative spectrum is the same of standard general relativity, while the propagator on any maximally symmetric spacetime is a mere generalization of the one from Einstein's gravity derived and extensively studied in several previous papers. At quantum level the range of theories here presented is superrenormalizable or finite when proper (non affecting the propagator) terms cubic or higher in curvatures are added. Finally, it is proven that for a large class of nonlocal theories, which in their actions do involve neither the Weyl nor the Riemann tensor, the theory is classically equivalent to the Einstein-Hilbert one with cosmological constant by means of a metric field redefinition at any perturbative order.
△ Less
Submitted 14 August, 2018; v1 submitted 21 October, 2017;
originally announced October 2017.
-
Constraints on the Progenitor System of SN 2016gkg from a Comprehensive Statistical Analysis
Authors:
Niharika Sravan,
Pablo Marchant,
Vassiliki Kalogera,
Raffaella Margutti
Abstract:
Type IIb supernovae (SNe) present a unique opportunity for understanding the progenitors of stripped-envelope (SE) SNe as the stellar progenitor of several Type IIb SNe have been identified in pre-explosion images. In this paper, we use Bayesian inference and a large grid of non-rotating solar-metallicity single and binary stellar models to derive the associated probability distributions of single…
▽ More
Type IIb supernovae (SNe) present a unique opportunity for understanding the progenitors of stripped-envelope (SE) SNe as the stellar progenitor of several Type IIb SNe have been identified in pre-explosion images. In this paper, we use Bayesian inference and a large grid of non-rotating solar-metallicity single and binary stellar models to derive the associated probability distributions of single and binary progenitors of the Type IIb SN 2016gkg using existing observational constraints. We find that potential binary star progenitors have smaller pre-SN hydrogen-envelope and helium-core masses than potential single-star progenitors typically by 0.1 Msun and 2 Msun, respectively. We find that, a binary companion, if present, is a main-sequence or red-giant star. Apart from this, we do not find strong constraints on the nature of the companion star. We demonstrate that the range of progenitor helium-core mass inferred from observations could help improve constraints on the progenitor. We find that the probability that the progenitor of SN 2016gkg was a binary is 22% when we use constraints only on the progenitor luminosity and effective temperature. Imposing the range of pre-SN progenitor hydrogen-envelope mass and radius inferred from SN light-curves the probability the progenitor is a binary increases to 44%. However, there is no clear preference for a binary progenitor. This is in contrast to binaries being the currently favored formation channel for Type IIb SNe. Our analysis demonstrates the importance of statistical inference methods to constrain progenitor channels.
△ Less
Submitted 22 August, 2018; v1 submitted 14 August, 2017;
originally announced August 2017.
-
On signatures of spontaneous collapse dynamics modified single field inflation
Authors:
Shreya Banerjee,
Suratna Das,
K. Sravan Kumar,
T. P. Singh
Abstract:
The observed classicality of primordial perturbations, despite their quantum origin during inflation, calls for a mechanism for quantum-to-classical transition of these initial fluctuations. As literature suggests a number of plausible mechanisms which try to address this issue, it is of importance to seek for concrete observational signatures of these several approaches in order to have a better…
▽ More
The observed classicality of primordial perturbations, despite their quantum origin during inflation, calls for a mechanism for quantum-to-classical transition of these initial fluctuations. As literature suggests a number of plausible mechanisms which try to address this issue, it is of importance to seek for concrete observational signatures of these several approaches in order to have a better understanding of the early universe dynamics. Among these several approaches, it is the spontaneous collapse dynamics of Quantum Mechanics which is most viable of leaving discrete observational signatures as collapse mechanism inherently changes the generic Quantum dynamics. We observe in this study that the observables from the scalar sector, i.e. scalar tilt $n_s$, running of scalar tilt $α_s$ and running of running of scalar tilt $β_s$, can not potentially distinguish a collapse modified inflationary dynamics in the realm of canonical scalar field and $k-$inflationary scenarios. The only distinguishable imprint of collapse mechanism lies in the observables of tensor sector in the form of modified consistency relation and a blue-tilted tensor spectrum only when the collapse parameter $δ$ is non-zero and positive.
△ Less
Submitted 19 May, 2017; v1 submitted 29 December, 2016;
originally announced December 2016.
-
Interacting 3-form dark energy models: distinguishing interactions and avoiding the Little Sibling of the Big Rip
Authors:
João Morais,
Mariam Bouhmadi-López,
K. Sravan Kumar,
João Marto,
Yaser Tavakoli
Abstract:
In this paper we consider 3-form dark energy (DE) models with interactions in the dark sector. We aim to distinguish the phenomenological interactions that are defined through the dark matter (DM) and the DE energy densities. We do our analysis mainly in two stages. In the first stage, we identify the non-interacting 3-form DE model which generically leads to an abrupt late-time cosmological event…
▽ More
In this paper we consider 3-form dark energy (DE) models with interactions in the dark sector. We aim to distinguish the phenomenological interactions that are defined through the dark matter (DM) and the DE energy densities. We do our analysis mainly in two stages. In the first stage, we identify the non-interacting 3-form DE model which generically leads to an abrupt late-time cosmological event which is known as the little sibling of the Big Rip (LSBR). We classify the interactions which can possibly avoid this late-time abrupt event. We also study the parameter space of the model that is consistent with the interaction between DM and DE energy densities at present as indicated by recent studies based on BAO and SDSS data. In the later stage, we observationally distinguish those interactions using the statefinder hierarchy parameters $\{ S_{3}^{(1)}\,,\, S_{4}^{(1)}\} \,,\,\{ S_{3}^{(1)}\,,\, S_{5}^{(1)}\} .$ We also compute the growth factor parameter $ε(z)$ for the various interactions we consider herein and use the composite null diagnostic (CND) $\{ S_{3}^{(1)}\,,\,ε(z)\} $ as a tool to characterise those interactions by measuring their departures from the concordance model. In addition, we make a preliminary analysis of our model in light of the recently released data by SDSS~III on the measurement of the linear growth rate of structure.
△ Less
Submitted 17 November, 2016; v1 submitted 4 August, 2016;
originally announced August 2016.
-
Non-Gaussianity in multiple three-form field inflation
Authors:
K. Sravan Kumar,
David J. Mulryne,
Nelson J. Nunes,
João Marto,
Paulo Vargas Moniz
Abstract:
In this work, we present a method for implementing the $δN$ formalism to study the primordial non-Gaussianity produced in multiple three-form field inflation. Using a dual description relating three-form fields to noncanonical scalar fields, and employing existing results, we produce expressions for the bispectrum of the curvature perturbation in terms of three-form quantities. We study the bispec…
▽ More
In this work, we present a method for implementing the $δN$ formalism to study the primordial non-Gaussianity produced in multiple three-form field inflation. Using a dual description relating three-form fields to noncanonical scalar fields, and employing existing results, we produce expressions for the bispectrum of the curvature perturbation in terms of three-form quantities. We study the bispectrum generated in a two three-form field inflationary scenario for a particular potential that for suitable values of the parameters was found in earlier work to give values of the spectral index and ratio of tensor to scalar perturbations compatible with current bounds. We calculate the reduced bispectrum for this model, finding an amplitude in equilateral and orthogonal configurations of ${\cal O}(1)$ and in the squeezed limit of ${\cal O}(10^{-3})$. We confirm, therefore, that this three-form inflationary scenario is compatible with present observational constraints.
△ Less
Submitted 8 November, 2016; v1 submitted 22 June, 2016;
originally announced June 2016.
-
$K$-essence model from the mechanical approach point of view: coupled scalar field and the late cosmic acceleration
Authors:
Mariam Bouhmadi-López,
K. Sravan Kumar,
João Marto,
João Morais,
Alexander Zhuk
Abstract:
In this paper, we consider the Universe at the late stage of its evolution and deep inside the cell of uniformity. At these scales, we can consider the Universe to be filled with dust-like matter in the form of discretely distributed galaxies, a $K$-essence scalar field, playing the role of dark energy, and radiation as matter sources. We investigate such a Universe in the mechanical approach. Thi…
▽ More
In this paper, we consider the Universe at the late stage of its evolution and deep inside the cell of uniformity. At these scales, we can consider the Universe to be filled with dust-like matter in the form of discretely distributed galaxies, a $K$-essence scalar field, playing the role of dark energy, and radiation as matter sources. We investigate such a Universe in the mechanical approach. This means that the peculiar velocities of the inhomogeneities (in the form of galaxies) as well as the fluctuations of the other perfect fluids are non-relativistic. Such fluids are designated as coupled because they are concentrated around the inhomogeneities. In the present paper, we investigate the conditions under which the $K$-essence scalar field with the most general form for its action can become coupled. We investigate at the background level three particular examples of the $K$-essence models: (i) the pure kinetic $K$-essence field, (ii) a $K$-essence with a constant speed of sound and (iii) the $K$-essence model with the Lagrangian $bX+cX^2-V(φ)$. We demonstrate that if the $K$-essence is coupled, all these $K$-essence models take the form of multicomponent perfect fluids where one of the component is the cosmological constant. Therefore, they can provide the late-time cosmic acceleration and be simultaneously compatible with the mechanical approach.
△ Less
Submitted 19 July, 2016; v1 submitted 10 May, 2016;
originally announced May 2016.
-
Effective models of inflation from a non-local framework
Authors:
Alexey S. Koshelev,
K. Sravan Kumar,
Paulo Vargas Moniz
Abstract:
The dilaton is a possible inflaton candidate following recent CMB data allowing a non-minimal coupling to the Ricci curvature scalar in the early Universe. In this paper, we introduce an approach that has seldom been used in the literature, namely dilaton inflation with non-local features. More concretely, employing non-local features expressed in J. High Energy Phys. 04 (2007) 029, we study quadr…
▽ More
The dilaton is a possible inflaton candidate following recent CMB data allowing a non-minimal coupling to the Ricci curvature scalar in the early Universe. In this paper, we introduce an approach that has seldom been used in the literature, namely dilaton inflation with non-local features. More concretely, employing non-local features expressed in J. High Energy Phys. 04 (2007) 029, we study quadratic variations around a de Sitter geometry of an effective action with a non-local dilaton. The non-locality refers to an infinite derivative kinetic term involving the operator $\mathcal{F}\left(\Box\right)$. Algebraic roots of the characteristic equation $\mathcal{F}(z)=0$ play a crucial role in determining the properties of the theory. We subsequently study the cases when $\mathcal{F}\left(\Box\right)$ has one real root and one complex root, from which we retrieve two concrete effective models of inflation. In the first case we retrieve a class of single field inflations with universal prediction of $n_{s}\sim0.967$ with any value of the tensor to scalar ratio $r<0.1$ intrinsically controlled by the root of the characteristic equation. The second case involves a new class of two field conformally invariant models with a peculiar quadratic cross-product of scalar fields. In this latter case, we obtain Starobinsky like inflation through a spontaneously broken conformal invariance. Furthermore, an uplifted minimum of the potential, which accounts for the vacuum energy after inflation is produced naturally through this mechanism intrinsically within our approach.
△ Less
Submitted 8 November, 2017; v1 submitted 5 April, 2016;
originally announced April 2016.
-
Gravitational waves in $α-$attractors
Authors:
K. Sravan Kumar,
João Marto,
Paulo Vargas Moniz,
Suratna Das
Abstract:
We study inflation in the $α-$attractor model under a non-slow-roll dynamics with an ansatz proposed by Gong \& Sasaki \cite{Gong:2015ypa} of assuming $N=N\left(φ\right)$. Under this approach, we construct a class of local shapes of inflaton potential that are different from the T-models. We find this type of inflationary scenario predicts an attractor at $n_{s}\sim0.967$ and $r\sim0.00055$. In ou…
▽ More
We study inflation in the $α-$attractor model under a non-slow-roll dynamics with an ansatz proposed by Gong \& Sasaki \cite{Gong:2015ypa} of assuming $N=N\left(φ\right)$. Under this approach, we construct a class of local shapes of inflaton potential that are different from the T-models. We find this type of inflationary scenario predicts an attractor at $n_{s}\sim0.967$ and $r\sim0.00055$. In our approach, the non-slow-roll inflaton dynamics are related to the $α-$parameter which is the curvature of Kähler geometry in the SUGRA embedding of this model.
△ Less
Submitted 28 December, 2015;
originally announced December 2015.
-
Coupled scalar fields in the late Universe: The mechanical approach and the late cosmic acceleration
Authors:
Alvina Burgazli,
Alexander Zhuk,
João Morais,
Mariam Bouhmadi-López,
K. Sravan Kumar
Abstract:
In this paper, we consider the Universe at the late stage of its evolution and deep inside the cell of uniformity. At these scales, we consider the Universe to be filled with dust-like matter in the form of discretely distributed galaxies, a minimally coupled scalar field and radiation as matter sources. We investigate such a Universe in the mechanical approach. This means that the peculiar veloci…
▽ More
In this paper, we consider the Universe at the late stage of its evolution and deep inside the cell of uniformity. At these scales, we consider the Universe to be filled with dust-like matter in the form of discretely distributed galaxies, a minimally coupled scalar field and radiation as matter sources. We investigate such a Universe in the mechanical approach. This means that the peculiar velocities of the inhomogeneities (in the form of galaxies) as well as fluctuations of other perfect fluids are non-relativistic. Such fluids are designated as coupled because they are concentrated around inhomogeneities. In the present paper we investigate the conditions under which a scalar field can become coupled, and show that, at the background level, such coupled scalar field behaves as a two component perfect fluid: a network of frustrated cosmic strings with EoS parameter $w=-1/3$ and a cosmological constant. The potential of this scalar field is very flat at the present time. Hence, the coupled scalar field can provide the late cosmic acceleration. The fluctuations of the energy density and pressure of this field are concentrated around the galaxies screening their gravitational potentials. Therefore, such scalar fields can be regarded as coupled to the inhomogeneities.
△ Less
Submitted 4 October, 2016; v1 submitted 11 December, 2015;
originally announced December 2015.
-
Strongly Time-Variable Ultra-Violet Metal Line Emission from the Circum-Galactic Medium of High-Redshift Galaxies
Authors:
N. Sravan,
C. -A. Faucher-Giguere,
F. van de Voort,
D. Keres,
A. L. Muratov,
P. F. Hopkins,
R. Feldmann,
E. Quataert,
N. Murray
Abstract:
We use cosmological simulations from the Feedback In Realistic Environments (FIRE) project, which implement a comprehensive set of stellar feedback processes, to study ultra-violet (UV) metal line emission from the circum-galactic medium of high-redshift (z=2-4) galaxies. Our simulations cover the halo mass range Mh ~ 2x10^11 - 8.5x10^12 Msun at z=2, representative of Lyman break galaxies. Of the…
▽ More
We use cosmological simulations from the Feedback In Realistic Environments (FIRE) project, which implement a comprehensive set of stellar feedback processes, to study ultra-violet (UV) metal line emission from the circum-galactic medium of high-redshift (z=2-4) galaxies. Our simulations cover the halo mass range Mh ~ 2x10^11 - 8.5x10^12 Msun at z=2, representative of Lyman break galaxies. Of the transitions we analyze, the low-ionization C III (977 A) and Si III (1207 A) emission lines are the most luminous, with C IV (1548 A) and Si IV (1394 A) also showing interesting spatially-extended structures. The more massive halos are on average more UV-luminous. The UV metal line emission from galactic halos in our simulations arises primarily from collisionally ionized gas and is strongly time variable, with peak-to-trough variations of up to ~2 dex. The peaks of UV metal line luminosity correspond closely to massive and energetic mass outflow events, which follow bursts of star formation and inject sufficient energy into galactic halos to power the metal line emission. The strong time variability implies that even some relatively low-mass halos may be detectable. Conversely, flux-limited samples will be biased toward halos whose central galaxy has recently experienced a strong burst of star formation. Spatially-extended UV metal line emission around high-redshift galaxies should be detectable by current and upcoming integral field spectrographs such as the Multi Unit Spectroscopic Explorer (MUSE) on the Very Large Telescope and Keck Cosmic Web Imager (KCWI).
△ Less
Submitted 17 August, 2016; v1 submitted 21 October, 2015;
originally announced October 2015.
-
Non-slow-roll dynamics in $α-$attractors
Authors:
K. Sravan Kumar,
João Marto,
Paulo Vargas Moniz,
Suratna Das
Abstract:
In this paper we consider the $α-$attractor model and study inflation under a non-slow-roll dynamics. More precisely, we follow the approach recently proposed by Gong and Sasaki \cite{Gong:2015ypa} by means of assuming $N=N\left(φ\right)$. Within this framework we obtain a family of functions describing the local shape of the potential during inflation. We study a specific model and find an inflat…
▽ More
In this paper we consider the $α-$attractor model and study inflation under a non-slow-roll dynamics. More precisely, we follow the approach recently proposed by Gong and Sasaki \cite{Gong:2015ypa} by means of assuming $N=N\left(φ\right)$. Within this framework we obtain a family of functions describing the local shape of the potential during inflation. We study a specific model and find an inflationary scenario predicting an attractor at $n_{s}\approx0.967$ and $r\approx5.5\times10^{-4}$. We further show that considering a non-slow-roll dynamics, the $α-$attractor model can be broaden to a wider class of models that remain compatible with value of $r<0.1$. We further explore the model parameter space with respect to large and small field inflation and conclude that the inflaton dynamics is connected to the $ α- $ parameter, which is also related to the Kähler manifold curvature in the supergravity (SUGRA) embedding of this model. We also comment on the stabilization of the inflaton's trajectory.
△ Less
Submitted 4 April, 2016; v1 submitted 17 June, 2015;
originally announced June 2015.
-
DBI Galileon inflation in the light of Planck 2015
Authors:
K. Sravan Kumar,
Juan C. Bueno Sanchez,
Celia Escamilla-Rivera,
Joao Marto,
Paulo Vargas Moniz
Abstract:
In this work we consider a DBI Galileon (DBIG) inflationary model and constrain its parameter space with the Planck 2015 and BICEP2/Keck array and Planck (BKP) joint analysis data by means of a potential independent analysis. We focus our attention on inflationary solutions characterized by a constant or varying sound speed as well as warp factor. We impose bounds on stringy aspects of the model,…
▽ More
In this work we consider a DBI Galileon (DBIG) inflationary model and constrain its parameter space with the Planck 2015 and BICEP2/Keck array and Planck (BKP) joint analysis data by means of a potential independent analysis. We focus our attention on inflationary solutions characterized by a constant or varying sound speed as well as warp factor. We impose bounds on stringy aspects of the model, such as the warp factor $\left(f\right)$ and the induced gravity parameter $\left(\tilde{m}\right)$. We study the parameter space of the model and find that the tensor-to-scalar ratio can be as low as $r\simeq6\times10^{-4}$ and inflation happens to be at GUT scale. In addition, we obtain the tilt of the tensor power spectrum and test the standard inflationary consistency relation $\left(r=-8n_{t}\right)$ against the latest bounds from the combined results of BKP+Laser Interferometer Gravitational-Waves Observatory (LIGO), and find that DBIG inflation predicts a red spectral index for the tensor power spectrum.
△ Less
Submitted 27 February, 2016; v1 submitted 6 April, 2015;
originally announced April 2015.
-
Inflation in a two 3-form fields scenario
Authors:
K. Sravan Kumar,
J. Marto,
Nelson J. Nunes,
Paulo Vargas Moniz
Abstract:
A setting constituted by $\mathbb{N}$ 3-form fields, without any direct interaction between them, minimally coupled to gravity, is introduced in this paper as a framework to study the early evolution of the universe. We focus particularly on the two 3-forms case. An inflationary scenario is found, emerging from the coupling to gravity. More concretely, the fields coupled in this manner exhibit a c…
▽ More
A setting constituted by $\mathbb{N}$ 3-form fields, without any direct interaction between them, minimally coupled to gravity, is introduced in this paper as a framework to study the early evolution of the universe. We focus particularly on the two 3-forms case. An inflationary scenario is found, emerging from the coupling to gravity. More concretely, the fields coupled in this manner exhibit a complex interaction, mediated by the time derivative of the Hubble parameter. Our investigation is supported by means of a suitable choice of potentials, employing numerical methods and analytical approximations. In more detail, the oscillations on the small field limit become correlated, and one field is intertwined with the other. In this type of solution, a varying sound speed is present, together with the generation of isocurvature perturbations. The mentioned features allow to consider an interesting model, to test against observation. It is subsequently shown how our results are consistent with current CMB data (viz.Planck and BICEP2).
△ Less
Submitted 3 July, 2014; v1 submitted 1 April, 2014;
originally announced April 2014.
-
Importance of Tides for Periastron Precession in Eccentric Neutron Star - White Dwarf Binaries
Authors:
Niharika Sravan,
Francesca Valsecchi,
Vassiliki Kalogera,
Leandro G. Althaus
Abstract:
Although not nearly as numerous as binaries with two white dwarfs, eccentric neutron star-white dwarf (NS-WD) binaries are important gravitational-wave (GW) sources for the next generation of space-based detectors sensitive to low frequency waves. Here we investigate periastron precession in these sources as a result of general relativistic, tidal, and rotational effects; such precession is expect…
▽ More
Although not nearly as numerous as binaries with two white dwarfs, eccentric neutron star-white dwarf (NS-WD) binaries are important gravitational-wave (GW) sources for the next generation of space-based detectors sensitive to low frequency waves. Here we investigate periastron precession in these sources as a result of general relativistic, tidal, and rotational effects; such precession is expected to be detectable for at least some of the detected binaries of this type. Currently, two eccentric NS-WD binaries are known in the galactic field, PSR J1141-6545 and PSR B2303+46, both of which have orbits too wide to be relevant in their current state to GW observations. However, population synthesis studies predict the existence of a significant Galactic population of such systems. Though small in most of these systems, we find that tidally induced periastron precession becomes important when tides contribute to more than 3% of the total precession rate. For these systems, accounting for tides when analyzing periastron precession rate measurements can improve estimates of the WD component mass inferred and, in some cases, will prevent us from misclassifying the object. However, such systems are rare due to rapid orbital decay. To aid the inclusion of tidal effects when using periastron precession as a mass measurement tool, we derive a function that relates the WD radius and periastron precession constant to the WD mass.
△ Less
Submitted 18 September, 2014; v1 submitted 4 January, 2014;
originally announced January 2014.
-
The Mass Distribution of Stellar-Mass Black Holes
Authors:
Will M. Farr,
Niharika Sravan,
Andrew Cantrell,
Laura Kreidberg,
Charles D. Bailyn,
Ilya Mandel,
Vicky Kalogera
Abstract:
We perform a Bayesian analysis of the mass distribution of stellar-mass black holes using the observed masses of 15 low-mass X-ray binary systems undergoing Roche lobe overflow and five high-mass, wind-fed X-ray binary systems. Using Markov Chain Monte Carlo calculations, we model the mass distribution both parametrically---as a power law, exponential, gaussian, combination of two gaussians, or lo…
▽ More
We perform a Bayesian analysis of the mass distribution of stellar-mass black holes using the observed masses of 15 low-mass X-ray binary systems undergoing Roche lobe overflow and five high-mass, wind-fed X-ray binary systems. Using Markov Chain Monte Carlo calculations, we model the mass distribution both parametrically---as a power law, exponential, gaussian, combination of two gaussians, or log-normal distribution---and non-parametrically---as histograms with varying numbers of bins. We provide confidence bounds on the shape of the mass distribution in the context of each model and compare the models with each other by calculating their relative Bayesian evidence as supported by the measurements, taking into account the number of degrees of freedom of each model. The mass distribution of the low-mass systems is best fit by a power-law, while the distribution of the combined sample is best fit by the exponential model. We examine the existence of a "gap" between the most massive neutron stars and the least massive black holes by considering the value, M_1%, of the 1% quantile from each black hole mass distribution as the lower bound of black hole masses. The best model (the power law) fitted to the low-mass systems has a distribution of lower-bounds with M_1% > 4.3 Msun with 90% confidence, while the best model (the exponential) fitted to all 20 systems has M_1% > 4.5 Msun with 90% confidence. We conclude that our sample of black hole masses provides strong evidence of a gap between the maximum neutron star mass and the lower bound on black hole masses. Our results on the low-mass sample are in qualitative agreement with those of Ozel, et al (2010).
△ Less
Submitted 24 October, 2011; v1 submitted 5 November, 2010;
originally announced November 2010.
-
Audio enabled information extraction system for cricket and hockey domains
Authors:
S. Saraswathi,
Narasimha Sravan. V,
Sai Vamsi Krishna. B. V,
Suresh Reddy. S
Abstract:
The proposed system aims at the retrieval of the summarized information from the documents collected from web based search engine as per the user query related to cricket and hockey domain. The system is designed in a manner that it takes the voice commands as keywords for search. The parts of speech in the query are extracted using the natural language extractor for English. Based on the keywords…
▽ More
The proposed system aims at the retrieval of the summarized information from the documents collected from web based search engine as per the user query related to cricket and hockey domain. The system is designed in a manner that it takes the voice commands as keywords for search. The parts of speech in the query are extracted using the natural language extractor for English. Based on the keywords the search is categorized into 2 types: - 1.Concept wise - information retrieved to the query is retrieved based on the keywords and the concept words related to it. The retrieved information is summarized using the probabilistic approach and weighted means algorithm.2.Keyword search - extracts the result relevant to the query from the highly ranked document retrieved from the search by the search engine. The relevant search results are retrieved and then keywords are used for summarizing part. During summarization it follows the weighted and probabilistic approaches in order to identify the data comparable to the keywords extracted. The extracted information is then refined repeatedly through the aggregation process to reduce redundancy. Finally the resultant data is submitted to the user in the form of audio output.
△ Less
Submitted 26 April, 2010;
originally announced April 2010.