-
SUTRA: A Novel Approach to Modelling Pandemics with Applications to COVID-19
Authors:
Manindra Agrawal,
Madhuri Kanitkar,
Deepu Phillip,
Tanima Hajra,
Arti Singh,
Avaneesh Singh,
Prabal Pratap Singh,
Mathukumalli Vidyasagar
Abstract:
The Covid-19 pandemic has two key properties: (i) asymptomatic cases (both detected and undetected) that can result in new infections, and (ii) time-varying characteristics due to new variants, Non-Pharmaceutical Interventions etc. We develop a model called SUTRA (Susceptible, Undetected though infected, Tested positive, and Removed Analysis) that takes into account both of these two key propertie…
▽ More
The Covid-19 pandemic has two key properties: (i) asymptomatic cases (both detected and undetected) that can result in new infections, and (ii) time-varying characteristics due to new variants, Non-Pharmaceutical Interventions etc. We develop a model called SUTRA (Susceptible, Undetected though infected, Tested positive, and Removed Analysis) that takes into account both of these two key properties. While applying the model to a region, two parameters of the model can be learnt from the number of daily new cases found in the region. Using the learnt values of the parameters the model can predict the number of daily new cases so long as the learnt parameters do not change substantially. Whenever any of the two parameters changes due to the key property (ii) above, the SUTRA model can detect that the values of one or both of the parameters have changed. Further, the model has the capability to relearn the changed parameter values, and then use these to carry out the prediction of the trajectory of the pandemic for the region of concern. The SUTRA approach can be applied at various levels of granularity, from an entire country to a district, more specifically, to any large enough region for which the data of daily new cases are available.
We have applied the SUTRA model to thirty-two countries, covering more than half of the world's population. Our conclusions are: (i) The model is able to capture the past trajectories very well. Moreover, the parameter values, which we can estimate robustly, help quantify the impact of changes in the pandemic characteristics. (ii) Unless the pandemic characteristics change significantly, the model has good predictive capability. (iii) Natural immunity provides significantly better protection against infection than the currently available vaccines.
△ Less
Submitted 25 October, 2022; v1 submitted 22 January, 2021;
originally announced January 2021.
-
Estimating Hidden Asymptomatics, Herd Immunity Threshold and Lockdown Effects using a COVID-19 Specific Model
Authors:
Shaurya Kaushal,
Abhineet Singh Rajput,
Soumyadeep Bhattacharya,
M. Vidyasagar,
Aloke Kumar,
Meher K. Prakash,
Santosh Ansumali
Abstract:
A quantitative COVID-19 model that incorporates hidden asymptomatic patients is developed, and an analytic solution in parametric form is given. The model incorporates the impact of lockdown and resulting spatial migration of population due to announcement of lockdown. A method is presented for estimating the model parameters from real-world data. It is shown that increase of infections slows down…
▽ More
A quantitative COVID-19 model that incorporates hidden asymptomatic patients is developed, and an analytic solution in parametric form is given. The model incorporates the impact of lockdown and resulting spatial migration of population due to announcement of lockdown. A method is presented for estimating the model parameters from real-world data. It is shown that increase of infections slows down and herd immunity is achieved when symptomatic patients are 4-6\% of the population for the European countries we studied, when the total infected fraction is between 50-56 \%. Finally, a method for estimating the number of asymptomatic patients, who have been the key hidden link in the spread of the infections, is presented.
△ Less
Submitted 29 May, 2020;
originally announced June 2020.
-
Machine Learning Methods in the Computational Biology of Cancer
Authors:
Mathukumalli Vidyasagar
Abstract:
The objectives of this "perspective" paper are to review some recent advances in sparse feature selection for regression and classification, as well as compressed sensing, and to discuss how these might be used to develop tools to advance personalized cancer therapy. As an illustration of the possibilities, a new algorithm for sparse regression is presented, and is applied to predict the time to t…
▽ More
The objectives of this "perspective" paper are to review some recent advances in sparse feature selection for regression and classification, as well as compressed sensing, and to discuss how these might be used to develop tools to advance personalized cancer therapy. As an illustration of the possibilities, a new algorithm for sparse regression is presented, and is applied to predict the time to tumor recurrence in ovarian cancer. A new algorithm for sparse feature selection in classification problems is presented, and its validation in endometrial cancer is briefly discussed. Some open problems are also presented.
△ Less
Submitted 24 February, 2014;
originally announced February 2014.
-
Reverse Engineering Gene Interaction Networks Using the Phi-Mixing Coefficient
Authors:
Nitin Kumar Singh,
M. Eren Ahsen,
Shiva Mankala,
Hyun-Seok Kim,
Michael A. White,
M. Vidyasagar
Abstract:
Constructing gene interaction networks (GINs) from high-throughput gene expression data is an important and challenging problem in systems biology. Existing algorithms produce networks that either have undirected and unweighted edges, or else are constrained to contain no cycles, both of which are biologically unrealistic. In the present paper we propose a new algorithm, based on a concept from pr…
▽ More
Constructing gene interaction networks (GINs) from high-throughput gene expression data is an important and challenging problem in systems biology. Existing algorithms produce networks that either have undirected and unweighted edges, or else are constrained to contain no cycles, both of which are biologically unrealistic. In the present paper we propose a new algorithm, based on a concept from probability theory known as the phi-mixing coefficient, that produces networks whose edges are weighted and directed, and are permitted to contain cycles. Because there is no "ground truth" for genome-wide networks on a human scale, we analyzed the outcomes of several experiments on lung cancer, and matched the predictions from the inferred networks with experimental results. Specifically, we inferred three networks (NSCLC, Neuro-endocrine NSCLC plus SCLC, and normal) from the gene expression measurements of 157 lung cancer and 59 normal cell lines, compared with the outcomes of siRNA screening of 19,000+ genes on 11 NSCLC cell lines, and analyzed data from a ChIP-Seq experiment to determine putative downstream targets of the lineage specific oncogenic transcription factor ASCL1. The inferred networks displayed a scale-free or power law behavior between the degree of a node and the number of nodes with that degree. There was a strong correlation between the degree of a gene in the inferred NSCLC network and its essentiality for the survival of the cells. The inferred downstream neighborhood genes of ASCL1 in the SCLC network were significantly enriched by ChIP-Seq determined putative target genes, while no such enrichment was found in the inferred NSCLC network.
△ Less
Submitted 12 March, 2016; v1 submitted 20 August, 2012;
originally announced August 2012.
-
Mixing Coefficients Between Discrete and Real Random Variables: Computation and Properties
Authors:
Mehmet Eren Ahsen,
Mathukumalli Vidyasagar
Abstract:
In this paper we study the problem of estimating the alpha-, beta- and phi-mixing coefficients between two random variables, that can either assume values in a finite set or the set of real numbers. In either case, explicit closed-form formulas for the beta-mixing coefficient are already known. Therefore for random variables assuming values in a finite set, our contributions are two-fold: (i) In t…
▽ More
In this paper we study the problem of estimating the alpha-, beta- and phi-mixing coefficients between two random variables, that can either assume values in a finite set or the set of real numbers. In either case, explicit closed-form formulas for the beta-mixing coefficient are already known. Therefore for random variables assuming values in a finite set, our contributions are two-fold: (i) In the case of the alpha-mixing coefficient, we show that determining whether or not it exceeds a prespecified threshold is NP-complete, and provide efficiently computable upper and lower bounds. (ii) We derive an exact closed-form formula for the phi-mixing coefficient. Next, we prove analogs of the data-processing inequality from information theory for each of the three kinds of mixing coefficients. Then we move on to real-valued random variables, and show that by using percentile binning and allowing the number of bins to increase more slowly than the number of samples, we can generate empirical estimates that are consistent, i.e., converge to the true values as the number of samples approaches infinity.
△ Less
Submitted 3 July, 2013; v1 submitted 8 August, 2012;
originally announced August 2012.