-
Approaching allelic probabilities and Genome-Wide Association Studies from beta distributions
Authors:
José Santiago García-Cremades,
Angel del Río,
José A. García,
Javier Gayán,
Antonio González-Pérez,
Agustín Ruiz,
O. Sotolongo-Grau,
Manuel Ruiz-Marín
Abstract:
In this paper we have proposed a model for the distribution of allelic probabilities for generating populations as reliably as possible. Our objective was to develop such a model which would allow simulating allelic probabilities with different observed truncation and de- gree of noise. In addition, we have also introduced here a complete new approach to analyze a genome-wide association study (GW…
▽ More
In this paper we have proposed a model for the distribution of allelic probabilities for generating populations as reliably as possible. Our objective was to develop such a model which would allow simulating allelic probabilities with different observed truncation and de- gree of noise. In addition, we have also introduced here a complete new approach to analyze a genome-wide association study (GWAS) dataset, starting from a new test of association with a statistical distribution and two effect sizes of each genotype. The new methodologi- cal approach was applied to a real data set together with a Monte Carlo experiment which showed the power performance of our new method. Finally, we compared the new method based on beta distribution with the conventional method (based on Chi-Squared distribu- tion) using the agreement Kappa index and a principal component analysis (PCA). Both the analyses show found differences existed between both the approaches while selecting the single nucleotide polymorphisms (SNPs) in association.
△ Less
Submitted 25 February, 2014;
originally announced February 2014.
-
Statistical analysis of the distribution of amino acids in Borrelia burgdorferi genome under different genetic codes
Authors:
Jose A Garcia,
Samantha Alvarez,
Alejandro Flores,
Tzipe Govezensky,
Juan R. Bobadilla,
Marco V. Jose
Abstract:
The genetic code is considered to be universal. In order to test if some statistical properties of the coding bacterial genome were due to inherent properties of the genetic code, we compared the autocorrelation function, the scaling properties and the maximum entropy of the distribution of distances of amino acids in sequences obtained by translating protein-coding regions from the genome of Bo…
▽ More
The genetic code is considered to be universal. In order to test if some statistical properties of the coding bacterial genome were due to inherent properties of the genetic code, we compared the autocorrelation function, the scaling properties and the maximum entropy of the distribution of distances of amino acids in sequences obtained by translating protein-coding regions from the genome of Borrelia burgdorferi, under different genetic codes. Overall our results indicate that these properties are very stable to perturbations made by altering the genetic code. We also discuss the evolutionary likely implications of the present results.
△ Less
Submitted 22 March, 2004;
originally announced March 2004.
-
An ordinary differential equation model for the multistep transformation to cancer
Authors:
Sabrina L. Spencer,
Matthew J. Berryman,
Jose A. Garcia,
Derek Abbott
Abstract:
Cancer is viewed as a multistep process whereby a normal cell is transformed into a cancer cell through the acquisition of mutations. We reduce the complexities of cancer progression to a simple set of underlying rules that govern the transformation of normal cells to malignant cells. In doing so, we derive an ordinary differential equation model that explores how the balance of angiogenesis, ce…
▽ More
Cancer is viewed as a multistep process whereby a normal cell is transformed into a cancer cell through the acquisition of mutations. We reduce the complexities of cancer progression to a simple set of underlying rules that govern the transformation of normal cells to malignant cells. In doing so, we derive an ordinary differential equation model that explores how the balance of angiogenesis, cell death rates, genetic instability, and replication rates give rise to different kinetics in the development of cancer. The key predictions of the model are that cancer develops fastest through a particular ordering of mutations and that mutations in genes that maintain genomic integrity would be the most deleterious type of mutations to inherit. In addition, we perform a sensitivity analysis on the parameters included in the model to determine the probable contribution of each. This paper presents a novel approach to viewing the genetic basis of cancer from a systems biology perspective and provides the groundwork for other models that can be directly tied to clinical and molecular data.
△ Less
Submitted 3 March, 2004;
originally announced March 2004.