Skip to main content

Showing 1–10 of 10 results for author: Cole, C

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2008.00539  [pdf

    cs.LG cs.NE q-bio.QM

    An Investigation in Optimal Encoding of Protein Primary Sequence for Structure Prediction by Artificial Neural Networks

    Authors: Aaron Hein, Casey Cole, Homayoun Valafar

    Abstract: Machine learning and the use of neural networks has increased precipitously over the past few years primarily due to the ever-increasing accessibility to data and the growth of computation power. It has become increasingly easy to harness the power of machine learning for predictive tasks. Protein structure prediction is one area where neural networks are becoming increasingly popular and successf… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

  2. arXiv:2007.13469  [pdf

    q-bio.BM cs.CE q-bio.QM

    A Preliminary Investigation in the Molecular Basis of Host Shutoff Mechanism in SARS-CoV

    Authors: Niharika Pandala, Casey A. Cole, Devaun McFarland, Anita Nag, Homayoun Valafar

    Abstract: Recent events leading to the worldwide pandemic of COVID-19 have demonstrated the effective use of genomic sequencing technologies to establish the genetic sequence of this virus. In contrast, the COVID-19 pandemic has demonstrated the absence of computational approaches to understand the molecular basis of this infection rapidly. Here we present an integrated approach to the study of the nsp1 pro… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: Consists of 9 pages, 8 figures and 7 tables. 11th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics 2020

  3. arXiv:2001.03092  [pdf

    q-bio.GN

    De Novo Assembly of Uca minax Transcriptome from Next Generation Sequencing

    Authors: Hanin Omar, Casey A. Cole, Arjang Fahim, Giuliana Gusmaroli, Stephen Borgianini, Homayoun Valafar

    Abstract: High-throughput cDNA sequencing (RNA-seq) is a very powerful technique to quantify gene expression in an unbiased way. The Crustacean family is among the groups of organisms sparsely represented in current genomic databases. Here we present transcriptome data from Uca minax (red-jointed fiddler crab) as an opportunity to extend our knowledge. Next generation sequencing was performed on six tissue… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: 8 pages. BioComp 2015

  4. arXiv:2001.03088  [pdf

    q-bio.BM

    An Investigation of Minimum Data Requirement for Successful Structure Determination of Pf2048.1 with REDCRAFT

    Authors: Casey A. Cole, Daniela Ishimaru, Mirko Hennig, Homayoun Valafar

    Abstract: Traditional approaches to elucidation of protein structures by NMR spectroscopy rely on distance restraints also know as nuclear Overhauser effects (NOEs). The use of NOEs as the primary source of structure determination by NMR spectroscopy is time consuming and expensive. Residual Dipolar Couplings (RDCs) have become an alternate approach for structure calculation by NMR spectroscopy. In this wor… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: 8 pages. BioComp 2015

  5. arXiv:1911.08614  [pdf

    cs.DB q-bio.BM

    PDBMine: A Reformulation of the Protein Data Bank to Facilitate Structural Data Mining

    Authors: Casey A Cole, Christopher Ott, Diego Valdes, Homayoun Valafar

    Abstract: Large scale initiatives such as the Human Genome Project, Structural Genomics, and individual research teams have provided large deposits of genomic and proteomic data. The transfer of data to knowledge has become one of the existing challenges, which is a consequence of capturing data in databases that are optimally designed for archiving and not mining. In this research, we have targeted the Pro… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 6 pages, 8 figures, IEEE Annual Conf. on Computational Science & Computational Intelligence (CSCI), December 2019

  6. arXiv:1911.08612  [pdf

    q-bio.BM cs.OH

    Improvements of the REDCRAFT Software Package

    Authors: Casey A Cole, Caleb Parks, Julian Rachele, Homayoun Valafar

    Abstract: Traditional approaches to elucidation of protein structures by NMR spectroscopy rely on distance restraints also known as nuclear Overhauser effects (NOEs). The use of NOEs as the primary source of structure determination by NMR spectroscopy is time consuming and expensive. Residual Dipolar Couplings (RDCs) have become an alternate approach for structure calculation by NMR spectroscopy. In previou… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 7 pages, 5 figures, Int'l Conf. Bioinformatics and Computational Biology (BIOCOMP'19), Las Vegas, NV, August 2019

  7. Evaluation of tools for differential gene expression analysis by RNA-seq on a 48 biological replicate experiment

    Authors: Nicholas J. Schurch, Pieta Schofield, Marek Gierliński, Christian Cole, Alexander Sherstnev, Vijender Singh, Nicola Wrobel, Karim Gharbi, Gordon G. Simpson, Tom Owen-Hughes, Mark Blaxter, Geoffrey J. Barton

    Abstract: An RNA-seq experiment with 48 biological replicates in each of 2 conditions was performed to determine the number of biological replicates ($n_r$) required, and to identify the most effective statistical analysis tools for identifying differential gene expression (DGE). When $n_r=3$, seven of the nine tools evaluated give true positive rates (TPR) of only 20 to 40 percent. For high fold-change gen… ▽ More

    Submitted 8 June, 2015; v1 submitted 8 May, 2015; originally announced May 2015.

    Comments: 21 Pages and 4 Figures in main text. 9 Figures in Supplement attached to PDF. Revision to correct a minor error in the abstract

  8. Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment

    Authors: Marek Gierliński, Christian Cole, Pietà Schofield, Nicholas J. Schurch, Alexander Sherstnev, Vijender Singh, Nicola Wrobel, Karim Gharbi, Gordon Simpson, Tom Owen-Hughes, Mark Blaxter, Geoffrey J. Barton

    Abstract: High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read count variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of t… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.

    Comments: 15 pages 6 figures

  9. Ancient human genomes suggest three ancestral populations for present-day Europeans

    Authors: Iosif Lazaridis, Nick Patterson, Alissa Mittnik, Gabriel Renaud, Swapan Mallick, Karola Kirsanow, Peter H. Sudmant, Joshua G. Schraiber, Sergi Castellano, Mark Lipson, Bonnie Berger, Christos Economou, Ruth Bollongino, Qiaomei Fu, Kirsten I. Bos, Susanne Nordenfelt, Heng Li, Cesare de Filippo, Kay Prüfer, Susanna Sawyer, Cosimo Posth, Wolfgang Haak, Fredrik Hallgren, Elin Fornander, Nadin Rohland , et al. (95 additional authors not shown)

    Abstract: We sequenced genomes from a $\sim$7,000 year old early farmer from Stuttgart in Germany, an $\sim$8,000 year old hunter-gatherer from Luxembourg, and seven $\sim$8,000 year old hunter-gatherers from southern Sweden. We analyzed these data together with other ancient genomes and 2,345 contemporary humans to show that the great majority of present-day Europeans derive from at least three highly diff… ▽ More

    Submitted 1 April, 2014; v1 submitted 23 December, 2013; originally announced December 2013.

  10. Improved annotation of 3-prime untranslated regions and complex loci by combination of strand-specific Direct RNA Sequencing, RNA-seq and ESTs

    Authors: Nick Schurch, Christian Cole, Alexander Sherstnev, Junfang Song, Céline Duc, Kate G. Storey, W. H. Irwin McLean, Sara J. Brown, Gordon G. Simpson, Geoffrey J. Barton

    Abstract: The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct annotation is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental sy… ▽ More

    Submitted 11 November, 2013; originally announced November 2013.

    Comments: 44 pages, 9 figures

    Journal ref: PLoS ONE 9(4) (2014): e94270