Skip to main content

Showing 1–13 of 13 results for author: Honkela, A

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:1901.10227  [pdf, other

    q-bio.QM cs.CR cs.LG stat.ML

    Representation Transfer for Differentially Private Drug Sensitivity Prediction

    Authors: Teppo Niinimäki, Mikko Heikkilä, Antti Honkela, Samuel Kaski

    Abstract: Motivation: Human genomic datasets often contain sensitive information that limits use and sharing of the data. In particular, simple anonymisation strategies fail to provide sufficient level of protection for genomic data, because the data are inherently identifiable. Differentially private machine learning can help by guaranteeing that the published results do not leak too much information about… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: 12 pages, 5 figures

    Journal ref: Bioinformatics 35(14):i218-i224, 2019

  2. arXiv:1605.08001  [pdf, other

    q-bio.QM

    Analysis of differential splicing suggests different modes of short-term splicing regulation

    Authors: Hande Topa, Antti Honkela

    Abstract: Motivation: Alternative splicing is an important mechanism in which the regions of pre-mRNAs are differentially joined in order to form different transcript isoforms. Alternative splicing is involved in the regulation of normal physiological functions but also linked to the development of diseases such as cancer. We analyse differential expression and splicing using RNA-seq time series in three di… ▽ More

    Submitted 25 May, 2016; originally announced May 2016.

    Comments: 20 pages, 5 figures. To be published in the conference proceedings of Intelligent Systems for Molecular Biology (ISMB) 2016

  3. arXiv:1511.06546  [pdf, other

    q-bio.GN q-bio.QM stat.AP

    Bayesian identification of bacterial strains from sequencing data

    Authors: Aravind Sankar, Brandon Malone, Sion Bayliss, Ben Pascoe, Guillaume Méric, Matthew D. Hitchings, Samuel K. Sheppard, Edward J. Feil, Jukka Corander, Antti Honkela

    Abstract: Rapidly assaying the diversity of a bacterial species present in a sample obtained from a hospital patient or an evironmental source has become possible after recent technological advances in DNA sequencing. For several applications it is important to accurately identify the presence and estimate relative abundances of the target organisms from short sequence reads obtained from a sample. This tas… ▽ More

    Submitted 17 February, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: 16 pages, 7 figures

  4. arXiv:1503.01081  [pdf, other

    q-bio.GN q-bio.QM stat.AP

    Genome-wide modelling of transcription kinetics reveals patterns of RNA production delays

    Authors: Antti Honkela, Jaakko Peltonen, Hande Topa, Iryna Charapitsa, Filomena Matarese, Korbinian Grote, Hendrik G. Stunnenberg, George Reid, Neil D. Lawrence, Magnus Rattray

    Abstract: Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles due to differences in transcription time, degradation rate and RNA processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. We introduce a joint model of transcriptional activation and mRNA accumulation which can be used for inference of… ▽ More

    Submitted 16 July, 2015; v1 submitted 3 March, 2015; originally announced March 2015.

    Comments: 42 pages, 17 figures

    Journal ref: PNAS 112(42):13115-13120, 2015

  5. arXiv:1412.5995  [pdf, other

    q-bio.QM q-bio.GN

    Fast and accurate approximate inference of transcript expression from RNA-seq data

    Authors: James Hensman, Panagiotis Papastamoulis, Peter Glaus, Antti Honkela, Magnus Rattray

    Abstract: Motivation: Assigning RNA-seq reads to their transcript of origin is a fundamental task in transcript expression estimation. Where ambiguities in assignments exist due to transcripts sharing sequence, e.g. alternative isoforms or alleles, the problem can be solved through probabilistic inference. Bayesian methods have been shown to provide accurate transcript abundance estimates compared to compet… ▽ More

    Submitted 30 June, 2015; v1 submitted 18 December, 2014; originally announced December 2014.

    Comments: Main changes: (a) shuffling of reads simulated from spanki and repeat the analysis for sailfish and eXpress. Now both methods yield better point estimates. (b) including the Markov chain Monte Carlo sampler of rsem (RSEM-PME). (c) including the Kallisto method (d) adding alternative measures of transcript expression (TPM) and filtering out low expressed transcripts (supplementary material). arXiv admin note: substantial text overlap with arXiv:1308.5953

  6. arXiv:1403.4086  [pdf, other

    q-bio.PE q-bio.GN q-bio.QM stat.AP

    Gaussian process test for high-throughput sequencing time series: application to experimental evolution

    Authors: Hande Topa, Ágnes Jónás, Robert Kofler, Carolin Kosiol, Antti Honkela

    Abstract: Motivation: Recent advances in high-throughput sequencing (HTS) have made it possible to monitor genomes in great detail. New experiments not only use HTS to measure genomic features at one time point but to monitor them changing over time with the aim of identifying significant changes in their abundance. In population genetics, for example, allele frequencies are monitored over time to detect si… ▽ More

    Submitted 18 September, 2014; v1 submitted 17 March, 2014; originally announced March 2014.

    Comments: 41 pages, 29 figures

  7. arXiv:1308.6074  [pdf, ps, other

    q-bio.GN cs.CE cs.IR

    Exploration and retrieval of whole-metagenome sequencing samples

    Authors: Sohan Seth, Niko Välimäki, Samuel Kaski, Antti Honkela

    Abstract: Over the recent years, the field of whole metagenome shotgun sequencing has witnessed significant growth due to the high-throughput sequencing technologies that allow sequencing genomic samples cheaper, faster, and with better coverage than before. This technical advancement has initiated the trend of sequencing multiple samples in different conditions or environments to explore the similarities a… ▽ More

    Submitted 3 April, 2014; v1 submitted 28 August, 2013; originally announced August 2013.

    Comments: 16 pages; additional results

  8. arXiv:1308.5953   

    q-bio.GN stat.AP stat.CO

    Fast Approximate Inference of Transcript Expression Levels from RNA-seq Data

    Authors: James Hensman, Peter Glaus, Antti Honkela, Magnus Rattray

    Abstract: Motivation: The mapping of RNA-seq reads to their transcripts of origin is a fundamental task in transcript expression estimation and differential expression scoring. Where ambiguities in mapping exist due to transcripts sharing sequence, e.g. alternative isoforms or alleles, the problem becomes an instance of non-trivial probabilistic inference. Bayesian inference in such a problem is intractable… ▽ More

    Submitted 27 January, 2015; v1 submitted 27 August, 2013; originally announced August 2013.

    Comments: This paper has been withdrawn by the authors. Please see much revised edition arXiv:1412.5995

  9. arXiv:1304.1698  [pdf, other

    q-bio.GN q-bio.QM

    Probe region expression estimation for RNA-seq data for improved microarray comparability

    Authors: Karolis Uziela, Antti Honkela

    Abstract: Rapidly growing public gene expression databases contain a wealth of data for building an unprecedentedly detailed picture of human biology and disease. This data comes from many diverse measurement platforms that make integrating it all difficult. Although RNA-sequencing (RNA-seq) is attracting the most attention, at present the rate of new microarray studies submitted to public databases far exc… ▽ More

    Submitted 15 October, 2014; v1 submitted 5 April, 2013; originally announced April 2013.

  10. arXiv:1303.4926  [pdf, other

    q-bio.QM q-bio.MN

    Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data

    Authors: Ciira wa Maina, Antti Honkela, Filomena Matarese, Korbinian Grote, Hendrik G. Stunnenberg, George Reid, Neil D. Lawrence, Magnus Rattray

    Abstract: Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription s… ▽ More

    Submitted 5 March, 2014; v1 submitted 20 March, 2013; originally announced March 2013.

    Comments: 40 pages: 21 pages Main text, 19 pages supplementary material

  11. arXiv:1210.2850  [pdf, other

    q-bio.GN q-bio.PE q-bio.QM

    A mixed model approach for joint genetic analysis of alternatively spliced transcript isoforms using RNA-Seq data

    Authors: Barbara Rakitsch, Christoph Lippert, Hande Topa, Karsten Borgwardt, Antti Honkela, Oliver Stegle

    Abstract: RNA-Seq technology allows for studying the transcriptional state of the cell at an unprecedented level of detail. Beyond quantification of whole-gene expression, it is now possible to disentangle the abundance of individual alternatively spliced transcript isoforms of a gene. A central question is to understand the regulatory processes that lead to differences in relative abundance variation due t… ▽ More

    Submitted 10 October, 2012; originally announced October 2012.

  12. arXiv:1210.2503  [pdf, other

    stat.ML q-bio.QM stat.ME

    Gaussian process modelling of multiple short time series

    Authors: Hande Topa, Antti Honkela

    Abstract: We present techniques for effective Gaussian process (GP) modelling of multiple short time series. These problems are common when applying GP models independently to each gene in a gene expression time series data set. Such sets typically contain very few time points. Naive application of common GP modelling techniques can lead to severe over-fitting or under-fitting in a significant fraction of t… ▽ More

    Submitted 9 October, 2012; originally announced October 2012.

    Comments: 11 pages, 6 figures

  13. Identifying differentially expressed transcripts from RNA-seq data with biological variation

    Authors: Peter Glaus, Antti Honkela, Magnus Rattray

    Abstract: Motivation: High-throughput sequencing enables expression analysis at the level of individual transcripts. The analysis of transcriptome expression levels and differential expression estimation requires a probabilistic approach to properly account for ambiguity caused by shared exons and finite read sampling as well as the intrinsic biological variance of transcript expression. Results: We prese… ▽ More

    Submitted 5 March, 2012; v1 submitted 5 September, 2011; originally announced September 2011.

    Comments: 12 pages, 6 figures in main text; 11 pages, 5 figures in supplementary information (included in the same file)

    Journal ref: Bioinformatics 28(13):1721-1728, 2012