Skip to main content

Showing 1–13 of 13 results for author: Airoldi, E

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:1909.07578  [pdf, other

    stat.ML cs.LG cs.SI physics.data-an q-bio.MN

    Stacking Models for Nearly Optimal Link Prediction in Complex Networks

    Authors: Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, Aaron Clauset

    Abstract: Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speedup the collection of network data and improve the validity of network models. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability v… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: 30 pages, 9 figures, 22 tables

    Journal ref: Proc. Natl. Acad. Sci. USA 117(38), 23393-23400 (2020)

  2. arXiv:1708.01772  [pdf, other

    q-bio.QM q-bio.GN stat.AP stat.ME stat.ML

    Quantifying homologous proteins and proteoforms

    Authors: Dmitry Malioutov, Tianchi Chen, Jacob Jaffe, Edoardo Airoldi, Steven Carr, Bogdan Budnik, Nikolai Slavov

    Abstract: Many proteoforms - arising from alternative splicing, post-translational modifications (PTMs), or paralogous genes - have distinct biological functions, such as histone PTM proteoforms. However, their quantification by existing bottom-up mass-spectrometry (MS) methods is undermined by peptide-specific biases. To avoid these biases, we developed and implemented a first-principles model (HIquant) fo… ▽ More

    Submitted 5 August, 2017; originally announced August 2017.

    Report number: mcp.TIR118.000947

    Journal ref: Molecular & Cellular Proteomics, 2018

  3. arXiv:1506.00219  [pdf, other

    q-bio.GN q-bio.QM q-bio.TO stat.AP stat.ME

    Post-transcriptional regulation across human tissues

    Authors: Alexander Franks, Edoardo Airoldi, Nikolai Slavov

    Abstract: Transcriptional and post-transcriptional regulation shape tissue-type-specific proteomes, but their relative contributions remain contested. Estimates of the factors determining protein levels in human tissues do not distinguish between (i) the factors determining the variability between the abundances of different proteins, i.e., mean-level-variability and, (ii) the factors determining the physio… ▽ More

    Submitted 2 May, 2017; v1 submitted 31 May, 2015; originally announced June 2015.

    Comments: 30 pages, 4 figures

    Journal ref: PLoS Comput Biol 13(5): e1005535 (2017)

  4. arXiv:1406.5799  [pdf, other

    q-bio.MN stat.AP stat.ME

    Estimating cellular pathways from an ensemble of heterogeneous data sources

    Authors: Alexander Franks, Florian Markowetz, Edoardo Airoldi

    Abstract: Building better models of cellular pathways is one of the major challenges of systems biology and functional genomics. There is a need for methods to build on established expert knowledge and reconcile it with results of high-throughput studies. Moreover, the available data sources are heterogeneous and need to be combined in a way specific for the part of the pathway in which they are most inform… ▽ More

    Submitted 22 June, 2014; originally announced June 2014.

  5. arXiv:1406.0399  [pdf, other

    q-bio.GN q-bio.BM q-bio.MN q-bio.SC

    Differential stoichiometry among core ribosomal proteins

    Authors: Nikolai Slavov, Sefan Semrau, Edoardo Airoldi, Bogdan Budnik, Alexander van Oudenaarden

    Abstract: Understanding the regulation and structure of ribosomes is essential to understanding protein synthesis and its deregulation in disease. While ribosomes are believed to have a fixed stoichiometry among their core ribosomal proteins (RPs), some experiments suggest a more variable composition. Testing such variability requires direct and precise quantification of RPs. We used mass-spectrometry to di… ▽ More

    Submitted 15 April, 2015; v1 submitted 2 June, 2014; originally announced June 2014.

    Comments: 31 pages, 8 figures

    Journal ref: Cell Reports 13: 865 - 873, 2015

  6. arXiv:1405.2566  [pdf, other

    stat.ML cs.SI physics.soc-ph q-bio.QM stat.AP

    Learning modular structures from network data and node variables

    Authors: Elham Azizi, James E. Galagan, Edoardo M. Airoldi

    Abstract: A standard technique for understanding underlying dependency structures among a set of variables posits a shared conditional probability distribution for the variables measured on individuals within a group. This approach is often referred to as module networks, where individuals are represented by nodes in a network, groups are termed modules, and the focus is on estimating the network structure… ▽ More

    Submitted 11 May, 2014; originally announced May 2014.

    Comments: 22 pages, 6 figures, 3 tables, 3 algorithms

  7. Sashimi plots: Quantitative visualization of RNA sequencing read alignments

    Authors: Yarden Katz, Eric T. Wang, Jacob Silterra, Schraga Schwartz, Bang Wong, Jill P. Mesirov, Edoardo M. Airoldi, Christopher B. Burge

    Abstract: We introduce Sashimi plots, a quantitative multi-sample visualization of mRNA sequencing reads aligned to gene annotations. Sashimi plots are made using alignments (stored in the SAM/BAM format) and gene model annotations (in GFF format), which can be custom-made by the user or obtained from databases such as Ensembl or UCSC. We describe two implementations of Sashimi plots: (1) a stand-alone comm… ▽ More

    Submitted 14 June, 2013; originally announced June 2013.

    Comments: 2 figures

  8. Mapping Dynamic Histone Acetylation Patterns to Gene Expression in Nanog-depleted Murine Embryonic Stem Cells

    Authors: Florian Markowetz, Klaas W Mulder, Edoardo M Airoldi, Ihor R Lemischka, Olga G Troyanskaya

    Abstract: Embryonic stem cells (ESC) have the potential to self-renew indefinitely and to differentiate into any of the three germ layers. The molecular mechanisms for self-renewal, maintenance of pluripotency and lineage specification are poorly understood, but recent results point to a key role for epigenetic mechanisms. In this study, we focus on quantifying the impact of histone 3 acetylation (H3K9,14ac… ▽ More

    Submitted 15 October, 2010; originally announced October 2010.

    Comments: accepted at PLoS Computational Biology

    Journal ref: PLoS Comp Bio, 2010 Dec 16;6(12):e1001034

  9. arXiv:0912.5410  [pdf, other

    stat.ME cs.LG physics.soc-ph q-bio.MN stat.ML

    A survey of statistical network models

    Authors: Anna Goldenberg, Alice X Zheng, Stephen E Fienberg, Edoardo M Airoldi

    Abstract: Networks are ubiquitous in science and have become a focal point for discussion in everyday life. Formal statistical models for the analysis of network data have emerged as a major topic of interest in diverse areas of study, and most of these involve a form of graphical representation. Probability models on graphs date back to 1959. Along with empirical studies in social psychology and sociolog… ▽ More

    Submitted 29 December, 2009; originally announced December 2009.

    Comments: 96 pages, 14 figures, 333 references

    Journal ref: Foundations and Trends in Machine Learning, 2(2):1-117, 2009

  10. arXiv:0912.5193  [pdf, ps, other

    stat.ME cs.LG physics.soc-ph q-bio.QM stat.AP

    Ranking relations using analogies in biological and information networks

    Authors: Ricardo Silva, Katherine Heller, Zoubin Ghahramani, Edoardo M. Airoldi

    Abstract: Analogical reasoning depends fundamentally on the ability to learn and generalize about relations between objects. We develop an approach to relational learning which, given a set of pairs of objects $\mathbf{S}=\{A^{(1)}:B^{(1)},A^{(2)}:B^{(2)},\ldots,A^{(N)}:B ^{(N)}\}$, measures how well other pairs A:B fit in with the set $\mathbf{S}$. Our work addresses the following question: is the relation… ▽ More

    Submitted 29 August, 2013; v1 submitted 28 December, 2009; originally announced December 2009.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS321 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS321

    Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 2, 615-644

  11. arXiv:0711.2520  [pdf, other

    q-bio.QM q-bio.GN

    Mixed membership analysis of genome-wide expression data

    Authors: Edoardo M Airoldi, Stephen E Fienberg, Eric P Xing

    Abstract: Learning latent expression themes that best express complex patterns in a sample is a central problem in data mining and scientific research. For example, in computational biology we seek a set of salient gene expression themes that explain a biological process, extracting them from a large pool of gene expression profiles. In this paper, we introduce probabilistic models to learn such latent th… ▽ More

    Submitted 15 November, 2007; originally announced November 2007.

    Comments: 22 pages, 4 figures

  12. arXiv:0706.2040  [pdf, other

    q-bio.QM cs.LG physics.soc-ph stat.ME stat.ML

    Getting started in probabilistic graphical models

    Authors: Edoardo M Airoldi

    Abstract: Probabilistic graphical models (PGMs) have become a popular tool for computational analysis of biological data in a variety of domains. But, what exactly are they and how do they work? How can we use PGMs to discover patterns that are biologically relevant? And to what extent can PGMs help us formulate new hypotheses that are testable at the bench? This note sketches out some answers and illustr… ▽ More

    Submitted 10 November, 2007; v1 submitted 14 June, 2007; originally announced June 2007.

    Comments: 12 pages, 1 figure

    Journal ref: Airoldi EM (2007) Getting started in probabilistic graphical models. PLoS Comput Biol 3(12): e252

  13. arXiv:0706.0294  [pdf, other

    q-bio.MN q-bio.QM

    Mixed membership analysis of high-throughput interaction studies: Relational data

    Authors: Edoardo M Airoldi, David M Blei, Stephen E Fienberg, Eric P Xing

    Abstract: In this paper, we consider the statistical analysis of a protein interaction network. We propose a Bayesian model that uses a hierarchy of probabilistic assumptions about the way proteins interact with one another in order to: (i) identify the number of non-observable functional modules; (ii) estimate the degree of membership of proteins to modules; and (iii) estimate typical interaction pattern… ▽ More

    Submitted 15 November, 2007; v1 submitted 2 June, 2007; originally announced June 2007.

    Comments: 22 pages, 6 figures, 2 tables