Skip to main content

Showing 1–50 of 71 results for author: Weigt, M

Searching in archive cond-mat. Search in all archives.
.
  1. arXiv:2412.01969  [pdf, other

    q-bio.BM cond-mat.dis-nn q-bio.PE

    Fluctuations and the limit of predictability in protein evolution

    Authors: Saverio Rossi, Leonardo Di Bari, Martin Weigt, Francesco Zamponi

    Abstract: Protein evolution involves mutations occurring across a wide range of time scales. In analogy with other disordered systems, this dynamical heterogeneity suggests strong correlations between mutations happening at distinct sites and times. To quantify these correlations, we examine the role of various fluctuation sources in protein evolution, simulated using a data-driven epistatic landscape. By a… ▽ More

    Submitted 12 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

  2. arXiv:2403.09436  [pdf, other

    q-bio.BM cond-mat.dis-nn q-bio.PE

    Emergent time scales of epistasis in protein evolution

    Authors: Leonardo Di Bari, Matteo Bisardi, Sabrina Cotogno, Martin Weigt, Francesco Zamponi

    Abstract: We introduce a data-driven epistatic model of protein evolution, capable of generating evolutionary trajectories spanning very different time scales reaching from individual mutations to diverged homologs. Our in silico evolution encompasses random nucleotide mutations, insertions and deletions, and models selection using a fitness landscape, which is inferred via a generative probabilistic model… ▽ More

    Submitted 27 September, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 31 pages, 16 figures - final version

    Journal ref: PNAS 121, e2406807121 (2024)

  3. arXiv:2310.12700  [pdf, other

    q-bio.BM cond-mat.stat-mech q-bio.GN q-bio.QM

    Towards Parsimonious Generative Modeling of RNA Families

    Authors: Francesco Calvanese, Camille N. Lambert, Philippe Nghe, Francesco Zamponi, Martin Weigt

    Abstract: Generative probabilistic models emerge as a new paradigm in data-driven, evolution-informed design of biomolecular sequences. This paper introduces a novel approach, called Edge Activation Direct Coupling Analysis (eaDCA), tailored to the characteristics of RNA sequences, with a strong emphasis on simplicity, efficiency, and interpretability. eaDCA explicitly constructs sparse coevolutionary model… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 33 pages (including SI)

    Journal ref: Nucleic Acids Research, gkae289 (2024)

  4. Machine-learning-assisted Monte Carlo fails at sampling computationally hard problems

    Authors: Simone Ciarella, Jeanne Trinquier, Martin Weigt, Francesco Zamponi

    Abstract: Several strategies have been recently proposed in order to improve Monte Carlo sampling efficiency using machine learning tools. Here, we challenge these methods by considering a class of problems that are known to be exponentially hard to sample using conventional local Monte Carlo at low enough temperatures. In particular, we study the antiferromagnetic Potts model on a random graph, which reduc… ▽ More

    Submitted 10 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Journal ref: Machine Learning: Science and Technology 4, 010501 (2023)

  5. arXiv:2208.11626  [pdf, other

    q-bio.BM cond-mat.stat-mech q-bio.MN

    Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins

    Authors: Carlos A. Gandarilla-Perez, Sergio Pinilla, Anne-Florence Bitbol, Martin Weigt

    Abstract: Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to imp… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 19 pages

  6. arXiv:2207.13402  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn physics.bio-ph q-bio.BM q-bio.MN

    Statistical-physics approaches to RNA molecules, families and networks

    Authors: Simona Cocco, Andrea De Martino, Andrea Pagnani, Martin Weigt

    Abstract: This contribution focuses on the fascinating RNA molecule, its sequence-dependent folding driven by base-pairing interactions, the interplay between these interactions and natural evolution, and its multiple regulatory roles. The four of us have dug into these topics using the tools and the spirit of the statistical physics of disordered systems, and in particular the concept of a disordered (ener… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 19 pages, 6 figures, to appear in "Spin Glass Theory and Far Beyond - Replica Symmetry Breaking after 40 years" (edited by P Charbonneau, E Marinari, G Parisi, F Ricci Tersenghi, G Sicuro and F Zamponi)

  7. arXiv:2109.04105  [pdf, other

    q-bio.QM cond-mat.dis-nn q-bio.BM

    adabmDCA: Adaptive Boltzmann machine learning for biological sequences

    Authors: Anna Paola Muntoni, Andrea Pagnani, Martin Weigt, Francesco Zamponi

    Abstract: Boltzmann machines are energy-based models that have been shown to provide an accurate statistical description of domains of evolutionary-related protein and RNA families. They are parametrized in terms of local biases accounting for residue conservation, and pairwise terms to model epistatic coevolution between residues. From the model parameters, it is possible to extract an accurate prediction… ▽ More

    Submitted 2 November, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Journal ref: BMC Bioinformatics 22, 528 (2021)

  8. arXiv:2106.02441  [pdf, other

    q-bio.BM cond-mat.dis-nn q-bio.PE q-bio.QM

    Modeling sequence-space exploration and emergence of epistatic signals in protein evolution

    Authors: Matteo Bisardi, Juan Rodriguez-Rivas, Francesco Zamponi, Martin Weigt

    Abstract: During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein evolution. These models predict quantitatively important features of experimentally evolved sequence l… ▽ More

    Submitted 27 January, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: 16 pages, 14 figures

    Journal ref: Molecular Biology and Evolution 39, msab321 (2022)

  9. arXiv:2103.03292  [pdf, other

    q-bio.BM cond-mat.dis-nn cond-mat.stat-mech q-bio.QM

    Efficient generative modeling of protein sequences using simple autoregressive models

    Authors: Jeanne Trinquier, Guido Uguzzoni, Andrea Pagnani, Francesco Zamponi, Martin Weigt

    Abstract: Generative models emerge as promising candidates for novel sequence-data driven approaches to protein design, and for the extraction of structural and functional information about proteins deeply hidden in rapidly growing sequence databases. Here we propose simple autoregressive models as highly accurate but computationally efficient generative sequence models. We show that they perform similarly… ▽ More

    Submitted 9 November, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 12 pages, 4 Figures + Supplementary Material

    Journal ref: Nature Communications 12, 5800 (2021)

  10. arXiv:2102.06036  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech q-bio.QM

    Global multivariate model learning from hierarchically correlated data

    Authors: Edwin Rodriguez Horta, Alejandro Lage, Martin Weigt, Pierre Barrat-Charlaix

    Abstract: Inverse statistical physics aims at inferring models compatible with a set of empirical averages estimated from a high-dimensional dataset of independently distributed equilibrium configurations of a given system. However, in several applications such as biology, data result from stochastic evolutionary processes, and configurations are related through a hierarchical structure, typically represent… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: 34 pages 10 figures

  11. arXiv:2011.11259  [pdf, other

    q-bio.BM cond-mat.stat-mech cs.LG

    Sparse generative modeling via parameter-reduction of Boltzmann machines: application to protein-sequence families

    Authors: Pierre Barrat-Charlaix, Anna Paola Muntoni, Kai Shimagaki, Martin Weigt, Francesco Zamponi

    Abstract: Boltzmann machines (BM) are widely used as generative models. For example, pairwise Potts models (PM), which are instances of the BM class, provide accurate statistical models of families of evolutionarily related protein sequences. Their parameters are the local fields, which describe site-specific patterns of amino-acid conservation, and the two-site couplings, which mirror the coevolution betwe… ▽ More

    Submitted 30 July, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: 7 pages, 5 figures, plus Appendix

    Journal ref: Phys. Rev. E 104, 024407 (2021)

  12. arXiv:2005.08500  [pdf, other

    q-bio.QM cond-mat.dis-nn physics.bio-ph q-bio.BM

    Aligning biological sequences by exploiting residue conservation and coevolution

    Authors: Anna Paola Muntoni, Andrea Pagnani, Martin Weigt, Francesco Zamponi

    Abstract: Sequences of nucleotides (for DNA and RNA) or amino acids (for proteins) are central objects in biology. Among the most important computational problems is that of sequence alignment, i.e. arranging sequences from different organisms in such a way to identify similar regions, to detect evolutionary relationships between sequences, and to predict biomolecular structure and function. This is typical… ▽ More

    Submitted 13 November, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 20 pages, 11 figures + Supplementary Information

    Journal ref: Phys. Rev. E 102, 062409 (2020)

  13. arXiv:1912.10956  [pdf, other

    q-bio.BM cond-mat.stat-mech physics.bio-ph

    Statistical physics of interacting proteins: impact of dataset size and quality assessed in synthetic sequences

    Authors: Carlos A. Gandarilla-Pérez, Pierre Mergny, Martin Weigt, Anne-Florence Bitbol

    Abstract: Identifying protein-protein interactions is crucial for a systems-level understanding of the cell. Recently, algorithms based on inverse statistical physics, e.g. Direct Coupling Analysis (DCA), have allowed to use evolutionarily related sequences to address two conceptually related inference tasks: finding pairs of interacting proteins, and identifying pairs of residues which form contacts betwee… ▽ More

    Submitted 16 March, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: 18 pages, 16 figures

    Journal ref: Phys. Rev. E 101, 032413 (2020)

  14. arXiv:1906.04266  [pdf, ps, other

    q-bio.BM cond-mat.stat-mech physics.bio-ph q-bio.QM

    Phylogenetic correlations can suffice to infer protein partners from sequences

    Authors: Guillaume Marmier, Martin Weigt, Anne-Florence Bitbol

    Abstract: Determining which proteins interact together is crucial to a systems-level understanding of the cell. Recently, algorithms based on Direct Coupling Analysis (DCA) pairwise maximum-entropy models have allowed to identify interaction partners among paralogous proteins from sequence data. This success of DCA at predicting protein-protein interactions could be mainly based on its known ability to iden… ▽ More

    Submitted 4 September, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: 31 pages, 14 figures

    Journal ref: PLoS Comput. Biol. 15(10): e1007179 (2019)

  15. arXiv:1905.11848  [pdf, other

    q-bio.BM cond-mat.dis-nn cond-mat.stat-mech q-bio.QM

    Selection of sequence motifs and generative Hopfield-Potts models for protein familiesilies

    Authors: Kai Shimagaki, Martin Weigt

    Abstract: Statistical models for families of evolutionary related proteins have recently gained interest: in particular pairwise Potts models, as those inferred by the Direct-Coupling Analysis, have been able to extract information about the three-dimensional structure of folded proteins, and about the effect of amino-acid substitutions in proteins. These models are typically requested to reproduce the one-… ▽ More

    Submitted 5 September, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: 26 pages, 16 figures, to app. in PRE

    Journal ref: Phys. Rev. E 100, 032128 (2019)

  16. arXiv:1801.04184  [pdf, other

    q-bio.QM cond-mat.stat-mech q-bio.BM

    How pairwise coevolutionary models capture the collective residue variability in proteins

    Authors: Matteo Figliuzzi, Pierre Barrat-Charlaix, Martin Weigt

    Abstract: Global coevolutionary models of homologous protein families, as constructed by direct coupling analysis (DCA), have recently gained popularity in particular due to their capacity to accurately predict residue-residue contacts from sequence information alone, and thereby to facilitate tertiary and quaternary protein structure prediction. More recently, they have also been used to predict fitness ef… ▽ More

    Submitted 12 January, 2018; originally announced January 2018.

    Comments: 17 pages, 3 figures and one table

    Journal ref: Molecular Biology and Evolution 35, 1018 (2018)

  17. arXiv:1703.01222  [pdf, other

    q-bio.BM cond-mat.stat-mech q-bio.QM

    Inverse Statistical Physics of Protein Sequences: A Key Issues Review

    Authors: Simona Cocco, Christoph Feinauer, Matteo Figliuzzi, Remi Monasson, Martin Weigt

    Abstract: In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e.~evolutionarily related protein sequences, to which method… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: 18 pages, 7 figures

    Journal ref: Rep. Prog. Phys. 81, 032601 (2018)

  18. arXiv:1609.05692  [pdf, ps, other

    q-bio.QM cond-mat.dis-nn cond-mat.stat-mech

    Improving landscape inference by integrating heterogeneous data in the inverse Ising problem

    Authors: Pierre Barrat-Charlaix, Matteo Figliuzzi, Martin Weigt

    Abstract: The inverse Ising problem and its generalizations to Potts and continuous spin models have recently attracted much attention thanks to their successful applications in the statistical modeling of biological data. In the standard setting, the parameters of an Ising model (couplings and fields) are inferred using a sample of equilibrium configurations drawn from the Boltzmann distribution. However,… ▽ More

    Submitted 5 November, 2016; v1 submitted 19 September, 2016; originally announced September 2016.

    Comments: Accepted for publication in Scientific Reports. 11 pages, 4 figures

    Journal ref: Sci. Rep. 6, 37812 (2016)

  19. arXiv:1605.03745  [pdf, other

    q-bio.QM cond-mat.stat-mech q-bio.MN

    Simultaneous identification of specifically interacting paralogs and inter-protein contacts by Direct-Coupling Analysis

    Authors: Thomas Gueudré, Carlo Baldassi, Marco Zamparo, Martin Weigt, Andrea Pagnani

    Abstract: Understanding protein-protein interactions is central to our understanding of almost all complex biological processes. Computational tools exploiting rapidly growing genomic databases to characterize protein-protein interactions are urgently needed. Such methods should connect multiple scales from evolutionary conserved interactions between families of homologous proteins, over the identification… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: Main Text 19 pages Supp. Inf. 16 pages

    Journal ref: PNAS 2016 113 (43) 12186-12191

  20. arXiv:1510.03224  [pdf, other

    q-bio.QM cond-mat.stat-mech q-bio.BM

    Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1

    Authors: Matteo Figliuzzi, Hervé Jacquier, Alexander Schug, Olivier Tenaillon, Martin Weigt

    Abstract: The quantitative characterization of mutational landscapes is a task of outstanding importance in evolutionary and medical biology: It is, e.g., of central importance for our understanding of the phenotypic effect of mutations related to disease and antibiotic drug resistance. Here we develop a novel inference scheme for mutational landscapes, which is based on the statistical analysis of large al… ▽ More

    Submitted 12 October, 2015; originally announced October 2015.

    Comments: 14 pages, 5 figures. Supplementary files on the publisher's website: http://mbe.oxfordjournals.org/content/early/2015/10/06/molbev.msv211.short?rss=1

    Journal ref: Mol Biol Evol (2015) doi: 10.1093/molbev/msv211

  21. arXiv:1404.1240  [pdf, other

    q-bio.QM cond-mat.dis-nn stat.ME

    Fast and accurate multivariate Gaussian modeling of protein families: Predicting residue contacts and protein-interaction partners

    Authors: Carlo Baldassi, Marco Zamparo, Christoph Feinauer, Andrea Procaccini, Riccardo Zecchina, Martin Weigt, Andrea Pagnani

    Abstract: In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence inform… ▽ More

    Submitted 4 April, 2014; originally announced April 2014.

    Comments: 24 pages, 7 pdf figures, 2 tables, plus supporting informations. Published on PLOS ONE

    Journal ref: PLoS ONE 9(3): e92721

  22. arXiv:1212.3281  [pdf, ps, other

    q-bio.BM cond-mat.stat-mech q-bio.QM

    From principal component to direct coupling analysis of coevolution in proteins: Low-eigenvalue modes are needed for structure prediction

    Authors: Simona Cocco, Remi Monasson, Martin Weigt

    Abstract: Various approaches have explored the covariation of residues in multiple-sequence alignments of homologous proteins to extract functional and structural information. Among those are principal component analysis (PCA), which identifies the most correlated groups of residues, and direct coupling analysis (DCA), a global inference method based on the maximum entropy principle, which aims at predictin… ▽ More

    Submitted 27 August, 2013; v1 submitted 13 December, 2012; originally announced December 2012.

    Comments: Supporting information can be downloaded from: http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1003176

    Journal ref: PLoS Computational Biology 9, 8 (2013) e1003176

  23. arXiv:1211.1281  [pdf, ps, other

    q-bio.QM cond-mat.dis-nn cond-mat.stat-mech physics.data-an

    Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models

    Authors: Magnus Ekeberg, Cecilia Lövkvist, Yueheng Lan, Martin Weigt, Erik Aurell

    Abstract: Spatially proximate amino acids in a protein tend to coevolve. A protein's three-dimensional (3D) structure hence leaves an echo of correlations in the evolutionary record. Reverse engineering 3D structures from such correlations is an open problem in structural biology, pursued with increasing vigor as more and more protein sequences continue to fill the data banks. Within this task lies a statis… ▽ More

    Submitted 12 January, 2013; v1 submitted 6 November, 2012; originally announced November 2012.

    Comments: 19 pages, 16 figures, published version

    Journal ref: M. Ekeberg, C. Lövkvist, Y. Lan, M. Weigt, E. Aurell, Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models, Phys. Rev. E 87, 012707 (2013)

  24. arXiv:1110.5223  [pdf

    q-bio.QM cond-mat.stat-mech q-bio.BM

    Direct-coupling analysis of residue co-evolution captures native contacts across many protein families

    Authors: Faruck Morcos, Andrea Pagnani, Bryan Lunt, Arianna Bertolino, Debora S. Marks, Chris Sander, Riccardo Zecchina, Jose' N. Onuchic, Terence Hwa, Martin Weigt

    Abstract: The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct an… ▽ More

    Submitted 25 October, 2011; v1 submitted 24 October, 2011; originally announced October 2011.

    Comments: 28 pages, 7 figures, to appear in PNAS

    Journal ref: PNAS December 6, 2011 vol. 108 no. 49 E1293-E1301

  25. arXiv:1105.3295  [pdf

    q-bio.MN cond-mat.dis-nn q-bio.QM

    Dissecting the Specificity of Protein-Protein Interaction in Bacterial Two-Component Signaling: Orphans and Crosstalks

    Authors: Andrea Procaccini, Bryan Lunt, Hendrik Szurmant, Terence Hwa, Martin Weigt

    Abstract: Predictive understanding of the myriads of signal transduction pathways in a cell is an outstanding challenge of systems biology. Such pathways are primarily mediated by specific but transient protein-protein interactions, which are difficult to study experimentally. In this study, we dissect the specificity of protein-protein interactions governing two-component signaling (TCS) systems ubiquitous… ▽ More

    Submitted 17 May, 2011; originally announced May 2011.

    Comments: Supplementary information available on http://www.plosone.org/article/info:doi/10.1371/journal.pone.0019729

    Journal ref: PLoS ONE 6(5): e19729 (2011)

  26. arXiv:0907.3687  [pdf, ps, other

    cond-mat.stat-mech q-bio.GN q-bio.QM

    Classification and sparse-signature extraction from gene-expression data

    Authors: Andrea Pagnani, Francesca Tria, Martin Weigt

    Abstract: In this work we suggest a statistical mechanics approach to the classification of high-dimensional data according to a binary label. We propose an algorithm whose aim is twofold: First it learns a classifier from a relatively small number of data, second it extracts a sparse signature, {\it i.e.} a lower-dimensional subspace carrying the information needed for the classification. In particular t… ▽ More

    Submitted 21 July, 2009; originally announced July 2009.

    Comments: 15 pages, 13 eps figures

    Journal ref: J. Stat. Mech. (2009) P05001

  27. arXiv:0907.3241  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech physics.data-an

    Statistical mechanics of sparse generalization and model selection

    Authors: Alejandro Lage-Castellanos, Andrea Pagnani, Martin Weigt

    Abstract: One of the crucial tasks in many inference problems is the extraction of sparse information out of a given number of high-dimensional measurements. In machine learning, this is frequently achieved using, as a penality term, the $L_p$ norm of the model parameters, with $p\leq 1$ for efficient dilution. Here we propose a statistical-mechanics analysis of the problem in the setting of perceptron me… ▽ More

    Submitted 18 July, 2009; originally announced July 2009.

    Comments: 18 pages, 9 eps figures

    Journal ref: J. Stat. Mech. (2009) P10009

  28. arXiv:0905.1893  [pdf, ps, other

    q-bio.QM cond-mat.stat-mech cs.DS

    Aligning graphs and finding substructures by a cavity approach

    Authors: S. Bradde, A. Braunstein, H. Mahmoudi, F. Tria, M. Weigt, R. Zecchina

    Abstract: We introduce a new distributed algorithm for aligning graphs or finding substructures within a given graph. It is based on the cavity method and is used to study the maximum-clique and the graph-alignment problems in random graphs. The algorithm allows to analyze large graphs and may find applications in fields such as computational biology. As a proof of concept we use our algorithm to align the… ▽ More

    Submitted 1 April, 2010; v1 submitted 12 May, 2009; originally announced May 2009.

    Comments: 5 pages, 4 figures

    Journal ref: 2010 Europhys. Lett. 89 37009

  29. arXiv:0901.1248  [pdf

    q-bio.BM cond-mat.stat-mech q-bio.QM

    Identification of direct residue contacts in protein-protein interaction by message passing

    Authors: M. Weigt, R. A. White, H. Szurmant, J. A. Hoch, T. Hwa

    Abstract: Understanding the molecular determinants of specificity in protein-protein interaction is an outstanding challenge of postgenome biology. The availability of large protein databases generated from sequences of hundreds of bacterial genomes enables various statistical approaches to this problem. In this context covariance-based methods have been used to identify correlation between amino acid pos… ▽ More

    Submitted 9 January, 2009; originally announced January 2009.

    Comments: Supplementary information available on http://www.pnas.org/content/106/1/67.abstract

    Journal ref: Proc. Natl. Acad. Sci. 106(1), 67-72 (2009)

  30. arXiv:0812.0940  [pdf, ps, other

    q-bio.QM cond-mat.dis-nn

    Inference algorithms for gene networks: a statistical mechanics analysis

    Authors: A. Braunstein, A. Pagnani, M. Weigt, R. Zecchina

    Abstract: The inference of gene regulatory networks from high throughput gene expression data is one of the major challenges in systems biology. This paper aims at analysing and comparing two different algorithmic approaches. The first approach uses pairwise correlations between regulated and regulating genes; the second one uses message-passing techniques for inferring activating and inhibiting regulator… ▽ More

    Submitted 4 December, 2008; originally announced December 2008.

    Journal ref: J. Stat. Mech. (2008) P12001

  31. arXiv:0812.0936  [pdf, ps, other

    q-bio.QM cond-mat.dis-nn

    Gene-network inference by message passing

    Authors: A. Braunstein, A. Pagnani, M. Weigt, R. Zecchina

    Abstract: The inference of gene-regulatory processes from gene-expression data belongs to the major challenges of computational systems biology. Here we address the problem from a statistical-physics perspective and develop a message-passing algorithm which is able to infer sparse, directed and combinatorial regulatory mechanisms. Using the replica technique, the algorithmic performance can be characteriz… ▽ More

    Submitted 4 December, 2008; originally announced December 2008.

    Comments: Proc. of International Workshop on Statistical-Mechanical Informatics 2007, Kyoto

    Journal ref: Journal of Physics: Conference Series 95 (2008) 012016

  32. arXiv:0801.1480  [pdf, ps, other

    q-bio.SC cond-mat.soft cond-mat.stat-mech physics.bio-ph q-bio.GN

    A thermodynamic model for agglomeration of DNA-looping proteins

    Authors: Sumedha, Martin Weigt

    Abstract: In this paper, we propose a thermodynamic mechanism for the formation of transcriptional foci via the joint agglomeration of DNA-looping proteins and protein-binding domains on DNA: The competition between the gain in protein-DNA binding free energy and the entropy loss due to DNA looping is argued to result in an effective attraction between loops. A mean-field approximation can be described an… ▽ More

    Submitted 20 October, 2008; v1 submitted 9 January, 2008; originally announced January 2008.

    Comments: 12 pages, 5 figures, to app. in JSTAT

    Journal ref: J. Stat. Mech. (2008) P11005

  33. arXiv:0712.1165  [pdf, other

    physics.data-an cond-mat.stat-mech q-bio.QM

    Unsupervised and semi-supervised clustering by message passing: Soft-constraint affinity propagation

    Authors: Michele Leone, Sumedha, Martin Weigt

    Abstract: Soft-constraint affinity propagation (SCAP) is a new statistical-physics based clustering technique. First we give the derivation of a simplified version of the algorithm and discuss possibilities of time- and memory-efficient implementations. Later we give a detailed analysis of the performance of SCAP on artificial data, showing that the algorithm efficiently unveils clustered and hierarchical… ▽ More

    Submitted 15 September, 2008; v1 submitted 7 December, 2007; originally announced December 2007.

    Comments: 11 pages, 13 pdf figures, to app. in EPJB

    Journal ref: Eur. Phys. J. B (2008), published online 8 Oct. 2008

  34. arXiv:0705.2646  [pdf, ps, other

    q-bio.QM cond-mat.stat-mech physics.data-an

    Clustering by soft-constraint affinity propagation: Applications to gene-expression data

    Authors: Michele Leone, Sumedha, Martin Weigt

    Abstract: Motivation: Similarity-measure based clustering is a crucial problem appearing throughout scientific data analysis. Recently, a powerful new algorithm called Affinity Propagation (AP) based on message-passing techniques was proposed by Frey and Dueck \cite{Frey07}. In AP, each cluster is identified by a common exemplar all other data points of the same cluster refer to, and exemplars have to ref… ▽ More

    Submitted 29 November, 2007; v1 submitted 18 May, 2007; originally announced May 2007.

    Comments: 11 pages, supplementary material: http://isiosf.isi.it/~weigt/scap_supplement.pdf

    Journal ref: Bioinformatics 23, 2708 (2007)

  35. arXiv:0704.3406  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn q-bio.MN

    Propagation of external regulation and asynchronous dynamics in random Boolean networks

    Authors: Hamed Mahmoudi, Andrea Pagnani, Martin Weigt, Riccardo Zecchina

    Abstract: Boolean Networks and their dynamics are of great interest as abstract modeling schemes in various disciplines, ranging from biology to computer science. Whereas parallel update schemes have been studied extensively in past years, the level of understanding of asynchronous updates schemes is still very poor. In this paper we study the propagation of external information given by regulatory input… ▽ More

    Submitted 25 April, 2007; originally announced April 2007.

    Comments: 19 pages, 14 figures, to appear in Chaos

    Journal ref: Chaos 17, 026109 (2007)

  36. arXiv:cond-mat/0703534  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech

    Finitely coordinated models for low-temperature phases of amorphous systems

    Authors: Reimer Kuehn, Jort van Mourik, Martin Weigt, Annette Zippelius

    Abstract: We introduce models of heterogeneous systems with finite connectivity defined on random graphs to capture finite-coordination effects on the low-temperature behavior of finite dimensional systems. Our models use a description in terms of small deviations of particle coordinates from a set of reference positions, particularly appropriate for the description of low-temperature phenomena. A Born-vo… ▽ More

    Submitted 20 March, 2007; originally announced March 2007.

    Comments: 35 pages, 8 Figures

  37. arXiv:cond-mat/0605623  [pdf, ps, other

    cond-mat.stat-mech q-fin.ST

    Statistical mechanics of combinatorial auctions

    Authors: Tobias Galla, Michele Leone, Matteo Marsili, Mauro Sellitto, Martin Weigt, Riccardo Zecchina

    Abstract: Combinatorial auctions are formulated as frustrated lattice gases on sparse random graphs, allowing the determination of the optimal revenue by methods of statistical physics. Transitions between computationally easy and hard regimes are found and interpreted in terms of the geometric structure of the space of solutions. We introduce an iterative algorithm to solve intermediate and large instanc… ▽ More

    Submitted 24 August, 2006; v1 submitted 25 May, 2006; originally announced May 2006.

    Comments: 4 pages, 4 figures, minor changes, references added. To appear on PRL

    Journal ref: Phys. Rev. Lett. 97, 128701 (2006)

  38. arXiv:cond-mat/0605190  [pdf, ps, other

    cond-mat.stat-mech cs.DS

    Message passing for vertex covers

    Authors: Martin Weigt, Haijun Zhou

    Abstract: Constructing a minimal vertex cover of a graph can be seen as a prototype for a combinatorial optimization problem under hard constraints. In this paper, we develop and analyze message passing techniques, namely warning and survey propagation, which serve as efficient heuristic algorithms for solving these computational hard problems. We show also, how previously obtained results on the typical-… ▽ More

    Submitted 8 September, 2006; v1 submitted 8 May, 2006; originally announced May 2006.

    Comments: 25 pages, 9 figures - version accepted for publication in PRE

    Journal ref: Phys. Rev. E 74, 046110 (2006)

  39. arXiv:cond-mat/0603819  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn

    Sudden emergence of q-regular subgraphs in random graphs

    Authors: Marco Pretti, Martin Weigt

    Abstract: We investigate the computationally hard problem whether a random graph of finite average vertex degree has an extensively large $q$-regular subgraph, i.e., a subgraph with all vertices having degree equal to $q$. We reformulate this problem as a constraint-satisfaction problem, and solve it using the cavity method of statistical physics at zero temperature. For $q=3$, we find that the first larg… ▽ More

    Submitted 30 March, 2006; originally announced March 2006.

    Comments: 7 pages, 5 figures

    Journal ref: Europhys. Lett. 75, 8 (2006)

  40. arXiv:cond-mat/0602129  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech physics.soc-ph

    Introduction to graphs

    Authors: Alexander K. Hartmann, Martin Weigt

    Abstract: Graph theory provides fundamental concepts for many fields of science like statistical physics, network analysis and theoretical computer science. Here we give a pedagogical introduction to graph theory, divided into three sections. In the first, we introduce some basic notations and graph theoretical problems, e.g. Eulerian circuits, vertex covers, and graph colorings. The second section descri… ▽ More

    Submitted 6 February, 2006; originally announced February 2006.

    Comments: 45 pages, with permission of Wiley-VCH, see http://www.wiley.com

    Journal ref: A.K. Hartmann and M. Weigt, Phase Transitions in Combinatorial Optimization Problems, (Wiley-VCH, Berlin, Weinheim 2005), ISBN 3527404732

  41. arXiv:cond-mat/0512089  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn q-bio.MN

    Computational core and fixed-point organisation in Boolean networks

    Authors: L. Correale, M. Leone, A. Pagnani, M. Weigt, R. Zecchina

    Abstract: In this paper, we analyse large random Boolean networks in terms of a constraint satisfaction problem. We first develop an algorithmic scheme which allows to prune simple logical cascades and under-determined variables, returning thereby the computational core of the network. Second we apply the cavity method to analyse number and organisation of fixed points. We find in particular a phase trans… ▽ More

    Submitted 6 March, 2006; v1 submitted 5 December, 2005; originally announced December 2005.

    Comments: 29 pages, 18 figures, version accepted for publication in JSTAT

    Journal ref: J. Stat. Mech. (2006) P03002

  42. arXiv:cond-mat/0506194  [pdf, ps, other

    cond-mat.soft cond-mat.dis-nn cond-mat.stat-mech

    Cavity Approach to the Random Solid State

    Authors: Xiaoming Mao, Paul M. Goldbart, Marc Mezard, Martin Weigt

    Abstract: The cavity approach is used to address the physical properties of random solids in equilibrium. Particular attention is paid to the fraction of localized particles and the distribution of localization lengths characterizing their thermal motion. This approach is of relevance to a wide class of random solids, including rubbery media (formed via the vulcanization of polymer fluids) and chemical ge… ▽ More

    Submitted 8 June, 2005; originally announced June 2005.

    Comments: 4 pages, 2 figures

    Journal ref: Phys. Rev. Lett. 95, 148302 (2005)

  43. arXiv:cond-mat/0505202  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn

    A hard-sphere model on generalised Bethe lattices: Dynamics

    Authors: Hendrik Hansen-Goos, Martin Weigt

    Abstract: We analyse the dynamics of a hard-sphere lattice gas on generalised Bethe lattices using a projective approximation scheme (PAS). The latter consists in mapping the system's dynamics to a finite set of global observables, closure of the resulting equations is obtained by approximating the true non-equilibrium state by a pseudo-equilibrium based only on the value of the observables under consider… ▽ More

    Submitted 9 May, 2005; originally announced May 2005.

    Comments: 23 pages, 12 figures

    Journal ref: J. Stat. Mech. P08001 (2005)

  44. arXiv:cond-mat/0501571  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn

    A hard-sphere model on generalized Bethe lattices: Statics

    Authors: Hendrik Hansen-Goos, Martin Weigt

    Abstract: We analyze the phase diagram of a model of hard spheres of chemical radius one, which is defined over a generalized Bethe lattice containing short loops. We find a liquid, two different crystalline, a glassy and an unusual crystalline glassy phase. Special attention is also paid to the close-packing limit in the glassy phase. All analytical results are cross-checked by numerical Monte-Carlo simu… ▽ More

    Submitted 10 May, 2005; v1 submitted 24 January, 2005; originally announced January 2005.

    Comments: 24 pages, revised version

    Journal ref: J. Stat. Mech. (2005) P04006

  45. arXiv:cond-mat/0412443  [pdf, ps, other

    cond-mat.dis-nn cond-mat.other q-bio.MN q-bio.OT

    Core percolation and onset of complexity in Boolean networks

    Authors: L. Correale, M. Leone, A. Pagnani, M. Weigt, R. Zecchina

    Abstract: The determination and classification of fixed points of large Boolean networks is addressed in terms of constraint satisfaction problem. We develop a general simplification scheme that, removing all those variables and functions belonging to trivial logical cascades, returns the computational core of the network. The onset of an easy-to-complex regulatory phase is introduced as a function of the… ▽ More

    Submitted 22 November, 2005; v1 submitted 16 December, 2004; originally announced December 2004.

    Comments: major revisions, extended results, version accepted for publication in PRL

    Journal ref: Phys. Rev. Lett. 96, 018101 (2006)

  46. arXiv:cond-mat/0403725  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.CC

    Threshold values, stability analysis and high-q asymptotics for the coloring problem on random graphs

    Authors: Florent Krzakala, Andrea Pagnani, Martin Weigt

    Abstract: We consider the problem of coloring Erdos-Renyi and regular random graphs of finite connectivity using q colors. It has been studied so far using the cavity approach within the so-called one-step replica symmetry breaking (1RSB) ansatz. We derive a general criterion for the validity of this ansatz and, applying it to the ground state, we provide evidence that the 1RSB solution gives exact thresh… ▽ More

    Submitted 28 July, 2004; v1 submitted 30 March, 2004; originally announced March 2004.

    Comments: 23 pages, 10 figures. Replaced with accepted version

    Journal ref: Phys. Rev. E 70, 046705 (2004)

  47. arXiv:cond-mat/0402451  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn

    Approximation schemes for the dynamics of diluted spin models: the Ising ferromagnet on a Bethe lattice

    Authors: Guilhem Semerjian, Martin Weigt

    Abstract: We discuss analytical approximation schemes for the dynamics of diluted spin models. The original dynamics of the complete set of degrees of freedom is replaced by a hierarchy of equations including an increasing number of global observables, which can be closed approximately at different levels of the hierarchy. We illustrate this method on the simple example of the Ising ferromagnet on a Bethe… ▽ More

    Submitted 17 February, 2004; originally announced February 2004.

    Comments: 21 pages, 5 figures

    Journal ref: J. Phys. A 37, 5525 (2004)

  48. Statistical mechanics of the vertex-cover problem

    Authors: Alexander K. Hartmann, Martin Weigt

    Abstract: We review recent progress in the study of the vertex-cover problem (VC). VC belongs to the class of NP-complete graph theoretical problems, which plays a central role in theoretical computer science. On ensembles of random graphs, VC exhibits an coverable-uncoverable phase transition. Very close to this transition, depending on the solution algorithm, easy-hard transitions in the typical running… ▽ More

    Submitted 10 July, 2003; originally announced July 2003.

    Comments: review article, 26 pages, 9 figures, to appear in J. Phys. A: Math. Gen

    Journal ref: J. Phys. A: Math. Gen. 36, 11069 (2003)

  49. arXiv:cond-mat/0304558  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech

    Polynomial iterative algorithms for coloring and analyzing random graphs

    Authors: A. Braunstein, R. Mulet, A. Pagnani, M. Weigt, R. Zecchina

    Abstract: We study the graph coloring problem over random graphs of finite average connectivity $c$. Given a number $q$ of available colors, we find that graphs with low connectivity admit almost always a proper coloring whereas graphs with high connectivity are uncolorable. Depending on $q$, we find the precise value of the critical average connectivity $c_q$. Moreover, we show that below $c_q$ there exi… ▽ More

    Submitted 24 April, 2003; originally announced April 2003.

    Comments: 23 pages, 10 eps figures

    Journal ref: Phys. Rev. E 68, 036702 (2003)

  50. arXiv:cond-mat/0301271  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn cs.CC

    Solving satisfiability problems by fluctuations: The dynamics of stochastic local search algorithms

    Authors: Wolfgang Barthel, Alexander K. Hartmann, Martin Weigt

    Abstract: Stochastic local search algorithms are frequently used to numerically solve hard combinatorial optimization or decision problems. We give numerical and approximate analytical descriptions of the dynamics of such algorithms applied to random satisfiability problems. We find two different dynamical regimes, depending on the number of constraints per variable: For low constraintness, the problems a… ▽ More

    Submitted 7 May, 2003; v1 submitted 15 January, 2003; originally announced January 2003.

    Comments: 21 pages, 18 figures, revised version, to app. in PRE (2003)

    Journal ref: Phys. Rev. E 67, 066104 (2003)