Search | arXiv e-print repository

Towards a robust criterion of anomalous diffusion

Authors: Vittoria Sposini, Diego Krapf, Enzo Marinari, Raimon Sunyer, Felix Ritort, Fereydoon Taheri, Christine Selhuber-Unkel, Rebecca Benelli, Matthias Weiss, Ralf Metzler, Gleb Oshanin

Abstract: Anomalous-diffusion, the departure of the spreading dynamics of diffusing particles from the traditional law of Brownian-motion, is a signature feature of a large number of complex soft-matter and biological systems. Anomalous-diffusion emerges due to a variety of physical mechanisms, e.g., trapping interactions or the viscoelasticity of the environment. However, sometimes systems dynamics are err… ▽ More Anomalous-diffusion, the departure of the spreading dynamics of diffusing particles from the traditional law of Brownian-motion, is a signature feature of a large number of complex soft-matter and biological systems. Anomalous-diffusion emerges due to a variety of physical mechanisms, e.g., trapping interactions or the viscoelasticity of the environment. However, sometimes systems dynamics are erroneously claimed to be anomalous, despite the fact that the true motion is Brownian -- or vice versa. This ambiguity in establishing whether the dynamics as normal or anomalous can have far-reaching consequences, e.g., in predictions for reaction- or relaxation-laws. Demonstrating that a system exhibits normal- or anomalous-diffusion is highly desirable for a vast host of applications. Here, we present a criterion for anomalous-diffusion based on the method of power-spectral analysis of single trajectories. The robustness of this criterion is studied for trajectories of fractional-Brownian-motion, a ubiquitous stochastic process for the description of anomalous-diffusion, in the presence of two types of measurement errors. In particular, we find that our criterion is very robust for subdiffusion. Various tests on surrogate data in absence or presence of additional positional noise demonstrate the efficacy of this method in practical contexts. Finally, we provide a proof-of-concept based on diverse experiments exhibiting both normal and anomalous-diffusion. △ Less

Submitted 9 November, 2022; originally announced November 2022.

Comments: 13 pages, 6 figures, RevTeX

arXiv:2109.04309 [pdf, other]

Unravelling the origins of anomalous diffusion: from molecules to migrating storks

Authors: Ohad Vilk, Erez Aghion, Tal Avgar, Carsten Beta, Oliver Nagel, Adal Sabri, Raphael Sarfati, Daniel K. Schwartz, Matthias Weiss, Diego Krapf, Ran Nathan, Ralf Metzler, Michael Assaf

Abstract: Anomalous diffusion or, more generally, anomalous transport, with nonlinear dependence of the mean-squared displacement on the measurement time, is ubiquitous in nature. It has been observed in processes ranging from microscopic movement of molecules to macroscopic, large-scale paths of migrating birds. Using data from multiple empirical systems, spanning 12 orders of magnitude in length and 8 ord… ▽ More Anomalous diffusion or, more generally, anomalous transport, with nonlinear dependence of the mean-squared displacement on the measurement time, is ubiquitous in nature. It has been observed in processes ranging from microscopic movement of molecules to macroscopic, large-scale paths of migrating birds. Using data from multiple empirical systems, spanning 12 orders of magnitude in length and 8 orders of magnitude in time, we employ a method to detect the individual underlying origins of anomalous diffusion and transport in the data. This method decomposes anomalous transport into three primary effects: long-range correlations ("Joseph effect"), fat-tailed probability density of increments ("Noah effect"), and non-stationarity ("Moses effect"). We show that such a decomposition of real-life data allows to infer nontrivial behavioral predictions, and to resolve open questions in the fields of single particle tracking in living cells and movement ecology. △ Less

Submitted 27 June, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

Comments: 17 pages, 6 figures + Supplemental Material. To appear in Physical Review Research (2022)

arXiv:2002.00363 [pdf, other]

doi 10.1016/j.plrev.2021.03.004

From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics

Authors: Susanna Manrubia, José A. Cuesta, Jacobo Aguirre, Sebastian E. Ahnert, Lee Altenberg, Alejandro V. Cano, Pablo Catalán, Ramon Diaz-Uriarte, Santiago F. Elena, Juan Antonio García-Martín, Paulien Hogeweg, Bhavin S. Khatri, Joachim Krug, Ard A. Louis, Nora S. Martin, Joshua L. Payne, Matthew J. Tarnowski, Marcel Weiß

Abstract: Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced… ▽ More Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced in the space of genotypes by sequences mapped to molecular structures, has revealed important facts that deeply affect the dynamical description of evolutionary processes. Empirical evidence supporting the fundamental relevance of features such as phenotypic bias is mounting as well, while the synthesis of conceptual and experimental progress leads to questioning current assumptions on the nature of evolutionary dynamics-cancer progression models or synthetic biology approaches being notable examples. This work delves into a critical and constructive attitude in our current knowledge of how genotypes map onto molecular phenotypes and organismal functions, and discusses theoretical and empirical avenues to broaden and improve this comprehension. As a final goal, this community should aim at deriving an updated picture of evolutionary processes soundly relying on the structural properties of genotype spaces, as revealed by modern techniques of molecular and functional analysis. △ Less

Submitted 17 March, 2021; v1 submitted 2 February, 2020; originally announced February 2020.

Comments: 111 pages, 11 figures uses elsarticle latex class

Journal ref: Physics of Life Reviews 38, 55-106 (2021)

arXiv:1910.09600 [pdf, other]

Is graph-based feature selection of genes better than random?

Authors: Mohammad Hashir, Paul Bertin, Martin Weiss, Vincent Frappier, Theodore J. Perkins, Geneviève Boucher, Joseph Paul Cohen

Abstract: Gene interaction graphs aim to capture various relationships between genes and represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing whether those graphs capture dep… ▽ More Gene interaction graphs aim to capture various relationships between genes and represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing whether those graphs capture dependencies seen in gene expression data better than random. We formulate a condition that graphs should satisfy to provide a good prior knowledge and propose to test it using a `Single Gene Inference' (SGI) task. We compare random graphs with seven major gene interaction graphs published by different research groups, aiming to measure the true benefit of using biologically relevant graphs in this context. Our analysis finds that dependencies can be captured almost as well at random which suggests that, in terms of gene expression levels, the relevant information about the state of the cell is spread across many genes. △ Less

Submitted 27 December, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

Comments: Accepted to the Machine Learning in Computational Biology (MLCB) meeting 2019. 7 pages. 4 figures. arXiv admin note: substantial text overlap with arXiv:1905.02295

arXiv:1910.08636 [pdf, other]

The TCGA Meta-Dataset Clinical Benchmark

Authors: Mandana Samiei, Tobias Würfl, Tristan Deleu, Martin Weiss, Francis Dutil, Thomas Fevens, Geneviève Boucher, Sebastien Lemieux, Joseph Paul Cohen

Abstract: Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinica… ▽ More Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinical outcome, this approach is far from the reality of clinical decision making in which you have to consider several factors simultaneously. In addition, it is difficult to follow the recent progress concretely as there is a lack of consistency in benchmark datasets and task definitions in the field of Genomics. To address the aforementioned issues, we provide a clinical Meta-Dataset derived from the publicly available data hub called The Cancer Genome Atlas Program (TCGA) that contains 174 tasks. We believe those tasks could be good proxy tasks to develop methods which can work on a few samples of gene expression data. Also, learning to predict multiple clinical variables using gene-expression data is an important task due to the variety of phenotypes in clinical problems and lack of samples for some of the rare variables. The defined tasks cover a wide range of clinical problems including predicting tumor tissue site, white cell count, histological type, family history of cancer, gender, and many others which we explain later in the paper. Each task represents an independent dataset. We use regression and neural network baselines for all the tasks using only 150 samples and compare their performance. △ Less

Submitted 18 October, 2019; originally announced October 2019.

Comments: 5 Pages, Submitted to MLCB 2019

arXiv:1905.02295 [pdf, other]

Analysis of Gene Interaction Graphs as Prior Knowledge for Machine Learning Models

Authors: Paul Bertin, Mohammad Hashir, Martin Weiss, Vincent Frappier, Theodore J. Perkins, Geneviève Boucher, Joseph Paul Cohen

Abstract: Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing how well those graphs captur… ▽ More Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing how well those graphs capture dependencies seen in gene expression data to evaluate the adequacy of the prior knowledge provided by those graphs. We propose a condition graphs should satisfy to provide good prior knowledge and test it using `Single Gene Inference' tasks. We also compare with randomly generated graphs, aiming to measure the true benefit of using biologically relevant graphs in this context, and validate our findings with five clinical tasks. We find some graphs capture relevant dependencies for most genes while being very sparse. Our analysis with random graphs finds that dependencies can be captured almost as well at random which suggests that, in terms of gene expression levels, the relevant information about the state of the cell is spread across many genes. △ Less

Submitted 13 January, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

Comments: Preprint. Under review

arXiv:1806.06975 [pdf, other]

Towards Gene Expression Convolutions using Gene Interaction Graphs

Authors: Francis Dutil, Joseph Paul Cohen, Martin Weiss, Georgy Derevyanko, Yoshua Bengio

Abstract: We study the challenges of applying deep learning to gene expression data. We find experimentally that there exists non-linear signal in the data, however is it not discovered automatically given the noise and low numbers of samples used in most research. We discuss how gene interaction graphs (same pathway, protein-protein, co-expression, or research paper text association) can be used to impose… ▽ More We study the challenges of applying deep learning to gene expression data. We find experimentally that there exists non-linear signal in the data, however is it not discovered automatically given the noise and low numbers of samples used in most research. We discuss how gene interaction graphs (same pathway, protein-protein, co-expression, or research paper text association) can be used to impose a bias on a deep model similar to the spatial bias imposed by convolutions on an image. We explore the usage of Graph Convolutional Neural Networks coupled with dropout and gene embeddings to utilize the graph information. We find this approach provides an advantage for particular tasks in a low data regime but is very dependent on the quality of the graph used. We conclude that more work should be done in this direction. We design experiments that show why existing methods fail to capture signal that is present in the data when features are added which clearly isolates the problem that needs to be addressed. △ Less

Submitted 18 June, 2018; originally announced June 2018.

Comments: 4 pages +1 page references, To appear in the International Conference on Machine Learning Workshop on Computational Biology, 2018

Showing 1–7 of 7 results for author: Weiß, M