-
Towards a robust criterion of anomalous diffusion
Authors:
Vittoria Sposini,
Diego Krapf,
Enzo Marinari,
Raimon Sunyer,
Felix Ritort,
Fereydoon Taheri,
Christine Selhuber-Unkel,
Rebecca Benelli,
Matthias Weiss,
Ralf Metzler,
Gleb Oshanin
Abstract:
Anomalous-diffusion, the departure of the spreading dynamics of diffusing particles from the traditional law of Brownian-motion, is a signature feature of a large number of complex soft-matter and biological systems. Anomalous-diffusion emerges due to a variety of physical mechanisms, e.g., trapping interactions or the viscoelasticity of the environment. However, sometimes systems dynamics are err…
▽ More
Anomalous-diffusion, the departure of the spreading dynamics of diffusing particles from the traditional law of Brownian-motion, is a signature feature of a large number of complex soft-matter and biological systems. Anomalous-diffusion emerges due to a variety of physical mechanisms, e.g., trapping interactions or the viscoelasticity of the environment. However, sometimes systems dynamics are erroneously claimed to be anomalous, despite the fact that the true motion is Brownian -- or vice versa. This ambiguity in establishing whether the dynamics as normal or anomalous can have far-reaching consequences, e.g., in predictions for reaction- or relaxation-laws. Demonstrating that a system exhibits normal- or anomalous-diffusion is highly desirable for a vast host of applications. Here, we present a criterion for anomalous-diffusion based on the method of power-spectral analysis of single trajectories. The robustness of this criterion is studied for trajectories of fractional-Brownian-motion, a ubiquitous stochastic process for the description of anomalous-diffusion, in the presence of two types of measurement errors. In particular, we find that our criterion is very robust for subdiffusion. Various tests on surrogate data in absence or presence of additional positional noise demonstrate the efficacy of this method in practical contexts. Finally, we provide a proof-of-concept based on diverse experiments exhibiting both normal and anomalous-diffusion.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Unravelling the origins of anomalous diffusion: from molecules to migrating storks
Authors:
Ohad Vilk,
Erez Aghion,
Tal Avgar,
Carsten Beta,
Oliver Nagel,
Adal Sabri,
Raphael Sarfati,
Daniel K. Schwartz,
Matthias Weiss,
Diego Krapf,
Ran Nathan,
Ralf Metzler,
Michael Assaf
Abstract:
Anomalous diffusion or, more generally, anomalous transport, with nonlinear dependence of the mean-squared displacement on the measurement time, is ubiquitous in nature. It has been observed in processes ranging from microscopic movement of molecules to macroscopic, large-scale paths of migrating birds. Using data from multiple empirical systems, spanning 12 orders of magnitude in length and 8 ord…
▽ More
Anomalous diffusion or, more generally, anomalous transport, with nonlinear dependence of the mean-squared displacement on the measurement time, is ubiquitous in nature. It has been observed in processes ranging from microscopic movement of molecules to macroscopic, large-scale paths of migrating birds. Using data from multiple empirical systems, spanning 12 orders of magnitude in length and 8 orders of magnitude in time, we employ a method to detect the individual underlying origins of anomalous diffusion and transport in the data. This method decomposes anomalous transport into three primary effects: long-range correlations ("Joseph effect"), fat-tailed probability density of increments ("Noah effect"), and non-stationarity ("Moses effect"). We show that such a decomposition of real-life data allows to infer nontrivial behavioral predictions, and to resolve open questions in the fields of single particle tracking in living cells and movement ecology.
△ Less
Submitted 27 June, 2022; v1 submitted 9 September, 2021;
originally announced September 2021.
-
From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics
Authors:
Susanna Manrubia,
José A. Cuesta,
Jacobo Aguirre,
Sebastian E. Ahnert,
Lee Altenberg,
Alejandro V. Cano,
Pablo Catalán,
Ramon Diaz-Uriarte,
Santiago F. Elena,
Juan Antonio García-Martín,
Paulien Hogeweg,
Bhavin S. Khatri,
Joachim Krug,
Ard A. Louis,
Nora S. Martin,
Joshua L. Payne,
Matthew J. Tarnowski,
Marcel Weiß
Abstract:
Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced…
▽ More
Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced in the space of genotypes by sequences mapped to molecular structures, has revealed important facts that deeply affect the dynamical description of evolutionary processes. Empirical evidence supporting the fundamental relevance of features such as phenotypic bias is mounting as well, while the synthesis of conceptual and experimental progress leads to questioning current assumptions on the nature of evolutionary dynamics-cancer progression models or synthetic biology approaches being notable examples. This work delves into a critical and constructive attitude in our current knowledge of how genotypes map onto molecular phenotypes and organismal functions, and discusses theoretical and empirical avenues to broaden and improve this comprehension. As a final goal, this community should aim at deriving an updated picture of evolutionary processes soundly relying on the structural properties of genotype spaces, as revealed by modern techniques of molecular and functional analysis.
△ Less
Submitted 17 March, 2021; v1 submitted 2 February, 2020;
originally announced February 2020.
-
Is graph-based feature selection of genes better than random?
Authors:
Mohammad Hashir,
Paul Bertin,
Martin Weiss,
Vincent Frappier,
Theodore J. Perkins,
Geneviève Boucher,
Joseph Paul Cohen
Abstract:
Gene interaction graphs aim to capture various relationships between genes and represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing whether those graphs capture dep…
▽ More
Gene interaction graphs aim to capture various relationships between genes and represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing whether those graphs capture dependencies seen in gene expression data better than random. We formulate a condition that graphs should satisfy to provide a good prior knowledge and propose to test it using a `Single Gene Inference' (SGI) task. We compare random graphs with seven major gene interaction graphs published by different research groups, aiming to measure the true benefit of using biologically relevant graphs in this context. Our analysis finds that dependencies can be captured almost as well at random which suggests that, in terms of gene expression levels, the relevant information about the state of the cell is spread across many genes.
△ Less
Submitted 27 December, 2019; v1 submitted 21 October, 2019;
originally announced October 2019.
-
The TCGA Meta-Dataset Clinical Benchmark
Authors:
Mandana Samiei,
Tobias Würfl,
Tristan Deleu,
Martin Weiss,
Francis Dutil,
Thomas Fevens,
Geneviève Boucher,
Sebastien Lemieux,
Joseph Paul Cohen
Abstract:
Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinica…
▽ More
Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinical outcome, this approach is far from the reality of clinical decision making in which you have to consider several factors simultaneously. In addition, it is difficult to follow the recent progress concretely as there is a lack of consistency in benchmark datasets and task definitions in the field of Genomics. To address the aforementioned issues, we provide a clinical Meta-Dataset derived from the publicly available data hub called The Cancer Genome Atlas Program (TCGA) that contains 174 tasks. We believe those tasks could be good proxy tasks to develop methods which can work on a few samples of gene expression data. Also, learning to predict multiple clinical variables using gene-expression data is an important task due to the variety of phenotypes in clinical problems and lack of samples for some of the rare variables. The defined tasks cover a wide range of clinical problems including predicting tumor tissue site, white cell count, histological type, family history of cancer, gender, and many others which we explain later in the paper. Each task represents an independent dataset. We use regression and neural network baselines for all the tasks using only 150 samples and compare their performance.
△ Less
Submitted 18 October, 2019;
originally announced October 2019.
-
Analysis of Gene Interaction Graphs as Prior Knowledge for Machine Learning Models
Authors:
Paul Bertin,
Mohammad Hashir,
Martin Weiss,
Vincent Frappier,
Theodore J. Perkins,
Geneviève Boucher,
Joseph Paul Cohen
Abstract:
Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing how well those graphs captur…
▽ More
Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing how well those graphs capture dependencies seen in gene expression data to evaluate the adequacy of the prior knowledge provided by those graphs. We propose a condition graphs should satisfy to provide good prior knowledge and test it using `Single Gene Inference' tasks. We also compare with randomly generated graphs, aiming to measure the true benefit of using biologically relevant graphs in this context, and validate our findings with five clinical tasks. We find some graphs capture relevant dependencies for most genes while being very sparse. Our analysis with random graphs finds that dependencies can be captured almost as well at random which suggests that, in terms of gene expression levels, the relevant information about the state of the cell is spread across many genes.
△ Less
Submitted 13 January, 2020; v1 submitted 6 May, 2019;
originally announced May 2019.
-
Towards Gene Expression Convolutions using Gene Interaction Graphs
Authors:
Francis Dutil,
Joseph Paul Cohen,
Martin Weiss,
Georgy Derevyanko,
Yoshua Bengio
Abstract:
We study the challenges of applying deep learning to gene expression data. We find experimentally that there exists non-linear signal in the data, however is it not discovered automatically given the noise and low numbers of samples used in most research. We discuss how gene interaction graphs (same pathway, protein-protein, co-expression, or research paper text association) can be used to impose…
▽ More
We study the challenges of applying deep learning to gene expression data. We find experimentally that there exists non-linear signal in the data, however is it not discovered automatically given the noise and low numbers of samples used in most research. We discuss how gene interaction graphs (same pathway, protein-protein, co-expression, or research paper text association) can be used to impose a bias on a deep model similar to the spatial bias imposed by convolutions on an image. We explore the usage of Graph Convolutional Neural Networks coupled with dropout and gene embeddings to utilize the graph information. We find this approach provides an advantage for particular tasks in a low data regime but is very dependent on the quality of the graph used. We conclude that more work should be done in this direction. We design experiments that show why existing methods fail to capture signal that is present in the data when features are added which clearly isolates the problem that needs to be addressed.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.