-
Curriculum Based Multi-Task Learning for Parkinson's Disease Detection
Authors:
Nikhil J. Dhinagar,
Conor Owens-Walton,
Emily Laltoo,
Christina P. Boyle,
Yao-Liang Chen,
Philip Cook,
Corey McMillan,
Chih-Chien Tsai,
J-J Wang,
Yih-Ru Wu,
Ysbrand van der Werf,
Paul M. Thompson
Abstract:
There is great interest in developing radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typicall…
▽ More
There is great interest in developing radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typically, deep learning networks are trained by randomly selecting samples in each mini-batch. By contrast, curriculum learning is a training strategy that aims to boost classifier performance by starting with examples that are easier to classify. Here we define a curriculum to progressively increase the difficulty of the training data corresponding to the Hoehn and Yahr (H&Y) staging system for PD (total N=1,012; 653 PD patients, 359 controls; age range: 20.0-84.9 years). Even with our multi-task setting using pre-trained CNNs and transfer learning, PD classification based on T1-weighted (T1-w) MRI was challenging (ROC AUC: 0.59-0.65), but curriculum training boosted performance (by 3.9%) compared to our baseline model. Future work with multimodal imaging may further boost performance.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Weakly-supervised learning for image-based classification of primary melanomas into genomic immune subgroups
Authors:
Lucy Godson,
Navid Alemi,
Jeremie Nsengimana,
Graham P. Cook,
Emily L. Clarke,
Darren Treanor,
D. Timothy Bishop,
Julia Newton-Bishop,
Ali Gooya
Abstract:
Determining early-stage prognostic markers and stratifying patients for effective treatment are two key challenges for improving outcomes for melanoma patients. Previous studies have used tumour transcriptome data to stratify patients into immune subgroups, which were associated with differential melanoma specific survival and potential treatment strategies. However, acquiring transcriptome data i…
▽ More
Determining early-stage prognostic markers and stratifying patients for effective treatment are two key challenges for improving outcomes for melanoma patients. Previous studies have used tumour transcriptome data to stratify patients into immune subgroups, which were associated with differential melanoma specific survival and potential treatment strategies. However, acquiring transcriptome data is a time-consuming and costly process. Moreover, it is not routinely used in the current clinical workflow. Here we attempt to overcome this by developing deep learning models to classify gigapixel H&E stained pathology slides, which are well established in clinical workflows, into these immune subgroups. Previous subtyping approaches have employed supervised learning which requires fully annotated data, or have only examined single genetic mutations in melanoma patients. We leverage a multiple-instance learning approach, which only requires slide-level labels and uses an attention mechanism to highlight regions of high importance to the classification. Moreover, we show that pathology-specific self-supervised models generate better representations compared to pathology-agnostic models for improving our model performance, achieving a mean AUC of 0.76 for classifying histopathology images as high or low immune subgroups. We anticipate that this method may allow us to find new biomarkers of high importance and could act as a tool for clinicians to infer the immune landscape of tumours and stratify patients, without needing to carry out additional expensive genetic tests.
△ Less
Submitted 23 February, 2022;
originally announced February 2022.
-
Simulating topological domains in human chromosomes with a fitting-free model
Authors:
C. A. Brackley,
D. Michieletto,
F. Mouvet,
J. Johnson,
S. Kelly,
P. R. Cook,
D. Marenduzzo
Abstract:
We discuss a polymer model for the 3D organization of human chromosomes. A chromosome is represented by a string of beads, with each bead being "colored" according to 1D bioinformatic data (e.g., chromatin state, histone modification, GC content). Individual spheres (representing bi- and multi-valent transcription factors) can bind reversibly and selectively to beads with the appropriate color. Du…
▽ More
We discuss a polymer model for the 3D organization of human chromosomes. A chromosome is represented by a string of beads, with each bead being "colored" according to 1D bioinformatic data (e.g., chromatin state, histone modification, GC content). Individual spheres (representing bi- and multi-valent transcription factors) can bind reversibly and selectively to beads with the appropriate color. During molecular dynamics simulations, the factors bind, and the string spontaneously folds into loops, rosettes, and topologically-associating domains (TADs). This organization occurs in the absence of any specified interactions between distant DNA segments, or between transcription factors. A comparison with Hi-C data shows that simulations predict the location of most boundaries between TADs correctly. The model is "fitting-free" in the sense that it does not use Hi-C data as an input; consequently, one of its strengths is that it can -- in principle -- be used to predict the 3D organization of any region of interest, or whole chromosome, in a given organism, or cell line, in the absence of existing Hi-C data. We discuss how this simple model might be refined to include more transcription factors and binding sites, and to correctly predict contacts between convergent CTCF binding sites.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
Extrusion without a motor: a new take on the loop extrusion model of genome organization
Authors:
C. A. Brackley,
J. Johnson,
D. Michieletto,
A. N. Morozov,
M. Nicodemi,
P. R. Cook,
D. Marenduzzo
Abstract:
Chromatin loop extrusion is a popular model for the formation of CTCF loops and topological domains. Recent HiC data have revealed a strong bias in favour of a particular arrangement of the CTCF binding motifs that stabilize loops, and extrusion is the only model to date which can explain this. However, the model requires a motor to generate the loops, and although cohesin is a strong candidate fo…
▽ More
Chromatin loop extrusion is a popular model for the formation of CTCF loops and topological domains. Recent HiC data have revealed a strong bias in favour of a particular arrangement of the CTCF binding motifs that stabilize loops, and extrusion is the only model to date which can explain this. However, the model requires a motor to generate the loops, and although cohesin is a strong candidate for the extruding factor, a suitable motor protein (or a motor activity in cohesin itself) has yet to be found. Here we explore a new hypothesis: that there is no motor, and thermal motion within the nucleus drives extrusion. Using theoretical modelling and computer simulations we ask whether such diffusive extrusion could feasibly generate loops. Our simulations uncover an interesting ratchet effect (where an osmotic pressure promotes loop growth), and suggest, by comparison to recent in vitro and in vivo measurements, that diffusive extrusion can in principle generate loops of the size observed in the data.
Extra View on : C. A. Brackley, J. Johnson, D. Michieletto, A. N. Morozov, M. Nicodemi, P. R. Cook, and D. Marenduzzo "Non-equilibrium chromosome looping via molecular slip-links", Physical Review Letters 119, 138101 (2017)
△ Less
Submitted 6 October, 2020;
originally announced October 2020.
-
Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulations
Authors:
Peter R. Cook,
Davide Marenduzzo
Abstract:
Current models for the folding of the human genome see a hierarchy stretching down from chromosome territories, through A/B compartments and TADs (topologically-associating domains), to contact domains stabilized by cohesin and CTCF. However, molecular mechanisms underlying this folding, and the way folding affects transcriptional activity, remain obscure. Here we review physical principles drivin…
▽ More
Current models for the folding of the human genome see a hierarchy stretching down from chromosome territories, through A/B compartments and TADs (topologically-associating domains), to contact domains stabilized by cohesin and CTCF. However, molecular mechanisms underlying this folding, and the way folding affects transcriptional activity, remain obscure. Here we review physical principles driving proteins bound to long polymers into clusters surrounded by loops, and present a parsimonious yet comprehensive model for the way the organization determines function. We argue that clusters of active RNA polymerases and their transcription factors are major architectural features; then, contact domains, TADs, and compartments just reflect one or more loops and clusters. We suggest tethering a gene close to a cluster containing appropriate factors -- a transcription factory -- increases the firing frequency, and offer solutions to many current puzzles concerning the actions of enhancers, super-enhancers, boundaries, and eQTLs (expression quantitative trait loci). As a result, the activity of any gene is directly influenced by the activity of other transcription units around it in 3D space, and this is supported by Brownian-dynamics simulations of transcription factors binding to cognate sites on long polymers.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Shaping Epigenetic Memory via Genomic Bookmarking
Authors:
Davide Michieletto,
Michael Chiang,
Davide Coli,
Argyris Papantonis,
Enzo Orlandini,
Peter R. Cook,
Davide Marenduzzo
Abstract:
Reconciling the stability of epigenetic patterns with the rapid turnover of histone modifications and their adaptability to external stimuli is an outstanding challenge. Here, we propose a new biophysical mechanism that can establish and maintain robust yet plastic epigenetic domains via genomic bookmarking (GBM). We model chromatin as a recolourable polymer whose segments bear non-permanent histo…
▽ More
Reconciling the stability of epigenetic patterns with the rapid turnover of histone modifications and their adaptability to external stimuli is an outstanding challenge. Here, we propose a new biophysical mechanism that can establish and maintain robust yet plastic epigenetic domains via genomic bookmarking (GBM). We model chromatin as a recolourable polymer whose segments bear non-permanent histone marks (or colours) which can be modified by "writer" proteins. The three-dimensional chromatin organisation is mediated by protein bridges, or "readers", such as Polycomb Repressive Complexes and Transcription Factors. The coupling between readers and writers drives spreading of biochemical marks and sustains the memory of local chromatin states across replication and mitosis. In contrast, GBM-targeted perturbations destabilise the epigenetic patterns. Strikingly, we demonstrate that GBM alone can explain the full distribution of Polycomb marks in a whole Drosophila chromosome. We finally suggest that our model provides a starting point for an understanding of the biophysics of cellular differentiation and reprogramming.
△ Less
Submitted 28 November, 2017; v1 submitted 5 September, 2017;
originally announced September 2017.
-
Non-equilibrium chromosome looping via molecular slip-links
Authors:
C. A. Brackley,
J. Johnson,
D. Michieletto,
A. N. Morozov,
M. Nicodemi,
P. R. Cook,
D. Marenduzzo
Abstract:
We propose a model for the formation of chromatin loops based on the diffusive sliding of a DNA-bound factor which can dimerise to form a molecular slip-link. Our slip-links mimic the behaviour of cohesin-like molecules, which, along with the CTCF protein, stabilize loops which organize the genome. By combining 3D Brownian dynamics simulations and 1D exactly solvable non-equilibrium models, we sho…
▽ More
We propose a model for the formation of chromatin loops based on the diffusive sliding of a DNA-bound factor which can dimerise to form a molecular slip-link. Our slip-links mimic the behaviour of cohesin-like molecules, which, along with the CTCF protein, stabilize loops which organize the genome. By combining 3D Brownian dynamics simulations and 1D exactly solvable non-equilibrium models, we show that diffusive sliding is sufficient to account for the strong bias in favour of convergent CTCF-mediated chromosome loops observed experimentally. Importantly, our model does not require any underlying, and energetically costly, motor activity of cohesin. We also find that the diffusive motion of multiple slip-links along chromatin may be rectified by an intriguing ratchet effect that arises if slip-links bind to the chromatin at a preferred "loading site". This emergent collective behaviour is driven by a 1D osmotic pressure which is set up near the loading point, and favours the extrusion of loops which are much larger than the ones formed by single slip-links.
△ Less
Submitted 21 December, 2016;
originally announced December 2016.
-
Modular Segregation of Structural Brain Networks Supports the Development of Executive Function in Youth
Authors:
Graham L. Baum,
Rastko Ciric,
David R. Roalf,
Richard F. Betzel,
Tyler M. Moore,
Russel T. Shinohara,
Ari E. Kahn,
Megan Quarmley,
Philip A. Cook,
Mark A. Elliot,
Kosha Ruparel,
Raquel E. Gur,
Ruben C. Gur,
Danielle S. Bassett,
Theodore D. Satterthwaite
Abstract:
The human brain is organized into large-scale functional modules that have been shown to evolve in childhood and adolescence. However, it remains unknown whether structural brain networks are similarly refined during development, potentially allowing for improvements in executive function. In a sample of 882 participants (ages 8-22) who underwent diffusion imaging as part of the Philadelphia Neuro…
▽ More
The human brain is organized into large-scale functional modules that have been shown to evolve in childhood and adolescence. However, it remains unknown whether structural brain networks are similarly refined during development, potentially allowing for improvements in executive function. In a sample of 882 participants (ages 8-22) who underwent diffusion imaging as part of the Philadelphia Neurodevelopmental Cohort, we demonstrate that structural network modules become more segregated with age, with weaker connections between modules and stronger connections within modules. Evolving modular topology facilitated network integration, driven by age-related strengthening of hub edges that were present both within and between modules. Critically, both modular segregation and network integration were associated with enhanced executive performance, and mediated the improvement of executive functioning with age. Together, results delineate a process of structural network maturation that supports executive function in youth.
△ Less
Submitted 11 August, 2016;
originally announced August 2016.
-
Ephemeral protein binding to DNA shapes stable nuclear bodies and chromatin domains
Authors:
C. A. Brackley,
B. Liebchen,
D. Michieletto,
F. Mouvet,
P. R. Cook,
D. Marenduzzo
Abstract:
Fluorescence microscopy reveals that the contents of many (membrane-free) nuclear "bodies" exchange rapidly with the soluble pool whilst the underlying structure persists; such observations await a satisfactory biophysical explanation. To shed light on this, we perform large-scale Brownian dynamics simulations of a chromatin fiber interacting with an ensemble of (multivalent) DNA-binding proteins;…
▽ More
Fluorescence microscopy reveals that the contents of many (membrane-free) nuclear "bodies" exchange rapidly with the soluble pool whilst the underlying structure persists; such observations await a satisfactory biophysical explanation. To shed light on this, we perform large-scale Brownian dynamics simulations of a chromatin fiber interacting with an ensemble of (multivalent) DNA-binding proteins; these proteins switch between two states -- active (binding) and inactive (non-binding). This system provides a model for any DNA-binding protein that can be modified post-translationally to change its affinity for DNA (e.g., like the phosphorylation of a transcription factor). Due to this out-of-equilibrium process, proteins spontaneously assemble into clusters of self-limiting size, as individual proteins in a cluster exchange with the soluble pool with kinetics like those seen in photo-bleaching experiments. This behavior contrasts sharply with that exhibited by "equilibrium", or non-switching, proteins that exist only in the binding state; when these bind to DNA non-specifically, they form clusters that grow indefinitely in size. Our results point to post-translational modification of chromatin-bridging proteins as a generic mechanism driving the self-assembly of highly dynamic, non-equilibrium, protein clusters with the properties of nuclear bodies. Such active modification also reshapes intra-chromatin contacts to give networks resembling those seen in topologically-associating domains, as switching markedly favors local (short-range) contacts over distant ones.
△ Less
Submitted 22 July, 2016;
originally announced July 2016.
-
Binding of bivalent transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and domains
Authors:
C. A. Brackley,
J. Johnson,
S. Kelly,
P. R. Cook,
D. Marenduzzo
Abstract:
Biophysicists are modeling conformations of interphase chromosomes, often basing the strengths of interactions between segments distant on the genetic map on contact frequencies determined experimentally. Here, instead, we develop a fitting-free, minimal model: bivalent red and green "transcription factors" bind to cognate sites in runs of beads ("chromatin") to form molecular bridges stabilizing…
▽ More
Biophysicists are modeling conformations of interphase chromosomes, often basing the strengths of interactions between segments distant on the genetic map on contact frequencies determined experimentally. Here, instead, we develop a fitting-free, minimal model: bivalent red and green "transcription factors" bind to cognate sites in runs of beads ("chromatin") to form molecular bridges stabilizing loops. In the absence of additional explicit forces, molecular dynamic simulations reveal that bound "factors' spontaneously cluster -- red with red, green with green, but rarely red with green -- to give structures reminiscent of transcription factories. Binding of just two transcription factors (or proteins) to active and inactive regions of human chromosomes yields rosettes, topological domains, and contact maps much like those seen experimentally. This emergent "bridging-induced attraction" proves to be a robust, simple, and generic force able to organize interphase chromosomes at all scales.
△ Less
Submitted 5 November, 2015;
originally announced November 2015.