Showing 1–2 of 2 results for author: Schwarz, R F

Search v0.5.6 released 2020-02-24

arXiv:1306.1685 [pdf, other]

q-bio.QM q-bio.PE

doi 10.1371/journal.pcbi.1003535

Phylogenetic quantification of intra-tumour heterogeneity

Authors: Roland F Schwarz, Anne Trinh, Botond Sipos, James D Brenton, Nick Goldman, Florian Markowetz

Abstract: Background: Intra-tumour heterogeneity (ITH) is the result of ongoing evolutionary change within each cancer. The expansion of genetically distinct sub-clonal populations may explain the emergence of drug resistance and if so would have prognostic and predictive utility. However, methods for objectively quantifying ITH have been missing and are particularly difficult to establish in cancers where… ▽ More Background: Intra-tumour heterogeneity (ITH) is the result of ongoing evolutionary change within each cancer. The expansion of genetically distinct sub-clonal populations may explain the emergence of drug resistance and if so would have prognostic and predictive utility. However, methods for objectively quantifying ITH have been missing and are particularly difficult to establish in cancers where predominant copy number variation prevents accurate phylogenetic reconstruction owing to horizontal dependencies caused by long and cascading genomic rearrangements. Results: To address these challenges we present MEDICC, a method for phylogenetic reconstruction and ITH quantification based on a Minimum Event Distance for Intra-tumour Copynumber Comparisons. Using a transducer-based pairwise comparison function we determine optimal phasing of major and minor alleles, as well as evolutionary distances between samples, and are able to reconstruct ancestral genomes. Rigorous simulations and an extensive clinical study show the power of our method, which outperforms state-of-the-art competitors in reconstruction accuracy and additionally allows unbiased numerical quantification of ITH. Conclusions: Accurate quantification and evolutionary inference are essential to understand the functional consequences of ITH. The MEDICC algorithms are independent of the experimental techniques used and are applicable to both next-generation sequencing and array CGH data. △ Less

Submitted 7 June, 2013; originally announced June 2013.
arXiv:1011.5096 [pdf, other]

q-bio.PE stat.ML

doi 10.1371/journal.pone.0015788

Evolutionary distances in the twilight zone -- a rational kernel approach

Authors: Roland F. Schwarz, William Fletcher, Frank Förster, Benjamin Merget, Matthias Wolf, Jörg Schultz, Florian Markowetz

Abstract: Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly. Alignment-free methods, on the other hand, are based on abstract string comparisons and avoid potential alignment problems. However, in general they are not biologi… ▽ More Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly. Alignment-free methods, on the other hand, are based on abstract string comparisons and avoid potential alignment problems. However, in general they are not biologically motivated and ignore our knowledge about the evolution of sequences. Thus, it is still a major open question how to define an evolutionary distance metric between divergent sequences that makes use of indel information and known substitution models without the need for a multiple alignment. Here we propose a new evolutionary distance metric to close this gap. It uses finite-state transducers to create a biologically motivated similarity score which models substitutions and indels, and does not depend on a multiple sequence alignment. The sequence similarity score is defined in analogy to pairwise alignments and additionally has the positive semi-definite property. We describe its derivation and show in simulation studies and real-world examples that it is more accurate in reconstructing phylogenies than competing methods. The result is a new and accurate way of determining evolutionary distances in and beyond the twilight zone of sequence alignments that is suitable for large datasets. △ Less

Submitted 23 November, 2010; originally announced November 2010.

Comments: to appear in PLoS ONE

Journal ref: PLoS One. 2010 Dec 31;5(12):e15788

Search v0.5.6 released 2020-02-24