-
A graph-based approach for modification site assignment in proteomics
Authors:
Dafni Skiadopoulou,
Lukas Käll,
Harald Barsnes,
Veit Schwämmle,
Marc Vaudel
Abstract:
Background In proteomics, the most probable localizations of post-translational modifications are assessed by localization scores evaluating the likelihood of a given modification to occupy a site on a peptide sequence. When identifying highly modified peptides, localization scores for different modifications can return conflicting results, stacking modifications on the same amino acid. Here, we p…
▽ More
Background In proteomics, the most probable localizations of post-translational modifications are assessed by localization scores evaluating the likelihood of a given modification to occupy a site on a peptide sequence. When identifying highly modified peptides, localization scores for different modifications can return conflicting results, stacking modifications on the same amino acid. Here, we propose a graph-based approach that assigns modifications to sites in a way that maximizes localization scores while avoiding conflicting assignments. Results The algorithm is implemented as both a standalone Python program and in the compomics-utilities Java library. Our graph-based approach showed the ability to match complex combinations of modifications and acceptor sites, allowing the processing of thousands of peptides in a few seconds. Conclusions Our graph-based approach to modification site assignment allows distributing multiple modifications in a way that maximizes individual localization scores. Having an optimal modification site assignment is important for spectrum annotation and biological interpretation.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
ProHap Explorer: Visualizing Haplotypes in Proteogenomic Datasets
Authors:
Jakub Vašíček,
Dafni Skiadopoulou,
Ksenia G. Kuznetsova,
Lukas Käll,
Marc Vaudel,
Stefan Bruckner
Abstract:
In mass spectrometry-based proteomics, experts usually project data onto a single set of reference sequences, overlooking the influence of common haplotypes (combinations of genetic variants inherited together from a parent). We recently introduced ProHap, a tool for generating customized protein haplotype databases. Here, we present ProHap Explorer, a visualization interface designed to investiga…
▽ More
In mass spectrometry-based proteomics, experts usually project data onto a single set of reference sequences, overlooking the influence of common haplotypes (combinations of genetic variants inherited together from a parent). We recently introduced ProHap, a tool for generating customized protein haplotype databases. Here, we present ProHap Explorer, a visualization interface designed to investigate the influence of common haplotypes on the human proteome. It enables users to explore haplotypes, their effects on protein sequences, and the identification of non-canonical peptides in public mass spectrometry datasets. The design builds on well-established representations in biological sequence analysis, ensuring familiarity for domain experts while integrating novel interactive elements tailored to proteogenomic data exploration. User interviews with proteomics experts confirmed the tool's utility, highlighting its ability to reveal whether haplotypes affect proteins of interest. By facilitating the intuitive exploration of proteogenomic variation, ProHap Explorer supports research in personalized medicine and the development of targeted therapies.
△ Less
Submitted 25 March, 2025;
originally announced April 2025.
-
On the importance of block randomisation when designing proteomics experiments
Authors:
Bram Burger,
Marc Vaudel,
Harald Barsnes
Abstract:
Randomisation is used in experimental design to reduce the prevalence of unanticipated confounders. Complete randomisation can however create unbalanced designs, for example, grouping all samples of the same condition in the same batch. Block randomisation is an approach that can prevent severe imbalances in sample allocation with respect to both known and unknown confounders. This feature provide…
▽ More
Randomisation is used in experimental design to reduce the prevalence of unanticipated confounders. Complete randomisation can however create unbalanced designs, for example, grouping all samples of the same condition in the same batch. Block randomisation is an approach that can prevent severe imbalances in sample allocation with respect to both known and unknown confounders. This feature provides the reader with an introduction to blocking and randomisation, insights into how to effectively organise samples during experimental design, with special considerations with respect to proteomics.
△ Less
Submitted 13 July, 2020;
originally announced July 2020.