Search | arXiv e-print repository

A graph-based approach for modification site assignment in proteomics

Authors: Dafni Skiadopoulou, Lukas Käll, Harald Barsnes, Veit Schwämmle, Marc Vaudel

Abstract: Background In proteomics, the most probable localizations of post-translational modifications are assessed by localization scores evaluating the likelihood of a given modification to occupy a site on a peptide sequence. When identifying highly modified peptides, localization scores for different modifications can return conflicting results, stacking modifications on the same amino acid. Here, we p… ▽ More Background In proteomics, the most probable localizations of post-translational modifications are assessed by localization scores evaluating the likelihood of a given modification to occupy a site on a peptide sequence. When identifying highly modified peptides, localization scores for different modifications can return conflicting results, stacking modifications on the same amino acid. Here, we propose a graph-based approach that assigns modifications to sites in a way that maximizes localization scores while avoiding conflicting assignments. Results The algorithm is implemented as both a standalone Python program and in the compomics-utilities Java library. Our graph-based approach showed the ability to match complex combinations of modifications and acceptor sites, allowing the processing of thousands of peptides in a few seconds. Conclusions Our graph-based approach to modification site assignment allows distributing multiple modifications in a way that maximizes individual localization scores. Having an optimal modification site assignment is important for spectrum annotation and biological interpretation. △ Less

Submitted 23 May, 2025; originally announced May 2025.

arXiv:2504.06282 [pdf]

ProHap Explorer: Visualizing Haplotypes in Proteogenomic Datasets

Authors: Jakub Vašíček, Dafni Skiadopoulou, Ksenia G. Kuznetsova, Lukas Käll, Marc Vaudel, Stefan Bruckner

Abstract: In mass spectrometry-based proteomics, experts usually project data onto a single set of reference sequences, overlooking the influence of common haplotypes (combinations of genetic variants inherited together from a parent). We recently introduced ProHap, a tool for generating customized protein haplotype databases. Here, we present ProHap Explorer, a visualization interface designed to investiga… ▽ More In mass spectrometry-based proteomics, experts usually project data onto a single set of reference sequences, overlooking the influence of common haplotypes (combinations of genetic variants inherited together from a parent). We recently introduced ProHap, a tool for generating customized protein haplotype databases. Here, we present ProHap Explorer, a visualization interface designed to investigate the influence of common haplotypes on the human proteome. It enables users to explore haplotypes, their effects on protein sequences, and the identification of non-canonical peptides in public mass spectrometry datasets. The design builds on well-established representations in biological sequence analysis, ensuring familiarity for domain experts while integrating novel interactive elements tailored to proteogenomic data exploration. User interviews with proteomics experts confirmed the tool's utility, highlighting its ability to reveal whether haplotypes affect proteins of interest. By facilitating the intuitive exploration of proteogenomic variation, ProHap Explorer supports research in personalized medicine and the development of targeted therapies. △ Less

Submitted 25 March, 2025; originally announced April 2025.

arXiv:2007.06336 [pdf, other]

On the importance of block randomisation when designing proteomics experiments

Authors: Bram Burger, Marc Vaudel, Harald Barsnes

Abstract: Randomisation is used in experimental design to reduce the prevalence of unanticipated confounders. Complete randomisation can however create unbalanced designs, for example, grouping all samples of the same condition in the same batch. Block randomisation is an approach that can prevent severe imbalances in sample allocation with respect to both known and unknown confounders. This feature provide… ▽ More Randomisation is used in experimental design to reduce the prevalence of unanticipated confounders. Complete randomisation can however create unbalanced designs, for example, grouping all samples of the same condition in the same batch. Block randomisation is an approach that can prevent severe imbalances in sample allocation with respect to both known and unknown confounders. This feature provides the reader with an introduction to blocking and randomisation, insights into how to effectively organise samples during experimental design, with special considerations with respect to proteomics. △ Less

Submitted 13 July, 2020; originally announced July 2020.

Comments: 9 pages, 4 figures

Showing 1–3 of 3 results for author: Vaudel, M