Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence
Authors:
Yingying Sun,
Jun A,
Zhiwei Liu,
Rui Sun,
Liujia Qian,
Samuel H. Payne,
Wout Bittremieux,
Markus Ralser,
Chen Li,
Yi Chen,
Zhen Dong,
Yasset Perez-Riverol,
Asif Khan,
Chris Sander,
Ruedi Aebersold,
Juan Antonio VizcaĆno,
Jonathan R Krieger,
Jianhua Yao,
Han Wen,
Linfeng Zhang,
Yunping Zhu,
Yue Xuan,
Benjamin Boyang Sun,
Liang Qiao,
Henning Hermjakob
, et al. (37 additional authors not shown)
Abstract:
Artificial intelligence (AI) is transforming scientific research, including proteomics. Advances in mass spectrometry (MS)-based proteomics data quality, diversity, and scale, combined with groundbreaking AI techniques, are unlocking new challenges and opportunities in biological discovery. Here, we highlight key areas where AI is driving innovation, from data analysis to new biological insights.…
▽ More
Artificial intelligence (AI) is transforming scientific research, including proteomics. Advances in mass spectrometry (MS)-based proteomics data quality, diversity, and scale, combined with groundbreaking AI techniques, are unlocking new challenges and opportunities in biological discovery. Here, we highlight key areas where AI is driving innovation, from data analysis to new biological insights. These include developing an AI-friendly ecosystem for proteomics data generation, sharing, and analysis; improving peptide and protein identification and quantification; characterizing protein-protein interactions and protein complexes; advancing spatial and perturbation proteomics; integrating multi-omics data; and ultimately enabling AI-empowered virtual cells.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
Extraction and integration of genetic networks from short-profile omic datasets
Authors:
Jacopo Iacovacci,
Alina Peluso,
Timothy Ebbels,
Markus Ralser,
Robert Charles Glen
Abstract:
Mass-spectrometry technologies are widely used in the fields of ionomics and metabolomics to simultaneously profile at the genome scale intracellular concentrations of e.g. amino acids or elements. Short profiles of molecular or sub-molecular features are intrinsically non-Gaussian and may reveal patterns of correlations that reflect the system nature of the cell biochemistry and biology. Here we…
▽ More
Mass-spectrometry technologies are widely used in the fields of ionomics and metabolomics to simultaneously profile at the genome scale intracellular concentrations of e.g. amino acids or elements. Short profiles of molecular or sub-molecular features are intrinsically non-Gaussian and may reveal patterns of correlations that reflect the system nature of the cell biochemistry and biology. Here we introduce two profile similarity measures that enforce information from the empirical covariance matrix of the data, the Mahalanobis cosine and the hybrid-Mahalanobis cosine. We evaluate the performance of these similarity measures in the task of inferring and integrating genetic networks from omics data by analysing experimental datasets derived from the ionome and the metabolome of the model organism S. cerevisiae, and several large curated databases of genetic annotations. The proposed covariance-based similarity measures can in general recover known and predicted associations between genes better than the commonly used Pearson's correlation and the standard cosine similarity. The choice of which of the two measures to recommend depends upon whether the focus is on extracting genetic associations at a global or local genetic network scale.
△ Less
Submitted 10 November, 2020; v1 submitted 30 October, 2019;
originally announced October 2019.