-
Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence
Authors:
Yingying Sun,
Jun A,
Zhiwei Liu,
Rui Sun,
Liujia Qian,
Samuel H. Payne,
Wout Bittremieux,
Markus Ralser,
Chen Li,
Yi Chen,
Zhen Dong,
Yasset Perez-Riverol,
Asif Khan,
Chris Sander,
Ruedi Aebersold,
Juan Antonio VizcaĆno,
Jonathan R Krieger,
Jianhua Yao,
Han Wen,
Linfeng Zhang,
Yunping Zhu,
Yue Xuan,
Benjamin Boyang Sun,
Liang Qiao,
Henning Hermjakob
, et al. (37 additional authors not shown)
Abstract:
Artificial intelligence (AI) is transforming scientific research, including proteomics. Advances in mass spectrometry (MS)-based proteomics data quality, diversity, and scale, combined with groundbreaking AI techniques, are unlocking new challenges and opportunities in biological discovery. Here, we highlight key areas where AI is driving innovation, from data analysis to new biological insights.…
▽ More
Artificial intelligence (AI) is transforming scientific research, including proteomics. Advances in mass spectrometry (MS)-based proteomics data quality, diversity, and scale, combined with groundbreaking AI techniques, are unlocking new challenges and opportunities in biological discovery. Here, we highlight key areas where AI is driving innovation, from data analysis to new biological insights. These include developing an AI-friendly ecosystem for proteomics data generation, sharing, and analysis; improving peptide and protein identification and quantification; characterizing protein-protein interactions and protein complexes; advancing spatial and perturbation proteomics; integrating multi-omics data; and ultimately enabling AI-empowered virtual cells.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
Proteomics Standards Initiatives ProForma 2.0 Unifying the encoding of Proteoforms and Peptidoforms
Authors:
Richard D. LeDuc,
Eric W. Deutsch,
Pierre-Alain Binz,
Ryan T. Fellers,
Anthony J. Cesnik,
Joshua A. Klein,
Tim Van Den Bossche,
Ralf Gabriels,
Arshika Yalavarthi,
Yasset Perez-Riverol,
Jeremy Carver,
Wout Bittremieux,
Shin Kawano,
Benjamin Pullman,
Nuno Bandeira,
Neil L. Kelleher,
Paul M. Thomas,
Juan Antonio VizcaĆno
Abstract:
There is the need to represent in a standard manner all the possible variations of a protein or peptide primary sequence, including both artefactual and post-translational modifications of peptides and proteins. With that overall aim, here, the Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has developed a notation, called ProForma 2.0, which is a substantial extension of…
▽ More
There is the need to represent in a standard manner all the possible variations of a protein or peptide primary sequence, including both artefactual and post-translational modifications of peptides and proteins. With that overall aim, here, the Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has developed a notation, called ProForma 2.0, which is a substantial extension of the original ProForma notation, developed by the Consortium for Top-Down Proteomics (CTDP). ProForma 2.0 aims to unify the representation of proteoforms and peptidoforms. Therefore, this notation supports use cases needed for bottom-up and middle/topdown proteomics approaches and allows the encoding of highly modified proteins and peptides using a human and machine-readable string. ProForma 2.0 covers encoding protein modification names and accessions, cross-linking reagents including disulfides, glycans, modifications encoded using mass shifts and/or via chemical formulas, labile and C or N-terminal modifications, ambiguity in the modification position and representation of atomic isotopes, among other use cases. Notational conventions are based on public controlled vocabularies and ontologies. Detailed information about the notation and existing implementations are available at http://www.psidev.info/proforma and at the corresponding GitHub repository (https://github.com/HUPO-PSI/proforma).
△ Less
Submitted 21 March, 2022; v1 submitted 23 September, 2021;
originally announced September 2021.
-
Trends in Cuban research output: publications and patents
Authors:
Yasset Perez-Riverol
Abstract:
Cuban science and technology are known for important achievements, particularly in human healthcare and biotechnology. During the second half of XX century, the country developed a system of scientific institutions to address and solve major economical, cultural, social and health problems. However, the economic crisis faced by the island during the last three decades has had a major impact in Cub…
▽ More
Cuban science and technology are known for important achievements, particularly in human healthcare and biotechnology. During the second half of XX century, the country developed a system of scientific institutions to address and solve major economical, cultural, social and health problems. However, the economic crisis faced by the island during the last three decades has had a major impact in Cuban scientific research. In addition to decreased investment, the emigration of thousands of young as well as senior scientists to other countries have had a major impact in Cuban research output. To date, no systematic analysis regarding scientific publications, citations, or patents granted to Cuban authors during this period, are available. Here, an analysis of Cuban scientific production since 1970, with an especial focus on the last three decades (1990 - 2019), is provided. All national metrics are compared with other countries, emphasizing those from Latin America. Preliminary results show that Cuban scientific publications are increased at a lower rate (two-fold) compare with several Latin American countries (five-fold average). In addition, since 2014 the annual number of Cuban scientific publications is decreasing. Finally, the analysis shows that most young Cuban authors with the higher index of citations (1990-2019) are working abroad. All the data and the code related to this study are open and can be found in GitHub (https://github.com/ypriverol/cubascience).
△ Less
Submitted 19 July, 2020;
originally announced July 2020.
-
A UML-based Approach to Design Parallel and Distributed Applications
Authors:
Yasset Perez-Riverol,
Roberto Vera Alvarez
Abstract:
Parallel and distributed application design is a major area of interest in the domain of high performance scientific and industrial computing. Over the years, various approaches have been proposed to aid parallel program developers to modeling their applications. In this paper it will be used some concepts from agile development methodologies and Unified Modeling Language (UML) to modeling paralle…
▽ More
Parallel and distributed application design is a major area of interest in the domain of high performance scientific and industrial computing. Over the years, various approaches have been proposed to aid parallel program developers to modeling their applications. In this paper it will be used some concepts from agile development methodologies and Unified Modeling Language (UML) to modeling parallel and distributed applications. The UML-based approach of this paper describes through different artifacts and graphs the main flows of events in the development of parallel and high performance applications. Here, we presented three work flows to describe and to model our parallel program, Domain Model, Design and Modeling and Test. All these phases of the development software allow to programmers convert the requirements of the problem in a good and efficient solution.
△ Less
Submitted 27 November, 2013;
originally announced November 2013.