-
Open and Sustainable AI: challenges, opportunities and the road ahead in the life sciences
Authors:
Gavin Farrell,
Eleni Adamidi,
Rafael Andrade Buono,
Mihail Anton,
Omar Abdelghani Attafi,
Salvador Capella Gutierrez,
Emidio Capriotti,
Leyla Jael Castro,
Davide Cirillo,
Lisa Crossman,
Christophe Dessimoz,
Alexandros Dimopoulos,
Raul Fernandez-Diaz,
Styliani-Christina Fragkouli,
Carole Goble,
Wei Gu,
John M. Hancock,
Alireza Khanteymoori,
Tom Lenaerts,
Fabio G. Liberante,
Peter Maccallum,
Alexander Miguel Monzon,
Magnus Palmblad,
Lucy Poveda,
Ovidiu Radulescu
, et al. (5 additional authors not shown)
Abstract:
Artificial intelligence (AI) has recently seen transformative breakthroughs in the life sciences, expanding possibilities for researchers to interpret biological information at an unprecedented capacity, with novel applications and advances being made almost daily. In order to maximise return on the growing investments in AI-based life science research and accelerate this progress, it has become u…
▽ More
Artificial intelligence (AI) has recently seen transformative breakthroughs in the life sciences, expanding possibilities for researchers to interpret biological information at an unprecedented capacity, with novel applications and advances being made almost daily. In order to maximise return on the growing investments in AI-based life science research and accelerate this progress, it has become urgent to address the exacerbation of long-standing research challenges arising from the rapid adoption of AI methods. We review the increased erosion of trust in AI research outputs, driven by the issues of poor reusability and reproducibility, and highlight their consequent impact on environmental sustainability. Furthermore, we discuss the fragmented components of the AI ecosystem and lack of guiding pathways to best support Open and Sustainable AI (OSAI) model development. In response, this perspective introduces a practical set of OSAI recommendations directly mapped to over 300 components of the AI ecosystem. Our work connects researchers with relevant AI resources, facilitating the implementation of sustainable, reusable and transparent AI. Built upon life science community consensus and aligned to existing efforts, the outputs of this perspective are designed to aid the future development of policy and structured pathways for guiding AI implementation.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
DOME Registry: Implementing community-wide recommendations for reporting supervised machine learning in biology
Authors:
Omar Abdelghani Attafi,
Damiano Clementel,
Konstantinos Kyritsis,
Emidio Capriotti,
Gavin Farrell,
Styliani-Christina Fragkouli,
Leyla Jael Castro,
András Hatos,
Tom Lenaerts,
Stanislav Mazurenko,
Soroush Mozaffari,
Franco Pradelli,
Patrick Ruch,
Castrense Savojardo,
Paola Turina,
Federico Zambelli,
Damiano Piovesan,
Alexander Miguel Monzon,
Fotis Psomopoulos,
Silvio C. E. Tosatto
Abstract:
Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The DOME recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by…
▽ More
Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The DOME recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by providing a structured set of questions. Here, we introduce the DOME Registry (URL: registry.dome-ml.org), a database that allows scientists to manage and access comprehensive DOME-related information on published ML studies. The registry uses external resources like ORCID, APICURON and the Data Stewardship Wizard to streamline the annotation process and ensure comprehensive documentation. By assigning unique identifiers and DOME scores to publications, the registry fosters a standardized evaluation of ML methods. Future plans include continuing to grow the registry through community curation, improving the DOME score definition and encouraging publishers to adopt DOME standards, promoting transparency and reproducibility of ML in the life sciences.
△ Less
Submitted 16 August, 2024; v1 submitted 14 August, 2024;
originally announced August 2024.
-
Mediating Artificial Intelligence Developments through Negative and Positive Incentives
Authors:
The Anh Han,
Luis Moniz Pereira,
Tom Lenaerts,
Francisco C. Santos
Abstract:
The field of Artificial Intelligence (AI) is going through a period of great expectations, introducing a certain level of anxiety in research, business and also policy. This anxiety is further energised by an AI race narrative that makes people believe they might be missing out. Whether real or not, a belief in this narrative may be detrimental as some stake-holders will feel obliged to cut corner…
▽ More
The field of Artificial Intelligence (AI) is going through a period of great expectations, introducing a certain level of anxiety in research, business and also policy. This anxiety is further energised by an AI race narrative that makes people believe they might be missing out. Whether real or not, a belief in this narrative may be detrimental as some stake-holders will feel obliged to cut corners on safety precautions, or ignore societal consequences just to "win". Starting from a baseline model that describes a broad class of technology races where winners draw a significant benefit compared to others (such as AI advances, patent race, pharmaceutical technologies), we investigate here how positive (rewards) and negative (punishments) incentives may beneficially influence the outcomes. We uncover conditions in which punishment is either capable of reducing the development speed of unsafe participants or has the capacity to reduce innovation through over-regulation. Alternatively, we show that, in several scenarios, rewarding those that follow safety measures may increase the development speed while ensuring safe choices. Moreover, in {the latter} regimes, rewards do not suffer from the issue of over-regulation as is the case for punishment. Overall, our findings provide valuable insights into the nature and kinds of regulatory actions most suitable to improve safety compliance in the contexts of both smooth and sudden technological shifts.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Towards a phylogenetic measure to quantify HIV incidence
Authors:
Pieter Libin,
Nassim Versbraegen,
Ana B. Abecasis,
Perpetua Gomes,
Tom Lenaerts,
Ann Nowé
Abstract:
One of the cornerstones in combating the HIV pandemic is being able to assess the current state and evolution of local HIV epidemics. This remains a complex problem, as many HIV infected individuals remain unaware of their infection status, leading to parts of HIV epidemics being undiagnosed and under-reported. To that end, we firstly present a method to learn epidemiological parameters from phylo…
▽ More
One of the cornerstones in combating the HIV pandemic is being able to assess the current state and evolution of local HIV epidemics. This remains a complex problem, as many HIV infected individuals remain unaware of their infection status, leading to parts of HIV epidemics being undiagnosed and under-reported. To that end, we firstly present a method to learn epidemiological parameters from phylogenetic trees, using approximate Bayesian computation (ABC). The epidemiological parameters learned as a result of applying ABC are subsequently used in epidemiological models that aim to simulate a specific epidemic. Secondly, we continue by describing the development of a tree statistic, rooted in coalescent theory, which we use to relate epidemiological parameters to a phylogenetic tree, by using the simulated epidemics. We show that the presented tree statistic enables differentiation of epidemiological parameters, while only relying on phylogenetic trees, thus enabling the construction of new methods to ascertain the epidemiological state of an HIV epidemic. By using genetic data to infer epidemic sizes, we expect to enhance understanding of the portions of the infected population in which diagnosis rates are low.
△ Less
Submitted 23 October, 2019; v1 submitted 10 October, 2019;
originally announced October 2019.
-
Modeling contact networks of patients and MRSA spread in Swedish hospitals
Authors:
Luis E C Rocha,
Vikramjit Singh,
Markus Esch,
Tom Lenaerts,
Mikael Stenhem,
Fredrik Liljeros,
Anna Thorson
Abstract:
Methicillin-resistant Staphylococcus aureus (MRSA) is a difficult-to-treat infection that only in the European Union affects about 150,000 patients and causes extra costs of 380 million Euros annually to the health-care systems. Increasing efforts have been taken to mitigate the epidemics and to avoid potential outbreaks in low endemic settings. Understanding the population dynamics of MRSA throug…
▽ More
Methicillin-resistant Staphylococcus aureus (MRSA) is a difficult-to-treat infection that only in the European Union affects about 150,000 patients and causes extra costs of 380 million Euros annually to the health-care systems. Increasing efforts have been taken to mitigate the epidemics and to avoid potential outbreaks in low endemic settings. Understanding the population dynamics of MRSA through modeling is essential to identify the causal mechanisms driving the epidemics and to generalize conclusions to different contexts. We develop an innovative high-resolution spatiotemporal contact network model of interactions between patients to reproduce the hospital population in the context of the Stockholm County in Sweden and simulate the spread of MRSA within this population. Our model captures the spatial and temporal heterogeneity caused by human behavior and by the dynamics of mobility within wards and hospitals. We estimate that in this population the epidemic threshold is at about 0.008. We also identify that these heterogeneous contact patterns cause the emergence of super-spreader patients and a polynomial growth of the epidemic curve. We finally study the effect of standard intervention control strategies and identify that screening is more effective than improved hygienic in order to cause smaller or null outbreaks.
△ Less
Submitted 21 November, 2016;
originally announced November 2016.
-
Evolution of Complexity
Authors:
Carlos Gershenson,
Tom Lenaerts
Abstract:
The evolution of complexity has been a central theme for Biology [2] and Artificial Life research [1]. It is generally agreed that complexity has increased in our universe, giving way to life, multi-cellularity, societies, and systems of higher complexities. However, the mechanisms behind the complexification and its relation to evolution are not well understood. Moreover complexification can be…
▽ More
The evolution of complexity has been a central theme for Biology [2] and Artificial Life research [1]. It is generally agreed that complexity has increased in our universe, giving way to life, multi-cellularity, societies, and systems of higher complexities. However, the mechanisms behind the complexification and its relation to evolution are not well understood. Moreover complexification can be used to mean different things in different contexts. For example, complexification has been interpreted as a process of diversification between evolving units [2] or as a scaling process related to the idea of transitions between different levels of complexity [7]. Understanding the difference or overlap between the mechanisms involved in both situations is mandatory to create acceptable synthetic models of the process, as is required in Artificial Life research. (...)
△ Less
Submitted 19 October, 2007;
originally announced October 2007.
-
Evolution of Complexity: Introduction to the Workshop
Authors:
Carlos Gershenson,
Tom Lenaerts
Abstract:
The evolution of complexity has been a central theme for Biology and Artificial Life (Bonner, 1988; Bedau et al., 2000). Complexification has been interpreted in different ways: as a process of diversification between evolving units (Bonner, 1988) or as a scaling process that is related to the idea of transitions between different levels of complexity (Smith and Szathmary, 1995). There have been…
▽ More
The evolution of complexity has been a central theme for Biology and Artificial Life (Bonner, 1988; Bedau et al., 2000). Complexification has been interpreted in different ways: as a process of diversification between evolving units (Bonner, 1988) or as a scaling process that is related to the idea of transitions between different levels of complexity (Smith and Szathmary, 1995). There have been previous workshops on this topic, e.g. (Heylighen et al., 1999; Lenaerts et al., 2002), but many open questions still remain. Trying to answer these questions from a general perspective might not immediately produce concrete answers for the different fields that are interested in this topic. Consequently, this workshop is organised with a particular focus in mind: The emergence of complexity through evolutionary mechanisms. Primarily we want to have a discussion on evolutionary and related dynamics as mechanisms for producing complexity. Furthermore, we want to bring together historical and novel research in this context.
△ Less
Submitted 26 April, 2006;
originally announced April 2006.