-
DeepProphet2 -- A Deep Learning Gene Recommendation Engine
Authors:
Daniele Brambilla,
Davide Maria Giacomini,
Luca Muscarnera,
Andrea Mazzoleni
Abstract:
New powerful tools for tackling life science problems have been created by recent advances in machine learning. The purpose of the paper is to discuss the potential advantages of gene recommendation performed by artificial intelligence (AI). Indeed, gene recommendation engines try to solve this problem: if the user is interested in a set of genes, which other genes are likely to be related to the…
▽ More
New powerful tools for tackling life science problems have been created by recent advances in machine learning. The purpose of the paper is to discuss the potential advantages of gene recommendation performed by artificial intelligence (AI). Indeed, gene recommendation engines try to solve this problem: if the user is interested in a set of genes, which other genes are likely to be related to the starting set and should be investigated? This task was solved with a custom deep learning recommendation engine, DeepProphet2 (DP2), which is freely available to researchers worldwide via https://www.generecommender.com?utm_source=DeepProphet2_paper&utm_medium=pdf. Hereafter, insights behind the algorithm and its practical applications are illustrated.
The gene recommendation problem can be addressed by mapping the genes to a metric space where a distance can be defined to represent the real semantic distance between them. To achieve this objective a transformer-based model has been trained on a well-curated freely available paper corpus, PubMed. The paper describes multiple optimization procedures that were employed to obtain the best bias-variance trade-off, focusing on embedding size and network depth. In this context, the model's ability to discover sets of genes implicated in diseases and pathways was assessed through cross-validation. A simple assumption guided the procedure: the network had no direct knowledge of pathways and diseases but learned genes' similarities and the interactions among them. Moreover, to further investigate the space where the neural network represents genes, the dimensionality of the embedding was reduced, and the results were projected onto a human-comprehensible space. In conclusion, a set of use cases illustrates the algorithm's potential applications in a real word setting.
△ Less
Submitted 22 March, 2023; v1 submitted 3 August, 2022;
originally announced August 2022.
-
A sensitivity study of VBS and diboson WW to dimension-6 EFT operators at the LHC
Authors:
Riccardo Bellan,
Giacomo Boldrini,
Daniele Brambilla,
Ilaria Brivio,
Riccardo Brusa,
Flavia Cetorelli,
Marco Chiusi,
Roberto Covarelli,
Vittorio Del Tatto,
Pietro Govoni,
Andrea Massironi,
Leonardo Olivi,
Giacomo Ortona,
Giorgio Pizzati,
Alessandro Tarabini,
Antonio Vagnerini,
Elena Vernazza,
Jie Xiao
Abstract:
We present a parton-level study of electro-weak production of vector-boson pairs at the Large Hadron Collider, establishing the sensitivity to a set of dimension-six operators in the Standard Model Effective Field Theory (SMEFT). Different final states are statistically combined, and we discuss how the orthogonality and interdependence of different analyses must be considered to obtain the most st…
▽ More
We present a parton-level study of electro-weak production of vector-boson pairs at the Large Hadron Collider, establishing the sensitivity to a set of dimension-six operators in the Standard Model Effective Field Theory (SMEFT). Different final states are statistically combined, and we discuss how the orthogonality and interdependence of different analyses must be considered to obtain the most stringent constraints. The main novelties of our study are the inclusion of SMEFT effects in non-resonant diagrams and in irreducible QCD backgrounds, and an exhaustive template analysis of optimal observables for each operator and process considered. We also assess for the first time the sensitivity of vector-boson-scattering searches in semileptonic final states.
△ Less
Submitted 11 April, 2022; v1 submitted 6 August, 2021;
originally announced August 2021.
-
A novel dowscaling procedure for compositional data in the Aitchison geometry with application to soil texture data
Authors:
Federico Gatti,
Alessandra Menafoglio,
Niccolò Togni,
Luca Bonaventura,
Davide Brambilla,
Monica Papini,
Laura Longoni
Abstract:
In this work, we present a novel downscaling procedure for compositional quantities based on the Aitchison geometry. The method is able to naturally consider compositional constraints, i.e. unit-sum and positivity. We show that the method can be used in a block sequential Gaussian simulation framework in order to assess the variability of downscaled quantities. Finally, to validate the method, we…
▽ More
In this work, we present a novel downscaling procedure for compositional quantities based on the Aitchison geometry. The method is able to naturally consider compositional constraints, i.e. unit-sum and positivity. We show that the method can be used in a block sequential Gaussian simulation framework in order to assess the variability of downscaled quantities. Finally, to validate the method, we test it first in an idealized scenario and then apply it for the downscaling of digital soil maps on a more realistic case study. The digital soil maps for the realistic case study are obtained from SoilGrids, a system for automated soil mapping based on state-of-the-art spatial predictions methods.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.