Skip to main content

Showing 1–14 of 14 results for author: Visani, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.00648  [pdf, other

    q-bio.QM cs.LG

    T-cell receptor specificity landscape revealed through de novo peptide design

    Authors: Gian Marco Visani, Michael N. Pun, Anastasia A. Minervina, Philip Bradley, Paul Thomas, Armita Nourmohammad

    Abstract: T-cells play a key role in adaptive immunity by mounting specific responses against diverse pathogens. An effective binding between T-cell receptors (TCRs) and pathogen-derived peptides presented on Major Histocompatibility Complexes (MHCs) mediate an immune response. However, predicting these interactions remains challenging due to limited functional data on T-cell reactivities. Here, we introduc… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  2. arXiv:2409.18201  [pdf, other

    physics.bio-ph cs.LG q-bio.QM

    Loop-Diffusion: an equivariant diffusion model for designing and scoring protein loops

    Authors: Kevin Borisiak, Gian Marco Visani, Armita Nourmohammad

    Abstract: Predicting protein functional characteristics from structure remains a central problem in protein science, with broad implications from understanding the mechanisms of disease to designing novel therapeutics. Unfortunately, current machine learning methods are limited by scarce and biased experimental data, and physics-based methods are either too slow to be useful, or too simplified to be accurat… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  3. arXiv:2408.05060  [pdf, other

    cs.LG cs.AI

    GLEAMS: Bridging the Gap Between Local and Global Explanations

    Authors: Giorgio Visani, Vincenzo Stanzione, Damien Garreau

    Abstract: The explainability of machine learning algorithms is crucial, and numerous methods have emerged recently. Local, post-hoc methods assign an attribution score to each feature, indicating its importance for the prediction. However, these methods require recalculating explanations for each example. On the other side, while there exist global approaches they often produce explanations that are either… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  4. arXiv:2407.06703  [pdf, other

    q-bio.BM cs.LG

    HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability prediction

    Authors: Gian Marco Visani, Michael N. Pun, William Galvin, Eric Daniel, Kevin Borisiak, Utheri Wagura, Armita Nourmohammad

    Abstract: Predicting the stability and fitness effects of amino acid mutations in proteins is a cornerstone of biological discovery and engineering. Various experimental techniques have been developed to measure mutational effects, providing us with extensive datasets across a diverse range of proteins. By training on these data, traditional computational modeling and more recent machine learning approaches… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 16 pages, 8 figures

    ACM Class: J.3

  5. arXiv:2311.09312  [pdf, other

    q-bio.BM cs.AI cs.LG

    H-Packer: Holographic Rotationally Equivariant Convolutional Neural Network for Protein Side-Chain Packing

    Authors: Gian Marco Visani, William Galvin, Michael Neal Pun, Armita Nourmohammad

    Abstract: Accurately modeling protein 3D structure is essential for the design of functional proteins. An important sub-task of structure modeling is protein side-chain packing: predicting the conformation of side-chains (rotamers) given the protein's backbone structure and amino-acid sequence. Conventional approaches for this task rely on expensive sampling procedures over hand-crafted energy functions and… ▽ More

    Submitted 28 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted as a conference paper at MLCB 2023. 8 pages main body, 20 pages with appendix. 10 figures

    ACM Class: J.3; I.2.0

  6. arXiv:2209.15567  [pdf, other

    cs.LG physics.bio-ph

    Holographic-(V)AE: an end-to-end SO(3)-Equivariant (Variational) Autoencoder in Fourier Space

    Authors: Gian Marco Visani, Michael N. Pun, Arman Angaji, Armita Nourmohammad

    Abstract: Group-equivariant neural networks have emerged as a data-efficient approach to solve classification and regression tasks, while respecting the relevant symmetries of the data. However, little work has been done to extend this paradigm to the unsupervised and generative domains. Here, we present Holographic-(Variational) Auto Encoder (H-(V)AE), a fully end-to-end SO(3)-equivariant (variational) aut… ▽ More

    Submitted 11 June, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

  7. arXiv:2204.06297  [pdf, other

    cs.LG cs.AI stat.ML

    Enabling Synthetic Data adoption in regulated domains

    Authors: Giorgio Visani, Giacomo Graffi, Mattia Alfero, Enrico Bagli, Davide Capuzzo, Federico Chesani

    Abstract: The switch from a Model-Centric to a Data-Centric mindset is putting emphasis on data and its quality rather than algorithms, bringing forward new challenges. In particular, the sensitive nature of the information in highly regulated scenarios needs to be accounted for. Specific approaches to address the privacy issue have been developed, as Privacy Enhancing Technologies. However, they frequently… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

  8. arXiv:2105.00773  [pdf, other

    stat.AP cs.LG stat.ML

    Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories

    Authors: Gian Marco Visani, Alexandra Hope Lee, Cuong Nguyen, David M. Kent, John B. Wong, Joshua T. Cohen, Michael C. Hughes

    Abstract: We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we mo… ▽ More

    Submitted 28 July, 2021; v1 submitted 28 April, 2021; originally announced May 2021.

    Comments: To appear in the Proceedings of the Machine Learning for Healthcare (MLHC) conference, 2021. 20 pages, 7 figures and 1 table. 26 additional pages of supplementary material

  9. arXiv:2012.15103  [pdf, other

    cs.LG stat.ML

    Explanations of Machine Learning predictions: a mandatory step for its application to Operational Processes

    Authors: Giorgio Visani, Federico Chesani, Enrico Bagli, Davide Capuzzo, Alessandro Poluzzi

    Abstract: In the global economy, credit companies play a central role in economic development, through their activity as money lenders. This important task comes with some drawbacks, mainly the risk of the debtors not being able to repay the provided credit. Therefore, Credit Risk Modelling (CRM), namely the evaluation of the probability that a debtor will not repay the due amount, plays a paramount role. S… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

  10. arXiv:2011.10367  [pdf, other

    cs.LG cs.AI

    PSD2 Explainable AI Model for Credit Scoring

    Authors: Neus Llop Torrent, Giorgio Visani, Enrico Bagli

    Abstract: The aim of this project is to develop and test advanced analytical methods to improve the prediction accuracy of Credit Risk Models, preserving at the same time the model interpretability. In particular, the project focuses on applying an explainable machine learning model to bank-related databases. The input data were obtained from open data. Over the total proven models, CatBoost has shown the h… ▽ More

    Submitted 6 August, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

  11. arXiv:2008.05756  [pdf, other

    stat.ML cs.LG

    Metrics for Multi-Class Classification: an Overview

    Authors: Margherita Grandini, Enrico Bagli, Giorgio Visani

    Abstract: Classification tasks in machine learning involving more than two classes are known by the name of "multi-class classification". Performance indicators are very useful when the aim is to evaluate and compare different classification models or machine learning techniques. Many metrics come in handy to test the ability of a multi-class classifier. Those metrics turn out to be useful at different stag… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  12. arXiv:2006.05714  [pdf, other

    cs.LG cs.AI stat.ML

    OptiLIME: Optimized LIME Explanations for Diagnostic Computer Algorithms

    Authors: Giorgio Visani, Enrico Bagli, Federico Chesani

    Abstract: Local Interpretable Model-Agnostic Explanations (LIME) is a popular method to perform interpretability of any kind of Machine Learning (ML) model. It explains one ML prediction at a time, by learning a simple linear model around the prediction. The model is trained on randomly generated data points, sampled from the training dataset distribution and weighted according to the distance from the refe… ▽ More

    Submitted 7 February, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

  13. arXiv:2002.07327  [pdf

    q-bio.CB cs.LG

    Enzyme promiscuity prediction using hierarchy-informed multi-label classification

    Authors: Gian Marco Visani, Michael C. Hughes, Soha Hassoun

    Abstract: As experimental efforts are costly and time consuming, computational characterization of enzyme capabilities is an attractive alternative. We present and evaluate several machine-learning models to predict which of 983 distinct enzymes, as defined via the Enzyme Commission, EC, numbers, are likely to interact with a given query molecule. Our data consists of enzyme-substrate interactions from the… ▽ More

    Submitted 25 January, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Presented as a poster at the 2019 Machine Learning for Computational Biology Symposium, Vancouver, CA Accepted for publication, Bioinformatics, Jan 22, 2021

  14. Statistical stability indices for LIME: obtaining reliable explanations for Machine Learning models

    Authors: Giorgio Visani, Enrico Bagli, Federico Chesani, Alessandro Poluzzi, Davide Capuzzo

    Abstract: Nowadays we are witnessing a transformation of the business processes towards a more computation driven approach. The ever increasing usage of Machine Learning techniques is the clearest example of such trend. This sort of revolution is often providing advantages, such as an increase in prediction accuracy and a reduced time to obtain the results. However, these methods present a major drawback:… ▽ More

    Submitted 12 November, 2020; v1 submitted 31 January, 2020; originally announced January 2020.