Skip to main content

Showing 1–5 of 5 results for author: Bennion, J

.
  1. arXiv:2505.17636  [pdf

    cs.LG cs.AI cs.CL

    Surfacing Semantic Orthogonality Across Model Safety Benchmarks: A Multi-Dimensional Analysis

    Authors: Jonathan Bennion, Shaona Ghosh, Mantek Singh, Nouha Dziri

    Abstract: Various AI safety datasets have been developed to measure LLMs against evolving interpretations of harm. Our evaluation of five recently published open-source safety benchmarks reveals distinct semantic clusters using UMAP dimensionality reduction and kmeans clustering (silhouette score: 0.470). We identify six primary harm categories with varying benchmark representation. GretelAI, for example, f… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 6th International Conference on Advanced Natural Language Processing (AdNLP 2025), May 17 ~ 18, 2025, Zurich, Switzerland

    Journal ref: Computer Science & Information Technology 15 (2025) 27 - 39

  2. arXiv:2503.05731  [pdf, other

    cs.CY cs.AI

    AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

    Authors: Shaona Ghosh, Heather Frase, Adina Williams, Sarah Luger, Paul Röttger, Fazl Barez, Sean McGregor, Kenneth Fricklas, Mala Kumar, Quentin Feuillade--Montixi, Kurt Bollacker, Felix Friedrich, Ryan Tsang, Bertie Vidgen, Alicia Parrish, Chris Knotz, Eleonora Presani, Jonathan Bennion, Marisa Ferrara Boston, Mike Kuniavsky, Wiebke Hutiri, James Ezick, Malek Ben Salem, Rajat Sahay, Sujata Goswami , et al. (77 additional authors not shown)

    Abstract: The rapid advancement and deployment of AI systems have created an urgent need for standard safety-evaluation frameworks. This paper introduces AILuminate v1.0, the first comprehensive industry-standard benchmark for assessing AI-product risk and reliability. Its development employed an open process that included participants from multiple fields. The benchmark evaluates an AI system's resistance… ▽ More

    Submitted 18 April, 2025; v1 submitted 19 February, 2025; originally announced March 2025.

    Comments: 51 pages, 8 figures and an appendix

  3. arXiv:2210.17043  [pdf, other

    cs.LG stat.AP

    Evaluating Point-Prediction Uncertainties in Neural Networks for Drug Discovery

    Authors: Ya Ju Fan, Jonathan E. Allen, Kevin S. McLoughlin, Da Shi, Brian J. Bennion, Xiaohua Zhang, Felice C. Lightstone

    Abstract: Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models require uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Methods that combine Bayesian models with NN models address this issue, but are d… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  4. arXiv:2104.04547  [pdf, other

    cs.LG q-bio.BM

    High-Throughput Virtual Screening of Small Molecule Inhibitors for SARS-CoV-2 Protein Targets with Deep Fusion Models

    Authors: Garrett A. Stevenson, Derek Jones, Hyojin Kim, W. F. Drew Bennett, Brian J. Bennion, Monica Borucki, Feliza Bourguet, Aidan Epstein, Magdalena Franco, Brooke Harmon, Stewart He, Max P. Katz, Daniel Kirshner, Victoria Lao, Edmond Y. Lau, Jacky Lo, Kevin McLoughlin, Richard Mosesso, Deepa K. Murugesh, Oscar A. Negrete, Edwin A. Saada, Brent Segelke, Maxwell Stefan, Marisa W. Torres, Dina Weilhammer , et al. (7 additional authors not shown)

    Abstract: Structure-based Deep Fusion models were recently shown to outperform several physics- and machine learning-based protein-ligand binding affinity prediction methods. As part of a multi-institutional COVID-19 pandemic response, over 500 million small molecules were computationally screened against four protein structures from the novel coronavirus (SARS-CoV-2), which causes COVID-19. Three enhanceme… ▽ More

    Submitted 31 May, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

  5. arXiv:2002.12541  [pdf, other

    q-bio.QM

    Machine Learning Models to Predict Inhibition of the Bile Salt Export Pump

    Authors: Kevin S. McLoughlin, Claire G. Jeong, Thomas D. Sweitzer, Amanda J. Minnich, Margaret J. Tse, Brian J. Bennion, Jonathan E. Allen, Stacie Calad-Thomson, Thomas S. Rush, James M. Brase

    Abstract: Drug-induced liver injury (DILI) is the most common cause of acute liver failure and a frequent reason for withdrawal of candidate drugs during preclinical and clinical testing. An important type of DILI is cholestatic liver injury, caused by buildup of bile salts within hepatocytes; it is frequently associated with inhibition of bile salt transporters, such as the bile salt export pump (BSEP). Re… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.