Skip to main content

Showing 1–6 of 6 results for author: Eng, R

.
  1. arXiv:2503.05731  [pdf, other

    cs.CY cs.AI

    AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

    Authors: Shaona Ghosh, Heather Frase, Adina Williams, Sarah Luger, Paul Röttger, Fazl Barez, Sean McGregor, Kenneth Fricklas, Mala Kumar, Quentin Feuillade--Montixi, Kurt Bollacker, Felix Friedrich, Ryan Tsang, Bertie Vidgen, Alicia Parrish, Chris Knotz, Eleonora Presani, Jonathan Bennion, Marisa Ferrara Boston, Mike Kuniavsky, Wiebke Hutiri, James Ezick, Malek Ben Salem, Rajat Sahay, Sujata Goswami , et al. (77 additional authors not shown)

    Abstract: The rapid advancement and deployment of AI systems have created an urgent need for standard safety-evaluation frameworks. This paper introduces AILuminate v1.0, the first comprehensive industry-standard benchmark for assessing AI-product risk and reliability. Its development employed an open process that included participants from multiple fields. The benchmark evaluates an AI system's resistance… ▽ More

    Submitted 18 April, 2025; v1 submitted 19 February, 2025; originally announced March 2025.

    Comments: 51 pages, 8 figures and an appendix

  2. arXiv:2501.10057  [pdf, other

    cs.CL

    MSTS: A Multimodal Safety Test Suite for Vision-Language Models

    Authors: Paul Röttger, Giuseppe Attanasio, Felix Friedrich, Janis Goldzycher, Alicia Parrish, Rishabh Bhardwaj, Chiara Di Bonaventura, Roman Eng, Gaia El Khoury Geagea, Sujata Goswami, Jieun Han, Dirk Hovy, Seogyeong Jeong, Paloma Jeretič, Flor Miriam Plaza-del-Arco, Donya Rooein, Patrick Schramowski, Anastassia Shaitarova, Xudong Shen, Richard Willats, Andrea Zugarini, Bertie Vidgen

    Abstract: Vision-language models (VLMs), which process image and text inputs, are increasingly integrated into chat assistants and other consumer AI applications. Without proper safeguards, however, VLMs may give harmful advice (e.g. how to self-harm) or encourage unsafe behaviours (e.g. to consume drugs). Despite these clear hazards, little work so far has evaluated VLM safety and the novel risks created b… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: under review

  3. arXiv:2212.14105  [pdf, other

    econ.EM

    Supercompliers

    Authors: Matthew L. Comey, Amanda R. Eng, Pauline Leung, Zhuan Pei

    Abstract: In a binary-treatment instrumental variable framework, we define supercompliers as the subpopulation whose treatment take-up positively responds to eligibility and whose outcome positively responds to take-up. Supercompliers are the only subpopulation to benefit from treatment eligibility and, hence, are important for policy. We provide tools to characterize supercompliers under a set of jointly t… ▽ More

    Submitted 20 December, 2024; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: This version substantially revises v2. Pauline Leung has made significant contributions and is now a coauthor. We expand the non-binary outcome case, essential in the new connection to MVPF (Section 3). We replace the original empirical application with two job training experiments (Section 4), add new theoretical results in Remark 5, Appendix A.3, and A.7. References are updated

  4. arXiv:2001.06683  [pdf

    astro-ph.IM

    The Habitable Exoplanet Observatory (HabEx) Mission Concept Study Final Report

    Authors: B. Scott Gaudi, Sara Seager, Bertrand Mennesson, Alina Kiessling, Keith Warfield, Kerri Cahoy, John T. Clarke, Shawn Domagal-Goldman, Lee Feinberg, Olivier Guyon, Jeremy Kasdin, Dimitri Mawet, Peter Plavchan, Tyler Robinson, Leslie Rogers, Paul Scowen, Rachel Somerville, Karl Stapelfeldt, Christopher Stark, Daniel Stern, Margaret Turnbull, Rashied Amini, Gary Kuan, Stefan Martin, Rhonda Morgan , et al. (161 additional authors not shown)

    Abstract: The Habitable Exoplanet Observatory, or HabEx, has been designed to be the Great Observatory of the 2030s. For the first time in human history, technologies have matured sufficiently to enable an affordable space-based telescope mission capable of discovering and characterizing Earthlike planets orbiting nearby bright sunlike stars in order to search for signs of habitability and biosignatures. Su… ▽ More

    Submitted 26 January, 2020; v1 submitted 18 January, 2020; originally announced January 2020.

    Comments: Full report: 498 pages. Executive Summary: 14 pages. More information about HabEx can be found here: https://www.jpl.nasa.gov/habex/

  5. arXiv:1809.09674  [pdf

    astro-ph.IM

    The Habitable Exoplanet Observatory (HabEx) Mission Concept Study Interim Report

    Authors: B. Scott Gaudi, Sara Seager, Bertrand Mennesson, Alina Kiessling, Keith Warfield, Gary Kuan, Kerri Cahoy, John T. Clarke, Shawn Domagal-Goldman, Lee Feinberg, Olivier Guyon, Jeremy Kasdin, Dimitri Mawet, Tyler Robinson, Leslie Rogers, Paul Scowen, Rachel Somerville, Karl Stapelfeldt, Christopher Stark, Daniel Stern, Margaret Turnbull, Stefan Martin, Oscar Alvarez-Salazar, Rashied Amini, William Arnold , et al. (57 additional authors not shown)

    Abstract: For the first time in human history, technologies have matured sufficiently to enable a mission capable of discovering and characterizing habitable planets like Earth orbiting sunlike stars other than the Sun. At the same time, such a platform would enable unique science not possible from ground-based facilities. This science is broad and exciting, ranging from new investigations of our own solar… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: 212 Pages

  6. arXiv:1805.06880  [pdf, other

    cs.CV

    It's all Relative: Monocular 3D Human Pose Estimation from Weakly Supervised Data

    Authors: Matteo Ruggero Ronchi, Oisin Mac Aodha, Robert Eng, Pietro Perona

    Abstract: We address the problem of 3D human pose estimation from 2D input images using only weakly supervised training data. Despite showing considerable success for 2D pose estimation, the application of supervised machine learning to 3D pose estimation in real world images is currently hampered by the lack of varied training images with corresponding 3D poses. Most existing 3D pose estimation algorithms… ▽ More

    Submitted 27 July, 2018; v1 submitted 17 May, 2018; originally announced May 2018.

    Comments: BMVC 2018. Project page available at http://www.vision.caltech.edu/~mronchi/projects/RelativePose