Skip to main content

Showing 1–10 of 10 results for author: Schumann, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.08307  [pdf, other

    cs.LG cs.CV

    What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

    Authors: Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Vasconcelos, Deepak Ramachandran, Candice Schumann, Junfeng He, Katherine Heller, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaei

    Abstract: Deep Generative Models are frequently used to learn continuous representations of complex data distributions using a finite number of samples. For any generative model, including pre-trained foundation models with Diffusion or Transformer architectures, generation performance can significantly vary across the learned data manifold. In this paper we study the local geometry of the learned manifold… ▽ More

    Submitted 6 February, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at ICLR 2025

  2. arXiv:2408.07009  [pdf, other

    cs.CV

    Imagen 3

    Authors: Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Lluis Castrejon, Kelvin Chan, Yichang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Christos Kaplanis , et al. (237 additional authors not shown)

    Abstract: We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.

    Submitted 21 December, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  3. arXiv:2401.14322  [pdf, other

    cs.CV cs.CY

    Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images

    Authors: Hansa Srinivasan, Candice Schumann, Aradhana Sinha, David Madras, Gbolahan Oluwafemi Olanubi, Alex Beutel, Susanna Ricco, Jilin Chen

    Abstract: Capturing the diversity of people in images is challenging: recent literature tends to focus on diversifying one or two attributes, requiring expensive attribute labels or building classifiers. We introduce a diverse people image ranking method which more flexibly aligns with human notions of people diversity in a less prescriptive, label-free manner. The Perception-Aligned Text-derived Human repr… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  4. arXiv:2305.09073  [pdf, other

    cs.CV cs.CY

    Consensus and Subjectivity of Skin Tone Annotation for ML Fairness

    Authors: Candice Schumann, Gbolahan O. Olanubi, Auriel Wright, Ellis Monk Jr., Courtney Heldreth, Susanna Ricco

    Abstract: Understanding different human attributes and how they affect model behavior may become a standard need for all model creation and usage, from traditional computer vision tasks to the newest multimodal generative AI systems. In computer vision specifically, we have relied on datasets augmented with perceived attribute signals (e.g., gender presentation, skin tone, and age) and benchmarks enabled by… ▽ More

    Submitted 2 January, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

  5. A Step Toward More Inclusive People Annotations for Fairness

    Authors: Candice Schumann, Susanna Ricco, Utsav Prabhu, Vittorio Ferrari, Caroline Pantofaru

    Abstract: The Open Images Dataset contains approximately 9 million images and is a widely accepted dataset for computer vision research. As is common practice for large datasets, the annotations are not exhaustive, with bounding boxes and attribute labels for only a subset of the classes in each image. In this paper, we present a new set of annotations on a subset of the Open Images dataset called the MIAP… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Journal ref: AIES (2021)

  6. arXiv:2001.00089  [pdf, other

    cs.CY cs.AI cs.LG

    Measuring Non-Expert Comprehension of Machine Learning Fairness Metrics

    Authors: Debjani Saha, Candice Schumann, Duncan C. McElfresh, John P. Dickerson, Michelle L. Mazurek, Michael Carl Tschantz

    Abstract: Bias in machine learning has manifested injustice in several areas, such as medicine, hiring, and criminal justice. In response, computer scientists have developed myriad definitions of fairness to correct this bias in fielded algorithms. While some definitions are based on established legal and ethical norms, others are largely mathematical. It is unclear whether the general public agrees with th… ▽ More

    Submitted 2 July, 2020; v1 submitted 16 December, 2019; originally announced January 2020.

  7. arXiv:1912.03802  [pdf, other

    cs.LG cs.AI stat.ML

    Group Fairness in Bandit Arm Selection

    Authors: Candice Schumann, Zhi Lang, Nicholas Mattei, John P. Dickerson

    Abstract: We propose a novel formulation of group fairness with biased feedback in the contextual multi-armed bandit (CMAB) setting. In the CMAB setting, a sequential decision maker must, at each time step, choose an arm to pull from a finite set of arms after observing some context for each of the potential arm pulls. In our model, arms are partitioned into two or more sensitive groups based on some protec… ▽ More

    Submitted 15 February, 2022; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Accepted to AAMAS 2022

  8. arXiv:1906.09688  [pdf, other

    cs.LG stat.ML

    Transfer of Machine Learning Fairness across Domains

    Authors: Candice Schumann, Xuezhi Wang, Alex Beutel, Jilin Chen, Hai Qian, Ed H. Chi

    Abstract: If our models are used in new or unexpected cases, do we know if they will make fair predictions? Previously, researchers developed ways to debias a model for a single problem domain. However, this is often not how models are trained and used in practice. For example, labels and demographics (sensitive attributes) are often hard to observe, resulting in auxiliary or synthetic data to be used for t… ▽ More

    Submitted 14 November, 2019; v1 submitted 23 June, 2019; originally announced June 2019.

  9. arXiv:1906.09621  [pdf, other

    cs.LG cs.AI stat.ML

    Making the Cut: A Bandit-based Approach to Tiered Interviewing

    Authors: Candice Schumann, Zhi Lang, Jeffrey S. Foster, John P. Dickerson

    Abstract: Given a huge set of applicants, how should a firm allocate sequential resume screenings, phone interviews, and in-person site visits? In a tiered interview process, later stages (e.g., in-person visits) are more informative, but also more expensive than earlier stages (e.g., resume screenings). Using accepted hiring models and the concept of structured interviews, a best practice in human resource… ▽ More

    Submitted 14 November, 2019; v1 submitted 23 June, 2019; originally announced June 2019.

  10. arXiv:1709.03441  [pdf, ps, other

    cs.LG

    The Diverse Cohort Selection Problem

    Authors: Candice Schumann, Samsara N. Counts, Jeffrey S. Foster, John P. Dickerson

    Abstract: How should a firm allocate its limited interviewing resources to select the optimal cohort of new employees from a large set of job applicants? How should that firm allocate cheap but noisy resume screenings and expensive but in-depth in-person interviews? We view this problem through the lens of combinatorial pure exploration (CPE) in the multi-armed bandit setting, where a central learning agent… ▽ More

    Submitted 14 March, 2019; v1 submitted 11 September, 2017; originally announced September 2017.