Skip to main content

Showing 1–5 of 5 results for author: Halevy, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  2. arXiv:2412.10575  [pdf, other

    cs.LG cs.AI stat.ML

    Who's the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration

    Authors: Karina Halevy, Karly Hou, Charumathi Badrinath

    Abstract: Data augmentation methods, especially SoTA interpolation-based methods such as Fair Mixup, have been widely shown to increase model fairness. However, this fairness is evaluated on metrics that do not capture model uncertainty and on datasets with only one, relatively large, minority group. As a remedy, multicalibration has been introduced to measure fairness while accommodating uncertainty and ac… ▽ More

    Submitted 14 April, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

    Comments: Expanded version of AAAI 2025 main track paper. 8 pages, 2 figures

  3. arXiv:2403.00180  [pdf, other

    cs.CL

    "Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models

    Authors: Karina Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut

    Abstract: Model editing has emerged as a cost-effective strategy to update knowledge stored in language models. However, model editing can have unintended consequences after edits are applied: information unrelated to the edits can also be changed, and other general behaviors of the model can be wrongly altered. In this work, we investigate how model editing methods unexpectedly amplify model biases post-ed… ▽ More

    Submitted 3 October, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted to EMNLP 2024 Main. 9 pages, 4 figures

  4. arXiv:2304.11223  [pdf, other

    cs.CL

    A Group-Specific Approach to NLP for Hate Speech Detection

    Authors: Karina Halevy

    Abstract: Automatic hate speech detection is an important yet complex task, requiring knowledge of common sense, stereotypes of protected groups, and histories of discrimination, each of which may constantly evolve. In this paper, we propose a group-specific approach to NLP for online hate speech detection. The approach consists of creating and infusing historical and linguistic knowledge about a particular… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 11 pages, 0 figures

  5. arXiv:2106.00877  [pdf, other

    cs.CL

    Evaluating Word Embeddings with Categorical Modularity

    Authors: Sílvia Casacuberta, Karina Halevy, Damián E. Blasi

    Abstract: We introduce categorical modularity, a novel low-resource intrinsic metric to evaluate word embedding quality. Categorical modularity is a graph modularity metric based on the $k$-nearest neighbor graph constructed with embedding vectors of words from a fixed set of semantic categories, in which the goal is to measure the proportion of words that have nearest neighbors within the same categories.… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted to Findings of ACL 2021 (Long Paper)