Skip to main content

Showing 1–14 of 14 results for author: Mai, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18003  [pdf, ps, other

    cs.LG cs.AI

    An Example Safety Case for Safeguards Against Misuse

    Authors: Joshua Clymer, Jonah Weinbaum, Robert Kirk, Kimberly Mai, Selena Zhang, Xander Davies

    Abstract: Existing evaluations of AI misuse safeguards provide a patchwork of evidence that is often difficult to connect to real-world decisions. To bridge this gap, we describe an end-to-end argument (a "safety case") that misuse safeguards reduce the risk posed by an AI assistant to low levels. We first describe how a hypothetical developer red teams safeguards, estimating the effort required to evade th… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. Understanding the limitations of self-supervised learning for tabular anomaly detection

    Authors: Kimberly T. Mai, Toby Davies, Lewis D. Griffin

    Abstract: While self-supervised learning has improved anomaly detection in computer vision and natural language processing, it is unclear whether tabular data can benefit from it. This paper explores the limitations of self-supervision for tabular anomaly detection. We conduct several experiments spanning various pretext tasks on 26 benchmark datasets to understand why this is the case. Our results confirm… ▽ More

    Submitted 14 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  3. arXiv:2306.14437  [pdf, other

    cs.RO cs.AI

    A Self-supervised Contrastive Learning Method for Grasp Outcomes Prediction

    Authors: Chengliang Liu, Binhua Huang, Yiwen Liu, Yuanzhe Su, Ke Mai, Yupo Zhang, Zhengkun Yi, Xinyu Wu

    Abstract: In this paper, we investigate the effectiveness of contrastive learning methods for predicting grasp outcomes in an unsupervised manner. By utilizing a publicly available dataset, we demonstrate that contrastive learning methods perform well on the task of grasp outcomes prediction. Specifically, the dynamic-dictionary-based method with the momentum updating technique achieves a satisfactory accur… ▽ More

    Submitted 21 September, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Manuscript accepted to RCAR 2023

  4. arXiv:2305.13877  [pdf, other

    cs.CL cs.AI

    NarrativeXL: A Large-scale Dataset For Long-Term Memory Models

    Authors: Arseny Moskvichev, Ky-Vinh Mai

    Abstract: We propose a new large-scale (nearly a million questions) ultra-long-context (more than 50,000 words average document length) reading comprehension dataset. Using GPT 3.5, we summarized each scene in 1,500 hand-curated fiction books from Project Gutenberg, which resulted in approximately 150 scene-level summaries per book. After that, we created a number of reading comprehension questions based on… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    ACM Class: I.2.7; I.2.6

  5. arXiv:2303.06074  [pdf

    cs.CL

    Susceptibility to Influence of Large Language Models

    Authors: Lewis D Griffin, Bennett Kleinberg, Maximilian Mozes, Kimberly T Mai, Maria Vau, Matthew Caldwell, Augustine Marvor-Parker

    Abstract: Two studies tested the hypothesis that a Large Language Model (LLM) can be used to model psychological change following exposure to influential input. The first study tested a generic mode of influence - the Illusory Truth Effect (ITE) - where earlier exposure to a statement (through, for example, rating its interest) boosts a later truthfulness test rating. Data was collected from 1000 human part… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 24 pages, 6 figures, 7 tables, 53 references

    ACM Class: J.4; I.2.m; I.2.7

  6. Warning: Humans Cannot Reliably Detect Speech Deepfakes

    Authors: Kimberly T. Mai, Sergi D. Bray, Toby Davies, Lewis D. Griffin

    Abstract: Speech deepfakes are artificial voices generated by machine learning models. Previous literature has highlighted deepfakes as one of the biggest security threats arising from progress in artificial intelligence due to their potential for misuse. However, studies investigating human detection capabilities are limited. We presented genuine and deepfake audio to n = 529 individuals and asked them to… ▽ More

    Submitted 2 August, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Journal ref: PLoS ONE 18(8) (2023): e0285333

  7. arXiv:2204.05695  [pdf, other

    cs.CL

    Self-Supervised Losses for One-Class Textual Anomaly Detection

    Authors: Kimberly T. Mai, Toby Davies, Lewis D. Griffin

    Abstract: Current deep learning methods for anomaly detection in text rely on supervisory signals in inliers that may be unobtainable or bespoke architectures that are difficult to tune. We study a simpler alternative: fine-tuning Transformers on the inlier data with self-supervised objectives and using the losses as an anomaly score. Overall, the self-supervision approach outperforms other methods under va… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  8. arXiv:2104.10453  [pdf, other

    cs.LG cs.CV

    Brittle Features May Help Anomaly Detection

    Authors: Kimberly T. Mai, Toby Davies, Lewis D. Griffin

    Abstract: One-class anomaly detection is challenging. A representation that clearly distinguishes anomalies from normal data is ideal, but arriving at this representation is difficult since only normal data is available at training time. We examine the performance of representations, transferred from auxiliary tasks, for anomaly detection. Our results suggest that the choice of representation is more import… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Accepted to Women in Computer Vision workshop at CVPR (2021)

  9. arXiv:2009.09659  [pdf, other

    cs.CY

    Identifying synergies in private and public transportation

    Authors: Iva Bojic, Dániel Kondor, Wei Tu, Ke Mai, Paolo Santi, Carlo Ratti

    Abstract: In this paper, we explore existing synergies between private and public transportation as provided by taxi and bus services on the level of individual trips. While these modes are typically separated for economic reasons, in a future with shared Autonomous Vehicles (AVs) providing cheap and efficient transportation services, such distinctions will blur. Consequently, optimization based on real-tim… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  10. arXiv:2005.03066  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly-Supervised Neural Response Selection from an Ensemble of Task-Specialised Dialogue Agents

    Authors: Asir Saeed, Khai Mai, Pham Minh, Nguyen Tuan Duc, Danushka Bollegala

    Abstract: Dialogue engines that incorporate different types of agents to converse with humans are popular. However, conversations are dynamic in the sense that a selected response will change the conversation on-the-fly, influencing the subsequent utterances in the conversation, which makes the response selection a challenging problem. We model the problem of selecting the best response from a set of re… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  11. arXiv:1902.10118  [pdf, other

    cs.CL

    Multi-Task Learning with Contextualized Word Representations for Extented Named Entity Recognition

    Authors: Thai-Hoang Pham, Khai Mai, Nguyen Minh Trung, Nguyen Tuan Duc, Danushka Bolegala, Ryohei Sasano, Satoshi Sekine

    Abstract: Fine-Grained Named Entity Recognition (FG-NER) is critical for many NLP applications. While classical named entity recognition (NER) has attracted a substantial amount of research, FG-NER is still an open research domain. The current state-of-the-art (SOTA) model for FG-NER relies heavily on manual efforts for building a dictionary and designing hand-crafted features. The end-to-end framework whic… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 7 pages, 2 figures, 4 tables

  12. arXiv:1805.03291  [pdf, other

    cs.AR

    Characterizing, Exploiting, and Mitigating Vulnerabilities in MLC NAND Flash Memory Programming

    Authors: Yu Cai, Saugata Ghose, Yixin Luo, Ken Mai, Onur Mutlu, Erich F. Haratsch

    Abstract: This paper summarizes our work on experimentally analyzing, exploiting, and addressing vulnerabilities in multi-level cell NAND flash memory programming, which was published in the industrial session of HPCA 2017, and examines the work's significance and future potential. Modern NAND flash memory chips use multi-level cells (MLC), which store two bits of data in each cell, to improve chip density.… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  13. arXiv:1805.03283  [pdf, other

    cs.AR

    Read Disturb Errors in MLC NAND Flash Memory

    Authors: Yu Cai, Yixin Luo, Saugata Ghose, Erich F. Haratsch, Ken Mai, Onur Mutlu

    Abstract: This paper summarizes our work on experimentally characterizing, mitigating, and recovering read disturb errors in multi-level cell (MLC) NAND flash memory, which was published in DSN 2015, and examines the work's significance and future potential. NAND flash memory reliability continues to degrade as the memory is scaled down and more bits are programmed per cell. A key contributor to this reduce… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  14. arXiv:1805.02819  [pdf, other

    cs.AR

    Experimental Characterization, Optimization, and Recovery of Data Retention Errors in MLC NAND Flash Memory

    Authors: Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Saugata Ghose, Onur Mutlu

    Abstract: This paper summarizes our work on experimentally characterizing, mitigating, and recovering data retention errors in multi-level cell (MLC) NAND flash memory, which was published in HPCA 2015, and examines the work's significance and future potential. Retention errors, caused by charge leakage over time, are the dominant source of flash memory errors. Understanding, characterizing, and reducing re… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.