Skip to main content

Showing 1–2 of 2 results for author: Lu, H Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.14161  [pdf, other

    cs.CL

    TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

    Authors: Frank F. Xu, Yufan Song, Boxuan Li, Yuxuan Tang, Kritanjali Jain, Mengxue Bao, Zora Z. Wang, Xuhui Zhou, Zhitong Guo, Murong Cao, Mingyang Yang, Hao Yang Lu, Amaad Martin, Zhe Su, Leander Maben, Raj Mehta, Wayne Chi, Lawrence Jang, Yiqing Xie, Shuyan Zhou, Graham Neubig

    Abstract: We interact with computers on an everyday basis, be it in everyday life or work, and many aspects of work can be done entirely with access to a computer and the Internet. At the same time, thanks to improvements in large language models (LLMs), there has also been a rapid development in AI agents that interact with and affect change in their surrounding environments. But how performant are AI agen… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Preprint

  2. arXiv:2405.07841  [pdf, other

    cs.LG

    Sample Selection Bias in Machine Learning for Healthcare

    Authors: Vinod Kumar Chauhan, Lei Clifton, Achille Salaün, Huiqi Yvonne Lu, Kim Branson, Patrick Schwab, Gaurav Nigam, David A. Clifton

    Abstract: While machine learning algorithms hold promise for personalised medicine, their clinical adoption remains limited, partly due to biases that can compromise the reliability of predictions. In this paper, we focus on sample selection bias (SSB), a specific type of bias where the study population is less representative of the target population, leading to biased and potentially harmful decisions. Des… ▽ More

    Submitted 26 November, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 21 pages and 11 figures (under review)