Skip to main content

Showing 1–3 of 3 results for author: González-Bustamante, B

Searching in archive cs. Search in all archives.
.
  1. Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case

    Authors: Bastián González-Bustamante, Nando Verelst, Carla Cisternas

    Abstract: Large Language Models (LLMs) offer promising avenues for methodological and applied innovations in survey research by using synthetic respondents to emulate human answers and behaviour, potentially mitigating measurement and representation errors. However, the extent to which LLMs recover aggregate item distributions remains uncertain and downstream applications risk reproducing social stereotypes… ▽ More

    Submitted 11 September, 2025; originally announced September 2025.

    Comments: Working paper: 18 pages, 4 tables, 2 figures

    MSC Class: 68T50 (Primary) 91F10 (Secondary)

    Journal ref: Empiria Lab Method Series (2025)

  2. arXiv:2412.00539  [pdf, other

    cs.CL cs.AI

    TextClass Benchmark: A Continuous Elo Rating of LLMs in Social Sciences

    Authors: Bastián González-Bustamante

    Abstract: The TextClass Benchmark project is an ongoing, continuous benchmarking process that aims to provide a comprehensive, fair, and dynamic evaluation of LLMs and transformers for text classification tasks. This evaluation spans various domains and languages in social sciences disciplines engaged in NLP and text-as-data approach. The leaderboards present performance metrics and relative ranking using a… ▽ More

    Submitted 6 December, 2024; v1 submitted 30 November, 2024; originally announced December 2024.

    Comments: Working paper: 6 pages, 2 figures

    MSC Class: 68T50 (Primary) 91F10; 91F20 (Secondary)

  3. arXiv:2409.09741  [pdf, other

    cs.CL cs.AI

    Benchmarking LLMs in Political Content Text-Annotation: Proof-of-Concept with Toxicity and Incivility Data

    Authors: Bastián González-Bustamante

    Abstract: This article benchmarked the ability of OpenAI's GPTs and a number of open-source LLMs to perform annotation tasks on political content. We used a novel protest event dataset comprising more than three million digital interactions and created a gold standard that includes ground-truth labels annotated by human coders about toxicity and incivility on social media. We included in our benchmark Googl… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

    Comments: Paper prepared for delivery at the 8th Monash-Warwick-Zurich Text-as-Data Workshop, September 16-17, 2024: 11 pages, 3 tables, 3 figures

    MSC Class: 68T50 (Primary) 91F10; 91F20 (Secondary)