Skip to main content

Showing 1–3 of 3 results for author: Burtenshaw, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.02737  [pdf, other

    cs.CL

    SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

    Authors: Loubna Ben Allal, Anton Lozhkov, Elie Bakouch, Gabriel Martín Blázquez, Guilherme Penedo, Lewis Tunstall, Andrés Marafioti, Hynek Kydlíček, Agustín Piqueres Lajarín, Vaibhav Srivastav, Joshua Lochner, Caleb Fahlgren, Xuan-Son Nguyen, Clémentine Fourrier, Ben Burtenshaw, Hugo Larcher, Haojun Zhao, Cyril Zakka, Mathieu Morlon, Colin Raffel, Leandro von Werra, Thomas Wolf

    Abstract: While large language models have facilitated breakthroughs in many applications of artificial intelligence, their inherent largeness makes them computationally expensive and challenging to deploy in resource-constrained settings. In this paper, we document the development of SmolLM2, a state-of-the-art "small" (1.7 billion parameter) language model (LM). To attain strong performance, we overtrain… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  2. arXiv:2408.16961  [pdf, other

    cs.HC cs.AI

    The Future of Open Human Feedback

    Authors: Shachar Don-Yehiya, Ben Burtenshaw, Ramon Fernandez Astudillo, Cailean Osborne, Mimansa Jaiswal, Tzu-Sheng Kuo, Wenting Zhao, Idan Shenfeld, Andi Peng, Mikhail Yurochkin, Atoosa Kasirzadeh, Yangsibo Huang, Tatsunori Hashimoto, Yacine Jernite, Daniel Vila-Suero, Omri Abend, Jennifer Ding, Sara Hooker, Hannah Rose Kirk, Leshem Choshen

    Abstract: Human feedback on conversations with language language models (LLMs) is central to how these systems learn about the world, improve their capabilities, and are steered toward desirable and safe behaviors. However, this feedback is mostly collected by frontier AI labs and kept behind closed doors. In this work, we bring together interdisciplinary experts to assess the opportunities and challenges t… ▽ More

    Submitted 4 September, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  3. arXiv:2011.04242  [pdf

    cs.CL cs.AI

    AI Stories: An Interactive Narrative System for Children

    Authors: Ben Burtenshaw

    Abstract: AI Stories is a proposed interactive dialogue system, that lets children co-create narrative worlds through conversation. Over the next three years this system will be developed and tested within pediatric wards, where it offers a useful resource between the gap of education and play. Telling and making stories is a fundamental part of language play, and its chatty and nonsensical qualities are im… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Originally submitted to the ICCC 2017 Doctoral Consortium [https://computationalcreativity.net/iccc2017/doctoralconsortium/]