Skip to main content

Showing 1–8 of 8 results for author: Rooein, D

.
  1. arXiv:2506.08702  [pdf, ps, other

    cs.ET

    Educators' Perceptions of Large Language Models as Tutors: Comparing Human and AI Tutors in a Blind Text-only Setting

    Authors: Sankalan Pal Chowdhury, Terry Jingchen Zhang, Donya Rooein, Dirk Hovy, Tanja Käser, Mrinmaya Sachan

    Abstract: The rapid development of Large Language Models (LLMs) opens up the possibility of using them as personal tutors. This has led to the development of several intelligent tutoring systems and learning assistants that use LLMs as back-ends with various degrees of engineering. In this study, we seek to compare human tutors with LLM tutors in terms of engagement, empathy, scaffolding, and conciseness. W… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Accepted to BEA@ACL 2025

  2. arXiv:2504.17720  [pdf, other

    cs.CL cs.AI

    Multilingual Performance Biases of Large Language Models in Education

    Authors: Vansh Gupta, Sankalan Pal Chowdhury, Vilém Zouhar, Donya Rooein, Mrinmaya Sachan

    Abstract: Large language models (LLMs) are increasingly being adopted in educational settings. These applications expand beyond English, though current LLMs remain primarily English-centric. In this work, we ascertain if their use in education settings in non-English languages is warranted. We evaluated the performance of popular LLMs on four educational tasks: identifying student misconceptions, providing… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  3. arXiv:2501.10057  [pdf, other

    cs.CL

    MSTS: A Multimodal Safety Test Suite for Vision-Language Models

    Authors: Paul Röttger, Giuseppe Attanasio, Felix Friedrich, Janis Goldzycher, Alicia Parrish, Rishabh Bhardwaj, Chiara Di Bonaventura, Roman Eng, Gaia El Khoury Geagea, Sujata Goswami, Jieun Han, Dirk Hovy, Seogyeong Jeong, Paloma Jeretič, Flor Miriam Plaza-del-Arco, Donya Rooein, Patrick Schramowski, Anastassia Shaitarova, Xudong Shen, Richard Willats, Andrea Zugarini, Bertie Vidgen

    Abstract: Vision-language models (VLMs), which process image and text inputs, are increasingly integrated into chat assistants and other consumer AI applications. Without proper safeguards, however, VLMs may give harmful advice (e.g. how to self-harm) or encourage unsafe behaviours (e.g. to consume drugs). Despite these clear hazards, little work so far has evaluated VLM safety and the novel risks created b… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: under review

  4. arXiv:2406.09123  [pdf

    cs.SI

    Can I introduce my boyfriend to my grandmother? Evaluating Large Language Models Capabilities on Iranian Social Norm Classification

    Authors: Hamidreza Saffari, Mohammadamin Shafiei, Donya Rooein, Francesco Pierri, Debora Nozza

    Abstract: Creating globally inclusive AI systems demands datasets reflecting diverse social norms. Iran, with its unique cultural blend, offers an ideal case study, with Farsi adding linguistic complexity. In this work, we introduce the Iranian Social Norms (ISN) dataset, a novel collection of 1,699 Iranian social norms, including environments, demographic features, and scope annotation, alongside English t… ▽ More

    Submitted 18 March, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 15 pages, 1 figure, 9 tables

  5. arXiv:2405.09482  [pdf, other

    cs.CL

    Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts

    Authors: Donya Rooein, Paul Rottger, Anastassia Shaitarova, Dirk Hovy

    Abstract: Using large language models (LLMs) for educational applications like dialogue-based teaching is a hot topic. Effective teaching, however, requires teachers to adapt the difficulty of content and explanations to the education level of their students. Even the best LLMs today struggle to do this well. If we want to improve LLMs on this adaptation task, we need to be able to measure adaptation succes… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  6. arXiv:2404.10475  [pdf, other

    cs.CL

    Conversations as a Source for Teaching Scientific Concepts at Different Education Levels

    Authors: Donya Rooein, Dirk Hovy

    Abstract: Open conversations are one of the most engaging forms of teaching. However, creating those conversations in educational software is a complex endeavor, especially if we want to address the needs of different audiences. While language models hold great promise for educational applications, there are substantial challenges in training them to engage in meaningful and effective conversational teachin… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  7. arXiv:2312.02065  [pdf, other

    cs.CL cs.AI

    Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?

    Authors: Donya Rooein, Amanda Cercas Curry, Dirk Hovy

    Abstract: Large language models (LLMs) offer a range of new possibilities, including adapting the text to different audiences and their reading needs. But how well do they adapt? We evaluate the readability of answers generated by four state-of-the-art LLMs (commercial and open-source) to science questions when prompted to target different age groups and education levels. To assess the adaptability of LLMs… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  8. arXiv:2010.03021  [pdf, other

    cs.CY cs.SI

    Image-based Social Sensing: Combining AI and the Crowd to Mine Policy-Adherence Indicators from Twitter

    Authors: Virginia Negri, Dario Scuratti, Stefano Agresti, Donya Rooein, Gabriele Scalia, Amudha Ravi Shankar, Jose Luis Fernandez Marquez, Mark James Carman, Barbara Pernici

    Abstract: Social Media provides a trove of information that, if aggregated and analysed appropriately can provide important statistical indicators to policy makers. In some situations these indicators are not available through other mechanisms. For example, given the ongoing COVID-19 outbreak, it is essential for governments to have access to reliable data on policy-adherence with regards to mask wearing, s… ▽ More

    Submitted 5 March, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 10 pages, 9 figures, to be published in Proceedings of ICSE Software Engineering in Society, May 2021