Skip to main content

Showing 1–15 of 15 results for author: Brusilovsky, P

.
  1. arXiv:2502.18632  [pdf, other

    cs.AI cs.CL cs.CY cs.LG cs.SE

    Automated Knowledge Component Generation and Knowledge Tracing for Coding Problems

    Authors: Zhangqi Duan, Nigel Fernandez, Arun Balajiee Lekshmi Narayanan, Mohammad Hassany, Rafaella Sampaio de Alencar, Peter Brusilovsky, Bita Akram, Andrew Lan

    Abstract: Knowledge components (KCs) mapped to problems help model student learning, tracking their mastery levels on fine-grained skills thereby facilitating personalized learning and feedback in online learning platforms. However, crafting and tagging KCs to problems, traditionally performed by human domain experts, is highly labor-intensive. We present a fully automated, LLM-based pipeline for KC generat… ▽ More

    Submitted 23 May, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  2. arXiv:2410.10756  [pdf, other

    cs.CL

    Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification

    Authors: Jan Cegin, Branislav Pecher, Jakub Simko, Ivan Srba, Maria Bielikova, Peter Brusilovsky

    Abstract: The generative large language models (LLMs) are increasingly used for data augmentation tasks, where text samples are paraphrased (or generated anew) and then used for classifier fine-tuning. Existing works on augmentation leverage the few-shot scenarios, where samples are given to LLMs as part of prompts, leading to better augmentations. Yet, the samples are mostly selected randomly and a compreh… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  3. arXiv:2408.16502  [pdf, other

    cs.CL

    LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?

    Authors: Jan Cegin, Jakub Simko, Peter Brusilovsky

    Abstract: The generative large language models (LLMs) are increasingly being used for data augmentation tasks, where text samples are LLM-paraphrased and then used for classifier fine-tuning. However, a research that would confirm a clear cost-benefit advantage of LLMs over more established augmentation methods is largely missing. To study if (and when) is the LLM-based augmentation advantageous, we compare… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 20 pages

  4. arXiv:2403.05538  [pdf, other

    cs.CY cs.HC cs.SE

    Explaining Code Examples in Introductory Programming Courses: LLM vs Humans

    Authors: Arun-Balajiee Lekshmi-Narayanan, Priti Oli, Jeevan Chapagain, Mohammad Hassany, Rabin Banjade, Peter Brusilovsky, Vasile Rus

    Abstract: Worked examples, which present an explained code for solving typical programming problems are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarely have time to provide explanations for many examples typically used in a progr… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 December, 2023; originally announced March 2024.

    Comments: 3 tables; 1 figure

  5. arXiv:2402.16235  [pdf, other

    cs.HC cs.AI

    Human-AI Co-Creation of Worked Examples for Programming Classes

    Authors: Mohammad Hassany, Peter Brusilovsky, Jiaze Ke, Kamil Akhuseyinoglu, Arun Balajiee Lekshmi Narayanan

    Abstract: Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarel… ▽ More

    Submitted 29 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.02105

  6. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation

    Authors: Jan Cegin, Branislav Pecher, Jakub Simko, Ivan Srba, Maria Bielikova, Peter Brusilovsky

    Abstract: The latest generative large language models (LLMs) have found their application in data augmentation tasks, where small numbers of text samples are LLM-paraphrased and then used to fine-tune downstream models. However, more research is needed to assess how different prompts, seed data selection strategies, filtering methods, or model settings affect the quality of paraphrased data (and downstream… ▽ More

    Submitted 18 August, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: ACL'24 version, 24 pages

    Journal ref: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024

  7. arXiv:2312.02105  [pdf, other

    cs.HC cs.AI

    Authoring Worked Examples for Java Programming with Human-AI Collaboration

    Authors: Mohammad Hassany, Peter Brusilovsky, Jiaze Ke, Kamil Akhuseyinoglu, Arun Balajiee Lekshmi Narayanan

    Abstract: Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarel… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 2 papers both same content

  8. arXiv:2305.12947  [pdf, other

    cs.CL

    ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness

    Authors: Jan Cegin, Jakub Simko, Peter Brusilovsky

    Abstract: The emergence of generative large language models (LLMs) raises the question: what will be its impact on crowdsourcing? Traditionally, crowdsourcing has been used for acquiring solutions to a wide variety of human-intelligence tasks, including ones involving text generation, modification or evaluation. For some of these tasks, models like ChatGPT can potentially substitute human workers. In this s… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Long paper accepted to EMNLP 2023 conference main track, 17 pages, 9 figures

  9. arXiv:2210.09013  [pdf, other

    cs.CY cs.AI cs.LG

    Knowledge Tracing for Complex Problem Solving: Granular Rank-Based Tensor Factorization

    Authors: Chunpai Wang, Shaghayegh Sahebi, Siqian Zhao, Peter Brusilovsky, Laura O. Moraes

    Abstract: Knowledge Tracing (KT), which aims to model student knowledge level and predict their performance, is one of the most important applications of user modeling. Modern KT approaches model and maintain an up-to-date state of student knowledge over a set of course concepts according to students' historical performance in attempting the problems. However, KT approaches were designed to model knowledge… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted by UMAP-2021

  10. arXiv:2209.13426  [pdf, other

    cs.IR cs.HC cs.IT

    From Ranked Lists to Carousels: A Carousel Click Model

    Authors: Behnam Rahdari, Branislav Kveton, Peter Brusilovsky

    Abstract: Carousel-based recommendation interfaces allow users to explore recommended items in a structured, efficient, and visually-appealing way. This made them a de-facto standard approach to recommending items to end users in many real-life recommenders. In this work, we try to explain the efficiency of carousel recommenders using a \emph{carousel click model}, a generative model of user interaction wit… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  11. arXiv:2005.11422  [pdf, other

    cs.IR

    Concept Annotation for Intelligent Textbooks

    Authors: Mengdi Wang, Hung Chau, Khushboo Thaker, Peter Brusilovsky, Daqing He

    Abstract: With the increased popularity of electronic textbooks, there is a growing interests in developing a new generation of "intelligent textbooks", which have the ability to guide the readers according to their learning goals and current knowledge. The intelligent textbooks extend regular textbooks by integrating machine-manipulatable knowledge such as a knowledge map or a prerequisite-outcome relation… ▽ More

    Submitted 9 June, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

  12. arXiv:1909.03590  [pdf, ps, other

    cs.CL

    Does Order Matter? An Empirical Study on Generating Multiple Keyphrases as a Sequence

    Authors: Rui Meng, Xingdi Yuan, Tong Wang, Peter Brusilovsky, Adam Trischler, Daqing He

    Abstract: Recently, concatenating multiple keyphrases as a target sequence has been proposed as a new learning paradigm for keyphrase generation. Existing studies concatenate target keyphrases in different orders but no study has examined the effects of ordering on models' behavior. In this paper, we propose several orderings for concatenation and inspect the important factors for training a successful keyp… ▽ More

    Submitted 28 February, 2022; v1 submitted 8 September, 2019; originally announced September 2019.

  13. arXiv:1904.02634  [pdf

    cs.SI

    Sequence Analysis of Learning Behavior in Different Consecutive Activities

    Authors: Abdulelah Abuabat, Peter Brusilovsky

    Abstract: The purpose of this research is to study the possibility of identifying students, statistically, by analyzing their behavior in different consecutive activities. In this project, there are three different sorts of activities: animated example, basic example, and parameterized exercises. We extracted the behavior of each student from the log activities of the Mastery Grids platform. Additionally, w… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: The Association for the Advancement of Computing in Education, E-Learn 2018

  14. arXiv:1810.05241  [pdf, other

    cs.CL cs.LG

    One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

    Authors: Xingdi Yuan, Tong Wang, Rui Meng, Khushboo Thaker, Peter Brusilovsky, Daqing He, Adam Trischler

    Abstract: Different texts shall by nature correspond to different number of keyphrases. This desideratum is largely missing from existing neural keyphrase generation models. In this study, we address this problem from both modeling and evaluation perspectives. We first propose a recurrent generative model that generates multiple keyphrases as delimiter-separated sequences. Generation diversity is further… ▽ More

    Submitted 12 May, 2020; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: ACL 2020

  15. arXiv:1704.06879  [pdf, other

    cs.CL

    Deep Keyphrase Generation

    Authors: Rui Meng, Sanqiang Zhao, Shuguang Han, Daqing He, Peter Brusilovsky, Yu Chi

    Abstract: Keyphrase provides highly-condensed information that can be effectively used for understanding, organizing and retrieving text content. Though previous studies have provided many workable solutions for automated keyphrase extraction, they commonly divided the to-be-summarized content into multiple text chunks, then ranked and selected the most meaningful ones. These approaches could neither identi… ▽ More

    Submitted 31 May, 2021; v1 submitted 23 April, 2017; originally announced April 2017.

    Comments: Accepted by ACL2017