Skip to main content

Showing 1–8 of 8 results for author: Weitekamp, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.10422  [pdf, other

    cs.LG

    Decomposed Inductive Procedure Learning: Learning Academic Tasks with Human-Like Data Efficiency

    Authors: Daniel Weitekamp, Christopher MacLellan, Erik Harpstead, Kenneth Koedinger

    Abstract: Human learning relies on specialization -- distinct cognitive mechanisms working together to enable rapid learning. In contrast, most modern neural networks rely on a single mechanism: gradient descent over an objective function. This raises the question: might human learners' relatively rapid learning from just tens of examples instead of tens of thousands in data-driven deep learning arise from… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: To appear in CogSci 2025

  2. arXiv:2505.01563  [pdf, other

    cs.AI

    TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students

    Authors: Daniel Weitekamp, Momin N. Siddiqui, Christopher J. MacLellan

    Abstract: Recent improvements in large language model (LLM) performance on academic benchmarks, such as MATH and GSM8K, have emboldened their use as standalone tutors and as simulations of human learning. However, these new applications require more than evaluations of final solution generation. We introduce TutorGym to evaluate these applications more directly. TutorGym is a standard interface for testing… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    ACM Class: I.2

  3. arXiv:2503.16460  [pdf, other

    cs.HC cs.AI

    Beyond Final Answers: Evaluating Large Language Models for Math Tutoring

    Authors: Adit Gupta, Jennifer Reddig, Tommaso Calo, Daniel Weitekamp, Christopher J. MacLellan

    Abstract: Researchers have made notable progress in applying Large Language Models (LLMs) to solve math problems, as demonstrated through efforts like GSM8k, ProofNet, AlphaGeometry, and MathOdyssey. This progress has sparked interest in their potential use for tutoring students in mathematics. However, the reliability of LLMs in tutoring contexts -- where correctness and instructional quality are crucial -… ▽ More

    Submitted 23 February, 2025; originally announced March 2025.

  4. arXiv:2411.17924  [pdf

    cs.HC cs.AI cs.LG

    AI2T: Building Trustable AI Tutors by Interactively Teaching a Self-Aware Learning Agent

    Authors: Daniel Weitekamp, Erik Harpstead, Kenneth Koedinger

    Abstract: AI2T is an interactively teachable AI for authoring intelligent tutoring systems (ITSs). Authors tutor AI2T by providing a few step-by-step solutions and then grading AI2T's own problem-solving attempts. From just 20-30 minutes of interactive training, AI2T can induce robust rules for step-by-step solution tracking (i.e., model-tracing). As AI2T learns it can accurately estimate its certainty of p… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    ACM Class: I.2.6; I.2.2

  5. arXiv:2409.15631  [pdf, other

    cs.LG cs.AI

    Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI

    Authors: Liang Zhang, Jionghao Lin, John Sabatini, Conrad Borchers, Daniel Weitekamp, Meng Cao, John Hollander, Xiangen Hu, Arthur C. Graesser

    Abstract: Learning performance data describe correct and incorrect answers or problem-solving attempts in adaptive learning, such as in intelligent tutoring systems (ITSs). Learning performance data tend to be highly sparse (80\%\(\sim\)90\% missing observations) in most real-world applications due to adaptive item selection. This data sparsity presents challenges to using learner models to effectively pred… ▽ More

    Submitted 3 January, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

  6. arXiv:2409.07653  [pdf, other

    cs.LG

    STAND: Data-Efficient and Self-Aware Precondition Induction for Interactive Task Learning

    Authors: Daniel Weitekamp, Kenneth Koedinger

    Abstract: STAND is a data-efficient and computationally efficient machine learning approach that produces better classification accuracy than popular approaches like XGBoost on small-data tabular classification problems like learning rule preconditions from interactive training. STAND accounts for a complete set of good candidate generalizations instead of selecting a single generalization by breaking ties… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  7. arXiv:2110.13233  [pdf, other

    cs.LG cs.AI

    Decomposed Inductive Procedure Learning

    Authors: Daniel Weitekamp, Christopher MacLellan, Erik Harpstead, Kenneth Koedinger

    Abstract: Recent advances in machine learning have made it possible to train artificially intelligent agents that perform with super-human accuracy on a great diversity of complex tasks. However, the process of training these capabilities often necessitates millions of annotated examples -- far more than humans typically need in order to achieve a passing level of mastery on similar tasks. Thus, while conte… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 38 pages, 7 figures, submitted to Journal of Artificial Intelligence

  8. arXiv:1807.00083  [pdf, other

    hep-ex cs.LG hep-ph physics.data-an

    Topology classification with deep learning to improve real-time event selection at the LHC

    Authors: Thong Q. Nguyen, Daniel Weitekamp III, Dustin Anderson, Roberto Castello, Olmo Cerri, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: We show how event topology classification based on deep learning could be used to improve the purity of data samples selected in real time at at the Large Hadron Collider. We consider different data representations, on which different kinds of multi-class classifiers are trained. Both raw data and high-level features are utilized. In the considered examples, a filter based on the classifier's scor… ▽ More

    Submitted 2 September, 2019; v1 submitted 29 June, 2018; originally announced July 2018.

    Comments: This is a pre-print of an article published in Computing and Software for Big Science. The final authenticated version is available online at: https://doi.org/10.1007/s41781-019-0028-1

    Journal ref: Comput Softw Big Sci (2019) 3: 12