Skip to main content

Showing 1–1 of 1 results for author: Clair, C S

.
  1. arXiv:2304.07687  [pdf, other

    cs.LG cs.CL cs.FL

    MLRegTest: A Benchmark for the Machine Learning of Regular Languages

    Authors: Sam van der Poel, Dakotah Lambert, Kalina Kostyszyn, Tiantian Gao, Rahul Verma, Derek Andersen, Joanne Chau, Emily Peterson, Cody St. Clair, Paul Fodor, Chihiro Shibata, Jeffrey Heinz

    Abstract: Synthetic datasets constructed from formal languages allow fine-grained examination of the learning and generalization capabilities of machine learning systems for sequence classification. This article presents a new benchmark for machine learning systems on sequence classification called MLRegTest, which contains training, development, and test sets from 1,800 regular languages. Different kinds o… ▽ More

    Submitted 1 September, 2024; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: Accepted for publication in the Journal of Machine Learning Research. Dataset available at https://doi.org/10.5061/dryad.dncjsxm4h , code available at https://github.com/heinz-jeffrey/subregular-learning