Skip to main content

Showing 1–2 of 2 results for author: Culnan, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.14272  [pdf, other

    cs.CL cs.HC cs.LG

    Collecting high-quality adversarial data for machine reading comprehension tasks with humans and models in the loop

    Authors: Damian Y. Romero Diaz, Magdalena AnioĊ‚, John Culnan

    Abstract: We present our experience as annotators in the creation of high-quality, adversarial machine-reading-comprehension data for extractive QA for Task 1 of the First Workshop on Dynamic Adversarial Data Collection (DADC). DADC is an emergent data collection paradigm with both models and humans in the loop. We set up a quasi-experimental annotation design and perform quantitative analyses across groups… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 8 pages, 3 figures, for more information about the shared task please go to https://dadcworkshop.github.io/

  2. arXiv:1911.10436  [pdf, other

    cs.CL

    ScienceExamCER: A High-Density Fine-Grained Science-Domain Corpus for Common Entity Recognition

    Authors: Hannah Smith, Zeyu Zhang, John Culnan, Peter Jansen

    Abstract: Named entity recognition identifies common classes of entities in text, but these entity labels are generally sparse, limiting utility to downstream tasks. In this work we present ScienceExamCER, a densely-labeled semantic classification corpus of 133k mentions in the science exam domain where nearly all (96%) of content words have been annotated with one or more fine-grained semantic class labels… ▽ More

    Submitted 23 November, 2019; originally announced November 2019.