Skip to main content

Showing 1–1 of 1 results for author: Leitner, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2003.13016  [pdf, ps, other

    cs.CL cs.IR

    A Dataset of German Legal Documents for Named Entity Recognition

    Authors: Elena Leitner, Georg Rehm, Julián Moreno-Schneider

    Abstract: We describe a dataset developed for Named Entity Recognition in German federal court decisions. It consists of approx. 67,000 sentences with over 2 million tokens. The resource contains 54,000 manually annotated entities, mapped to 19 fine-grained semantic classes: person, judge, lawyer, country, city, street, landscape, organization, company, institution, court, brand, law, ordinance, European le… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear