Skip to main content

Showing 1–4 of 4 results for author: Engel, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.12864  [pdf, ps, other

    cs.CL cs.AI cs.LG

    LEXam: Benchmarking Legal Reasoning on 340 Law Exams

    Authors: Yu Fan, Jingwei Ni, Jakob Merane, Etienne Salimbeni, Yang Tian, Yoan Hermstrüwer, Yinya Huang, Mubashara Akhtar, Florian Geering, Oliver Dreyer, Daniel Brunner, Markus Leippold, Mrinmaya Sachan, Alexander Stremitzer, Christoph Engel, Elliott Ash, Joel Niklaus

    Abstract: Long-form legal reasoning remains a key challenge for large language models (LLMs) in spite of recent advances in test-time scaling. We introduce LEXam, a novel benchmark derived from 340 law exams spanning 116 law school courses across a range of subjects and degree levels. The dataset comprises 4,886 law exam questions in English and German, including 2,841 long-form, open-ended questions and 2,… ▽ More

    Submitted 29 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    MSC Class: 68T50 ACM Class: I.2

  2. arXiv:2407.16615  [pdf, other

    cs.CL cs.AI cs.LG

    Lawma: The Power of Specialization for Legal Annotation

    Authors: Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe, Stefan Bechtold, Christoph Engel, Jens Frankenreiter, Krishna Gummadi, Moritz Hardt, Michael Livermore

    Abstract: Annotation and classification of legal text are central components of empirical legal research. Traditionally, these tasks are often delegated to trained research assistants. Motivated by the advances in language modeling, empirical legal scholars are increasingly turning to prompting commercial models, hoping that it will alleviate the significant cost of human annotation. Despite growing use, ou… ▽ More

    Submitted 23 April, 2025; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: ICLR 2025

  3. arXiv:2302.13340  [pdf

    q-bio.OT cs.DL

    Standardizing Paediatric Clinical Data: The Development of the conect4children (c4c) Cross Cutting Paediatric Data Dictionary

    Authors: Anando Sen, Victoria Hedley, John Owen, Ronald Cornet, Dipak Kalra, Corinna Engel, Avril Palmeri, Joanne Lee, Jean-Christophe Roze, Joseph F Standing, Adilia Warris, Claudia Pansieri, Rebecca Leary, Mark Turner, Volker Straub

    Abstract: Standardization of data items collected in paediatric clinical trials is an important but challenging issue. The Clinical Data Interchange Standards Consortium (CDISC) data standards are well understood by the pharmaceutical industry but lack the implementation of some paediatric specific concepts. When a paediatric concept is absent within CDISC standards, companies and research institutions take… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Journal ref: Journal of the Society of Clinical Data Management, Volume 2, Issue 3, 2023

  4. arXiv:2205.04738  [pdf, ps, other

    cs.LG cs.AI

    AI training resources for GLAM: a snapshot

    Authors: Andrew Darby, Catherine Nicole Coleman, Claudia Engel, Daniel van Strien, Mike Trizna, Zachary W. Painter

    Abstract: We take a snapshot of current resources available for teaching and learning AI with a focus on the Galleries, Libraries, Archives and Museums (GLAM) community. The review was carried out in 2021 and 2022. The review provides an overview of material we identified as being relevant, offers a description of this material and makes recommendations for future work in this area.

    Submitted 10 May, 2022; originally announced May 2022.