Skip to main content

Showing 1–1 of 1 results for author: Torhacs, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.08652  [pdf

    cs.CY cs.AI

    LegalScore: Development of a Benchmark for Evaluating AI Models in Legal Career Exams in Brazil

    Authors: Roberto Caparroz, Marcelo Roitman, Beatriz G. Chow, Caroline Giusti, Larissa Torhacs, Pedro A. Sola, João H. M. Diogo, Luiza Balby, Carolina D. L. Vasconcelos, Leonardo R. Caparroz, Albano P. Franco

    Abstract: This research introduces LegalScore, a specialized index for assessing how generative artificial intelligence models perform in a selected range of career exams that require a legal background in Brazil. The index evaluates fourteen different types of artificial intelligence models' performance, from proprietary to open-source models, in answering objective questions applied to these exams. The re… ▽ More

    Submitted 17 January, 2025; originally announced February 2025.

    Comments: Main article 25 pages, Appendices from page 26