Skip to main content

Showing 1–7 of 7 results for author: Noriega-Atala, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.09651  [pdf, other

    cs.CL cs.CY

    AI-VERDE: A Gateway for Egalitarian Access to Large Language Model-Based Resources For Educational Institutions

    Authors: Paul Mithun, Enrique Noriega-Atala, Nirav Merchant, Edwin Skidmore

    Abstract: We present AI-VERDE, a unified LLM-as-a-platform service designed to facilitate seamless integration of commercial, cloud-hosted, and on-premise open LLMs in academic settings. AI-VERDE streamlines access management for instructional and research groups by providing features such as robust access control, privacy-preserving mechanisms, native Retrieval-Augmented Generation (RAG) support, budget ma… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 7 Pages, includes appendix. Submitted to NAACL System demonstrations track 2025

  2. arXiv:2411.14569  [pdf, other

    cs.IR cs.LG

    Variable Extraction for Model Recovery in Scientific Literature

    Authors: Chunwei Liu, Enrique Noriega-Atala, Adarsh Pyarelal, Clayton T Morrison, Mike Cafarella

    Abstract: The global output of academic publications exceeds 5 million articles per year, making it difficult for humans to keep up with even a tiny fraction of scientific output. We need methods to navigate and interpret the artifacts -- texts, graphs, charts, code, models, and datasets -- that make up the literature. This paper evaluates various methods for extracting mathematical model variables from epi… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  3. arXiv:2410.07567  [pdf, other

    cs.CL cs.AI

    When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context

    Authors: Enrique Noriega-Atala, Robert Vacareanu, Salena Torres Ashton, Adarsh Pyarelal, Clayton T. Morrison, Mihai Surdeanu

    Abstract: We introduce a neural architecture finetuned for the task of scenario context generation: The relevant location and time of an event or entity mentioned in text. Contextualizing information extraction helps to scope the validity of automated finings when aggregating them as knowledge graphs. Our approach uses a high-quality curated dataset of time and location annotations in a corpus of epidemiolo… ▽ More

    Submitted 20 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: 9 pages, 7 figures

  4. arXiv:2205.15281  [pdf, other

    cs.CL cs.AI

    Learning Open Domain Multi-hop Search Using Reinforcement Learning

    Authors: Enrique Noriega-Atala, Mihai Surdeanu, Clayton T. Morrison

    Abstract: We propose a method to teach an automated agent to learn how to search for multi-hop paths of relations between entities in an open domain. The method learns a policy for directing existing information retrieval and machine reading resources to focus on relevant regions of a corpus. The approach formulates the learning problem as a Markov decision process with a state representation that encodes t… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at the Structured and Unstructured Knowledge Integration (SUKI) workshop, held at NAACL-HLT 2022

  5. arXiv:2112.09288  [pdf, other

    cs.CL cs.AI

    Neural Architectures for Biological Inter-Sentence Relation Extraction

    Authors: Enrique Noriega-Atala, Peter M. Lovett, Clayton T. Morrison, Mihai Surdeanu

    Abstract: We introduce a family of deep-learning architectures for inter-sentence relation extraction, i.e., relations where the participants are not necessarily in the same sentence. We apply these architectures to an important use case in the biomedical domain: assigning biological context to biochemical events. In this work, biological context is defined as the type of biological system within which the… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted at the Scientific Document Understanding workshop at AAAI'22

  6. arXiv:1812.06199  [pdf, other

    cs.CL cs.LG stat.ML

    Inter-sentence Relation Extraction for Associating Biological Context with Events in Biomedical Texts

    Authors: Enrique Noriega-Atala, Paul D. Hein, Shraddha S. Thumsi, Zechy Wong, Xia Wang, Clayton T. Morrison

    Abstract: We present an analysis of the problem of identifying biological context and associating it with biochemical events in biomedical texts. This constitutes a non-trivial, inter-sentential relation extraction task. We focus on biological context as descriptions of the species, tissue type and cell type that are associated with biochemical events. We describe the properties of an annotated corpus of co… ▽ More

    Submitted 14 December, 2018; originally announced December 2018.

  7. arXiv:1709.00149  [pdf, other

    cs.AI cs.CL cs.IR cs.LG

    Learning what to read: Focused machine reading

    Authors: Enrique Noriega-Atala, Marco A. Valenzuela-Escarcega, Clayton T. Morrison, Mihai Surdeanu

    Abstract: Recent efforts in bioinformatics have achieved tremendous progress in the machine reading of biomedical literature, and the assembly of the extracted biochemical interactions into large-scale models such as protein signaling pathways. However, batch machine reading of literature at today's scale (PubMed alone indexes over 1 million papers per year) is unfeasible due to both cost and processing ove… ▽ More

    Submitted 1 September, 2017; originally announced September 2017.

    Comments: 6 pages, 1 figure, 1 algorithm, 2 tables, accepted to EMNLP 2017

    ACM Class: H.3.3; I.2.6; I.2.7