Skip to main content

Showing 1–5 of 5 results for author: Ayala, O M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.24189  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows

    Authors: Orlando Marquez Ayala, Patrice Bechard, Emily Chen, Maggie Baird, Jingfei Chen

    Abstract: Large Language Models (LLMs) such as GPT-4o can handle a wide range of complex tasks with the right prompt. As per token costs are reduced, the advantages of fine-tuning Small Language Models (SLMs) for real-world applications -- faster inference, lower costs -- may no longer be clear. In this work, we present evidence that, for domain-specific tasks that require structured outputs, SLMs still hav… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2501.04652  [pdf, other

    cs.CL cs.IR cs.LG

    Multi-task retriever fine-tuning for domain-specific and efficient RAG

    Authors: Patrice Béchard, Orlando Marquez Ayala

    Abstract: Retrieval-Augmented Generation (RAG) has become ubiquitous when deploying Large Language Models (LLMs), as it can address typical limitations such as generating hallucinated or outdated information. However, when building real-world RAG applications, practical issues arise. First, the retrieved information is generally domain-specific. Since it is computationally expensive to fine-tune LLMs, it is… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 9 pages, 2 figures. Submitted to NAACL 2025 Industry Track

  3. arXiv:2412.00239  [pdf, other

    cs.SE cs.AI

    Generating a Low-code Complete Workflow via Task Decomposition and RAG

    Authors: Orlando Marquez Ayala, Patrice Béchard

    Abstract: AI technologies are moving rapidly from research to production. With the popularity of Foundation Models (FMs) that generate text, images, and video, AI-based systems are increasing their complexity. Compared to traditional AI-based software, systems employing FMs, or GenAI-based systems, are more difficult to design due to their scale and versatility. This makes it necessary to document best prac… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

    Comments: Under review; 12 pages, 8 figures

  4. Reducing hallucination in structured outputs via Retrieval-Augmented Generation

    Authors: Patrice Béchard, Orlando Marquez Ayala

    Abstract: A common and fundamental limitation of Generative AI (GenAI) is its propensity to hallucinate. While large language models (LLM) have taken the world by storm, without eliminating or at least reducing hallucinations, real-world GenAI systems may face challenges in user adoption. In the process of deploying an enterprise application that produces workflows based on natural language requirements, we… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: To be presented at NAACL 2024. 11 pages and 4 figures

    Journal ref: 2024.naacl-industry.19

  5. arXiv:2212.08216  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    Azimuth: Systematic Error Analysis for Text Classification

    Authors: Gabrielle Gauthier-Melançon, Orlando Marquez Ayala, Lindsay Brin, Chris Tyler, Frédéric Branchaud-Charron, Joseph Marinier, Karine Grande, Di Le

    Abstract: We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis mor… ▽ More

    Submitted 18 December, 2022; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: To be published in Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. 13 pages and 14 figures