Skip to main content

Showing 1–2 of 2 results for author: Madrigal, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.22716  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.DB

    BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute

    Authors: Dujian Ding, Ankur Mallick, Shaokun Zhang, Chi Wang, Daniel Madrigal, Mirian Del Carmen Hipolito Garcia, Menglin Xia, Laks V. S. Lakshmanan, Qingyun Wu, Victor Rühle

    Abstract: Large language models (LLMs) are powerful tools but are often expensive to deploy at scale. LLM query routing mitigates this by dynamically assigning queries to models of varying cost and quality to obtain a desired trade-off. Prior query routing approaches generate only one response from the selected model and a single response from a small (inexpensive) model was often not good enough to beat a… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Accepted to ICML 2025 (main conference)

  2. arXiv:2411.01643  [pdf, other

    cs.AI cs.CL

    EcoAct: Economic Agent Determines When to Register What Action

    Authors: Shaokun Zhang, Jieyu Zhang, Dujian Ding, Mirian Hipolito Garcia, Ankur Mallick, Daniel Madrigal, Menglin Xia, Victor Rühle, Qingyun Wu, Chi Wang

    Abstract: Recent advancements have enabled Large Language Models (LLMs) to function as agents that can perform actions using external tools. This requires registering, i.e., integrating tool information into the LLM context prior to taking actions. Current methods indiscriminately incorporate all candidate tools into the agent's context and retain them across multiple reasoning steps. This process remains o… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: 16 pages, 10 figures