BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Liu, Zhiwei; Yao, Weiran; Zhang, Jianguo; Xue, Le; Heinecke, Shelby; Murthy, Rithesh; Feng, Yihao; Chen, Zeyuan; Niebles, Juan Carlos; Arpit, Devansh; Xu, Ran; Mui, Phil; Wang, Huan; Xiong, Caiming; Savarese, Silvio

Computer Science > Artificial Intelligence

arXiv:2308.05960 (cs)

[Submitted on 11 Aug 2023]

Title:BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Authors:Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

View PDF

Abstract:The massive successes of large language models (LLMs) encourage the emerging exploration of LLM-augmented Autonomous Agents (LAAs). An LAA is able to generate actions with its core LLM and interact with environments, which facilitates the ability to resolve complex tasks by conditioning on past interactions such as observations and actions. Since the investigation of LAA is still very recent, limited explorations are available. Therefore, we provide a comprehensive comparison of LAA in terms of both agent architectures and LLM backbones. Additionally, we propose a new strategy to orchestrate multiple LAAs such that each labor LAA focuses on one type of action, \textit{i.e.} BOLAA, where a controller manages the communication among multiple agents. We conduct simulations on both decision-making and multi-step reasoning environments, which comprehensively justify the capacity of LAAs. Our performance results provide quantitative suggestions for designing LAA architectures and the optimal choice of LLMs, as well as the compatibility of both. We release our implementation code of LAAs to the public at \url{this https URL}.

Comments:	Preprint
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.05960 [cs.AI]
	(or arXiv:2308.05960v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2308.05960

Submission history

From: Zhiwei Liu [view email]
[v1] Fri, 11 Aug 2023 06:37:54 UTC (273 KB)

Computer Science > Artificial Intelligence

Title:BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators