Skip to main content

Showing 1–7 of 7 results for author: Khizbullin, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.08275  [pdf, other

    cs.AI cs.CL

    Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language Models

    Authors: Ruibin Xiong, Yimeng Chen, Dmitrii Khizbullin, Mingchen Zhuge, Jürgen Schmidhuber

    Abstract: Long-form writing agents require flexible integration and interaction across information retrieval, reasoning, and composition. Current approaches rely on predetermined workflows and rigid thinking patterns to generate outlines before writing, resulting in constrained adaptability during writing. In this paper we propose a general agent framework that achieves human-like adaptive writing through r… ▽ More

    Submitted 25 March, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

    Comments: 29 pages, 2 figures

  2. arXiv:2412.03624  [pdf, other

    cs.AI cs.CL cs.LG cs.MA stat.ML

    How to Correctly do Semantic Backpropagation on Language-based Agentic Systems

    Authors: Wenyi Wang, Hisham A. Alyahya, Dylan R. Ashley, Oleg Serikov, Dmitrii Khizbullin, Francesco Faccio, Jürgen Schmidhuber

    Abstract: Language-based agentic systems have shown great promise in recent years, transitioning from solving small-scale research problems to being deployed in challenging real-world tasks. However, optimizing these systems often requires substantial manual labor. Recent studies have demonstrated that these systems can be represented as computational graphs, enabling automatic optimization. Despite these a… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

    Comments: 11 pages in main text + 2 pages of references + 15 pages of appendices, 2 figures in main text + 17 figures in appendices, 2 tables in main text + 1 table in appendices, 2 algorithms in main text; source code available at https://github.com/HishamAlyahya/semantic_backprop

    MSC Class: 68T07 ACM Class: I.2.6; I.2.11

  3. arXiv:2411.06402  [pdf, other

    cs.CL cs.AI

    Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models

    Authors: Sultan Alrashed, Dmitrii Khizbullin, David R. Pugh

    Abstract: As large language models (LLMs) grow and develop, so do their data demands. This is especially true for multilingual LLMs, where the scarcity of high-quality and readily available data online has led to a multitude of synthetic dataset generation approaches. A key technique in this space is machine translation (MT), where high-quality English text is adapted to a target, comparatively low-resource… ▽ More

    Submitted 10 November, 2024; originally announced November 2024.

  4. arXiv:2410.10934  [pdf, other

    cs.AI

    Agent-as-a-Judge: Evaluate Agents with Agents

    Authors: Mingchen Zhuge, Changsheng Zhao, Dylan Ashley, Wenyi Wang, Dmitrii Khizbullin, Yunyang Xiong, Zechun Liu, Ernie Chang, Raghuraman Krishnamoorthi, Yuandong Tian, Yangyang Shi, Vikas Chandra, Jürgen Schmidhuber

    Abstract: Contemporary evaluation techniques are inadequate for agentic systems. These approaches either focus exclusively on final outcomes -- ignoring the step-by-step nature of agentic systems, or require excessive manual labour. To address this, we introduce the Agent-as-a-Judge framework, wherein agentic systems are used to evaluate agentic systems. This is an organic extension of the LLM-as-a-Judge fr… ▽ More

    Submitted 16 October, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: The project can be found at https://github.com/metauto-ai/agent-as-a-judge. The dataset is released at https://huggingface.co/DEVAI-benchmark

  5. arXiv:2405.16623  [pdf, other

    cs.LG cs.AR cs.PF

    Graph neural networks with configuration cross-attention for tensor compilers

    Authors: Dmitrii Khizbullin, Eduardo Rocha de Andrade, Thanh Hau Nguyen, Matheus Pedroza Ferreira, David R. Pugh

    Abstract: With the recent popularity of neural networks comes the need for efficient serving of inference workloads. A neural network inference workload can be represented as a computational graph with nodes as operators transforming multidimensional tensors. The tensors can be transposed and/or tiled in a combinatorially large number of ways, some configurations leading to accelerated inference. We propose… ▽ More

    Submitted 25 November, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  6. arXiv:2402.16823  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Language Agents as Optimizable Graphs

    Authors: Mingchen Zhuge, Wenyi Wang, Louis Kirsch, Francesco Faccio, Dmitrii Khizbullin, Jürgen Schmidhuber

    Abstract: Various human-designed prompt engineering techniques have been proposed to improve problem solvers based on Large Language Models (LLMs), yielding many disparate code bases. We unify these approaches by describing LLM-based agents as computational graphs. The nodes implement functions to process multimodal data or query LLMs, and the edges describe the information flow between operations. Graphs c… ▽ More

    Submitted 22 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Project Website: https://gptswarm.org ; Github Repo: https://github.com/metauto-ai/gptswarm . In Forty-first International Conference on Machine Learning (2024)

  7. arXiv:2303.17760  [pdf, other

    cs.AI cs.CL cs.CY cs.LG cs.MA

    CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society

    Authors: Guohao Li, Hasan Abed Al Kader Hammoud, Hani Itani, Dmitrii Khizbullin, Bernard Ghanem

    Abstract: The rapid advancement of chat-based language models has led to remarkable progress in complex task-solving. However, their success heavily relies on human input to guide the conversation, which can be challenging and time-consuming. This paper explores the potential of building scalable techniques to facilitate autonomous cooperation among communicative agents, and provides insight into their "cog… ▽ More

    Submitted 2 November, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted at NeurIPS'2023, 77 pages, project website: https://www.camel-ai.org, github repository: https://github.com/camel-ai/camel