Enhancing Pipeline-Based Conversational Agents with Large Language Models
Authors:
Mina Foosherian,
Hendrik Purwins,
Purna Rathnayake,
Touhidul Alam,
Rui Teimao,
Klaus-Dieter Thoben
Abstract:
The latest advancements in AI and deep learning have led to a breakthrough in large language model (LLM)-based agents such as GPT-4. However, many commercial conversational agent development tools are pipeline-based and have limitations in holding a human-like conversation. This paper investigates the capabilities of LLMs to enhance pipeline-based conversational agents during two phases: 1) in the…
▽ More
The latest advancements in AI and deep learning have led to a breakthrough in large language model (LLM)-based agents such as GPT-4. However, many commercial conversational agent development tools are pipeline-based and have limitations in holding a human-like conversation. This paper investigates the capabilities of LLMs to enhance pipeline-based conversational agents during two phases: 1) in the design and development phase and 2) during operations. In 1) LLMs can aid in generating training data, extracting entities and synonyms, localization, and persona design. In 2) LLMs can assist in contextualization, intent classification to prevent conversational breakdown and handle out-of-scope questions, auto-correcting utterances, rephrasing responses, formulating disambiguation questions, summarization, and enabling closed question-answering capabilities. We conducted informal experiments with GPT-4 in the private banking domain to demonstrate the scenarios above with a practical example. Companies may be hesitant to replace their pipeline-based agents with LLMs entirely due to privacy concerns and the need for deep integration within their existing ecosystems. A hybrid approach in which LLMs' are integrated into the pipeline-based agents allows them to save time and costs of building and running agents by capitalizing on the capabilities of LLMs while retaining the integration and privacy safeguards of their existing systems.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
Experimental Research Data Quality In Materials Science
Authors:
Thorsten Wuest,
Rainer Tinscher,
Robert Porzel,
Klaus-Dieter Thoben
Abstract:
In materials sciences, a large amount of research data is generated through a broad spectrum of different experiments. As of today, experimental research data including meta-data in materials science is often stored decentralized by the researcher(s) conducting the experiments without generally accepted standards on what and how to store data. The conducted research and experiments often involve a…
▽ More
In materials sciences, a large amount of research data is generated through a broad spectrum of different experiments. As of today, experimental research data including meta-data in materials science is often stored decentralized by the researcher(s) conducting the experiments without generally accepted standards on what and how to store data. The conducted research and experiments often involve a considerable investment from public funding agencies that desire the results to be made available in order to increase their impact. In order to achieve the goal of citable and (openly) accessible materials science experimental research data in the future, not only an adequate infrastructure needs to be established but the question of how to measure the quality of the experimental research data also to be addressed. In this publication, the authors identify requirements and challenges towards a systematic methodology to measure experimental research data quality prior to publication and derive different approaches on that basis. These methods are critically discussed and assessed by their contribution and limitations towards the set goals. Concluding, a combination of selected methods is presented as a systematic, functional and practical quality measurement and assurance approach for experimental research data in materials science with the goal of supporting the accessibility and dissemination of existing data sets.
△ Less
Submitted 6 January, 2015;
originally announced January 2015.