Skip to main content

Showing 1–17 of 17 results for author: Takanobu, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.08046  [pdf, other

    cs.CV

    Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

    Authors: Peng Jin, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, Li Yuan

    Abstract: Large language models have demonstrated impressive universal capabilities across a wide range of open-ended tasks and have extended their utility to encompass multimodal conversations. However, existing methods encounter challenges in effectively handling both image and video understanding, particularly with limited visual tokens. In this work, we introduce Chat-UniVi, a Unified Vision-language mo… ▽ More

    Submitted 5 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR 2024 (Highlight)

  2. arXiv:2208.07597  [pdf, other

    cs.CL cs.AI

    Manual-Guided Dialogue for Flexible Conversational Agents

    Authors: Ryuichi Takanobu, Hao Zhou, Yankai Lin, Peng Li, Jie Zhou, Minlie Huang

    Abstract: How to build and use dialogue data efficiently, and how to deploy models in different domains at scale can be two critical issues in building a task-oriented dialogue system. In this paper, we propose a novel manual-guided dialogue scheme to alleviate these problems, where the agent learns the tasks from both dialogue and manuals. The manual is an unstructured textual document that guides the agen… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  3. arXiv:2106.11796  [pdf, other

    cs.CL

    End-to-End Task-Oriented Dialog Modeling with Semi-Structured Knowledge Management

    Authors: Silin Gao, Ryuichi Takanobu, Antoine Bosselut, Minlie Huang

    Abstract: Current task-oriented dialog (TOD) systems mostly manage structured knowledge (e.g. databases and tables) to guide the goal-oriented conversations. However, they fall short of handling dialogs which also involve unstructured knowledge (e.g. reviews and documents). In this paper, we formulate a task of modeling TOD grounded on a fusion of structured and unstructured knowledge. To address this task,… ▽ More

    Submitted 1 February, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: IEEE/ACM TASLP, regular paper. arXiv admin note: text overlap with arXiv:2105.06041

  4. arXiv:2105.06041  [pdf, other

    cs.CL

    HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management

    Authors: Silin Gao, Ryuichi Takanobu, Wei Peng, Qun Liu, Minlie Huang

    Abstract: Task-oriented dialog (TOD) systems typically manage structured knowledge (e.g. ontologies and databases) to guide the goal-oriented conversations. However, they fall short of handling dialog turns grounded on unstructured knowledge (e.g. reviews and documents). In this paper, we formulate a task of modeling TOD grounded on both structured and unstructured knowledge. To address this task, we propos… ▽ More

    Submitted 2 June, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: Findings of ACL-IJCNLP 2021, long paper

  5. arXiv:2012.15262  [pdf, other

    cs.CL cs.AI

    Robustness Testing of Language Understanding in Task-Oriented Dialog

    Authors: Jiexi Liu, Ryuichi Takanobu, Jiaxin Wen, Dazhen Wan, Hongguang Li, Weiran Nie, Cheng Li, Wei Peng, Minlie Huang

    Abstract: Most language understanding models in task-oriented dialog systems are trained on a small amount of annotated training data, and evaluated in a small set from the same distribution. However, these models can lead to system failure or undesirable output when being exposed to natural language perturbation or variation in practice. In this paper, we conduct comprehensive evaluation and analysis with… ▽ More

    Submitted 4 June, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: ACL 2021 long paper

  6. arXiv:2012.15022  [pdf, other

    cs.CL cs.AI

    ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning

    Authors: Yujia Qin, Yankai Lin, Ryuichi Takanobu, Zhiyuan Liu, Peng Li, Heng Ji, Minlie Huang, Maosong Sun, Jie Zhou

    Abstract: Pre-trained Language Models (PLMs) have shown superior performance on various downstream Natural Language Processing (NLP) tasks. However, conventional pre-training objectives do not explicitly model relational facts in text, which are crucial for textual understanding. To address this issue, we propose a novel contrastive learning framework ERICA to obtain a deep understanding of the entities and… ▽ More

    Submitted 26 May, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: Accepted by ACL-IJCNLP 2021 main conference

  7. arXiv:2010.10333  [pdf, other

    cs.CL cs.IR

    CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for Conversational Recommendation

    Authors: Wenchang Ma, Ryuichi Takanobu, Minlie Huang

    Abstract: Growing interests have been attracted in Conversational Recommender Systems (CRS), which explore user preference through conversational interactions in order to make appropriate recommendation. However, there is still a lack of ability in existing CRS to (1) traverse multiple reasoning paths over background knowledge to introduce relevant items and attributes, and (2) arrange selected entities app… ▽ More

    Submitted 3 September, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: EMNLP 2021 long paper

  8. arXiv:2010.05594  [pdf, other

    cs.CL

    MultiWOZ 2.3: A multi-domain task-oriented dialogue dataset enhanced with annotation corrections and co-reference annotation

    Authors: Ting Han, Ximing Liu, Ryuichi Takanobu, Yixin Lian, Chongxuan Huang, Dazhen Wan, Wei Peng, Minlie Huang

    Abstract: Task-oriented dialogue systems have made unprecedented progress with multiple state-of-the-art (SOTA) models underpinned by a number of publicly available MultiWOZ datasets. Dialogue state annotations are error-prone, leading to sub-optimal performance. Various efforts have been put in rectifying the annotation errors presented in the original MultiWOZ dataset. In this paper, we introduce MultiWOZ… ▽ More

    Submitted 14 June, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  9. arXiv:2005.07362  [pdf, other

    cs.CL cs.AI

    Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical Analysis of System-wise Evaluation

    Authors: Ryuichi Takanobu, Qi Zhu, Jinchao Li, Baolin Peng, Jianfeng Gao, Minlie Huang

    Abstract: There is a growing interest in developing goal-oriented dialog systems which serve users in accomplishing complex tasks through multi-turn conversations. Although many methods are devised to evaluate and improve the performance of individual dialog components, there is a lack of comprehensive empirical study on how different components contribute to the overall performance of a dialog system. In t… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: SIGDIAL 2020 long paper

  10. arXiv:2004.03809  [pdf, other

    cs.CL cs.LG

    Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

    Authors: Ryuichi Takanobu, Runze Liang, Minlie Huang

    Abstract: Many studies have applied reinforcement learning to train a dialog policy and show great promise these years. One common approach is to employ a user simulator to obtain a large number of simulated user experiences for reinforcement learning algorithms. However, modeling a realistic user simulator is challenging. A rule-based simulator requires heavy domain expertise for complex tasks, and a data-… ▽ More

    Submitted 22 April, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: ACL 2020 long paper

  11. arXiv:2003.07490  [pdf, other

    cs.CL

    Recent Advances and Challenges in Task-oriented Dialog System

    Authors: Zheng Zhang, Ryuichi Takanobu, Qi Zhu, Minlie Huang, Xiaoyan Zhu

    Abstract: Due to the significance and value in human-computer interaction and natural language processing, task-oriented dialog systems are attracting more and more attention in both academic and industrial communities. In this paper, we survey recent advances and challenges in task-oriented dialog systems. We also discuss three critical topics for task-oriented dialog systems: (1) improving data efficiency… ▽ More

    Submitted 23 June, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

    Comments: Under review of SCIENCE CHINA Technological Science (SCTS)

  12. arXiv:2002.04793  [pdf, other

    cs.CL cs.AI

    ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

    Authors: Qi Zhu, Zheng Zhang, Yan Fang, Xiang Li, Ryuichi Takanobu, Jinchao Li, Baolin Peng, Jianfeng Gao, Xiaoyan Zhu, Minlie Huang

    Abstract: We present ConvLab-2, an open-source toolkit that enables researchers to build task-oriented dialogue systems with state-of-the-art models, perform an end-to-end evaluation, and diagnose the weakness of systems. As the successor of ConvLab (Lee et al., 2019b), ConvLab-2 inherits ConvLab's framework but integrates more powerful dialogue models and supports more datasets. Besides, we have developed… ▽ More

    Submitted 29 April, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: Accepted by ACL 2020 demo track

  13. arXiv:1908.10719  [pdf, other

    cs.CL cs.LG

    Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog

    Authors: Ryuichi Takanobu, Hanlin Zhu, Minlie Huang

    Abstract: Dialog policy decides what and how a task-oriented dialog system will respond, and plays a vital role in delivering effective conversations. Many studies apply Reinforcement Learning to learn a dialog policy with the reward function which requires elaborate design and pre-specified user goals. With the growing needs to handle complex goals across multiple domains, such manually designed reward fun… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: EMNLP 2019 long paper

  14. arXiv:1907.00710  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Deep Conversational Recommender in Travel

    Authors: Lizi Liao, Ryuichi Takanobu, Yunshan Ma, Xun Yang, Minlie Huang, Tat-Seng Chua

    Abstract: When traveling to a foreign country, we are often in dire need of an intelligent conversational agent to provide instant and informative responses to our various queries. However, to build such a travel agent is non-trivial. First of all, travel naturally involves several sub-tasks such as hotel reservation, restaurant recommendation and taxi booking etc, which invokes the need for global topic co… ▽ More

    Submitted 25 June, 2019; originally announced July 2019.

    Comments: 12 pages, 7 figures, submitted to TKDE. arXiv admin note: text overlap with arXiv:1809.07070 by other authors

  15. arXiv:1904.08637  [pdf, other

    cs.CL cs.AI

    ConvLab: Multi-Domain End-to-End Dialog System Platform

    Authors: Sungjin Lee, Qi Zhu, Ryuichi Takanobu, Xiang Li, Yaoqin Zhang, Zheng Zhang, Jinchao Li, Baolin Peng, Xiujun Li, Minlie Huang, Jianfeng Gao

    Abstract: We present ConvLab, an open-source multi-domain end-to-end dialog system platform, that enables researchers to quickly set up experiments with reusable components and compare a large set of different approaches, ranging from conventional pipeline systems to end-to-end neural models, in common environments. ConvLab offers a set of fully annotated datasets and associated pre-trained reference models… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  16. arXiv:1902.08882  [pdf, other

    cs.IR cs.AI cs.CY

    Aggregating E-commerce Search Results from Heterogeneous Sources via Hierarchical Reinforcement Learning

    Authors: Ryuichi Takanobu, Tao Zhuang, Minlie Huang, Jun Feng, Haihong Tang, Bo Zheng

    Abstract: In this paper, we investigate the task of aggregating search results from heterogeneous sources in an E-commerce environment. First, unlike traditional aggregated web search that merely presents multi-sourced results in the first page, this new task may present aggregated results in all pages and has to dynamically decide which source should be presented in the current page. Second, as pointed out… ▽ More

    Submitted 23 February, 2019; originally announced February 2019.

    Comments: WWW 19, 11 pages

  17. arXiv:1811.03925  [pdf, other

    cs.CL cs.IR

    A Hierarchical Framework for Relation Extraction with Reinforcement Learning

    Authors: Ryuichi Takanobu, Tianyang Zhang, Jiexi Liu, Minlie Huang

    Abstract: Most existing methods determine relation types only after all the entities have been recognized, thus the interaction between relation types and entity mentions is not fully modeled. This paper presents a novel paradigm to deal with relation extraction by regarding the related entities as the arguments of a relation. We apply a hierarchical reinforcement learning (HRL) framework in this paradigm t… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: To appear in AAAI 19