2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue System

      Preprint
      , , , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In this paper, we present Duplex Conversation, a multi-turn, multimodal spoken dialogue system that enables telephone-based agents to interact with customers like a human. We use the concept of full-duplex in telecommunication to demonstrate what a human-like interactive experience should be and how to achieve smooth turn-taking through three subtasks: user state detection, backchannel selection, and barge-in detection. Besides, we propose semi-supervised learning with multimodal data augmentation to leverage unlabeled data to increase model generalization. Experimental results on three sub-tasks show that the proposed method achieves consistent improvements compared with baselines. We deploy the Duplex Conversation to Alibaba intelligent customer service and share lessons learned in production. Online A/B experiments show that the proposed system can significantly reduce response latency by 50%.

          Related collections

          Author and article information

          Journal
          30 May 2022
          Article
          10.1145/3534678.3539209
          2205.15060
          96fdd4cf-9a05-4cc3-8e6f-4392cfbd67c8

          http://creativecommons.org/licenses/by/4.0/

          History
          Custom metadata
          Accepted by KDD 2022, ADS track
          cs.CL

          Theoretical computer science
          Theoretical computer science

          Comments

          Comment on this article