40
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings?

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Automatic Speech Recognition (ASR) systems introduce word errors, which often confuse punctuation prediction models, turning punctuation restoration into a challenging task. These errors usually take the form of homonyms. We show how retrofitting of the word embeddings on the domain-specific data can mitigate ASR errors. Our main contribution is a method for better alignment of homonym embeddings and the validation of the presented method on the punctuation prediction task. We record the absolute improvement in punctuation prediction accuracy between 6.2% (for question marks) to 9% (for periods) when compared with the state-of-the-art model.

          Related collections

          Author and article information

          Journal
          13 April 2020
          Article
          2004.05985
          9c9dbd6e-aa29-4d0e-8ce3-e298967e80cd

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          submitted to INTERSPEECH'20
          cs.CL cs.LG cs.SD eess.AS

          Theoretical computer science,Artificial intelligence,Electrical engineering,Graphics & Multimedia design

          Comments

          Comment on this article