Neural Text Normalization in Speech-to-Text Systems with Rich Features,Applied Artificial Intelligence

当前位置： X-MOL 学术 › Appl. Artif. Intell. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Neural Text Normalization in Speech-to-Text Systems with Rich Features
Applied Artificial Intelligence ( IF 2.8 ) Pub Date : 2021-01-11
Oanh Thi Tran, Viet The Bui

ABSTRACT

This paper presents the task of normalizing Vietnamese transcribed texts in Speech-to-Text (STT) systems. The main purpose is to develop a text normalizer that automatically converts proper nouns and other context-specific formatting of the transcription such as dates, time, and numbers into their appropriate expressions. To this end, we propose a solution that exploits deep neural networks with rich features followed by manually designed rules to recognize and then convert these text sequences. We also introduce a new corpus of 13 K spoken sentences to facilitate the process of the text normalization. The experimental results on this corpus are quite promising. The proposed method yields 90.67% in the F1 score in recognizing sequences of texts that need converting. We hope that this initial work will inspire other follow-up research on this important but unexplored problem.

中文翻译：

具有丰富功能的语音转文字系统中的神经文字标准化

摘要

本文介绍了在语音转文本（STT）系统中规范越南语转录文本的任务。主要目的是开发一种文本规范化器，该规范化器可自动将专有名词和转录的其他特定于上下文的格式（例如日期，时间和数字）转换为它们的合适表达形式。为此，我们提出了一种解决方案，该解决方案利用具有丰富功能的深层神经网络，然后通过人工设计的规则来识别并转换这些文本序列。我们还引入了一个新的13 K口语语料库，以促进文本规范化的过程。该语料库的实验结果很有希望。所提出的方法在识别需要转换的文本序列时在F1分数中产生90.67％。

更新日期：2021-01-11

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>