...
首页> 外文期刊>IEICE transactions on information and systems >Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text
【24h】

Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text

机译:迈向人友好的ASR系统:恢复越南文本的资本化和标点符号

获取原文

摘要

Speech recognition is a technique that recognizes words and sentences in audio form and converts them into text sentences. Currently, with the advancement of deep learning technologies, speech recognition has achieved very satisfactory results close to human abilities. However, there are still limitations in identification results such as lack of punctuation, capitalization, and standardized numerical data. Vietnamese also contains local words, homonyms, etc, which make it difficult to read and understand the identification results for users as well as to perform the next tasks in Natural Language Processing (NLP). In this paper, we propose to combine the transformer decoder with conditional random field (CRF) to restore punctuation and capitalization for the Vietnamese automatic speech recognition (ASR) output. By chunking input sentences and merging output sequences, it is possible to handle longer strings with greater accuracy. Experiments show that the method proposed in the Vietnamese post-speech recognition dataset delivers the best results.
机译:语音识别是一种识别音频形式的单词和句子的技术,并将它们转换为文本句子。目前,随着深度学习技术的进步,语音识别取得了非常令人满意的结果,接近人类能力。但是,识别结果仍有局限性,例如缺乏标点符号,大写和标准化的数值数据。越南人还包含本地单词,同音异义词等,这使得难以阅读和理解用户的识别结果,以及在自然语言处理(NLP)中执行下一个任务。在本文中,我们建议将变压器解码器与条件随机字段(CRF)相结合,以恢复越南自动语音识别(ASR)输出的标点符号和大写。通过划分输入句子和合并输出序列,可以以更高的准确度处理更长的字符串。实验表明,越南语音识别数据集中提出的方法提供了最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号