首页> 外文会议>International Conference on Electrical Engineering and Computer Science >A prototype of Indonesian dictation component for typing and formatting document using a word processor software
【24h】

A prototype of Indonesian dictation component for typing and formatting document using a word processor software

机译:用于使用Word处理器软件键入和格式化文档的印度尼西亚侦探组件的原型

获取原文

摘要

This paper presents our work to build a dictation component in Indonesian language which can be installed to the OpenOffice Writer. The component can reduce the usage of keyboard in making and formatting document by applying an Indonesian automatic speech recognizer (ASR) to transcribe speech into text and a natural language processor to transform text to document format or command. The Indonesian ASR has two main components: an acoustic model and a language model. The acoustic model was trained based on the hidden Markov model (HMM). It was trained by using recordings of 20 native Indonesian speakers, each pronounces about 340 sentences. The total duration of the recordings was about 14.5 hours. The language model based on the n-gram model with the Witten-Bell smoothing was trained by using about 800 sentences containing the recording transcriptions and some punctuated sentences. The dictation component was evaluated for its accuracy and user experience. For a closed evaluation, from 10 participants, the component achieved 89.13% word recognition accuracy. From 10 other participants, 6 out of 10 preferred to use keyboard rather than speech to make and format document. Those 6 who still preferred to use keyboard were willing to use speech if the dictation component could conduct the job effectively and efficiently.
机译:本文介绍了我们在印度尼西亚语言中建立一个听写组件的工作,可以安装到OpenOffice作家。该组件可以通过应用印度尼西亚自动语音识别器(ASR)将语音转录为文本和自然语言处理器以将文本转换为文档格式或命令的语音和格式化文档中的制作和格式化文档的使用。印度尼西亚ASR有两个主要组件:声学模型和语言模型。声学模型基于隐马尔可夫模型(HMM)培训。它是通过使用20个原始印度尼西亚语扬声器的录音,每次发音约340个句子。录音的总持续时间约为14.5小时。通过使用包含录制转录和一些标点句子的大约800个句子训练了基于N-Gram模型的语言模型。对其准确性和用户体验评估了听写组件。对于封闭式评估,来自10名参与者,该组件实现了89.13%的字识别准确性。从其他10名参与者,10个中的6个优选使用键盘而不是语音来制作和格式化文档。如果听写组件可以有效且有效地进行工作,那么仍然愿意使用键盘的那些仍然愿意使用言论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号