A prototype of Indonesian dictation component for typing and formatting document using a word processor software

机译：用于使用Word处理器软件键入和格式化文档的印度尼西亚侦探组件的原型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents our work to build a dictation component in Indonesian language which can be installed to the OpenOffice Writer. The component can reduce the usage of keyboard in making and formatting document by applying an Indonesian automatic speech recognizer (ASR) to transcribe speech into text and a natural language processor to transform text to document format or command. The Indonesian ASR has two main components: an acoustic model and a language model. The acoustic model was trained based on the hidden Markov model (HMM). It was trained by using recordings of 20 native Indonesian speakers, each pronounces about 340 sentences. The total duration of the recordings was about 14.5 hours. The language model based on the n-gram model with the Witten-Bell smoothing was trained by using about 800 sentences containing the recording transcriptions and some punctuated sentences. The dictation component was evaluated for its accuracy and user experience. For a closed evaluation, from 10 participants, the component achieved 89.13% word recognition accuracy. From 10 other participants, 6 out of 10 preferred to use keyboard rather than speech to make and format document. Those 6 who still preferred to use keyboard were willing to use speech if the dictation component could conduct the job effectively and efficiently.

机译：本文介绍了我们在印度尼西亚语言中建立一个听写组件的工作，可以安装到OpenOffice作家。该组件可以通过应用印度尼西亚自动语音识别器（ASR）将语音转录为文本和自然语言处理器以将文本转换为文档格式或命令的语音和格式化文档中的制作和格式化文档的使用。印度尼西亚ASR有两个主要组件：声学模型和语言模型。声学模型基于隐马尔可夫模型（HMM）培训。它是通过使用20个原始印度尼西亚语扬声器的录音，每次发音约340个句子。录音的总持续时间约为14.5小时。通过使用包含录制转录和一些标点句子的大约800个句子训练了基于N-Gram模型的语言模型。对其准确性和用户体验评估了听写组件。对于封闭式评估，来自10名参与者，该组件实现了89.13％的字识别准确性。从其他10名参与者，10个中的6个优选使用键盘而不是语音来制作和格式化文档。如果听写组件可以有效且有效地进行工作，那么仍然愿意使用键盘的那些仍然愿意使用言论。

著录项

来源
《International Conference on Electrical Engineering and Computer Science》|2014年||共5页
会议地点
作者
Hoesen Devin; Lestari Dessi Puji;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
dictation; natural language processing; speech recognition; text analysis; word processing; ASR; Indonesian dictation component; OpenOffice Writer; automatic speech recognizer; document formatting; document typing; language model; n-gram model; natural language processor; text transformation; word processor software; Accuracy; Acoustics; Hidden Markov models; Keyboards; Speech; Speech recognition; Training; ASR; Dictation; OpenOffice Writer; Speech Command;

机译：听写;语音识别;文本分析;文字处理;asr;印度尼西亚语听力组成;OpenOffice作家;自动语音识别器;文档格式;文档键入;语言模型;n-gram模型;文本转换;文本转换;文本转换;文本转型;文本转换;文本转换;文本转型;文本转型;文本转换;文本转换;文本转型处理器软件;准确性;声学;隐藏的马尔可夫模型;键盘;语音;语音识别;培训;asr;听写;OpenOffice作家;语音命令;

相似文献

外文文献
中文文献
专利

1. An investigation into formatting and layout errors produced by blind word-processor users and an evaluation of prototype error prevention and correction techniques [J] . Evans D.G., Diggle T., Kurniawan S.H., IEEE transactions on neural systems and rehabilitation engineering . 2003,第3期

机译：对盲文字处理器用户产生的格式和布局错误进行调查，并对原型错误预防和纠正技术进行评估
2. Business Card OCR Software by Re-recognition Processing Based on Document Layout and Word Attribute [J] . Ichiro Nakao, Mariko Takenouchi, Saki Takakura Matsushita Technical Journal . 2000,第2期

机译：基于文档布局和单词属性的重新识别处理的名片OCR软件
3. Evaluating the informative quality of documents in SGML format from judgements by means of fuzzy linguistic techniques based on computing with words [J] . Enrique Herrera-Viedma, Eduardo Peis Information Processing & Management . 2003,第2期

机译：通过基于单词计算的模糊语言技术，通过判断来评估SGML格式文档的信息质量
4. A prototype of Indonesian dictation component for typing and formatting document using a word processor software [C] . Hoesen Devin, Lestari Dessi Puji International Conference on Electrical Engineering and Computer Science . 2014

机译：印度尼西亚听写组件的原型，用于使用文字处理器软件对文档进行键入和格式设置
5. The strategies that English as a Second Language students use when spelling words in dictation format and when composing stories. [D] . Jones, Anne Marie. 1991

机译：英语作为第二语言的学生在以听写格式拼写单词和编写故事时使用的策略。
6. From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format [O] . David Remsen, Sandra Knapp, Teodor Georgiev, 2012

机译：从文本到结构化数据：将经过文字处理的植物区系清单转换为Darwin Core Archive格式
7. Dans ce document, nous nous proposons d’étudier les différentes formes de rentabilisation des sites Internet par l’E-publicité. En effet, la mise en place de sites Internet n’est souvent viable que grâce à l’E-publicité qui en couvre en partie les frais de fonctionnement. Quels sont les caractéristiques de la cyberpublicité, les normes, les formats, les types ? Quels sont les acteurs qui interviennent dans ce processus et quels sont leurs rôles ? Comment se rémunère un site ? Quels sont les différents outils de mesure utilisés ? Quels sont les significations et l’intérêt du référencement, du positionnement, de l’affiliation, du netlinking ? Nous dressons un panorama actuel de la situation et nous montrons comment ces différents outils participent à la bonne marche de l’Internet. Enfin, notre dossier propose des ouvertures sur les nouveaux outils du marketing. In this document, we focus on the analysis of the different ways of profitability of the websites performing e-advertising. Indeed the websites' start-up becomes only profitable when the e-advertising appears which covers partially the running costs. What are the features of e-advertising, the rules, the lay-outs, the models? Who are the participants enrolled in this process and what are their roles? How is the payment of a website? What are the different measuring tools used? What are the meanings and the interests of the reference, the positioning and the affiliation of netlinking? We draw up a current study of the situation and explain how those different elements contribute to the smooth running of internet? Finally our paper proposes openings on the new tools of marketing? [O] . Joël Moulhade 100

机译：Dans ce document，nous nous proposons d'étudierlesdifférentesformesde rentabilisation des sites Internet par l'E-publicité。 En effet，la mise en place de sites Internet n'est souvent possiblequegrâceàl'E-publicitéquien couvre en partie les frais de fonctionnement。 Quels sontlescaractéristiquesdelacyberpublicité，les normes，les formats，les types？ Quels sont les acteurs qui interviennent dans ce processus et quels sontleursrôles？评论serémunèreunsite？ Quels sontlesdifférentsoutilsdemesureutilisés？ Quels sont les significations et l'intérêtduréférencement，du positionnement，de l'affiliation，du netlinking？ Nous dressons un panorama actuel de la situation et nous montrons commentcesdifférentsoutilsparticipentàlabonne marche de l'Internet。 Enfin，notre dossier建议des ouvertures sur les nouveaux outils du marketing。在本文档中，我们重点分析了执行电子广告的网站的不同盈利方式。事实上，当电子广告出现时，网站的初创公司才有利可图，这部分地涵盖了运营成本。电子广告，规则，布局，模型有哪些特点？参加此过程的参与者是谁？他们的角色是什么？如何支付网站？使用了哪些不同的测量工具？ netlinking的参考，定位和隶属关系的含义和利益是什么？我们制定了当前的情况研究，并解释了这些不同因素如何促进互联网的顺利运行？最后，我们的论文提出了新的营销工具的开放性？
8. Mexican Market Survey for Dictation and Word Processing Equipment [R] . Lim, A. 1979

机译：墨西哥口述和文字处理设备市场调查

A prototype of Indonesian dictation component for typing and formatting document using a word processor software

摘要

著录项

相似文献

相关主题

期刊订阅