首页> 外文期刊>Journal of computer sciences >Automatic Re-Formulation of user’s Irrational Behavior in Speech Recognition using Acoustic Nudging Model
【24h】

Automatic Re-Formulation of user’s Irrational Behavior in Speech Recognition using Acoustic Nudging Model

机译:使用声学闪烁模型自动重新制定用户的语音识别中的非理性行为

获取原文
       

摘要

In automatic speech recognition for development of automatic speech recognition applications, there has been numerous claims on the presence of speech recognition errors known as classified into lexical and acoustic errors. These errors distort speech signals thereby depreciating the accuracy and performance rate of speech recognition applications. Even though lexical speech recognition error problem has been partially combated, acoustic speech recognition error referred to as user’s acoustic irrational behavior is being ignored causing high error rate with low accuracy which is the bone of contention and an impediment factor in the wide adoption of speech recognition technology. Users do not always behave in a rational manner especially when dealing with a particular speech recognition application. The persistent presence of these user’s acoustic irrational behavior in speech have intensified the essential need to automatically detect and correct such errors, as current researches only focus on detecting user’s acoustic irrational behavior but not correcting/reformulating/re-sizing this error. Hence, this paper provides an acoustic nudging model that will perform automatic correction/reformulation of user’s acoustic irrational behavior in speech to achieve higher performance and accuracy using different acoustic parameters which are based in Pitch, Time gaps between words, Timbre descend and ascend time and Loudness. This study was able to discover a foundation for reducing error rate and achieve higher performance, as well as improve accuracy in speech recognition applications through detection and re-formulation of user’s acoustic irrational behavior in speech signal automatically, thereby making the model applicable to any speech recognition applications. The outcome of this study would be useful in enhancing accuracy and performance in the context of automatic speech recognition.
机译:在用于自动语音识别应用的发展的自动语音识别中,在存在语音识别错误的情况下存在许多已知为词汇和声误差的声明。这些错误扭曲了语音信号,从而贬值语音识别应用程序的准确性和性能率。即使词汇语音识别误差问题已经部分地组合,声音语音识别误差也被忽略了导致具有低精度的高误差率,这是争用的骨骼和障碍因子广泛采用语音识别技术。用户并不总是以合理的方式行事,特别是在处理特定的语音识别应用程序时。由于目前的研究仅关注检测用户的声学非理性行为,因此目前的研究,这些用户的声学非理性行为的持续存在加剧了自动检测和纠正此类错误的必要必要性。因此,本文提供了一种声学亮度模型,它将在语音中执行用户的声学非理性行为的自动校正/重新校正,以实现使用基于音高的不同声学参数的更高的性能和准确性,单词,TimBre下降和上升时间和上升时间响度。本研究能够发现误差率并实现更高性能的基础,并通过自动检测和重新制定用户的语音信号中的用户的声学非理性行为来提高语音识别应用中的准确性,从而使适用于任何语音的模型识别申请。本研究的结果对于在自动语音识别的背景下提高准确性和性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号