Automatic Re-Formulation of user’s Irrational Behavior in Speech Recognition using Acoustic Nudging Model

Lydia Kehinde Ajayi; Ambrose Azeta; Isaac Odun-Ayo; Felix Chidozie; Ajayi Peter Taiwo

首页> 外文期刊>Journal of computer sciences >Automatic Re-Formulation of user’s Irrational Behavior in Speech Recognition using Acoustic Nudging Model

【24h】

Automatic Re-Formulation of user’s Irrational Behavior in Speech Recognition using Acoustic Nudging Model

机译：使用声学闪烁模型自动重新制定用户的语音识别中的非理性行为

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In automatic speech recognition for development of automatic speech recognition applications, there has been numerous claims on the presence of speech recognition errors known as classified into lexical and acoustic errors. These errors distort speech signals thereby depreciating the accuracy and performance rate of speech recognition applications. Even though lexical speech recognition error problem has been partially combated, acoustic speech recognition error referred to as user’s acoustic irrational behavior is being ignored causing high error rate with low accuracy which is the bone of contention and an impediment factor in the wide adoption of speech recognition technology. Users do not always behave in a rational manner especially when dealing with a particular speech recognition application. The persistent presence of these user’s acoustic irrational behavior in speech have intensified the essential need to automatically detect and correct such errors, as current researches only focus on detecting user’s acoustic irrational behavior but not correcting/reformulating/re-sizing this error. Hence, this paper provides an acoustic nudging model that will perform automatic correction/reformulation of user’s acoustic irrational behavior in speech to achieve higher performance and accuracy using different acoustic parameters which are based in Pitch, Time gaps between words, Timbre descend and ascend time and Loudness. This study was able to discover a foundation for reducing error rate and achieve higher performance, as well as improve accuracy in speech recognition applications through detection and re-formulation of user’s acoustic irrational behavior in speech signal automatically, thereby making the model applicable to any speech recognition applications. The outcome of this study would be useful in enhancing accuracy and performance in the context of automatic speech recognition.

机译：在用于自动语音识别应用的发展的自动语音识别中，在存在语音识别错误的情况下存在许多已知为词汇和声误差的声明。这些错误扭曲了语音信号，从而贬值语音识别应用程序的准确性和性能率。即使词汇语音识别误差问题已经部分地组合，声音语音识别误差也被忽略了导致具有低精度的高误差率，这是争用的骨骼和障碍因子广泛采用语音识别技术。用户并不总是以合理的方式行事，特别是在处理特定的语音识别应用程序时。由于目前的研究仅关注检测用户的声学非理性行为，因此目前的研究，这些用户的声学非理性行为的持续存在加剧了自动检测和纠正此类错误的必要必要性。因此，本文提供了一种声学亮度模型，它将在语音中执行用户的声学非理性行为的自动校正/重新校正，以实现使用基于音高的不同声学参数的更高的性能和准确性，单词，TimBre下降和上升时间和上升时间响度。本研究能够发现误差率并实现更高性能的基础，并通过自动检测和重新制定用户的语音信号中的用户的声学非理性行为来提高语音识别应用中的准确性，从而使适用于任何语音的模型识别申请。本研究的结果对于在自动语音识别的背景下提高准确性和性能。

著录项

来源
《Journal of computer sciences》 |2020年第12期|共11页
作者
Lydia Kehinde Ajayi; Ambrose Azeta; Isaac Odun-Ayo; Felix Chidozie; Ajayi Peter Taiwo;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Acoustic Nudging ModelAutomatic Speech Recognition ErrorUser’s Acoustic Irrational BehaviorAutomatic Speech RecognitionAcoustic Model;

机译：声学亮绿色语音识别Erroruser的声学非理性行为自动语音识别声学模型;

相似文献

外文文献
中文文献
专利

1. Automatic Re-Formulation of user's Irrational Behavior in Speech Recognition using Acoustic Nudging Model [J] . Lydia Kehinde Ajayi, Ambrose Azeta, Isaac Odun-Ayo, Journal of computer sciences . 2020,第12期

机译：使用声学亮模型自动重新制定用户的语音识别中的非理性行为
2. Acoustic Variability of Voice Signal as Factor of Information Security for Automatic Speech Recognition Systems with Tuning to User Voice [J] . V. V. Savchenko Radioelectronics and Communications Systems . 2020,第10期

机译：声音信号的声学可变性是具有调整到用户语音的自动语音识别系统的信息安全因素
3. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model [J] . He Di, Lim Boon Pang, Yang Xuesong, The Journal of the Acoustical Society of America . 2018,第6aPta1期

机译：声学地标包含与具有深度神经网络声学模型的自动语音识别的其他帧的更多信息
4. Acoustic model merging using acoustic models from multilingual speakers for automatic speech recognition [C] . Tien-Ping Tan, Besacier L., Lecouteux B. International conference on asian language processing . 2014

机译：使用多语言扬声器的声学模型合并声学模型以实现自动语音识别
5. Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition. [D] . Liu, Yuzong. 2016

机译：用于自动语音识别的声学建模中基于图的半监督学习。
6. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [O] . Prasanta Kumar Ghosh, Shrikanth Narayanan -1

机译：使用从独立于受试者的声学到发音反转的发音特征进行自动语音识别
7. Acoustic Model Merging Using Acoustic Models from Multilingual Speakers for Automatic Speech Recognition [O] . Tien-ping Tan, Laurent Besacier, Benjamin Lecouteux 2015

机译：声学模型融合使用多语言扬声器的声学模型进行自动语音识别

Automatic Re-Formulation of user’s Irrational Behavior in Speech Recognition using Acoustic Nudging Model

摘要

著录项

相似文献

相关主题

期刊订阅