Intra- and Inter-frame Features for Automatic Speech Recognition

Sung Joo Lee; Byung Ok Kang; Hoon Chung; Yunkeun Lee

首页> 外文期刊>ETRI journal >Intra- and Inter-frame Features for Automatic Speech Recognition

【24h】

Intra- and Inter-frame Features for Automatic Speech Recognition

机译：自动语音识别的帧内和帧间功能

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, alternative dynamic features for speech recognition are proposed. The goal of this work is to improve speech recognition accuracy by deriving the representation of distinctive dynamic characteristics from a speech spectrum. This work was inspired by two temporal dynamics of a speech signal. One is the highly non-stationary nature of speech, and the other is the inter-frame change of a speech spectrum. We adopt the use of a sub-frame spectrum analyzer to capture very rapid spectral changes within a speech analysis frame. In addition, we attempt to measure spectral fluctuations of a more complex manner as opposed to traditional dynamic features such as delta or double-delta. To evaluate the proposed features, speech recognition tests over smartphone environments were conducted. The experimental results show that the feature streams simply combined with the proposed features are effective for an improvement in the recognition accuracy of a hidden Markov model–based speech recognizer.

机译：在本文中，提出了语音识别的替代动态特征。这项工作的目的是通过从语音频谱中得出独特的动态特征来提高语音识别的准确性。这项工作的灵感来自于语音信号的两个时间动态。一个是语音的高度非平稳性，另一个是语音频谱的帧间变化。我们采用子帧频谱分析仪来捕获语音分析帧内非常快速的频谱变化。此外，我们尝试以更复杂的方式测量频谱波动，这与传统的动态特征（例如增量或双增量）相反。为了评估建议的功能，在智能手机环境下进行了语音识别测试。实验结果表明，将特征流与提出的特征进行简单组合可有效提高基于隐马尔可夫模型的语音识别器的识别精度。

著录项

来源
《ETRI journal》 |2014年第3期|共4页
作者
Sung Joo Lee; Byung Ok Kang; Hoon Chung; Yunkeun Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Intra- and Inter-frame Features for Automatic Speech Recognition [J] . Sung Joo Lee, Byung Ok Kang, Hoon Chung, ETRI journal . 2014,第3期

机译：帧内和帧间功能，用于自动语音识别
2. Intra- and inter-frame prediction in bandwidth scalable coding of wideband speech [J] . Song G.-B. Signal Processing, IET . 2011,第2期

机译：宽带语音的带宽可伸缩编码中的帧内和帧间预测
3. New Speech Features Based on time-varying LPC for Robust Automatic Speech Recognition [J] . George MUFUNGULWA, Alia ASHERALIEVA, Hiroshi TSUTSUI, 電子情報通信学会技術研究報告. スマートインフォメディアシステム . 2016,第81期

机译：基于时变LPC的新语音功能可实现鲁棒的自动语音识别
4. Quality-aware video based on robust embedding of intra- and inter-frame reduced-reference features [C] . Kai Zeng, Zhou Wang 17th IEEE International Conference on Image Processing . 2010

机译：基于帧内和帧间缩减参考功能的强大嵌入的基于质量的视频
5. Learning Feature Representation for Automatic Speech Recognition [D] . Ghahremani, Pegah. 2019

机译：自动语音识别学习功能表示
6. DWT features performance analysis for automatic speech recognition of Urdu [O] . Hazrat Ali, Nasir Ahmad, Xianwei Zhou, -1

机译：DWT具有性能分析功能可对乌尔都语进行自动语音识别
7. Frontal-view gait recognition by intra- and inter-frame rectangle size distribution [O] . Barnich, Olivier, Van Droogenbroeck, Marc 2009

机译：通过帧内和帧间矩形大小分布进行前视图步态识别

Intra- and Inter-frame Features for Automatic Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅