Three Steps of Neuron Network Classification for EMG-based Thai Tones Speech Recognition

机译：基于EMG的泰文语音识别的神经元网络分类的三个步骤

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to overcome the problem existing in original speech recognition (e.g. noise interruption and private data loss), many researchers have investigated to deal with these problems. Electromyography (EMG) from the muscles producing speech was used to replace a voiced signal. Similarly, we aim to develop EMG speech recognition based on Thai language. Tone is the important characteristic of this language. Hence, Thai tone classification is the first work that was explored. This paper proposes the new technique that can classify five Thai tones for EMG-based Thai speech recognition. This method can overcome the limitation of our previous work that we can classify only two tones. EMG was captured from six positions of the strap muscles and facial muscles while a volunteer was uttering 21 Thai isolated words and five tones of each word (total 105 words). The 68 EMG features were calculated, and RES index was used to evaluate clustering capability of each feature. Top five features that have high value of RES index were selected. Neuron Network (NN) was used for tone classification. We found that Modify Mean Absolute Value 2~(nd) type (MMAV2) is the best features. It yielded an accuracy rate of 56.2% for five Thai tones classification. However, it is not enough for our work. In order to improve the accuracy rate, the three steps of NN Classification was proposed. This technique is the series of three networks of NN classifier. Each network will classify different tones, and use distinct features. We obtained an accuracy rate of 80% for five Thai tones classification from this technique.

机译：为了克服原始语音识别中存在的问题（例如，噪声中断和私人数据丢失），许多研究人员已经调查处理这些问题。从生产语音的肌肉肌电图（EMG）用于取代浊音信号。同样，我们的目标是基于泰语制定EMG语音识别。语调是这种语言的重要特征。因此，泰语语气分类是探索的第一个工作。本文提出了可以对基于EMG的泰式语音识别进行分类的新技术。此方法可以克服我们以前的工作的限制，我们只能对两个音调进行分类。 EMG被带子肌肉和面部肌肉的六个位置捕获，而志愿者则发出21个泰国孤立的单词和每个单词的五个音调（总共105字）。计算68个EMG功能，使用RES索引来评估每个功能的聚类能力。选择了具有高价值的res索引的前五个功能。神经元网络（NN）用于音调分类。我们发现修改平均值2〜（nd）类型（mmav2）是最好的功能。它产生了五个泰语音调分类的精度为56.2％。但是，这对我们的工作来说是不够的。为了提高准确率，提出了NN分类的三个步骤。该技术是NN分类器的三个网络系列。每个网络都将分类不同的音调，并使用不同的功能。我们从该技术中获得了五个泰语音调分类的准确率为80％。

著录项

来源
《International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology》|2013年||共6页
会议地点
作者
Niyawadee Srisuwan; Pornchai Phukpattaranont; Chusak Limsakul;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Electromyography; Speech recognition; Myoelectric signal; Thai tone; Neuron Network;

机译：肌电图;语音识别;肌电信号;泰语;神经元网络;

相似文献

外文文献
中文文献
专利

1. Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network [J] . Xiao-Dong WANG, Keikichi HIROSE, Jin-Song ZHANG, IEICE Transactions on Information and Systems . 2008,第6期

机译：基于音频核模型和神经网络的普通话连续语音识别
2. Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network [J] . Xiao-Dong Wang, Keikichi Hirose, Jin-Song Zhang, 電子情報通信学会技術研究報告. 音声. Speech . 2006,第443期

机译：基于音频核模型和神经网络的普通话连续语音识别
3. Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network [J] . Xiao-Dong Wang, Keikichi Hirose, Jin-Song Zhang, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2006,第441期

机译：基于音频核模型和神经网络的普通话连续语音识别
4. Three steps of Neuron Network classification for EMG-based Thai tones speech recognition [C] . Srisuwan Niyawadee, Phukpattaranont Pornchai, Limsakul Chusak International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology . 2013

机译：基于EMG的泰语语音识别的神经元网络分类的三个步骤
5. Tone classification of syllable-segmented Thai speech based on multilayer perceptron. [D] . Satravaha, Nuttavudh. 2002

机译：基于多层感知器的音节段泰语语音的音调分类。
6. The Binaural Masking-Level Difference of Mandarin Tone Detection and the Binaural Intelligibility-Level Difference of Mandarin Tone Recognition in the Presence of Speech-Spectrum Noise [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, -1

机译：语音频谱噪声下普通话检测的双耳掩蔽水平差异和普通话识别的双耳可懂度水平差异
7. Tone Recognition Of Continuous Thai Speech Under Tonal Assimilation And Declination Effects Using Half-Tone Model [O] . Nuttakorn Thubthong, Boonserm Kijsirikul 2001

机译：使用半音模型在音调同化和衰减效应下连续泰语语音的语音识别

Three Steps of Neuron Network Classification for EMG-based Thai Tones Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅