首页> 外文会议>International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology >Three Steps of Neuron Network Classification for EMG-based Thai Tones Speech Recognition
【24h】

Three Steps of Neuron Network Classification for EMG-based Thai Tones Speech Recognition

机译:基于EMG的泰文语音识别的神经元网络分类的三个步骤

获取原文

摘要

In order to overcome the problem existing in original speech recognition (e.g. noise interruption and private data loss), many researchers have investigated to deal with these problems. Electromyography (EMG) from the muscles producing speech was used to replace a voiced signal. Similarly, we aim to develop EMG speech recognition based on Thai language. Tone is the important characteristic of this language. Hence, Thai tone classification is the first work that was explored. This paper proposes the new technique that can classify five Thai tones for EMG-based Thai speech recognition. This method can overcome the limitation of our previous work that we can classify only two tones. EMG was captured from six positions of the strap muscles and facial muscles while a volunteer was uttering 21 Thai isolated words and five tones of each word (total 105 words). The 68 EMG features were calculated, and RES index was used to evaluate clustering capability of each feature. Top five features that have high value of RES index were selected. Neuron Network (NN) was used for tone classification. We found that Modify Mean Absolute Value 2~(nd) type (MMAV2) is the best features. It yielded an accuracy rate of 56.2% for five Thai tones classification. However, it is not enough for our work. In order to improve the accuracy rate, the three steps of NN Classification was proposed. This technique is the series of three networks of NN classifier. Each network will classify different tones, and use distinct features. We obtained an accuracy rate of 80% for five Thai tones classification from this technique.
机译:为了克服原始语音识别中存在的问题(例如,噪声中断和私人数据丢失),许多研究人员已经调查处理这些问题。从生产语音的肌肉肌电图(EMG)用于取代浊音信号。同样,我们的目标是基于泰语制定EMG语音识别。语调是这种语言的重要特征。因此,泰语语气分类是探索的第一个工作。本文提出了可以对基于EMG的泰式语音识别进行分类的新技术。此方法可以克服我们以前的工作的限制,我们只能对两个音调进行分类。 EMG被带子肌肉和面部肌肉的六个位置捕获,而志愿者则发出21个泰国孤立的单词和每个单词的五个音调(总共105字)。计算68个EMG功能,使用RES索引来评估每个功能的聚类能力。选择了具有高价值的res索引的前五个功能。神经元网络(NN)用于音调分类。我们发现修改平均值2〜(nd)类型(mmav2)是最好的功能。它产生了五个泰语音调分类的精度为56.2%。但是,这对我们的工作来说是不够的。为了提高准确率,提出了NN分类的三个步骤。该技术是NN分类器的三个网络系列。每个网络都将分类不同的音调,并使用不同的功能。我们从该技术中获得了五个泰语音调分类的准确率为80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号