...
首页> 外文期刊>International journal of speech technology >Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure
【24h】

Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure

机译:使用最近邻分类器和恢复程序进行印度尼西亚字素音节化

获取原文
获取原文并翻译 | 示例
           

摘要

An automatic syllabification, decomposing a word into syllables, is an important part in an automatic speech recognition (ASR) that uses both syllable-based acoustic and language models. It can be performed to either phoneme or grapheme sequences. The phonemic syllabification is more complex than the other since it requires a grapheme-to-phoneme conversion (G2P) as a previous process. It generally gives a high accuracy for many formal words but its accuracy may decrease for person-names. In contrast, the graphemic syllabification is simpler and more potential to be applied for person-names. This research focuses on developing a model of graphemic syllabification using a combination of phonotactic rules and Fuzzy k-nearest neighbour in every Class (FkNNC). The phonotactic rules are designed to find some deterministic syllabification points while FkNNC, as a statistical classifier, is expected to search the remaining stochastic syllabification points. A recovery procedure is proposed to correct the wrong syllabification points produced by FkNNC. Fivefold cross-validating on a dataset of 50k formal words, selected from the great dictionary of the Indonesian language, shows that the proposed model gives syllable error rate (SER) of 2.48% and the proposed recovery procedure reduces the SER to be 2.27%, which is higher than that produced by the phonemic syllabification (only 0.99%). But, this model is capable of handling a dataset of 15k high variance person-names with SER of 7.45% and the proposed recovery procedure reduces the SER to be 6.78%.
机译:将单词分解为音节的自动音节化是使用基于音节的声学和语言模型的自动语音识别(ASR)的重要组成部分。它可以对音素序列或字素序列执行。音素音节化比其他音素音节化更为复杂,因为它需要一个音素到音素转换(G2P)作为先前的过程。通常,对于许多正式词来说,它具有很高的准确性,但对于人名,其准确性可能会降低。相反,字形音节化更简单,并且更有可能应用于人名。这项研究的重点是使用音素规则和每个类别中的模糊k最近邻(FkNNC)的组合来发展音素音节化模型。音位规则旨在查找一些确定性的音节化点,而FkNNC作为统计分类器,则有望搜索剩余的随机音节化点。提出了一种恢复程序来纠正FkNNC产生的错误音节化点。从印度尼西亚语大词典中选取的50k个正式单词的数据集进行五重交叉验证,结果表明,该模型的音节错误率(SER)为2.48%,并且所建议的恢复程序将SER降低为2.27%,这高于音素音节化产生的音调(仅0.99%)。但是,此模型能够处理SER为7.45%的15k高方差人名数据集,并且所提出的恢复程序将SER降低为6.78%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号