首页> 外文期刊>Speech Communication >Disambiguating the senses of non-text symbols for Mandarin TTS systems with a three-layer classifier
【24h】

Disambiguating the senses of non-text symbols for Mandarin TTS systems with a three-layer classifier

机译:使用三层分类器消除普通话TTS系统的非文本符号含义

获取原文
获取原文并翻译 | 示例
           

摘要

Various kinds of non-text symbols appear in texts. The oral expressions of these symbols may vary with their senses. This paper proposes a three-layer classifier (TLC) which can disambiguate the senses of these symbols effectively. The layers within TLC are employed in sequence. The 1st layer is composed of two components: pattern table and decision tree. If this layer can disambiguate the sense of the target symbol, the disambiguation task stops. Otherwise the next two layers will be triggered. In such a situation, the procedure will go through the TLC. Based on the Bayesian theory, the 2nd layer adopts the voting scheme to compute the disambiguation score. Several features of token, which may affect the effectiveness of our voting scheme, are analyzed and compared with each other to achieve better accuracy. According to the algorithm confidence of sense disambiguation, the 3rd layer may exploit an alternative model to enhance the performance. Experiments show that our approaches can learn well even with only a small amount of data. The overall accuracies of training and testing sets are 99.8% and 97.5%, respectively.
机译:各种非文本符号出现在文本中。这些符号的口头表达可能会因其感官而异。本文提出了一种三层分类器(TLC),可以有效消除这些符号的含义。 TLC中的各层按顺序使用。第一层由两个部分组成:模式表和决策树。如果该层可以消除目标符号的歧义,则消除歧义任务将停止。否则,将触发下两层。在这种情况下,该过程将通过TLC。根据贝叶斯理论,第二层采用投票方案计算消歧分数。分析并比较了可能影响我们投票方案有效性的代币的几个特征,以实现更好的准确性。根据算法消除歧义的置信度,第三层可以利用替代模型来增强性能。实验表明,即使只有少量数据,我们的方法也可以很好地学习。训练和测试集的总体准确性分别为99.8%和97.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号