首页> 外文会议>International conference on Asian language processing >Domain Specific Intent Classification of Sinhala Speech Data
【24h】

Domain Specific Intent Classification of Sinhala Speech Data

机译:僧伽罗语语音数据的特定领域意图分类

获取原文

摘要

Building an open domain automatic speech recognition(ASR) system can be accomplished by converting voice into text and performing a text classification on top of the converted text. However, with the inherent challenges in the approach mentioned above, it is not the most feasible way of the deriving intent of speech queries in a specific domain. This paper proposes a domain-specific intent classification for Sinhala language utilizing a feed-forward neural network with backpropagation. For the purposes of this research, a Neural network is trained from Mel Frequency Cepstral Coefficients (MFCC) which are extracted from a Sinhala speech corpus of 10 hours and the performance of the system is evaluated using the recognition accuracy of the speech queries. Further, the proposed solution in the paper introduces the first-of-its-kind for domain-specific intent classification for Sinhala language.
机译:通过将语音转换为文本并在转换后的文本之上执行文本分类,可以实现构建开放域自动语音识别(ASR)系统。但是,由于上述方法存在固有的挑战,因此这不是在特定领域推导语音查询意图的最可行方法。本文提出了利用前向神经网络和反向传播技术对僧伽罗语语言进行领域特定的意图分类。为了本研究的目的,从梅尔频率倒谱系数(MFCC)中训练了一个神经网络,该频率是从10小时的僧伽罗语语音库中提取的,并使用语音查询的识别准确性来评估系统的性能。此外,本文提出的解决方案介绍了Sinhala语言针对领域特定的意图分类的同类首创。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号