首页> 外文会议>Asian conference on intelligent information and database systems >Speech Activity Detection for Deaf People: Evaluation on the Developed Smart Solution Prototype
【24h】

Speech Activity Detection for Deaf People: Evaluation on the Developed Smart Solution Prototype

机译:聋人的语音活动检测:对已开发的智能解决方案原型的评估

获取原文

摘要

This research constitutes a relatively new approach by developing a smart solution which has emerged from the research activity using at first Google Glass and usable speech detection services. The authors conducted in the last year a series of developing, testing and evaluating the prototype results in order to decide, which service provides better results than the third-party speech detection service like Google Speech API or IBM Watson Speech To Text. This finding should significantly help the authors during the data evaluation and testing in developed smart solution. The basic idea is that authors have already developed a functional basic solution-a prototype. This solution was properly working and usable, but there are still some disadvantages to be improved. In order to accomplish the best results possible, the authors have added another element to their solution. A challenging problem which arises in this domain is concerned with significant data savings, server load, detection quality, and again opens a space for further improvements, such as following research and testing. This element is part of the statistical analysis and it is called Hidden Markov Model, which is used for speech recognition applications for last twenty years. The authors examined and studied many different articles and scientific sources in order to find the best solution for higher efficiency of speech recognition usable in their developed prototype (and for this article).
机译:通过开发一种智能解决方案,这项研究构成了一种相对较新的方法,该解决方案最初是使用Google Glass和可用的语音检测服务从研究活动中产生的。为了确定原型结果,作者在去年进行了一系列开发,测试和评估,以确定哪种服务比第三方语音检测服务(如Google Speech API或IBM Watson Speech To Text)提供更好的结果。这一发现将极大地帮助作者在开发的智能解决方案中进行数据评估和测试。基本思想是作者已经开发了功能性的基本解决方案-原型。该解决方案可以正常工作并且可以使用,但是仍有一些缺点需要改进。为了获得最佳结果,作者在解决方案中添加了另一个元素。在此领域中出现的一个具有挑战性的问题与节省大量数据,服务器负载,检测质量有关,并再次为进一步改进(例如进行后续研究和测试)开辟了空间。该元素是统计分析的一部分,被称为“隐马尔可夫模型”,该模型已用于语音识别应用近20年。作者检查并研究了许多不同的文章和科学资源,以找到可在其开发的原型(以及本文中使用)中获得更高语音识别效率的最佳解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号