首页> 外文会议>National Conference on Communications >Enhancements in Assamese spoken query system: Enabling background noise suppression and flexible queries

【24h】

Enhancements in Assamese spoken query system: Enabling background noise suppression and flexible queries

机译：阿萨姆语口语查询系统的增强功能：启用背景噪声抑制和灵活的查询

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the work presented in this paper, the recent improvements incorporated in the earlier developed Assamese spoken query (SQ) system for accessing the price of agricultural commodities are discussed. The developed SQ system consists of interactive voice response (IVR) and automatic speech recognition (ASR) modules. These are developed using open source resources. The speech data used for developing the ASR system was collected in the field conditions, thus contained significantly high level of background noise. On account of the background noise, the recognition performance of earlier version of the SQ system was severely affected. In order to deal with that, a front-end noise suppression module-based on zero frequency filtering has been added in the current version. Furthermore, we have also incorporated the subspace Gaussian mixture (SGMM) and deep neural network (DNN)-based acoustic modeling approaches. These techniques are found to be more effective than the Gaussian mixture model (GMM)-based approach which was employed in the previous version. The combination of noise removal and DNN-based acoustic modeling is found to result in a relative improvement of almost 32% in word error rate in comparison to the earlier reported GMM-HMM-based ASR system. The earlier SQ system was designed expecting the users' queries in form of isolated words only and, therefore, a high degraded recognition performance was noted whenever the queries were in the form of continuous sentences. In order to overcome that, we present a simple technique exploiting the inherent patterns in the user queries. These patterns are then incorporated in the employed language model. The modified language model is observed to result in significant improvements in the recognition performances in case of continuous queries.

机译：在本文介绍的工作中，讨论了早期开发的阿萨姆语口语查询（SQ）系统中用于获取农产品价格的最新改进。开发的SQ系统由交互式语音响应（IVR）和自动语音识别（ASR）模块组成。这些都是使用开源资源开发的。用于开发ASR系统的语音数据是在现场条件下收集的，因此包含很高水平的背景噪声。由于背景噪声，早期版本的SQ系统的识别性能受到严重影响。为了解决这个问题，当前版本中增加了基于零频率滤波的前端噪声抑制模块。此外，我们还结合了基于子空间的高斯混合（SGMM）和基于深度神经网络（DNN）的声学建模方法。发现这些技术比以前版本中使用的基于高斯混合模型（GMM）的方法更有效。与早期报道的基于GMM-HMM的ASR系统相比，噪声消除和基于DNN的声学建模相结合可导致字错误率几乎提高32％。较早的SQ系统被设计为仅期望用户以孤立词的形式进行查询，因此，只要查询以连续句子的形式出现，就会注意到较高的降级识别性能。为了克服这一点，我们提出了一种利用用户查询中固有模式的简单技术。然后将这些模式并入所采用的语言模型中。观察到修改后的语言模型可以在连续查询的情况下显着提高识别性能。

著录项

来源
《National Conference on Communications》|2016年|1-6|共6页
会议地点
作者
Abhishek Dey; S. Shahnawazuddin; Deepak K.T.; Siddika Imani; S.R.M Prasanna; Rohit Sinha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech; Hidden Markov models; Noise measurement; Acoustics; Meteorology; Training data; Training;

机译：语音;隐马尔可夫模型;噪声测量;声学;气象学;训练数据;训练;

相似文献

外文文献
中文文献
专利

1. Improvements in IITG Assamese Spoken Query System: Background Noise Suppression and Alternate Acoustic Modeling [J] . Shahnawazuddin S., Thotappa Deepak, Dey Abhishek, Journal of signal processing systems for signal, image, and video technology . 2017,第1期

机译：IITG阿萨姆语口语查询系统的改进：背景噪声抑制和交替声学建模
2. Low Complexity On-Line Adaptation Techniques in Context of Assamese Spoken Query System [J] . Shahnawazuddin S., Deepak K. T., Sarma B. D., Journal of signal processing systems for signal, image, and video technology . 2015,第1期

机译：Assamese口语查询系统中的低复杂度在线自适应技术
3. Robust query-by-singing/humming system against background noise environments [J] . Kichul Kim, Kang Ryoung Park, Sung-Joo Park, Consumer Electronics, IEEE Transactions on . 2011,第2期

机译：针对背景噪声环境的强大的按歌/哼唱查询系统
4. Enhancements in Assamese spoken query system: Enabling background noise suppression and flexible queries [C] . Abhishek Dey, S. Shahnawazuddin, Deepak K.T., National Conference on Communications . 2016

机译：assamese语音查询系统中的增强功能：启用背景噪声抑制和灵活查询
5. ubQL: A distributed query language to program distributed query systems. [D] . Sahuguet, Arnaud. 2002

机译：ubQL：一种分布式查询语言，用于对分布式查询系统进行编程。
6. Enabling Ontology Based Semantic Queries in Biomedical Database Systems [O] . Shuai Zheng, Fusheng Wang, James Lu -1

机译：在生物医学数据库系统中启用基于本体的语义查询
7. Enabling Flexible Queries with Guarantees in P2P Systems [O] . Cristina Schmidt, Manish Parashar 2015

机译：在p2p系统中实现带有保证的灵活查询

Enhancements in Assamese spoken query system: Enabling background noise suppression and flexible queries

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅