Precise detection of speech endpoints dynamically: A wavelet convolution based approach

Roy Tanmoy; Marwala Tshilidzi; Chakrayerty Snehashish

首页> 外文期刊>Communications in Nonlinear Science and Numerical Simulation >Precise detection of speech endpoints dynamically: A wavelet convolution based approach

【24h】

Precise detection of speech endpoints dynamically: A wavelet convolution based approach

机译：动态精确地检测语音端点：基于小波卷积的方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Precise detection of speech endpoints is an important factor which affects the performance of the systems where speech utterances need to be extracted from the speech signal such as Automatic Speech Recognition (ASR) system. Existing endpoint detection (EPD) methods mostly uses Short-Term Energy (STE), Zero-Crossing Rate (ZCR) based approaches and their variants. But STE and ZCR based EPD algorithms often fail in the presence of Non-speech Sound Artifacts (NSAs) produced by the speakers. Pattern recognition and classification techniques are also applied but those methods require labeled data for training. In this article, a novel approach is proposed to extract speech endpoints and the algorithm is termed as Wavelet Convolution based Speech Endpoint Detection (WCSED). WCSED decomposes the speech signal into high-frequency and low-frequency components using wavelet convolution and then computes information-entropy based thresholds for the two frequency components. The low-frequency thresholds are used to extract voiced speech segments, whereas the high-frequency thresholds are used to extract the unvoiced speech segments by filtering out the NSAs. WCSED does not require any labeled data for training and can automatically extract speech segments. Experiments are carried out on two speech databases and the results are promising even in the presence of NSAs. (C) 2018 Elsevier B.V. All rights reserved.

机译：语音端点的精确检测是影响需要从语音信号中提取语音的系统的性能的重要因素，例如自动语音识别（ASR）系统。现有的端点检测（EPD）方法主要使用基于短期能量（STE），零交叉速率（ZCR）的方法及其变体。但是，基于STE和ZCR的EPD算法通常会在扬声器产生非语音声音伪像（NSA）的情况下失败。模式识别和分类技术也被应用，但是那些方法需要标签数据进行训练。在本文中，提出了一种新颖的方法来提取语音端点，该算法称为基于小波卷积的语音端点检测（WCSED）。 WCSED使用小波卷积将语音信号分解为高频和低频分量，然后针对这两个频率分量计算基于信息熵的阈值。低频阈值用于提取带语音的语音段，而高频阈值用于通过滤除NSA来提取清音语音段。 WCSED不需要任何标签数据即可进行训练，并且可以自动提取语音片段。在两个语音数据库上进行了实验，即使存在NSA，结果也很有希望。（C）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Communications in Nonlinear Science and Numerical Simulation》 |2019年第2期|162-175|共14页
作者
Roy Tanmoy; Marwala Tshilidzi; Chakrayerty Snehashish;
展开▼
作者单位

Univ Johannesburg, Elect & Elect Engn, Johannesburg, South Africa;

Natl Inst Technol Rourkela, Dept Math, Rourkela, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech endpoint detection; Speech recognition; Wavelet convolution; Signal processing; Pattern recognition;

机译：语音端点检测;语音识别;小波卷积;信号处理;模式识别;

相似文献

外文文献
中文文献
专利

1. A fuzzy adaptive smoothing approach to robust endpoint detection based on MDL using sub-band speech [J] . WANG Ming-zheng, ZHANG Wen-jun, LI Jian-hua Journal of Harbin Institute of Technology . 2005,第6期

机译：基于子带语音的基于MDL的鲁棒端点检测的模糊自适应平滑方法
2. A fuzzy adaptive smoothing approach to robust endpoint detection based on MDL using sub-band speech [J] . WANG Ming-zheng, ZHANG Wen-jun, LI Jian-hua Journal of Harbin Institute of Technology . 2005,第6期

机译：基于子带语音的基于MDL的鲁棒端点检测的模糊自适应平滑方法
3. A fuzzy adaptive smoothing approach to robust endpoint detection based on MDL using sub-band speech [J] . WANG Ming-zheng, ZHANG Wen-jun, LI Jian-hua, 哈尔滨工业大学学报（英文版） . 2005,第006期

机译：基于子带语音的基于MDL的鲁棒端点检测的模糊自适应平滑方法
4. Study on Speech Endpoint Detection Algorithm Based on Wavelet Energy Entropy [C] . Yali Cao, Jing Gao, Guang Yang Chinese Control and Decision Conference . 2016

机译：基于小波能量熵的语音端点检测算法研究
5. Wavelet transform approach for adaptive filtering with application to fuzzy neural network based speech recognition. [D] . Jung, Byung-Chul. 2001

机译：小波变换的自适应滤波方法及其在基于模糊神经网络的语音识别中的应用。
6. A New Approach for Motor Imagery Classification Based on Sorted Blind Source Separation Continuous Wavelet Transform and Convolutional Neural Network [O] . César J. Ortiz-Echeverri, Sebastián Salazar-Colores, Juvenal Rodríguez-Reséndiz, 2019

机译：基于分类盲源分离连续小波变换和卷积神经网络的运动图像分类新方法
7. A wavelet-based multivariable approach for fault detection in dynamic systems Uma abordagem multivariável baseada em wavelets para detecção de falhas em sistemas dinâmicos [O] . Henrique Mohallem Paiva, Roberto Kawakami Harrop Galvão, Luis Rodrigues 2009

机译：基于小波的多变量动态系统故障检测方法基于小波的多变量动态系统故障检测方法
8. New Approach to the P-Wave Detection and Classification Based Upon Application of Wavelet Neural Network. [R] . Domider, T., Tkacz, E. J., Kostka, P., 2001

机译：基于小波神经网络的p波检测与分类新方法。

Precise detection of speech endpoints dynamically: A wavelet convolution based approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅