Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm

机译：基于小波的多分辨率频谱和支持向量机以及音频混合算法的语音活动检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a Voice Activity Detection (VAD) algorithm and efficient speech mixing algorithm for a multimedia conference. The proposed VAD uses MFCC of multiresolution spectrum based on wavelets and two classical audio parameters as audio feature, and prejudges silence by detection of multi-gate zero cross ratio, and classify noise and voice by Support Vector Machines (SVM). New speech mixing algorithm used in Multipoint Control Unit (MCU) of conferences imposes short-time power of each audio stream as mixing weight vector, and is designed for parallel processing in program. Various experiments show, proposed VAD algorithm achieves overall better performance in all SNRs than VAD of G.729b and other VAD, output audio of new speech mixing algorithm has excellent hearing perceptibility, and its computational time delay are small enough to satisfy the needs of real-time transmission, and MCU computation is lower than that based on G.729b VAD.

机译：本文提出了一种用于多媒体会议的语音活动检测（VAD）算法和有效的语音混合算法。拟议的VAD使用基于小波的多分辨率频谱的MFCC和两个经典音频参数作为音频特征，并通过检测多门零交叉比来预先判断沉默，并通过支持向量机（SVM）对噪声和语音进行分类。会议的多点控制单元（MCU）中使用的新语音混合算法将每个音频流的短时功率作为混合权重向量，并设计用于程序中的并行处理。各种实验表明，提出的VAD算法在所有SNR方面都比G.729b和其他VAD的VAD总体上具有更好的性能，新的语音混合算法的输出音频具有出色的听觉感知能力，并且其计算时间延迟足够小，可以满足真实用户的需求。实时传输，MCU计算低于基于G.729b VAD的计算。

著录项

来源
《European Conference on Computer Vision(ECCV 2006) Workshop on Human-Computer Interaction(HCI); 20060513; Graz(AT)》|2006年|P.78-88|共11页
会议地点 Graz(AT)
作者
Wei Xue; Sidan Du; Chengzhi Fang; Yingxian Ye;
展开▼
作者单位

Department of Electronics Science and Engineering, Nanjing University, Nanjing 210093, P.R. China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Improved voice activity detection algorithm using wavelet and support vector machine [J] . Shi-Huang Chen, Rodrigo Capobianco Guido, Trieu-Kien Truong, Computer speech and language . 2010,第3期

机译：改进的基于小波和支持向量机的语音活动检测算法
2. A Support Vector Machine-Based Voice Activity Detection Employing Effective Feature Vectors [J] . Q-Haing JO, Yun-Sik PARK, Kye-Hwan LEE, IEICE Transactions on Communications . 2008,第6期

机译：利用有效特征向量的支持向量机语音活动检测
3. Throat polyp detection based on compressed big data of voice with support vector machine algorithm [J] . Wei Wang, Zhangliang Chen, Jiasong Mu, EURASIP journal on advances in signal processing . 2014,第1期

机译：支持向量机算法的基于语音压缩大数据的喉咙息肉检测
4. Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm [C] . Wei Xue, Sidan Du, Chengzhi Fang, European Conference on Computer Vision . 2006

机译：基于小波的多分辨率频谱和支持向量机和音频混合算法的语音活动检测
5. FPGA implementation of ultrasonic flaw detection algorithm based on support vector machine classification. [D] . Jiang, Yiyue. 2016

机译：基于支持向量机分类的超声探伤算法的FPGA实现。
6. Sleep Quality Detection Based on EEG Signals Using Transfer Support Vector Machine Algorithm [O] . Wu Wen 2021

机译：使用传输支持向量机算法基于EEG信号的睡眠质量检测
7. Throat polyp detection based on compressed big data of voice with support vector machine algorithm [O] . Wei Wang, Zhangliang Chen, Jiasong Mu, 2014

机译：支持向量机算法的基于语音压缩大数据的喉咙息肉检测
8. Learning Algorithms for Audio and Video Processing: Independent Component Analysis and Support Vector Machine Based Approaches [R] . Qi, Y. 2000

机译：用于音频和视频处理的学习算法：独立分量分析和支持向量机的方法

Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅