Single-speaker/multi-speaker co-channel speech classification

机译：单口/多口同道语音分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The demand for content-based management and real-time manipulation of audio data is constantly increasing. This paper presents a method to identify temporal regions, in a segment of co-channel speech, as being either single-speaker or multi-speaker speech. The state of the art approach for this purpose is the kurtosis. In this paper, a set of complementary time-domain and frequency-domain features is studied. The employed classification scheme is the one-class SVM classifier. A recognition rate of 94.75 % is reached. The set of features providing the best performance is determined.

机译：基于内容的管理和音频数据的实时处理的需求在不断增长。本文提出了一种方法来识别同频道语音片段中的时间区域为单说话者语音还是多说话者语音。为此目的，最先进的方法是峰度。本文研究了一组互补的时域和频域特征。所采用的分类方案是一类SVM分类器。达到94.75％的识别率。确定提供最佳性能的功能集。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2322-2325|共4页
会议地点
作者
Stephane Rossignol; Olivier Pietquin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speech segmentation; speaker characterization and recognition;

机译：语音分割说话人表征和识别;

相似文献

外文文献
中文文献
专利

1. A Robust Spectral Correlation Technique for Text Dependent Speaker Identification under Co-Channel Multi-Speaker Conditions [J] . Aya S. Mostafa, Amr M. Gody, Tamer M. Barakat International Journal of Engineering Trends and Technology . 2016,第5期

机译：共通道多说话者条件下基于文本的说话人识别的鲁棒频谱相关技术
2. An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis [J] . Beáta L?rincz, Adriana Stan, Mircea Giurgiu Procedia Computer Science . 2021,第a期

机译：对多扬声器深神经动词合成中记录条件和扬声器特性的客观评价
3. Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network [J] . Shanfa Ke, Ruimin Hu, Xiaochen Wang, Multimedia Tools and Applications . 2020,第43a44期

机译：基于量化比率掩模和残差网络的单通道多扬声器语音分离
4. Single-speaker/multi-speaker co-channel speech classification [C] . Stephane Rossignol, Olivier Pietquin Annual conference of the International Speech Communication Association . 2010

机译：单扬声器/多扬声器共同信道语音分类
5. The Online Adjustment of Speaker-Specific Phonetic Beliefs in Multi-Speaker Speech Perception [D] . Lai, Wei. 2021

机译：在多扬声器语音感知中的发言者特定语音信念的在线调整
6. The Dynamics of Attention Shifts Among Concurrent Speech in a Naturalistic Multi-speaker Virtual Environment [O] . Keren Shavit-Cohen, Elana Zion Golumbic 2019

机译：自然多说话者虚拟环境中并发语音中注意转移的动力学
7. Semi-Supervised Learning for Multi-Speaker Text-to-Speech Synthesis Using Discrete Speech Representation [O] . Tao Tu, Yuan-Jui Chen, Alexander H. Liu, 2020

机译：使用离散语音表示，半监督多扬声器文本与语音合成的学习
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Single-speaker/multi-speaker co-channel speech classification

摘要

著录项

相似文献

相关主题

期刊订阅