Speaker Role Recognition using question detection and characterization

机译：使用问题检测和表征的说话人角色识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech Data Mining is an area of research dedicated to characterizing audio streams that contain speech from one or more speakers, using descriptors related to the form and the content of the speech signal. Besides the word transcription, information about the type of audio stream and the role and identity of speakers is also crucial to allow complex queries such as: "seek debates on X", "find all the interviews of Y", etc. In this framework we present a study performed on broadcast conversations that focuses on the way speakers express their questions in conversations. The initial intuition is that the type of questions asked can help identify the role (anchor, guest, expert, etc.) of a speaker in a conversation. By tagging these questions with a set of labels and using this information in addition to the commonly used descriptors to classify users' role in broadcast conversations, we improve the role classification accuracy and validate our initial intuition.

机译：语音数据挖掘是一个研究领域，致力于使用与语音信号的形式和内容有关的描述符来表征包含来自一个或多个扬声器的语音的音频流。除转录一词外，有关音频流类型以及讲话者的角色和身份的信息对于允许进行复杂的查询也至关重要，例如：“在X上进行辩论”，“在Y上进行所有采访”等。在此框架中我们提供了一项针对广播对话的研究，重点是演讲者在对话中表达问题的方式。最初的直觉是，所询问的问题类型可以帮助确定对话中讲话者的角色（主持人，嘉宾，专家等）。通过使用一组标签标记这些问题，并使用此信息以及常用的描述符对广播对话中用户的角色进行分类，我们提高了角色分类的准确性并验证了我们的直觉。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1344-1347|共4页
会议地点
作者
Thierry Bazillon; Benjamin Maza; Michael Rouvier; Frederic Bechet; Alexis Nasr;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
spoken language understanding; speech data mining; speaker role classification; question detection;

机译：口语理解;语音数据挖掘演讲者角色分类;问题检测;

相似文献

外文文献
中文文献
专利

1. A REVIEW ON VOICE ACTIVITY DETECTION AND MEL-FREQUENCY CEPSTRAL COEFFICIENTS FOR SPEAKER RECOGNITION (TREND ANALYSIS) [J] . P Mahalakshmi Asian Journal of Pharmaceutical and Clinical Research . 2016,第9期

机译：扬声器识别的语音活动检测和熔体倒谱系数综述（趋势分析）
2. Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System [J] . RIADH AJGOU, SALIM SBAA, SAID GHENDIR, WSEAS Transactions on Signal Processing . 2014,第Pta1期

机译：语音活动的新型检测算法及语音编解码器对远程讲话者识别系统的影响
3. A study of voice activity detection techniques for NIST speaker recognition evaluations [J] . Man-Wai Mak, Hon-Bill Yu Computer speech and language . 2014,第1期

机译：用于NIST说话人识别评估的语音活动检测技术的研究
4. Investigation of Spontaneous Speech Characterization Applied to Speaker Role Recognition [C] . Richard Dufour, Yannick Esteve, Paul Dettglise Annual conference of the International Speech Communication Association;INTERSPEECH 2011 . 2011

机译：自发性语音表征在说话人角色识别中的研究
5. Characterization of Speaker Recognition in Noisy Channels [D] . Ghilduta, Robert 2012

机译：嘈杂渠道中扬声器识别的特征
6. Hypothesis testing for evaluating a multimodal pattern recognition framework applied to speaker detection [O] . Patricia Besson, Murat Kunt 2008

机译：假设测试用于评估应用于说话人检测的多模式模式识别框架
7. Speaker Change Detection and Speaker Clustering Using VQ Distortion for Broadcast News Speech Recognition [O] . Kazumasa Mori, Seiichi Nakagawa 2001

机译：利用VQ失真进行广播新闻语音识别的扬声器变化检测和扬声器聚类
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Speaker Role Recognition using question detection and characterization

摘要

著录项

相似文献

相关主题

期刊订阅