首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Speaker Role Recognition using question detection and characterization
【24h】

Speaker Role Recognition using question detection and characterization

机译:使用问题检测和表征的说话人角色识别

获取原文

摘要

Speech Data Mining is an area of research dedicated to characterizing audio streams that contain speech from one or more speakers, using descriptors related to the form and the content of the speech signal. Besides the word transcription, information about the type of audio stream and the role and identity of speakers is also crucial to allow complex queries such as: "seek debates on X", "find all the interviews of Y", etc. In this framework we present a study performed on broadcast conversations that focuses on the way speakers express their questions in conversations. The initial intuition is that the type of questions asked can help identify the role (anchor, guest, expert, etc.) of a speaker in a conversation. By tagging these questions with a set of labels and using this information in addition to the commonly used descriptors to classify users' role in broadcast conversations, we improve the role classification accuracy and validate our initial intuition.
机译:语音数据挖掘是一个研究领域,致力于使用与语音信号的形式和内容有关的描述符来表征包含来自一个或多个扬声器的语音的音频流。除转录一词外,有关音频流类型以及讲话者的角色和身份的信息对于允许进行复杂的查询也至关重要,例如:“在X上进行辩论”,“在Y上进行所有采访”等。在此框架中我们提供了一项针对广播对话的研究,重点是演讲者在对话中表达问题的方式。最初的直觉是,所询问的问题类型可以帮助确定对话中讲话者的角色(主持人,嘉宾,专家等)。通过使用一组标签标记这些问题,并使用此信息以及常用的描述符对广播对话中用户的角色进行分类,我们提高了角色分类的准确性并验证了我们的直觉。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号