Speaker Clustering Aided by Visual Dialogue Analysis

机译：视觉对话分析辅助说话人聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speaker clustering aims to automatically cluster speech segments for each speaker. By speaker clustering, we can discover main cast list from long videos and retrieve their relevant video clips for efficient browsing. In this paper, we propose a dialogue supervised speaker clustering method, which makes use of the visual dialogue analysis results to improve the performance of speaker clustering. Compared with the traditional approach based only on acoustic features, the dialogue supervised speaker clustering approach can get significant improvement on the clustering result for movie and TV series.

机译：说话者聚类旨在自动为每个说话者聚类语音片段。通过演讲者聚类，我们可以从长视频中发现主要演员表，并检索其相关视频剪辑以进行有效浏览。本文提出了一种对话监督的说话人聚类方法，该方法利用视觉对话分析结果来提高说话人聚类的性能。与仅基于声学特征的传统方法相比，对话监督的说话人聚类方法可以大大改善电影和电视剧的聚类结果。

著录项

来源
《Advances in Multimedia Information Processing - PCM 2008》|2008年|693-702|共10页
会议地点 Tainan(CT);eTainan(CT)
作者
Shuang Zhang; Wei Hu; Tao Wang; Jia Liu; Yimin Zhang;
展开▼
作者单位

Tsinghua National Laboratory for Information Science and Technology Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China;

Intel China Research Center, Beijing, P.R. China;

Intel China Research Center, Beijing, P.R. China;

Tsinghua National Laboratory for Information Science and Technology Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China;

Intel China Research Center, Beijing, P.R. China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词
speaker clustering; dialogue analysis; speech segmentation;

机译：说话者聚类；对话分析；语音分割;

相似文献

外文文献
中文文献
专利

1. Ways of Perception. On Visual and Intercultural Perception/The Voice of the Visual. Visual Learning Strategies for Problem Analysis. Social Dialogue and Mediated Participation [J] . Maria Cristina Plencovicha The Journal of Agricultural Education and Extension . 2011,第5期

机译：感知方式。关于视觉和跨文化感知/视觉之声。用于问题分析的视觉学习策略。社交对话和中介参与
2. Access the number of speakers through visual access tendency for effective speech clustering [J] . T. Suneetha Rani, M. H. M. Krishna Prasad International journal of systems assurance engineering and management . 2018,第2期

机译：通过视觉访问趋势访问说话者的数量，以有效地进行语音聚类
3. Computer-aided analysis and design for spoken dialogue systemsbased on quantitative simulations [J] . Bor-Shen Lin, Lin-Shan Lee IEEE Transactions on Speech and Audio Proceessing . 2001,第5期

机译：基于定量模拟的语音对话系统的计算机辅助分析和设计
4. Speaker Clustering Aided by Visual Dialogue Analysis [C] . Shuang Zhang, Wei Hu, Tao Wang, Pacific Rim Conference on Multimedia . 2008

机译：通过视觉对话分析帮助扬声器聚类
5. Correlating Visual Speaker Gestures with Measures of Audience Engagement to Aid Video Browsing. [D] . Zhang, John Ruoyu. 2013

机译：将视觉演讲者手势与观众参与度进行关联，以辅助视频浏览。
6. COMPUTER-AIDED DIAGNOSIS AND VISUALIZATION BASED ON CLUSTERING AND INDEPENDENT COMPONENT ANALYSIS FOR BREAST MRI [O] . A. Meyer-Baese, O. Lange, T. Schlossbauer, -1

机译：基于聚类和独立分量分析的乳腺MRI计算机辅助诊断和可视化
7. Hierarchical Cluster Analysis to Aid Diagnostic Image DataudVisualization of MS and Other Medical Imaging Modalities [O] . Selvan Arul, Cole Laura, Spackman Lynne, 2017

机译：层次聚类分析以辅助诊断图像数据 udMS和其他医学成像模式的可视化

Speaker Clustering Aided by Visual Dialogue Analysis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅