SPEAKER DIARIZATION OF HETEROGENEOUS WEB 'VIDEO FILES: A PRELIMINARY STUDY

机译：异构Web'视频文件的扬声器日益改估：初步研究

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal re-sources. Concerning multimedia, the most impressive evolution is the continuous growing success of the video sharing websites. But with this success come the difficulties to efficiently search, index and access relevant information about these documents. Speaker diarization is an important task in the overall information retrieval process. This paper describes an audio/video database, especially built for the speaker diarization task, based on different video genres. Through some preliminary experiments, it highlights the difficulties encountered in this context, mainly linked to the database heterogeneity.

机译：在过去的十年中，互联网以及其应用程序的升值显着变化，主要归功于提高可用的个人重新来源。关于多媒体，最令人印象深刻的演变是视频共享网站的不断增长的成功。但是，随着这个成功来说，有效地搜索，索引和访问这些文件的相关信息的困难。扬声器日益改估是整体信息检索过程中的重要任务。本文介绍了一种音频/视频数据库，尤其是基于不同的视频类型的扬声器日复速衰期任务。通过一些初步实验，它突出了这种背景下遇到的困难，主要与数据库异质性相关联。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Pierre CLEMENT; Thierry BAZILLON; Corinne FREDOUILLE;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
speaker diarization; heterogeneous web videos; di-arization error rate;

机译：扬声器日益衰退;异构网页视频;偶极误差率;

相似文献

外文文献
中文文献
专利

1. Multimodal speaker diarization for meetings using volume-evaluated SRP-PHAT and video analysis [J] . Cabanas-Molero P., Lucena M., Fuertes J. M., Multimedia Tools and Applications . 2018,第20期

机译：使用音量评估的SRP-PHAT和视频分析为会议提供多峰发言人二分法
2. On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/ Speech Video Soundtracks [J] . Robert Mertens, Po-Sen Huang, Luke Gottlieb, International journal of multimedia data engineering & management . 2012,第3期

机译：说话者差异化在非语音和非语音/语音混合视频音轨的音频索引中的适用性
3. Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study [J] . Mihelic France, Vesnicer Bostjan, Zibert Janez Journal of computing and information technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者区分系统的开发：一个案例研究
4. Speaker diarization of heterogeneous web video files: A preliminary study [C] . Clement Pierre, Bazillon Thierry, Fredouille Corinne 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：异类网络视频文件的说话人区分：初步研究
5. Subjectivity, second/foreign language pragmatic use, and instruction: Evidence of accommodation and resistance. Study I. Emulating and resisting pragmatic norms: Learner subjectivity and foreign language pragmatic use. Study II. Centering second language (SL) speakers' experience: A study of SL speakers' resistance to pragmatic norms of the SL community. Study III. Web-based curriculum for pragmatics instruction in Japanese as a foreign language: An explicit awareness-raising approach. [D] . Ishihara, Noriko. 2006

机译：主观性，第二/外语的语用和用法：适应和抵制的证据。研究I.模拟和抵制实用规范：学习者的主观性和外语的实用性。研究二。以讲第二语言的人的经历为中心：研究讲第二语言者对SL社区的实用规范的抵制。研究III。基于网络的日语作为外语的语用学教学课程：一种明确的提高认识的方法。
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation [O] . Pavel Campr, Marie Kunešová, Jan Vaněk, 2016

机译：用于无监督扬声器和人脸模型创建的音频 - 视频扬声器二值化
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

SPEAKER DIARIZATION OF HETEROGENEOUS WEB 'VIDEO FILES: A PRELIMINARY STUDY

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅