首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >SPEAKER DIARIZATION OF HETEROGENEOUS WEB 'VIDEO FILES: A PRELIMINARY STUDY
【24h】

SPEAKER DIARIZATION OF HETEROGENEOUS WEB 'VIDEO FILES: A PRELIMINARY STUDY

机译:异构Web'视频文件的扬声器日益改估:初步研究

获取原文

摘要

In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal re-sources. Concerning multimedia, the most impressive evolution is the continuous growing success of the video sharing websites. But with this success come the difficulties to efficiently search, index and access relevant information about these documents. Speaker diarization is an important task in the overall information retrieval process. This paper describes an audio/video database, especially built for the speaker diarization task, based on different video genres. Through some preliminary experiments, it highlights the difficulties encountered in this context, mainly linked to the database heterogeneity.
机译:在过去的十年中,互联网以及其应用程序的升值显着变化,主要归功于提高可用的个人重新来源。关于多媒体,最令人印象深刻的演变是视频共享网站的不断增长的成功。但是,随着这个成功来说,有效地搜索,索引和访问这些文件的相关信息的困难。扬声器日益改估是整体信息检索过程中的重要任务。本文介绍了一种音频/视频数据库,尤其是基于不同的视频类型的扬声器日复速衰期任务。通过一些初步实验,它突出了这种背景下遇到的困难,主要与数据库异质性相关联。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号