【24h】

Multi-scale audio indexing for Chinese spoken document retrieval

机译:用于中文语音文档检索的多尺度音频索引

获取原文
获取外文期刊封面目录资料

摘要

The advent of the information age has brought massive digital libraries of multimedia content. This development creates a high demand for information indexing and retrieval technologies. and the capability of browsing through audio archives is much desired. This paper reports on our initial attempt in the use of syllable units for Chinese spoken document retrieval. Our experiments are based on 1801 news stories from local television broadcasts in Cantonese. a monosyllabic Chinese dialect with a rich tonal structure. Results show that indexing with overlapping bi-syllables (tonal syllables) mapped from text delivers the reference retrieval performance at average inverse rank (AIR)=0.830. Retrieval based on overlapping bisyllables (base syllables) recognized from audio achieved and AIR of 0.460.
机译:信息时代的到来带来了庞大的多媒体内容数字图书馆。这种发展对信息索引和检索技术提出了很高的要求。并且非常需要能够浏览音频档案的功能。本文报告了我们在使用音节单位进行中文语音文档检索方面的初步尝试。我们的实验是基于广东话当地电视台播出的1801个新闻报道。具有丰富音调结构的单音节汉语方言。结果表明,从文本映射的具有重叠双音节(音调音节)的索引在平均逆秩(AIR)= 0.830时提供了参考检索性能。基于实现的重叠双音节(基础音节)进行检索,该音节是从音频和AIR为0.460识别的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号