首页> 外文会议>2012 10th International Workshop on Content-Based Multimedia Indexing >Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features
【24h】

Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features

机译:使用结构特征和韵律特征在口头文化遗产档案中检测和标记民间文学

获取原文
获取原文并翻译 | 示例

摘要

Spoken cultural heritage can present considerably heterogeneous content as tales, stories, recitals, poems, theatrical representations and other form of folk literature. This work investigates the automatic detection and classification of those data type in large spoken audio archives. The corpus used for this study consists of 90 radio broadcast shows collected for preserving a large variety of Swiss French dialects. Given the variability of the language spoken in the recordings, the paper proposes a language-independent system based on structural features obtained using a speaker diarization system and various acoustic/prosodic features. Results reveal that such a system can achieve an F-measure equal to 0.85 (Precision 0.88/Recall 0.84) in retrieving folk literature in those archives. Prosodic features appear more effective and complementary to structural features. Furthermore, the paper investigates whether the same approach can be used to label speech segments into five large classes (Storytelling, Poetry, Theatre, Interviews, Functionals) showing F-measures ranging from 0.52 to 0.88. As last contribution, prosodic features for disambiguating between spoken prose and spoken poetry are investigated. In summary the study shows that simple structural and acoustic/prosodic features can be used to effectively retrieve and label folk literature in broadcast archives.
机译:口语文化遗产可以表现出相当不同的内容,例如故事,故事,独奏会,诗歌,戏剧作品和其他形式的民间文学。这项工作研究了大型语音档案中这些数据类型的自动检测和分类。本研究使用的语料库包括90个无线电广播节目,这些节目被收集来保存各种瑞士法语方言。考虑到录音中所讲语言的可变性,本文提出了一种基于语言的独立系统,该系统基于使用说话者二分系统和各种声学/韵律特征而获得的结构特征。结果表明,在检索那些档案中的民间文学作品时,这种系统可以实现等于0.85(精度0.88 /召回率0.84)的F值。韵律特征似乎更有效,并且是结构特征的补充。此外,本文研究了是否可以使用相同的方法将语音片段标记为五个大类(讲故事,诗歌,戏剧,访谈,功能类),以显示F值在0.52到0.88之间。作为最后的贡献,研究了用于区分口头散文和口头诗的韵律特征。总之,研究表明,简单的结构和声学/韵律特征可用于有效地检索和标记广播档案中的民间文学作品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号