首页> 外文会议>International Workshop on Content-Based Multimedia Indexing >Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features
【24h】

Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features

机译:使用结构和韵律特征检测和标记民间文学中的民间文学

获取原文

摘要

Spoken cultural heritage can present considerably heterogeneous content as tales, stories, recitals, poems, theatrical representations and other form of folk literature. This work investigates the automatic detection and classification of those data type in large spoken audio archives. The corpus used for this study consists of 90 radio broadcast shows collected for preserving a large variety of Swiss French dialects. Given the variability of the language spoken in the recordings, the paper proposes a language-independent system based on structural features obtained using a speaker diarization system and various acoustic/prosodic features. Results reveal that such a system can achieve an F-measure equal to 0.85 (Precision 0.88/Recall 0.84) in retrieving folk literature in those archives. Prosodic features appear more effective and complementary to structural features. Furthermore, the paper investigates whether the same approach can be used to label speech segments into five large classes (Storytelling, Poetry, Theatre, Interviews, Functionals) showing F-measures ranging from 0.52 to 0.88. As last contribution, prosodic features for disambiguating between spoken prose and spoken poetry are investigated. In summary the study shows that simple structural and acoustic/prosodic features can be used to effectively retrieve and label folk literature in broadcast archives.
机译:口语文化遗产可以呈现出与故事,故事,重组,诗歌,戏剧表征和其他形式的民间文学的异质内容。这项工作调查了大口头音频档案中这些数据类型的自动检测和分类。用于本研究的语料库由90个无线电广播节目组成,用于保留各种各样的瑞士法式方言。鉴于录音中所说的语言的可变性,本文提出了一种基于使用扬声器日复速度系统和各种声学/韵律特征获得的结构特征的语言无关系统。结果表明,这种系统可以实现等于0.85(精度0.88 /召回0.84)的F测量值,以便在这些档案中检索民间文学。韵律特征看起来更有效和与结构特征互补。此外,本文调查了相同的方法是否可用于将语音段标记为五个大型课程(讲故事,诗歌,剧院,采访,功能),显示F尺寸范围为0.52至0.88。作为最后的贡献,调查了口语散文与口语诗之间消化不懈的韵律特征。总之,研究表明,简单的结构和声学/韵律特征可用于有效地检索和标记广播档案中的民间文学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号