Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features

机译：使用结构特征和韵律特征在口头文化遗产档案中检测和标记民间文学

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spoken cultural heritage can present considerably heterogeneous content as tales, stories, recitals, poems, theatrical representations and other form of folk literature. This work investigates the automatic detection and classification of those data type in large spoken audio archives. The corpus used for this study consists of 90 radio broadcast shows collected for preserving a large variety of Swiss French dialects. Given the variability of the language spoken in the recordings, the paper proposes a language-independent system based on structural features obtained using a speaker diarization system and various acoustic/prosodic features. Results reveal that such a system can achieve an F-measure equal to 0.85 (Precision 0.88/Recall 0.84) in retrieving folk literature in those archives. Prosodic features appear more effective and complementary to structural features. Furthermore, the paper investigates whether the same approach can be used to label speech segments into five large classes (Storytelling, Poetry, Theatre, Interviews, Functionals) showing F-measures ranging from 0.52 to 0.88. As last contribution, prosodic features for disambiguating between spoken prose and spoken poetry are investigated. In summary the study shows that simple structural and acoustic/prosodic features can be used to effectively retrieve and label folk literature in broadcast archives.

机译：口语文化遗产可以表现出相当不同的内容，例如故事，故事，独奏会，诗歌，戏剧作品和其他形式的民间文学。这项工作研究了大型语音档案中这些数据类型的自动检测和分类。本研究使用的语料库包括90个无线电广播节目，这些节目被收集来保存各种瑞士法语方言。考虑到录音中所讲语言的可变性，本文提出了一种基于语言的独立系统，该系统基于使用说话者二分系统和各种声学/韵律特征而获得的结构特征。结果表明，在检索那些档案中的民间文学作品时，这种系统可以实现等于0.85（精度0.88 /召回率0.84）的F值。韵律特征似乎更有效，并且是结构特征的补充。此外，本文研究了是否可以使用相同的方法将语音片段标记为五个大类（讲故事，诗歌，戏剧，访谈，功能类），以显示F值在0.52到0.88之间。作为最后的贡献，研究了用于区分口头散文和口头诗的韵律特征。总之，研究表明，简单的结构和声学/韵律特征可用于有效地检索和标记广播档案中的民间文学作品。

著录项

来源
《2012 10th International Workshop on Content-Based Multimedia Indexing》|2012年|p.1- 6|共6页
会议地点 Annecy(FR)
作者
Valente Fabio; Motlicek Petr;
展开▼
作者单位

Idiap Research Institute, CH-1920 Martigny, Switzerland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类多媒体技术与多媒体计算机;
关键词
入库时间 2022-08-26 13:46:04

相似文献

外文文献
中文文献
专利

1. System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive [J] . Josef Psutka, Jan Svec, Josef V Psutka, EURASIP journal on audio, speech, and music processing . 2011,第1期

机译：捷克文化遗产档案中的快速词汇和语音口语检测系统
2. System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive [J] . Josef Psutka, Jan ?vec, Josef V Psutka, EURASIP journal on audio, speech, and music processing . 2011,第1期

机译：捷克文化遗产档案中的快速词汇和语音口语检测系统
3. Metadata in Archival and Cultural Heritage Settings: A Review of the Literature [J] . Julia Skinner Journal of library metadata . 2014,第1期

机译：档案和文化遗产环境中的元数据：文献综述
4. Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features [C] . Valente Fabio, Motlicek Petr International Workshop on Content-Based Multimedia Indexing . 2012

机译：使用结构和韵律特征检测和标记民间文学中的民间文学
5. Spoken Language Identification with Prosodic Features. [D] . Ng, Wai Man. 2011

机译：具有韵律特征的口语识别。
6. Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework [O] . Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan -1

机译：在最大熵框架中利用声音和句法特征进行自动韵律标记
7. System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive [O] . Josef Psutka, Jan Švec, Josef V Psutka, 2011

机译：捷克文化遗产档案中的快速词汇和语音口语检测系统

Detecting and labeling folk literature in spoken cultural heritage archives using structural and prosodic features

摘要

著录项

相似文献

相关主题

期刊订阅