Wikipedia based semantic metadata annotation of audio transcripts

机译：基于维基百科的音频转录本的语义元数据注释

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A method to automatically annotate video items with semantic metadata is presented. The method has been developed in the context of the Papyrus project to annotate documentary- like broadcast videos with a set of relevant keywords using automatic speech recognition (ASR) transcripts as a primary complementary resource. The task is complicated by the high word error rate (WER) of the ASR for this kind of videos. For this reason a novel relevance criterion based on domain information is proposed. Wikipedia is used both as a source of metadata and as a linguistic resource for disambiguating keywords and for eliminating the out of topic/out of domain keywords. Documents are annotated with relevant links to Wikipedia pages, concepts definitions, synonyms, translations and concepts categories.

机译：提出了一种利用语义元数据自动注释视频项目的方法。该方法是在Papyrus项目的上下文中开发的，以使用自动语音识别（ASR）成绩单作为主要补充资源，使用一组相关关键字来注释类似纪录片的广播视频。对于此类视频，ASR的高单词错误率（WER）使任务变得复杂。为此，提出了一种新的基于领域信息的相关性准则。维基百科既用作元数据的来源，又用作消除歧义关键词和消除主题外/域外关键词的语言资源。文档带有指向Wikipedia页面，概念定义，同义词，翻译和概念类别的相关链接注释。

著录项

来源
《11th International Workshop on Image Analysis for Multimedia Interactive Services》|2010年|1-4|共4页
会议地点
作者
Paci Giulio; Pedrazzi Giorgio; Turra Roberta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词

相似文献

外文文献
中文文献
专利

1. Semantic Annotation of Documents Based on Wikipedia Concepts | Brank | Informatica [J] . Janez Brank, Gregor Leban, Marko Grobelnik Informatica: An International Journal of Computing and Informatics . 2018,第1期

机译：基于维基百科概念的文档语义注释布兰克|信息学
2. A WIKIPEDIA-BASED FRAMEWORK FOR COLLABORATIVE SEMANTIC ANNOTATION [J] . N. FERNANDEZ, J. A. FISTEUS, D. FUENTES, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2011,第5期

机译：基于WIKIPEDIA的协同语义标注框架
3. ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository [J] . Martin Dugas, Alexandra Meidt, Philipp Neuhaus, BMC Medical Research Methodology . 2016,第1期

机译：ODMedit：基于公共元数据存储库的统一语义注释，用于医学数据集成
4. Wikipedia based semantic metadata annotation of audio transcripts [C] . Paci Giulio, Pedrazzi Giorgio, Turra Roberta 11th International Workshop on Image Analysis for Multimedia Interactive Services . 2010

机译：基于维基百科的音频转录本的语义元数据注释
5. Faceted metadata for annotation and retrieval of Web-based information. [D] . Liu, Yuancheng. 2006

机译：多面元数据，用于注释和检索基于Web的信息。
6. ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository [O] . Martin Dugas, Alexandra Meidt, Philipp Neuhaus, 2016

机译：ODMedit：基于公共元数据存储库的统一语义注释用于医学数据集成
7. ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository [O] . Martin Dugas, Alexandra Meidt, Philipp Neuhaus, 2016

机译：ODMedit：基于公共元数据存储库的统一语义注释，用于医学数据集成

Wikipedia based semantic metadata annotation of audio transcripts

摘要

著录项

相似文献

相关主题

期刊订阅