Generating web-based corpora for video transcripts categorization

Jose M. Perea-Ortega; Arturo Montejo-Raez; M. Teresa Martin-Valdivia; L Alfonso Urena-Lopez

首页> 外文期刊>Expert Systems with Application >Generating web-based corpora for video transcripts categorization

【24h】

Generating web-based corpora for video transcripts categorization

机译：生成基于Web的语料库以进行视频笔录分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes the use of Internet as a rich source of information in order to generate learning corpora for video transcripts categorization systems. Our main goal in this work has been to study the behavior of different learning corpora generated from the Internet and analyze some of their features. Specifically, Wikipedia, Google and the blogosphere have been employed to generate these learning corpora, using the VideoCLEF 2008 track as the evaluation framework for the different experiments carried out. Based on this evaluation framework, we conclude that the proposed approach is a promising strategy for the video classification task using the transcripts of the videos. The different sizes of the corpora generated could lead to believe that better results are achieved when the corpus size is larger, but we demonstrate that this feature may not always be a reliable indicator of the behavior of the learning corpus. The obtained results show that the integration of knowledge from the blogosphere or Google allows generating more reliable corpora for this task than those based on Wikipedia.

机译：本文提出使用Internet作为丰富的信息源，以便为视频笔录分类系统生成学习语料库。我们这项工作的主要目标是研究从互联网生成的不同学习语料库的行为并分析其某些功能。具体而言，已将Wikipedia，Google和Blogosphere用于生成这些学习语料库，并使用VideoCLEF 2008跟踪作为进行的不同实验的评估框架。基于此评估框架，我们得出结论，对于使用视频转录本的视频分类任务，所提出的方法是一种很有前途的策略。生成的语料库的大小不同可能会导致人们相信，当语料库大小较大时，可以获得更好的结果，但是我们证明了此功能可能并不总是可靠地指示学习语料库的行为。获得的结果表明，与基于Wikipedia的知识库相比，来自Blogosphere或Google的知识集成可以为该任务生成更可靠的语料库。

著录项

来源
《Expert Systems with Application》 |2013年第1期|337-344|共8页
作者
Jose M. Perea-Ortega; Arturo Montejo-Raez; M. Teresa Martin-Valdivia; L Alfonso Urena-Lopez;
展开▼
作者单位

SINAI Research Group. Computer Science Department, University of Jaen, 23071 Jaen, Spain;

SINAI Research Group. Computer Science Department, University of Jaen, 23071 Jaen, Spain;

SINAI Research Group. Computer Science Department, University of Jaen, 23071 Jaen, Spain;

SINAI Research Group. Computer Science Department, University of Jaen, 23071 Jaen, Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
video transcripts categorization; video tagging; web-based corpora generation; automatic speech recognition (ASR);

机译：视频笔录分类;视频标记;基于网络的语料库生成;自动语音识别（ASR）;

相似文献

外文文献
中文文献
专利

1. Contrasting user generated videos versus brand generated videos in ecommerce [J] . Diwanji Vaibhav S., Cortese Juliann Journal of retailing and consumer services . 2020,第May期

机译：对比用户生成的视频与电子商务中的品牌生成视频
2. Categorization of Unorganized Text Corpora for better Domain-Specific Language Modeling [J] . Advances in Electrical and Electronic Engineering . 2013,第5期

机译：分类非组织文本语料库，以实现更好的领域特定语言建模
3. Fast method of video genre categorization for temporally aggregated broadcast videos [J] . Choros Kazimierz Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第6aPta1期

机译：用于时间汇总广播视频的视频类型分类的快速方法
4. Joint categorization of queries and clips for web-based video search [C] . Ruofei Zhang, Ramesh Sarukkai, Jyh-Herng Chow, Proceedings of the 8th ACM international workshop on Multimedia information retrieval . 2006

机译：基于Web的视频搜索的查询和剪辑的联合分类
5. Automatic video categorization for massively large corpora: A paradigm shift for applications in lane tracking. [D] . Yang, Xuebo. 2010

机译：大型语料库的自动视频分类：车道跟踪中应用的范例转变。
6. Use of Web-Based Videos in a Community Pharmacy to Optimize Inhalation Technique [O] . Tobias Müller, Maike Möller, Christian Lücker, 2020

机译：在社区药房中使用基于网络的视频来优化吸入技术
7. Semi-automatic Categorization of Videos on VideoLectures.net [O] . Miha Grcar, Dunja Mladenic, Peter Kese 2009

机译：VideOccectures.net上的视频半自动分类

Generating web-based corpora for video transcripts categorization

摘要

著录项

相似文献

相关主题

期刊订阅