首页> 外文期刊>Journal of the American Society for Information Science and Technology >Text-Based Video Content Classification for Online Video-Sharing Sites
【24h】

Text-Based Video Content Classification for Online Video-Sharing Sites

机译:在线视频共享网站的基于文本的视频内容分类

获取原文
获取原文并翻译 | 示例
           

摘要

With the emergence of Web 2.0, sharing personal content, communicating ideas, and interacting with other online users in Web 2.0 communities have become daily routines for online users. User-generated data from Web 2.0 sites provide rich personal information (e.g., personal preferences and interests) and can be utilized to obtain insight about cyber communities and their social networks. Many studies have focused on leveraging user-generated information to analyze blogs and forums, but few studies have applied this approach to video-sharing Web sites. In this study, we propose a text-based framework for video content classification of online-video sharing Web sites. Different types of user-generated data (e.g., titles, descriptions, and comments) were used as proxies for online videos, and three types of text features (lexical, syntactic, and content-specific features) were extracted. Three feature-based classification techniques (C4.5, Naieve Bayes, and Support Vector Machine) were used to classify videos. To evaluate the proposed framework, user-generated data from candidate videos, which were identified by searching user-given keywords on YouTube, were first collected. Then, a subset of the collected data was randomly selected and manually tagged by users as our experiment data.The experimental results showed that the proposed approach was able to classify online videos based on users' interests with accuracy rates up to 87.2%, and all three types of text features contributed to discriminating videos. Support Vector Machine outperformed C4.5 and Naive Bayes techniques in our experiments. In addition, our case study further demonstrated that accurate video-classification results are very useful for identifying implicit cyber communities on video-sharing Web sites.
机译:随着Web 2.0的出现,共享个人内容,交流思想以及与Web 2.0社区中的其他在线用户进行交互已成为在线用户的日常工作。用户从Web 2.0站点生成的数据可提供丰富的个人信息(例如,个人喜好和兴趣),并可用于获取有关网络社区及其社交网络的见解。许多研究集中于利用用户生成的信息来分析博客和论坛,但是很少有研究将这种方法应用于视频共享网站。在这项研究中,我们提出了一个基于文本的框架,用于在线视频共享网站的视频内容分类。不同类型的用户生成的数据(例如标题,描述和评论)被用作在线视频的代理,并且提取了三种类型的文本特征(词法,句法和特定于内容的特征)。三种基于特征的分类技术(C4.5,Naieve Bayes和Support Vector Machine)用于对视频进行分类。为了评估建议的框架,首先收集了通过从YouTube搜索用户指定的关键字来识别的候选视频中的用户生成的数据。然后,随机选择一部分收集的数据并由用户手动标记为我们的实验数据。实验结果表明,该方法能够根据用户兴趣对在线视频进行分类,准确率高达87.2%,并且所有三种文字功能有助于区分视频。在我们的实验中,支持向量机的性能优于C4.5和朴素贝叶斯技术。此外,我们的案例研究进一步表明,准确的视频分类结果对于识别视频共享网站上的隐式网络社区非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号