首页> 外文会议>ISPE International Conference on Concurrent Engineering >A method for content-based news story classification in data mining
【24h】

A method for content-based news story classification in data mining

机译:数据挖掘中基于内容的新闻故事分类的方法

获取原文

摘要

Multimedia data mining is a sub field of data mining that deals with the mining of high-level multimedia information and implicit knowledge from large multimedia databases, and the classification is one of data mining modules for mining knowledge in multimedia databases. In this paper, a new method is presented to detect anchorperson shots automatically for digital TV news programs, then we use video OCR technique to extract text from news video stream, finally, Transductive Support Vector Machine (TSVM) is used to perform automated classification of news stories based on the texts obtained from OCR process for the first time. TSVM takes into account a particular test set and try to minimize misclassifications of just those particular examples. Experimental results show that TSVM is better than other learning algorithms such as Decision Trees and SVM, especially for small training sets.
机译:多媒体数据挖掘是数据挖掘的子领域,这些挖掘处理高级多媒体信息和来自大型多媒体数据库的隐式知识的挖掘,并且分类是用于多媒体数据库中的挖掘知识的数据挖掘模块之一。在本文中,提出了一种新方法来检测数字电视新闻节目的自动检测锚点镜头,然后我们使用视频OCR技术从新闻视频流中提取文本,最后,转换支持向量机(TSVM)用于执行自动分类基于第一次从OCR过程获得的文本的新闻故事。 TSVM考虑了特定的测试集,并尝试最小化仅仅是那些特定示例的错误分类。实验结果表明,TSVM比其他学习算法更好,如决策树和SVM,尤其是小型训练集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号