首页> 外文会议>The 2nd International Conference on Software Engineering and Data Mining >An empirical study on harmonizing classification precision using IE patterns
【24h】

An empirical study on harmonizing classification precision using IE patterns

机译:利用IE模式协调分类精度的实证研究。

获取原文

摘要

Web pages are conventionally represented by the words found within the contents for classification purpose. However, word-based web page representation suffers several limitations such as synonymy and homonymy. Motivated by the limitations of word-based representation, we explore the potential of representing web pages using information extraction patterns, in addition to words that are identified within the web contents. In this paper, we share the results as well as the findings learned from our experiments. Our empirical study conducted using WebKB dataset indicates that the addition of information extraction patterns in web page representation helps to improve the classification precision, especially in the categories which have much diversified web content.
机译:网页通常由在目录中找到的用于分类目的的单词表示。但是,基于单词的网页表示受到一些限制,例如同义词和同名。受基于单词的表示方式的局限性的驱使,我们探索了使用信息提取模式来表示网页的潜力,以及在网络内容中识别出的单词。在本文中,我们分享了结果以及从我们的实验中学到的发现。我们使用WebKB数据集进行的实证研究表明,在网页表示中添加信息提取模式有助于提高分类精度,尤其是在网页内容非常多样化的类别中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号