首页> 外文会议>Pattern recognition and machine intelligence >Hypertext Classification Using Tensor Space Model and Rough Set Based Ensemble Classifier
【24h】

Hypertext Classification Using Tensor Space Model and Rough Set Based Ensemble Classifier

机译:基于张量空间模型和基于粗糙集的集成分类器的超文本分类

获取原文
获取原文并翻译 | 示例

摘要

As WWW grows at an increasing speed, a classifier targeted at hypertext has become in high demand. While document categorization is quite a mature, the issue of utilizing hypertext structure and hyperlinks has been relatively unexplored. In this paper, we introduce tensor space model for representing hypertext documents. We exploit the local-structure and neighborhood recommendation encapsulated in the proposed representation model. Instead of using the text on a page for representing features in a vector space model, we have used features on the page and neighborhood features to represent a hypertext document in a tensor space model. Tensor similarity measure is defined. We have demonstrated the use of rough set based ensemble classifier on proposed tensor space model. Experimental results of classification obtained by using our method outperform existing hypertext classification techniques.
机译:随着WWW的增长,对超文本的分类器的需求量很大。虽然文档分类已经相当成熟,但是利用超文本结构和超链接的问题尚未得到充分探讨。在本文中,我们介绍了用于表示超文本文档的张量空间模型。我们利用封装在提出的表示模型中的局部结构和邻域推荐。我们没有使用页面上的文本来表示向量空间模型中的特征,而是使用页面上的特征和邻域特征来表示张量空间模型中的超文本文档。定义了张量相似性度量。我们已经证明了在提出的张量空间模型上使用基于粗糙集的集成分类器。使用我们的方法获得的分类实验结果优于现有的超文本分类技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号