首页> 外文会议>IEEE International Conference on Computer and Information Technology >Automatically Organize Web Text Resourceswith Frequent Term Tree
【24h】

Automatically Organize Web Text Resourceswith Frequent Term Tree

机译:自动组织常常术语树的Web文本资源

获取原文
获取外文期刊封面目录资料

摘要

With the expansion of the Web, automatically organizing large scale text resources, e.g. Web pages, becomes very important. Many Web sites, like Google and Yahoo, use hierarchical classification trees to organize text resources in Web. User can easily find the text resources that meet their requirements by navigating these hierarchical classification trees. Typically, the text resources in Web are manually assigned to the nodes of the hierarchical classification tree. This limits the hierarchical classification tree to organize large scale text resources. In this paper, we propose a Frequent Term Tree to improve the ability of hierarchical classification tree in organizing large scale text resources in Web. Different from the Fp-tree [17] which is utilized to efficiently discover frequent patterns, the frequent term tree is used to organize resources with frequent pattern based classification. The Frequent Term Tree can accurately assign text resources to each node of classification tree and improve the ability in organizing resources with the incremental classified text resources. The evaluation of the Frequent Term Tree demonstrates that Frequent Term Tree can effectively and efficiently organize text resources.
机译:随着网络的扩展,自动组织大规模文本资源,例如,网页,变得非常重要。许多网站,如谷歌和雅虎,使用分层分类树来组织Web中的文本资源。用户可以通过导航这些层次分类树轻松找到满足其要求的文本资源。通常,Web中的文本资源被手动分配给分层分类树的节点。这限制了分层分类树来组织大规模文本资源。在本文中,我们提出了一种频繁的术语树来提高层次分类树在组织Web中大规模文本资源中的能力。与用于有效地发现频繁模式的FP-Tree [17]不同,频繁的术语树用于组织具有频繁的基于模式的分类的资源。频繁的术语树可以准确地将文本资源分配给分类树的每个节点,并提高通过增量分类文本资源组织资源的能力。频繁术语树的评估演示了频繁的术语树可以有效和有效地组织文本资源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号