首页> 外文会议>8th World Multi-Conference on Systemics, Cybernetics and Informatics(SCI 2004) vol.1: Information Systems, Technologies and Applications >A Large-Itemset-Based Index Structure for Supporting Personalized Information Filtering on the Internet
【24h】

A Large-Itemset-Based Index Structure for Supporting Personalized Information Filtering on the Internet

机译:基于大项目集的索引结构,用于支持Internet上的个性化信息过滤

获取原文
获取原文并翻译 | 示例

摘要

The World Wide Web creates many new challenges for information retrieval. Information Filtering (IF) can find good matches between the web pages and the users' information needs. In an information filtering system, users are associated with profiles that describe what they need, while data are represented in the same form of user profiles. Comparing data with profiles, the users who are interested in the data are identified and informed. Therefore, a critical issue of the information filtering service is how to index the user profiles for an efficient matching process. In this paper, first, we propose a count-based tree method, to reduce the large storage space as needed by Yan and Garcia-Molina's tree method. Next, by applying the technique for mining association rules, we propose a large-item set-based method, the count-major large item-set, to further reduce the storage space. From our simulation results, the cost of storage space of our methods is less than that of the tree method.
机译:万维网为信息检索带来了许多新挑战。信息过滤(IF)可以找到网页与用户信息需求之间的良好匹配。在信息过滤系统中,用户与描述他们需要的配置文件相关联,而数据以相同形式的用户配置文件表示。将数据与配置文件进行比较,即可识别并告知对数据感兴趣的用户。因此,信息过滤服务的关键问题是如何为有效的匹配过程索引用户配置文件。在本文中,首先,我们提出了一种基于计数的树方法,以减少Yan和Garcia-Molina的树方法所需的大存储空间。接下来,通过应用挖掘关联规则的技术,我们提出了一种基于大项目集的方法,即计数-大型项目集,以进一步减少存储空间。从我们的仿真结果来看,我们的方法的存储空间成本比树方法的成本低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号