首页> 外文学位 >Concept drift learning and its application to adaptive information filtering.
【24h】

Concept drift learning and its application to adaptive information filtering.

机译:概念漂移学习及其在自适应信息过滤中的应用。

获取原文
获取原文并翻译 | 示例

摘要

Tracking the evolution of user interests is a problem instance of concept drift learning. Keeping track of multiple interest categories is a natural phenomenon as well as an interesting tracking problem because interests can emerge and diminish at different time frames. The first part of this dissertation presents a M&barbelow;ultiple T&barbelow;hree-D&barbelow;escriptor R&barbelow;epresentation (MTDR) algorithm, a novel algorithm for learning concept drift especially built for tracking the dynamics of multiple target concepts in the information filtering domain. The learning process of the algorithm combines the long-term and short-term interest (concept) models in an attempt to benefit from the strength of both models. The MTDR algorithm improves over existing concept drift learning algorithms in the domain.; Being able to track multiple target concepts with a few examples poses an even more important and challenging problem because casual users tend to be reluctant to provide the examples needed, and learning from a few labeled data is generally difficult. The second part presents a computational F&barbelow;ramework for E&barbelow;xtending I&barbelow;ncomplete L&barbelow;abeled D&barbelow;ata S&barbelow;tream (FEILDS). The system modularly extends the capability of an existing concept drift learner in dealing with incomplete labeled data stream. It expands the learner's original input stream with relevant unlabeled data; the process generates a new stream with improved learnability. FEILDS employs a concept formation system for organizing its input stream into a concept (cluster) hierarchy. The system uses the concept and cluster hierarchy to identify the instance's concept and unlabeled data relevant to a concept. It also adopts the persistence assumption in temporal reasoning for inferring the relevance of concepts. Empirical evaluation indicates that FEILDS is able to improve the performance of existing learners particularly when learning from a stream with a few labeled data.; Lastly, a new concept formation algorithm, one of the key components in the FEILDS architecture, is presented. The main idea is to discover intrinsic hierarchical structures regardless of the class distribution and the shape of the input stream. Experimental evaluation shows that the algorithm is relatively robust to input ordering, consistently producing a hierarchy structure of high quality.
机译:跟踪用户兴趣的演变是概念漂移学习的一个问题实例。跟踪多个兴趣类别是一种自然现象,也是一个有趣的跟踪问题,因为兴趣会在不同的时间范围出现并减少。本文的第一部分提出了一种MDR算法,一种用于学习概念漂移的新颖算法,该算法专门为跟踪信息过滤领域中多个目标概念的动态而构建。该算法的学习过程结合了长期和短期兴趣(概念)模型,试图从这两个模型的优势中受益。 MTDR算法改进了该领域中现有的概念漂移学习算法。能够用几个示例跟踪多个目标概念提出了一个甚至更重要和更具挑战性的问题,因为临时用户往往不愿提供所需的示例,并且通常很难从一些标记数据中学习。第二部分介绍了E的计算F线,延伸的I线,不完整的L线,A的D线,ata S的线(FEILDS)。该系统以模块化方式扩展了现有概念漂移学习器处理不完整标记数据流的能力。它使用相关的未标记数据扩展了学习者的原始输入流;该过程产生了一个新的流,具有更好的可学习性。 FEILDS使用概念形成系统来将其输入流组织到概念(集群)层次结构中。系统使用概念和集群层次结构来标识实例的概念和与概念相关的未标记数据。它还在时间推理中采用持久性假设,以推断概念的相关性。实证评估表明,FEILDS能够提高现有学习者的表现,特别是在从带有少量标记数据的流中学习时。最后,提出了一种新的概念形成算法,它是FEILDS体系结构中的关键组成部分。主要思想是发现固有的层次结构,而不管类的分布和输入流的形状如何。实验评估表明,该算法对输入排序相对稳健,始终如一地产生高质量的层次结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号