首页> 外文会议>IEEE Students Conference on Engineering and Systems >Concept Drift Detection in Email Dataset through Intention Based Segmentation
【24h】

Concept Drift Detection in Email Dataset through Intention Based Segmentation

机译:通过基于意图的分割在电子邮件数据集中进行概念漂移检测

获取原文

摘要

In a dynamic environment, detection of changes in concept is very important for prediction and decision-based applications. Concept drift detection helps decision makers to perform smarter maintenance and operations at an appropriate time. In the context of Emails, concept drift is defined as how the concepts in Emails are changing with time. In this paper, a novel method is presented based on intention based segmentation to calculate concept drift in Email dataset. In intention based segmentation instead of comparing the contents of email as whole, it is divided into segments focusing on same intention and compared. Division of sentences is done on the basis of voices and tenses with the help of POS Tagging. After that, Hierarchical Clustering is used for clustering of segments and comparison is done using Vector Space Model. Importance of concept drift detection in Email domain is to find the emails which will be of less interest to user. It also helps to find the current interests of user by analyzing the changes in concepts over time.
机译:在动态环境中,概念更改的检测对于基于预测和决策的应用非常重要。概念漂移检测可帮助决策者在适当的时间执行更明智的维护和操作。在电子邮件的上下文中,概念漂移定义为电子邮件中的概念随时间变化的方式。本文提出了一种基于意图分割的新方法来计算电子邮件数据集中的概念漂移。在基于意图的细分中,不是将电子邮件的整体内容进行比较,而是将其分为关注相同意图的细分并进行比较。借助POS标记,根据语音和时态完成句子的划分。之后,将层次聚类用于段的聚类,并使用向量空间模型进行比较。在电子邮件域中,概念漂移检测的重要性在于找到用户不感兴趣的电子邮件。通过分析概念随时间的变化,它还有助于找到用户的当前兴趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号