首页> 外文期刊>Decision support systems >On learning to predict Web traffic
【24h】

On learning to predict Web traffic

机译:学习预测网络流量

获取原文
获取原文并翻译 | 示例
       

摘要

The ease of collecting data about customers through the Internet has facilitated the process of developing large repositories of data. These data can and do contain patterns that are useful for the decision maker. Knowledge discovery and data mining methods have been widely used to extract these patterns. It is acknowledged that about 80% of the resources in a majority of data mining applications are spent on cleaning and preprocessing the data. However, there have been relatively few studies on preprocessing data used as input in these data mining systems. In this study, we present a feature selection method based on the Hausdorff distance measure, and evaluate its effectiveness in preprocessing input data for inducing decision trees. Message traffic data from a Web site are used to illustrate performance of the proposed method.
机译:通过Internet收集有关客户的数据的便捷性促进了开发大型数据存储库的过程。这些数据可以而且确实包含对决策者有用的模式。知识发现和数据挖掘方法已被广泛用于提取这些模式。公认的是,在大多数数据挖掘应用程序中,大约80%的资源都用于清理和预处理数据。但是,关于在这些数据挖掘系统中用作输入的预处理数据的研究相对较少。在这项研究中,我们提出一种基于Hausdorff距离测度的特征选择方法,并评估其在预处理输入数据以诱导决策树的有效性。来自网站的消息流量数据用于说明所提出方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号