首页> 外文会议>Proceedings of the IASTED international conferences on informatics >A PROCEDURE FOR BUILDING REDUCED RELIABLE TRAINING DATASETS FROM REAL-WORLD DATA
【24h】

A PROCEDURE FOR BUILDING REDUCED RELIABLE TRAINING DATASETS FROM REAL-WORLD DATA

机译:从实数数据构建减少的可靠训练数据集的过程

获取原文
获取原文并翻译 | 示例

摘要

Dimensionality reduction and anomalous data detection are important tasks in machine learning and data mining applications. Many real-world datasets are affected by errors and variable redundancy and this fact can generate problems when the data are used to develop accurate models exploiting some training procedures for parameters tuning. In this paper an automatic procedure is proposed combining detection of unreliable data and reduction of dimensionality to be adopted before exploiting the data to develop a model for prediction purposes. The method has been tested on several datasets belonging to the UCI repository and industrial fields. The results of tests are showed and discussed in the paper. The proposed approach provide a good prediction accuracy providing a minimal but essential dataset.
机译:降维和异常数据检测是机器学习和数据挖掘应用程序中的重要任务。许多现实世界的数据集都受到错误和变量冗余的影响,当使用数据通过一些参数调整的训练程序来开发准确的模型时,此事实可能会产生问题。在本文中,提出了一种自动程序,该程序结合了对不可靠数据的检测和降维,在开发数据以开发用于预测目的的模型之前将其采用。该方法已在属于UCI存储库和工业领域的几个数据集上进行了测试。测试结果在本文中显示和讨论。所提出的方法提供了良好的预测精度,从而提供了最小但必不可少的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号