首页> 外文会议>International conference on machine learning and data mining >Evaluating Frequent-Set Mining Approaches in Machine-Learning Problems with Several Attributes: A Case Study in Healthcare
【24h】

Evaluating Frequent-Set Mining Approaches in Machine-Learning Problems with Several Attributes: A Case Study in Healthcare

机译:具有多个属性的机器学习问题中的频繁集挖掘方法评估:医疗保健案例研究

获取原文

摘要

Often datasets may involve thousands of attributes, and it is important to discover relevant features for machine-learning (ML) algorithms. Here, approaches that reduce or select features may become difficult to apply, and feature discovery may be made using frequent-set mining approaches. In this paper, we use the Apriori frequent-set mining approach to discover the most frequently occurring features from among thousands of features in datasets where patients consume pain medications. We use these frequently occurring features along with other demographic and clinical features in specific ML algorithms and compare algorithms' accuracies for classifying the type and frequency of consumption of pain medications. Results revealed that Apriori implementation for features discovery improved the performance of a large majority of ML algorithms and decision tree performed better among many ML algorithms. The main implication of our analyses is in helping the machine-learning community solves problems involving thousands of attributes.
机译:数据集通常可能涉及成千上万个属性,因此重要的是发现与机器学习(ML)算法相关的功能。在这里,减少或选择特征的方法可能变得难以应用,并且可以使用频繁设置的挖掘方法进行特征发现。在本文中,我们使用Apriori频繁集挖掘方法从患者使用止痛药的数据集中的数千个特征中发现最频繁出现的特征。在特定的ML算法中,我们将这些频繁出现的特征与其他人口统计特征和临床特征一起使用,并比较算法的准确性以对止痛药的消费类型和频率进行分类。结果表明,用于特征发现的Apriori实现提高了大多数ML算法的性能,并且决策树在许多ML算法中表现更好。我们分析的主要含义是帮助机器学习社区解决涉及数千个属性的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号