首页> 美国卫生研究院文献>other >ExSTraCS 2.0: Description and Evaluation of a Scalable Learning Classifier System
【2h】

ExSTraCS 2.0: Description and Evaluation of a Scalable Learning Classifier System

机译:ExSTraCS 2.0:可扩展学习分类器系统的描述和评估

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Algorithmic scalability is a major concern for any machine learning strategy in this age of ‘big data’. A large number of potentially predictive attributes is emblematic of problems in bioinformatics, genetic epidemiology, and many other fields. Previously, ExS-TraCS was introduced as an extended Michigan-style supervised learning classifier system that combined a set of powerful heuristics to successfully tackle the challenges of classification, prediction, and knowledge discovery in complex, noisy, and heterogeneous problem domains. While Michigan-style learning classifier systems are powerful and flexible learners, they are not considered to be particularly scalable. For the first time, this paper presents a complete description of the ExS-TraCS algorithm and introduces an effective strategy to dramatically improve learning classifier system scalability. ExSTraCS 2.0 addresses scalability with (1) a rule specificity limit, (2) new approaches to expert knowledge guided covering and mutation mechanisms, and (3) the implementation and utilization of the TuRF algorithm for improving the quality of expert knowledge discovery in larger datasets. Performance over a complex spectrum of simulated genetic datasets demonstrated that these new mechanisms dramatically improve nearly every performance metric on datasets with 20 attributes and made it possible for ExSTraCS to reliably scale up to perform on related 200 and 2000-attribute datasets. ExSTraCS 2.0 was also able to reliably solve the 6, 11, 20, 37, 70, and 135 multiplexer problems, and did so in similar or fewer learning iterations than previously reported, with smaller finite training sets, and without using building blocks discovered from simpler multiplexer problems. Furthermore, ExS-TraCS usability was made simpler through the elimination of previously critical run parameters.
机译:在当今“大数据”时代,算法可伸缩性是任何机器学习策略的主要关注点。大量潜在的预测属性是生物信息学,遗传流行病学和许多其他领域中问题的象征。以前,ExS-TraCS是作为扩展的密歇根州式有监督学习分类器系统引入的,该系统结合了一组强大的启发式方法,可以成功解决复杂,嘈杂和异构问题域中分类,预测和知识发现的挑战。虽然密歇根州风格的学习分类器系统功能强大且灵活,但并未被认为具有特别的可扩展性。本文首次全面介绍了ExS-TraCS算法,并介绍了一种有效提高学习分类器系统可伸缩性的有效策略。 ExSTraCS 2.0通过以下方式解决了可扩展性:(1)规则特异性限制;(2)专家知识指导的覆盖和变异机制的新方法;(3)TuRF算法的实现和利用,以提高大型数据集中专家知识发现的质量。在复杂的模拟遗传数据集范围内的性能表明,这些新机制显着改善了具有20个属性的数据集上几乎每个性能指标,并使ExSTraCS能够可靠地扩展以在相关200个和2000个属性的数据集上执行。 ExSTraCS 2.0还能够可靠地解决6、11、20、37、70和135个多路复用器的问题,并且与以前报告的学习迭代次数相近或更少,而且训练集更小,而且无需使用从更简单的多路复用器问题。此外,通过消除以前的关键运行参数,使ExS-TraCS的可用性变得更简单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号