首页> 外文会议>Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on >Multiple Sources Classification of Gene Position on Chromosomes Using Statistical Significance of Individual Classification Results
【24h】

Multiple Sources Classification of Gene Position on Chromosomes Using Statistical Significance of Individual Classification Results

机译:利用单个分类结果的统计意义对染色体上基因位置进行多源分类

获取原文

摘要

In data mining applications it is common to have more than one data source available to describe the same record. For example, in biological sciences, the same genes may be characterized through many types of experiments. Which of the data sources proves to be most reliable in predictions may depend on the record in question. For some records pieces of information may be unavailable because an experiment has not yet been done, or certain type of inferences may not be applicable, such as when a gene does not have a homologue in some species. We demonstrate how multi-classifier systems can allow classification in cases where any individual source is scarce or unreliable to provide an accurate prediction model by itself. We propose a method to predict a class label using statistical significance of individual classification results. We show that the proposed approach increases the accuracy of results compared with conventional techniques in a problem related to gene mapping in wheat.
机译:在数据挖掘应用程序中,通常有多个可用的数据源来描述同一记录。例如,在生物科学中,可以通过许多类型的实验来表征相同的基因。哪个数据源被证明在预测中最可靠,可能取决于相关记录。对于某些记录,由于尚未进行实验,或者某些类型的推论(例如某个基因在某些物种中没有同源物),因此可能无法获得信息。我们演示了多分类器系统如何在缺乏或不可靠地单独提供准确的预测模型的情况下允许分类。我们提出了一种使用各个分类结果的统计显着性来预测类别标签的方法。我们表明,在与小麦基因图谱相关的问题中,与传统技术相比,所提出的方法提高了结果的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号