首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >A novel phylogeny-based pattern selection algorithm and its application to microbiomic data
【24h】

A novel phylogeny-based pattern selection algorithm and its application to microbiomic data

机译:基于文化的文学模式选择算法及其在微生物数据中的应用

获取原文

摘要

Discriminative patterns describe significant differences between different types of subjects, and often provide insights to critical properties of the problem at hand. Pattern-based classifiers can directly utilize discriminative patterns to predict unseen samples by a majority voting or aggregation mechanism. Therefore, we are concerned with not only finding useful individual patterns, but also the effectiveness of the pattern set as a whole; and it is imperative to ensure the relevancy and non-redundancy of the discriminative patterns. Few studies have evaluated pattern redundancy via examining samples covered by the patterns; and in those that do, the focus has been mostly on the proportion of overlapping samples, suggesting that a great deal of information on non-overlapping samples was overlooked. To address this issue, we present a novel pattern selection algorithm that estimates pattern redundancy by not only the proportion of overlapping samples, but also the resemblance of non-overlapping samples. The proposed method was applied on two real microbiomic datasets, with the aim of providing new insights on the interactions between microbes, and their effects on the host. When compared with other robust classifiers and feature selection heuristics, our pattern selection algorithm led to diverse and compact sets of final patterns that demonstrated comparable or even superior predictive capabilities.
机译:鉴别模式描述了不同类型的受试者之间的显着差异,并且通常为手头问题的关键特性提供了洞察力。基于模式的分类器可以直接利用鉴别模式来通过大多数投票或聚集机制来预测未经看的样本。因此,我们担心不仅找到了有用的单独模式,而且还担心整个模式集的有效性;并且必须确保歧视模式的相关性和非冗余。通过检查图案覆盖的样品,少数研究评估了模式冗余;在那些所做的那些中,重点主要是在重叠样本的比例上,这表明有关于非重叠样本的大量信息被忽视了。为了解决这个问题,我们介绍了一种新颖的模式选择算法,其不仅是重叠样本的比例,还估计了模式冗余,而且估计了非重叠样本的相似性。将该方法应用于两个真实的微生物数据集,目的是对微生物之间的相互作用提供新的见解,以及它们对宿主的影响。与其他强大的分类器和特征选择启发式相比,我们的模式选择算法导致多样化和紧凑的最终模式集,这些决定性和紧凑的最终模式显示了相当甚至卓越的预测能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号