首页> 外文会议>Progress in Artificial Intelligence >Efficient and Tight Upper Bounds for Haplotype Inference by Pure Parsimony Using Delayed Haplotype Selection
【24h】

Efficient and Tight Upper Bounds for Haplotype Inference by Pure Parsimony Using Delayed Haplotype Selection

机译:利用延迟单倍型选择进行纯简约的单倍型推断的有效且紧的上界

获取原文

摘要

Haplotype inference from genotype data is a key step towards a better understanding of the role played by genetic variations on inherited diseases. One of the most promising approaches uses the pure parsimony criterion. This approach is called Haplotype Inference by Pure Parsimony (HIPP) and is NP-hard as it aims at minimising the number of haplotypes required to explain a given set of genotypes. The HIPP problem is often solved using constraint satisfaction techniques, for which the upper bound on the number of required haplotypes is a key issue. Another very well-known approach is Clark's method, which resolves genotypes by greedily selecting an explaining pair of haplotypes. In this work, we combine the basic idea of Clark's method with a more sophisticated method for the selection of explaining haplotypes, in order to explicitly introduce a bias towards parsimonious explanations. This new algorithm can be used either to obtain an approximated solution to the HIPP problem or to obtain an upper bound on the size of the pure parsimony solution. This upper bound can then used to efficiently encode the problem as a constraint satisfaction problem. The experimental evaluation, conducted using a large set of real and artificially generated examples, shows that the new method is much more effective than Clark's method at obtaining parsimonious solutions, while keeping the advantages of simplicity and speed of Clark's method.
机译:从基因型数据推断单倍型是迈向更好地了解遗传变异对遗传性疾病所起的作用的关键一步。最有前途的方法之一是使用纯简约标准。这种方法被称为纯简约的单倍型推论(HIPP),由于其目的是最大程度地减少解释给定基因型集所需的单倍型数量,因此具有NP难度。 HIPP问题通常使用约束满足技术来解决,对于这些问题,所需单倍型数量的上限是一个关键问题。另一个非常著名的方法是Clark方法,该方法通过贪婪地选择一对解释性单倍型来解析基因型。在这项工作中,我们将Clark方法的基本思想与更复杂的方法用于选择单倍型,以明确引入对简约解释的偏见。这种新算法既可以用于获得HIPP问题的近似解,也可以用于获得纯简约解的大小的上限。然后可以使用此上限将问题有效地编码为约束满足问题。使用大量真实且人工生成的示例进行的实验评估表明,该新方法在获得简约解时比Clark方法更有效,同时保留了Clark方法的简便性和速度优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号