首页> 外文会议>Algorithms in bioinformatics >Reducing Multi-state to Binary Perfect Phylogeny with Applications to Missing, Removable, Inserted, and Deleted Data
【24h】

Reducing Multi-state to Binary Perfect Phylogeny with Applications to Missing, Removable, Inserted, and Deleted Data

机译:将多态还原为二进制完美的系统发育,并适用于丢失,可移动,插入和删除的数据

获取原文
获取原文并翻译 | 示例

摘要

Multi-State Perfect Phylogeny is an extension of Binary Perfect Phylogeny where characters are allowed more than two states. In this paper we consider four problems that extend its utility: In the Missing Data (MD) Problem some entries in the input are missing and the question is whether (bounded) values can be imputed so that the resulting data has a multi-state Perfect Phylogeny; In the Character-Removal (CR) Problem we want to minimize the number of characters to remove from the data so that the resulting data has a multi-state Perfect Phylogeny; In the Missing-Data Character-Removal (MDCR) Problem we want to impute values for the missing data to minimize the solution to the resulting Character-Removal Problem; In the Insertion and Deletion (ID) Problem insertion and deletion mutational events spanning multiple characters are also allowed. In this paper, we introduce a new general conceptual solution to these four problems. The method reduces fc-state problems to binary problems with missing data. This gives a new conceptual solution to the multi-state Perfect Phylogeny problem, and conceptual solutions to the MD, CR, MDCR and ID problems for any k significantly improving previous work. Empirical evaluations of our implementations show that they are faster and effective for larger input than previously established methods for general k.
机译:多状态完美系统发生学是二进制完美系统发生学的扩展,其中字符被允许具有两个以上的状态。在本文中,我们考虑了四个问题,这些问题扩展了其效用:在数据丢失(MD)问题中,输入中的某些条目丢失了,问题是是否可以对(有界)值进行插值,从而使所得数据具有多状态Perfect系统发育在字符删除(CR)问题中,我们希望最小化要从数据中删除的字符数量,以使生成的数据具有多态的完美系统发育;在缺少数据的字符删除(MDCR)问题中,我们希望为丢失的数据计算值,以最大程度地减少由此产生的字符删除问题的解决方案。在插入和删除(ID)中,还允许跨多个字符的问题插入和删除突变事件。在本文中,我们为这四个问题引入了新的一般概念解决方案。该方法将fc状态问题简化为缺少数据的二进制问题。这为多态完美系统发育问题提供了新的概念性解决方案,为任何k都显着改善了以前的工作提供了MD,CR,MDCR和ID问题的概念性解决方案。对我们的实现的经验评估表明,与以前为通用k建立的方法相比,它们对于更大的输入更快,更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号