首页> 外文会议>Latin American Computing Conference >Computing translocation distance by a genetic algorithm
【24h】

Computing translocation distance by a genetic algorithm

机译:用遗传算法计算易位距离

获取原文

摘要

Translocation is a useful operation on strings with challenging questions in combinatorics of permutations and interesting applications in analysis of sequences. A translocation operation essentially is the interchange of prefixes and suffixes among two substrings of a string. For the case of genomes represented as strings, symbols that represent genes and chromosomes are modeled as substrings of the genomes; thus, translocation is an operation that models the interaction between chromosomes inside a genome. The translocation distance between two genomes is defined as the minimum number of translocations to convert one genome into another and has been proved to be a meaningful manner of modeling the evolutive distance between organisms. The particular case of unsigned genomes, those in which the orientation of the genes are not considered, is particularly difficult, while the signed case, in which the orientation of genes is considered, has been proved to be polynomially decidable. This paper presents an innovative Genetic Algorithm (GA) approach to solve the unsigned translocation distance problem. A distinguishing feature of the proposed GA is that it uses as fitness function the translocation distance for randomly generated signed versions of the input (that is an unsigned genome). Experiments over randomly generated strings (synthetic genomes) showed that the proposed GA approach computes answers that are better than those computed by an L5+ε-approximation algorithm, the latter also implemented as part of this work.
机译:易位是对字符串的有用操作,它在排列组合中具有挑战性的问题,并且在序列分析中很有用。移位操作本质上是字符串的两个子字符串之间的前缀和后缀的互换。对于以字符串表示的基因组,代表基因和染色体的符号被建模为基因组的子字符串。因此,易位是一种模拟基因组内部染色体之间相互作用的操作。将两个基因组之间的易位距离定义为将一个基因组转化为另一个基因组的最小易位数,并且已被证明是对生物体之间进化距离进行建模的一种有意义的方式。未签名基因组的特殊情况,即不考虑基因方向的基因组,特别困难,而已考虑基因取向的已签名情况,已被证明是多项式决定的。本文提出了一种创新的遗传算法(GA)方法来解决无符号易位距离问题。拟议的遗传算法的一个显着特征是,它将随机生成的输入带符号版本(即无符号基因组)的易位距离用作适应度函数。对随机生成的字符串(合成基因组)进行的实验表明,所提出的GA方法比L5 +ε-近似算法所计算出的答案更好,后者也是该工作的一部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号