首页> 外文会议>2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences >Workshop: Bioinformatics pipeline for fosmid based molecular haplotype sequencing
【24h】

Workshop: Bioinformatics pipeline for fosmid based molecular haplotype sequencing

机译:讲习班:基于fosmid的分子单倍型测序的生物信息学流水线

获取原文

摘要

A new bioinformatics pipeline for fosmid based analysis was developed by extending the standard SOLiD pipeline for NGS. The experimental approach starts by sequencing pools of up to 15000 DNA molecules called fosmids. Each fosmid has an average length of 40kb and is sampled at random from the genome. The pipeline includes an algorithm for fosmids detection which clusters SOLiD reads aligned to the reference genome based on a custom made set of proximity rules. It also includes a module to make homozygous allele calling on regions identified as potential fosmid locations. These allele calls are collected in a matrix for single individual haplotyping. The pipeline includes a new algorithm for this bioinformatics problem which tries to find the cut of fosmids consistent with their haplotype origin. The algorithm reduces the problem to the well known NP-Complete problem called Max-CUT which was approximately solved by combining well known heuristics. Finally, the algorithm calculates the consensus haplotypes assuming that the cut is correct. After running the pipeline on 48 different pools, 32347 SNPs in 102 blocks on chromosome 22 of an individual with a predicted switch error rate of about 1% were phased.
机译:通过扩展用于NGS的标准SOLiD管道,开发了用于基于化石质分析的新生物信息学管道。实验方法从测序最多15000个称为fosmids的DNA分子的池开始。每个fosmid的平均长度为40kb,并从基因组中随机取样。该流水线包括一种用于检测软体动物的算法,该算法基于一组定制的邻近性规则将与参考基因组对齐的SOLiD读物聚类。它还包括一个模块,用于产生纯合的等位基因,调用识别为潜在的质粒位置的区域。这些等位基因调用被收集在一个矩阵中,用于单个个体单体型分析。该管道包括针对该生物信息学问题的新算法,该算法试图查找与单倍型来源一致的fosmids片段。该算法将问题简化为众所周知的称为Max-CUT的NP-Complete问题,该问题已通过组合众所周知的试探法得到了近似解决。最终,该算法在假定切分正确的情况下计算出一致性单倍型。在48个不同的池上运行管道后,将一个人的22号染色体上102个区块中的32347个SNP进行了阶段转换,预测的转换错误率约为1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号