Finding anchors for genomic sequence comparison

机译：寻找锚进行基因组序列比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent sequencing of the human and other mammalian genomes has brought about the necessity to align them, to identify and characterize their commonalities and differences. Programs that align whole genomes generally use a seed-and-extend technique, starting from exact or near-exact matches and selecting a reliable subset of these, called anchors, and then filling in the remaining portions between the anchors using a combination of local and global alignment algorithms, but their choices for the parameters so far have been primarily heuristic. We present a statistical framework and practical methods for selecting a set of matches that is both sensitive and specific and can constitute a reliable set of anchors for a one-to-one mapping of two genomes from which a whole-genome alignment can be built. Starting from exact matches, we introduce a novel per-base repeat annotation, the $Z$-score, from which noise and repeat filtering conditions are explored. Dynamic programming-based chaining algorithms are also evaluated as context-based filters. We apply the methods described here to the comparison of two progressive assemblies of the human genome, NCBI build 28 and build 34 http://genome.ucsc.edu), and show that a significant portion of the two genomes can be found in selected exact matches, with very limited amount of sequence duplication.

机译：人类和其他哺乳动物基因组的最新测序带来了对它们进行比对，鉴定和表征其共性和差异的必要性。对齐整个基因组的程序通常使用种子扩展技术，从精确或接近精确的匹配开始，选择一个可靠的子集，称为锚，然后使用局部和局部组合填充锚之间的其余部分。全局对齐算法，但到目前为止，它们对参数的选择主要是启发式的。我们提供了一个统计框架和实用的方法，用于选择一组既敏感又特异的匹配项，可以构成一组可靠的锚，用于两个基因组的一对一映射，从中可以构建全基因组比对。从完全匹配开始，我们介绍了一种新颖的每碱基重复注释，即$ Z $分数，从中可以探讨噪声和重复过滤条件。基于动态编程的链接算法也被评估为基于上下文的过滤器。我们将此处描述的方法用于比较人类基因组的两个渐进装配，即NCBI build 28和build 34 http://genome.ucsc.edu ），并表明在选定的精确匹配中可以找到两个基因组，序列重复的数量非常有限。 展开▼

著录项

来源
《International conference on Computational molecular biology;Annual international conference on Computational molecular biology》|2004年|P.233-241|共9页

会议地点

作者
Ross A. Lippert; Xiaoyue Zhao; Liliana Florea; Clark Mobarry; Sorin Istrail;
展开▼

作者单位

展开▼

会议组织

原文格式 PDF

正文语种

中图分类计算技术、计算机技术;

关键词
whole-genome alignments;

机译：全基因组比对;

相似文献

外文文献

中文文献

专利

1. Finding anchors for genomic sequence comparison [J] . Lippert RA, Zhao XY, Florea L, Journal of computational biology: A journal of computational molecular cell biology . 2005,第6期

机译：寻找锚进行基因组序列比较

2. Development and characterization of tomato SSR markers from genomic sequences of anchored BAC clones on chromosome 6 [J] . Subramaniam Geethanjali, Kai-Yi Chen, Davidson V. Pastrana, Euphytica . 2010,第1期

机译：从6号染色体上锚定BAC克隆的基因组序列开发和鉴定番茄SSR标记

3. Rice pseudomolecule-anchored cross-species DNA sequence alignments indicate regional genomic variation in expressed sequence conservation [J] . Ian Armstead, Lin Huang, Julie King, BMC Genomics . 2007,第1期

机译：水稻假分子锚定的跨物种DNA序列比对表明表达序列保守性中的区域基因组变异

4. Finding Anchors for Genomic Sequence Comparison [C] . Ross A. Lippert, Xiaoyue Zhao, Liliana Florea, Annual International Conference on Research in Computational Molecular Biology . 2004

机译：寻找基因组序列比较的锚点

5. Finding noncoding RNA genes in genomic sequences. [D] . Klein, Robert Jared. 2003

机译：在基因组序列中寻找非编码RNA基因。

6. Rice pseudomolecule-anchored cross-species DNA sequence alignments indicate regional genomic variation in expressed sequence conservation [O] . Ian Armstead, Lin Huang, Julie King, 2007

机译：水稻假分子锚定的跨物种DNA序列比对表明表达序列保守性中的区域基因组变异

7. Rice pseudomolecule-anchored cross-species DNA sequence alignments indicate regional genomic variation in expressed sequence conservation [O] . Armstead, Ian P., Huang, Lin S., King, Julie, 2007

机译：水稻假分子锚定的跨物种DNA序列比对表明表达序列保守性中的区域基因组变异

8. Genomic Sequence Comparisons: Progress Report, August 1, 1987-April 7, 1988 [R] . Church, G. M. 1988

机译：基因组序列比较：进展报告，1987年8月1日 - 1988年4月7日

1. 人类基因组序列的变异与寻找脑卒中相关基因的工作 [J] . 张陆 . 中国分子心脏病学杂志 . 2003,第5期

2. 抛左锚与抛右锚的比较 [J] . 王泉 ,张建水 . 天津航海 . 1998,第002期

3. 寻找男人,寻找女人——寻找"人"——《查太莱夫人的情人》和《情爱画廊》比较谈 [J] . 陈琳 . 安徽师范大学学报（人文社会科学版） . 2002,第003期

4. 小麦主栽品种济麦22与良星99的基因组序列多态性比较分析 [J] . 杨正钊 ,王梓豪 ,胡兆荣 . 作物学报 . 2020,第012期

5. 一株猪蓝耳病毒的分离鉴定及其全基因组序列的比较分析 [J] . 罗修鑫 ,张华伟 ,郝根喜 . 中国兽药杂志 . 2020,第011期

6. 预埋锚板与植筋锚板的设计比较 [C] . 廖文彬 . 第八届全国建筑物鉴定与加固改造学术会议 . 2006

7. 大学生员工生涯锚寻找中的企业行为研究——以宝胜集团为例 [A] . 张晓海 . 2006

1. 快速包裹快速寻找系统及进行快速寻找的方法 [P] . 中国专利： CN104537510A . 2015-04-22

2. 基于比较器偏置对比较器进行分类 [P] . 中国专利： CN113227806A . 2021-08-06

3. Representation, visualization, comparison and reporting of genomic / proteomic sequences using bioinformatics character sets and mapped bioinformatics fonts [P] . 外国专利： JP6352804B2 . 2018-07-04

机译：使用生物信息学字符集和映射的生物信息学字体表示，可视化，比较和报告基因组/蛋白质组序列

4. Representation, visualization, comparison and reporting of genomic / proteomic sequences using bioinformatics character sets and mapped bioinformatics fonts [P] . 外国专利： JP2014525080A . 2014-09-25

机译：使用生物信息学字符集和映射的生物信息学字体表示，可视化，比较和报告基因组/蛋白质组序列

5. Evaluating potential for success in sports based on comparisons between genomic sequences [P] . 外国专利： US2003084042A1 . 2003-05-01

机译：根据基因组序列之间的比较评估运动成功的潜力

相关主题

Finding anchors for genomic sequence comparison

摘要

著录项

相似文献

相关主题

期刊订阅