Large Scale Analysis of Small Repeats via Mining of the Human Genome

机译：通过挖掘人类基因组的小重复大规模分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Small repetitive sequences, called tandem repeats, are abundant throughout the human genome, both in coding and in non-coding regions. Their role is still mostly unknown, but at least 20 of those repetitive sequences have been related to neurodegenerative disorders. The mutational process that is the basis of these disorders is not yet fully understood. Comprehending the origin, function and possible usefulness of the tandem repeats, will require analysis of huge data from various sources. In this paper we attempt such a large scale analysis of short repeats. We describe and discuss the steps that are needed to be taken to perform large scale genomic analysis. We define tandem repeats and compare the results of repeat localization with genome annotations. We show that the degree of repetitiveness is different for the human chromosomes. Chromosome 19 and 17 have more repeats per mega base pair than any of the other chromosomes, the Y chromosome has the least. We also demonstrate that some repeat motifs are much more common than others. Mono- and dinucleotide repeats are the most abundant, with A and AAC the most common motifs, while CG is hardly present within the genome. Repeats with unit length three are underrepresented on the genome and repeats with unit length 9 are extremely rare.

机译：在编码和非编码区中，在整个人类基因组中，叫做串联重复的小重复序列是丰富的。他们的作用仍然是最令人不安的，但这些重复序列中的至少20次与神经变性障碍有关。尚未完全理解为这些障碍的基础的突变过程。理解串联重复的起源，功能和可能的有用性，需要分析来自各种来源的巨大数据。在本文中，我们试图对短重复进行这种大规模分析。我们描述并讨论进行大规模基因组分析所需的步骤。我们定义串联重复并与基因组注释进行重复定位的结果。我们表明，人类染色体的重复程度是不同的。染色体19和17具有比任何其他染色体的兆颈部对的更多重复，Y染色体具有至少。我们还证明了一些重复图案比其他主题要普遍。单核苷酸重复是最丰富的，具有A和AAC最常见的基序，而CG几乎不存在于基因组内。具有单位长度三的重复在基因组上具有代表性，并且具有单位长度9的重复非常罕见。

著录项

来源
《International Workshop on Database and Expert Systems Applications》|2009年||共5页
会议地点
作者
Inge van den Berg; Dragan Bosnacki; Peter A. J. Hilbers;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-53;
关键词

相似文献

外文文献
中文文献
专利

1. Large-scale analysis of tandem repeat variability in the human genome [J] . Duitama Jorge, Zablotskaya Alena, Gemayel Rita, Nucleic Acids Research . 2014,第9期

机译：人类基因组中串联重复变异的大规模分析
2. Mining and analysis of simple sequence repeats in the chloroplast genomes of genus Vigna [J] . Nidhi Shukla, Himani Kuntal, Asheesh Shanker, Biotechnology Research and Innovation . 2018,第1期

机译：Vigna属叶绿体基因组中简单序列重复序列的挖掘和分析
3. Large-scale analysis reveals that the genome features of simple sequence repeats are generally conserved at the family level in insects [J] . Simin Ding, Shuping Wang, Kang He, BMC Genomics . 2017,第1期

机译：大规模分析表明，简单序列重复的基因组特征在昆虫中通常在家族水平上是保守的。
4. Large Scale Analysis of Small Repeats via Mining of the Human Genome [C] . Inge van den Berg, Dragan Bosnacki, Peter A. J. Hilbers International Workshop on Database and Expert Systems Applications . 2009

机译：通过挖掘人类基因组的小重复大规模分析
5. A Study of the Variability of Minisatellite Tandem Repeat Loci in the Human Genome Based on High-Throughput Sequencing Data [D] . Hernandez, Yozen. 2019

机译：基于高通量测序数据的人基因组中小型卫星串联重复基因座的变异研究
6. Large-scale analysis of tandem repeat variability in the human genome [O] . Jorge Duitama, Alena Zablotskaya, Rita Gemayel, 2014

机译：人类基因组中串联重复序列变异的大规模分析
7. Large-scale analysis of tandem repeat variability in the human genome [O] . Duitama Jorge, Zablotskaya Alena, Gemayel Rita, 2014

机译：人类基因组中串联重复序列变异的大规模分析

Large Scale Analysis of Small Repeats via Mining of the Human Genome

摘要

著录项

相似文献

相关主题

期刊订阅