...
首页> 外文期刊>Bioinformatics >Triplet repeats in human genome: distribution and their association with genes and other genomic regions
【24h】

Triplet repeats in human genome: distribution and their association with genes and other genomic regions

机译:人类基因组中的三联体重复序列:分布及其与基因和其他基因组区域的关联

获取原文
获取原文并翻译 | 示例
           

摘要

Simple sequence repeats (SSRs) or microsatellite repeats are found abundantly in many prokaryotic and eukaryotic genomes. Among SSRs, triplet repeats are of special significance because some of them have been linked to various genetic disorders. The objective of the study is to analyze the triplet repeats of complete human genome and to identify the genes that contain the triplet repeats in their coding region. The analysis will help us to identify the candidate genes that have potential for repeat expansion. We have analyzed triplet repeats in the complete human genome from the publicly available sequences. Our analysis revealed that AGC and CCG repeat were predominantly present in the coding regions of the genome while UTRs and the upstream sequences contained CCG repeats in relative abundance. Analysis of density of triplet repeats (bp/Mb) revealed that AAT and AAC were the abundant repeats whereas ACT and ACG were the rare repeats found in human genome. We could identify about 2135 known or predicted genes that were associated with at least one of the triplet repeat types. A large proportion of putative transcripts that were identified by gene finding programs were found to be associated with triplet repeats. These transcripts will be the candidate genes for analysis of triplet repeat expansion and a possible association with disease phenotypes. Identification of 171 genes which contain a minimum of ten repeat units will be of particular interest in future in correlating their association with any disease phenotype due to the expansion potential of repeats present in them. The list of genes and other details of analysis are given in the online supplementary data.
机译:在许多原核和真核基因组中都发现了简单序列重复(SSR)或微卫星重复。在SSR中,三联体重复具有特殊意义,因为其中一些与各种遗传疾病有关。该研究的目的是分析完整人类基因组的三联体重复序列,并鉴定在其编码区中包含三联体重复序列的基因。该分析将帮助我们确定具有重复扩增潜能的候选基因。我们已经从可公开获得的序列中分析了完整人类基因组中的三联体重复序列。我们的分析表明,AGC和CCG重复序列主要存在于基因组的编码区域,而UTR和上游序列包含相对丰富的CCG重复序列。对三联体重复序列密度(bp / Mb)的分析表明,AAT和AAC是丰富的重复序列,而ACT和ACG是在人类基因组中发现的罕见重复序列。我们可以鉴定出大约2135个与至少一种三联体重复类型相关的已知或预测基因。通过基因发现程序鉴定的大部分推定的转录物被发现与三联体重复有关。这些转录本将是分析三联体重复扩增以及可能与疾病表型相关的候选基因。由于它们中存在的重复序列的扩增潜力,未来将特别需要鉴定至少包含十个重复单元的171个基因,使其与任何疾病表型相关联。在线补充数据中提供了基因列表和其他分析细节。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号