...
首页> 外文期刊>Open access Bioinformatics >Statistical analysis of exon lengths in various eukaryotes
【24h】

Statistical analysis of exon lengths in various eukaryotes

机译:各种真核生物外显子长度的统计分析

获取原文
           

摘要

Purpose: The principal goals of this research were to investigate correlations between certain properties of exons in a gene (ie, between exon density and the corresponding protein length) and to compare genomic trees obtained with different approaches of clustering based on exonic parameters. The aim was a better understanding of exon–intron structures and their origin and development. The exon–intron structures of eukaryote genes are quite different from each other, and the evolution of such structures raises many problematic questions. As a preliminary attempt to address some of these questions, we performed a statistical analysis of gene exon–intron structures.Methods: Taking whole genomes of eukaryotes, we went through all the protein-coding genes in each chromosome separately and calculated the portion of intron-containing genes and average values of the net length of all the exons in a gene, the number of the exons, and the average length of an exon. Comparing those chromosomal and genomic averages, we developed a technique of clustering based on characteristics of the exon–intron structure. This technique of clustering separates different species, grouping them according to eukaryote taxonomy.Conclusion: Our conclusion is that the best approach is based on distances among four principal components obtained by factor analysis and followed by application of clustering algorithms, such as neighbor-joining, k-means, and partitioning around medoids.
机译:目的:本研究的主要目的是研究基因外显子某些特性之间的相关性(即外显子密度和相应的蛋白质长度之间),并比较基于外显子参数通过不同聚类方法获得的基因组树。目的是更好地了解外显子-内含子的结构及其起源和发展。真核生物基因的外显子-内含子结构彼此非常不同,并且这种结构的进化提出了许多有问题的问题。作为解决这些问题的初步尝试,我们对基因外显子-内含子的结构进行了统计分析。方法:采用真核生物的整个基因组,我们分别遍历每个染色体中的所有蛋白质编码基因,并计算出内含子的比例-含基因以及基因中所有外显子的净长度的平均值,外显子的数量和外显子的平均长度。比较那些染色体平均值和基因组平均值,我们开发了一种基于外显子-内含子结构特征的聚类技术。结论:我们的结论是,最好的方法是基于因子分析获得的四个主要成分之间的距离,然后应用聚类算法(例如邻域连接, k-均值,并在类固醇周围分配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号