首页> 外文期刊>Nucleic Acids Research >Pseudofam: the pseudogene families database.
【24h】

Pseudofam: the pseudogene families database.

机译:Pseudofam:假基因家族数据库。

获取原文
获取原文并翻译 | 示例
           

摘要

Pseudofam (http://pseudofam.pseudogene.org) is a database of pseudogene families based on the protein families from the Pfam database. It provides resources for analyzing the family structure of pseudogenes including query tools, statistical summaries and sequence alignments. The current version of Pseudofam contains more than 125 000 pseudogenes identified from 10 eukaryotic genomes and aligned within nearly 3000 families (approximately one-third of the total families in PfamA). Pseudofam uses a large-scale parallelized homology search algorithm (implemented as an extension of the PseudoPipe pipeline) to identify pseudogenes. Each identified pseudogene is assigned to its parent protein family and subsequently aligned to each other by transferring the parent domain alignments from the Pfam family. Pseudogenes are also given additional annotation based on an ontology, reflecting their mode of creation and subsequent history. In particular, our annotation highlights the association of pseudogene families with genomic features, such as segmental duplications. In addition, pseudogene families are associated with key statistics, which identify outlier families with an unusual degree of pseudogenization. The statistics also show how the number of genes and pseudogenes in families correlates across different species. Overall, they highlight the fact that housekeeping families tend to be enriched with a large number of pseudogenes.
机译:Pseudofam(http://pseudofam.pseudogene.org)是基于Pfam数据库中蛋白质家族的假基因家族数据库。它提供了用于分析假基因的家族结构的资源,包括查询工具,统计摘要和序列比对。当前版本的Pseudofam包含从10个真核基因组中鉴定的超过12.5万个假基因,并在近3000个家族中进行排列(约占PfamA总家族的三分之一)。 Pseudofam使用大规模并行化同源性搜索算法(作为PseudoPipe管道的扩展实现)来识别假基因。将每个鉴定出的假基因分配给其亲本蛋白家族,随后通过转移来自Pfam家族的亲本结构域比对彼此进行比对。伪基因还根据本体被赋予附加注释,以反映其创建方式和后续历史。特别是,我们的注释突出了假基因家族与基因组特征(例如节段重复)的关联。此外,假基因家族与关键统计数据相关联,这些关键统计数据以异常的假基因化程度来识别异常家族。统计数据还显示了家族中基因和假基因的数量如何在不同物种之间相互关联。总体而言,它们突显了一个事实,即管家家族往往富含大量假基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号