首页> 美国卫生研究院文献>Plant Physiology >Focus Issue on Plant Databases: Genome Cluster Database. A Sequence Family Analysis Platform for Arabidopsis and Rice

【2h】

Focus Issue on Plant Databases: Genome Cluster Database. A Sequence Family Analysis Platform for Arabidopsis and Rice

机译：关于植物数据库的重点问题：基因组簇数据库。拟南芥和水稻的序列家族分析平台

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The genome-wide protein sequences from Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) spp. japonica were clustered into families using sequence similarity and domain-based clustering. The two fundamentally different methods resulted in separate cluster sets with complementary properties to compensate the limitations for accurate family analysis. Functional names for the identified families were assigned with an efficient computational approach that uses the description of the most common molecular function gene ontology node within each cluster. Subsequently, multiple alignments and phylogenetic trees were calculated for the assembled families. All clustering results and their underlying sequences were organized in the Web-accessible Genome Cluster Database () with rich interactive and user-friendly sequence family mining tools to facilitate the analysis of any given family of interest for the plant science community. An automated clustering pipeline ensures current information for future updates in the annotations of the two genomes and clustering improvements. The analysis allowed the first systematic identification of family and singlet proteins present in both organisms as well as those restricted to one of them. In addition, the established Web resources for mining these data provide a road map for future studies of the composition and structure of protein families between the two species.

机译：来自拟南芥（Arabidopsis thaliana）和水稻（Oryza sativa）spp的全基因组蛋白序列。利用序列相似性和基于域的聚类将粳稻聚类为科。两种根本不同的方法导致了具有互补属性的单独聚类集，以弥补准确族分析的局限性。通过使用每个簇中最常见的分子功能基因本体论节点的描述的有效计算方法，为识别出的家族的功能名称分配了名称。随后，为组装的科计算了多个比对和系统发育树。所有聚类结果及其潜在序列都通过可访问Web的基因组聚类数据库（）进行了组织，该数据库具有丰富的交互式且用户友好的序列族挖掘工具，可促进植物科学界对任何给定感兴趣家族的分析。自动化的聚类流水线可确保当前信息，以便将来在两个基因组的注释中进行更新以及改善聚类。该分析允许对两种生物体以及局限于其中一种生物体中的家族蛋白和单线态蛋白进行首次系统鉴定。此外，用于挖掘这些数据的已建立Web资源为将来研究这两个物种之间蛋白质家族的组成和结构提供了路线图。

著录项

期刊名称 Plant Physiology
作者
Kevin Horan; Josh Lauricha; Julia Bailey-Serres; Natasha Raikhel; Thomas Girke;
展开▼
作者单位

展开▼
年(卷),期 2005(138),1
年度 2005
页码 47–54
总页数 8
原文格式 PDF
正文语种
中图分类人体生理学;
关键词

相似文献

外文文献
中文文献
专利

1. Genome Cluster Database. A Sequence Family Analysis Platform for Arabidopsis and Rice [J] . Kevin Horan Josh Lauricha Julia Bailey-Serres Natasha Raikhel and Thomas Girke* Plant Physiology . 2005,第1期

机译：基因组簇数据库。拟南芥和水稻的序列家族分析平台
2. Genome cluster database. A sequence family analysis platform for Arabidopsis and rice [J] . Horan K, Lauricha J, Bailey-Serres J, Plant physiology . 2005,第1期

机译：基因组集群数据库。拟南芥和水稻的序列家族分析平台
3. Systematization of the protein sequence diversity in enzymes related to secondary metabolic pathways in plants, in the context of big data biology inspired by the KNApSAcK Motorcycle database. (Special Focus Issue: Phytochemical genomics.) [J] . Ikeda S., Abe T., Nakamura Y., Plant and cell physiology . 2013,第5期

机译：在KNApSAcK摩托车数据库的启发下，在大数据生物学的背景下，与植物次级代谢途径相关的酶中蛋白质序列多样性的系统化。（特别关注的话题：植物化学基因组学。）
4. Genome-wide analysis of the associations between polyadenylation sites and repeated sequences in Arabidopsis thaliana [C] . Yuntian Wang, Cheng Sun, Huaibin Hong, International Conference on BioMedical Engineering and Informatics . 2015

机译：全基因组分析拟南芥中聚腺苷酸化位点与重复序列之间的关联
5. Analysis of expressed sequence tags in aspen tissues and characterization of copia elements in Arabidopsis genome: A bioinformatics approach. [D] . Ranjan, Priya. 2006

机译：分析白杨组织中表达的序列标签和拟南芥基因组中的Copia元素特征：一种生物信息学方法。
6. Focus Issue on Biochemistry of Plant Volatiles: The Plant-Specific Database. Classification of Arabidopsis Proteins Based on Their Phylogenetic Profile [O] . Rodrigo A. Gutiérrez, Matthew D. Larson, Curtis Wilkerson 2004

机译：植物挥发物生物化学的重点问题：植物特定数据库。基于系统发育谱的拟南芥蛋白分类
7. Genome Cluster Database. A Sequence Family Analysis Platform for Arabidopsis and Rice1 [O] . Horan, Kevin, Lauricha, Josh, Bailey-Serres, Julia, 2005

机译：基因组簇数据库。拟南芥和Rice1的序列家族分析平台
8. Database Federation Platform for Gene Chips and the Human Genome Database. [R] . Fu, B., Zhang, S., Chuang, W., 2001

机译：基因芯片和人类基因组数据库的数据库联合平台。

Focus Issue on Plant Databases: Genome Cluster Database. A Sequence Family Analysis Platform for Arabidopsis and Rice

摘要

著录项

相似文献

相关主题

期刊订阅