Clustering Analysis of Proteins from Microbial Genomes at Multiple Levels of Resolution

机译：微生物基因组蛋白质在多个分辨率水平上的聚类分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Microbial genomes at NCBI represent a large collection containing almost 30,000 genomes from more than 5,000 species. The quality and sampling density of the bacterial genome assemblies vary greatly: human pathogens are densely sampled while other bacteria are less represented. The variation in frequency of occurrences of different proteins in genome annotation is another factor contributing to the complexity of the analysis and presentation of the data. Redundancy in the results make them difficult to analyze and use, as the nearest-neighbor lists may often contain many nearly identical objects making it difficult or impossible to reflect more distant neighbor relationships. The complex data we work with requires the information to be organized, processed and shown at multiple levels of resolution, with appropriate levels of phylogenomic resolution and protein similarity and an adequate sampling strategy.

机译：NCBI的微生物基因组代表了一个庞大的集合，其中包含来自5,000多个物种的近30,000个基因组。细菌基因组集合的质量和采样密度相差很大：人类病原体采样密集，而其他细菌的代表较少。基因组注释中不同蛋白质出现频率的变化是导致数据分析和表示复杂性的另一个因素。结果中的冗余使得它们难以分析和使用，因为最近邻居列表可能经常包含许多几乎相同的对象，从而难以或不可能反映更远的邻居关系。我们处理的复杂数据要求以多种分辨率组织，处理和显示信息，并具有适当水平的植物学分辨率和蛋白质相似性以及适当的采样策略。

著录项

来源
《International symposium on bioinformatics research and applications》|2015年|438-439|共2页
会议地点
作者
Leonid Zaslavsky; Tatiana Tatusova;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Clustering analysis of proteins from microbial genomes at multiple levels of resolution [J] . Leonid Zaslavsky, Stacy Ciufo, Boris Fedorov, BMC Bioinformatics . 2016,第8期

机译：来自微生物基因组蛋白质的多种分辨率的聚类分析
2. Meta-analysis of genome-wide association studies in >80 000 subjects identifies multiple loci for C-reactive protein levels. [J] . Dehghan A, Dupuis J, Barbalic M, Circulation: An Official Journal of the American Heart Association . 2011,第7期

机译：超过80 000名受试者的全基因组关联研究的荟萃分析确定了C基因反应蛋白水平的多个基因座。
3. Genome-wide linkage analysis reveals evidence of multiple regions that influence variation in plasma lipid and apolipoprotein levels associated with risk of coronary heart disease. [J] . Klos KL, Kardia SL, Ferrell RE, Arteriosclerosis, thrombosis, and vascular biology . 2001,第6期

机译：全基因组连锁分析揭示了多个区域影响与冠心病风险相关的血浆脂质和载脂蛋白水平变化的证据。
4. Clustering Analysis of Proteins from Microbial Genomes at Multiple Levels of Resolution [C] . Leonid Zaslavsky Tatiana Tatusova ISBRA 2013 . 2015

机译：多种分辨率微生物基因组蛋白的聚类分析
5. Identification of protein coding regions in microbial genomes using unsupervised clustering. [D] . Konda, Jayashree. 2009

机译：使用无监督聚类鉴定微生物基因组中的蛋白质编码区。
6. Clustering analysis of proteins from microbial genomes at multiple levels of resolution [O] . Leonid Zaslavsky, Stacy Ciufo, Boris Fedorov, 2016

机译：来自微生物基因组的蛋白质在多个分辨率级别上的聚类分析
7. Clustering analysis of proteins from microbial genomes at multiple levels of resolution [O] . 2016

机译：来自微生物基因组的蛋白质在多个分辨率级别上的聚类分析

Clustering Analysis of Proteins from Microbial Genomes at Multiple Levels of Resolution

摘要

著录项

相似文献

相关主题

期刊订阅