首页> 美国卫生研究院文献>Metabolites >hcapca: Automated Hierarchical Clustering and Principal Component Analysis of Large Metabolomic Datasets in R
【2h】

hcapca: Automated Hierarchical Clustering and Principal Component Analysis of Large Metabolomic Datasets in R

机译:HCAPCA:r的大型代谢组数据集的自动分层聚类和主成分分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Microbial natural product discovery programs face two main challenges today: rapidly prioritizing strains for discovering new molecules and avoiding the rediscovery of already known molecules. Typically, these problems have been tackled using biological assays to identify promising strains and techniques that model variance in a dataset such as PCA to highlight novel chemistry. While these tools have shown successful outcomes in the past, datasets are becoming much larger and require a new approach. Since PCA models are dependent on the members of the group being modeled, large datasets with many members make it difficult to accurately model the variance in the data. Our tool, , first groups strains based on the similarity of their chemical composition, and then applies PCA to the smaller sub-groups yielding more robust PCA models. This allows for scalable chemical comparisons among hundreds of strains with thousands of molecular features. As a proof of concept, we applied our open-source tool to a dataset with 1046 LCMS profiles of marine invertebrate associated bacteria and discovered three new analogs of an established anticancer agent from one promising strain.
机译:今天微生物天然产品发现程序面临两个主要挑战:迅速优先考虑发现新分子并避免已知分子重新发现的菌株。通常,已经使用生物学测定来解决这些问题以确定有前途的菌株和技术,即模型在数据集中的模式差异,例如PCA以突出新的化学性。虽然这些工具过去已经表明了成功的结果,但数据集变得更大并且需要一种新的方法。由于PCA模型依赖于所建模的组成员,因此具有许多成员的大型数据集使得难以准确地模拟数据的方差。我们的工具,第一组基于其化学成分的相似性,然后将PCA施加到较小的子组产生更强大的PCA模型。这允许数百种菌株中具有成千上万的分子特征的可扩展化学比较。作为概念证明,我们将开源工具应用于具有1046个液体无脊椎动物相关细菌的数据集,并从一个有前途的菌株发现了三种新的抗癌剂的新类似物。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号