首页> 外文期刊>Informatics in Medicine Unlocked >Cluster analysis to identify prominent patterns of anti-hypertensives: A three-tiered unsupervised learning approach
【24h】

Cluster analysis to identify prominent patterns of anti-hypertensives: A three-tiered unsupervised learning approach

机译:群集分析确定抗高血压的突出模式:三层无监督的学习方法

获取原文
           

摘要

Clustering brings molecules having similar patterns together and is governed mainly by the structural features (SFs). The challenge is to cluster in such a way that the minimum number of groups with significant molecules having similar prevalent patterns comes together with minimal human intervention. Determining an automatic and reliable approach to cluster molecules is crucial for clinical assessment of medical conditions. Hypertension is one of such health conditions and anti-hypertensives (AHs) are the approved drugs to treat it. Here, an attempt has been made to cluster the AHs to identify the prominent patterns within a group. Principal component analysis (PCA) and k-means are well established independent algorithms, however, in this work, clustering is proceeded by PCA and is followed by one-way analysis of similarities (ANOSIM). The additional step of statistical relevance brings novelty in the final selection of the cluster. The latter highlights the significant difference between the two or more groups to enhance the clustering based on the similarity of the features within a group. Clustering of the United States Food and Drug Administration agency approved anti-hypertensives into six groups show a success rate of 94.73%. Kruskal-Wallis test for k?=?6 suggest that there is a significant difference between the sample medians. Analysis of the cluster identifies the prominent pattern within a group. The average specificity and sensitivity achieved for k?=?6 are 98.0?±?1.6% and 97.3?±?4.5%, respectively. A brief overview of the structural or functional relevance of molecules overlapping in a PCA plot for k?=?6 has been discussed. The study is likely to be useful for a preliminary assessment to identify prevalent trends in SFs of AHs.
机译:聚类将具有相似模式的分子聚集在一起,主要由结构特征(SFS)治理。挑战是以这样的方式聚类,即具有相似普遍模式的具有相似分子的最小群体的最小数量与最小的人类干预一起。确定群体分子的自动和可靠方法对于医疗病症的临床评估至关重要。高血压是这样的健康状况之一,抗高血压(AHS)是治疗它的批准药物。这里,已经尝试群集AHS来识别组内的突出模式。主成分分析(PCA)和K-Means是完善的独立算法,但是,在这项工作中,PCA进行了聚类,然后进行了相似性的单向分析(Anosim)。统计相关性的附加步骤在集群的最终选择中带来了新颖性。后者突出显示两个或更多组之间的显着差异,以基于组内的特征的相似性来增强聚类。美国食品和药物管理局的聚类批准抗高血压分为六组,表现出94.73%的成功率。 Kruskal-Wallis测试K?=?6表明样品中位数之间存在显着差异。群集的分析标识组内的突出模式。对于K?=α6实现的平均特异性和灵敏度分别为98.0?±1.6%和97.3?±4.5%。讨论了在PCA图中重叠的分子结构或功能相关性的简要概述了k?= 6。该研究可能有助于初步评估,以确定AHS的SFS中的普遍趋势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号