首页> 外文会议>IEEE International Symposium on Computer-Based Medical Systems >Combining Subgroup Discovery and Clustering to Identify Diverse Subpopulations in Cohort Study Data
【24h】

Combining Subgroup Discovery and Clustering to Identify Diverse Subpopulations in Cohort Study Data

机译:结合亚组发现和聚类来识别队列研究数据中的不同亚群

获取原文

摘要

Subgroup discovery (SD) exploits its full value in applications where the goal is to generate understandable models. Epidemiologists search for statistically significant relationships between risk factors and outcome in large and heterogeneous datasets encompassing information about the participants health status gathered from questionnaires, medical examinations and image acquisition. SD algorithms can help epidemiologists by automatically detecting such relationships presented as comprehensible rules, aiming to ultimately improve prevention, diagnosis and treatment of diseases. However, SD algorithms often produce large and overlapping rule sets requiring the expert to conduct a manual post-filtering step that is time-consuming and tedious. In this work, we propose a clustering-based algorithm that hierarchically reorganizes rule sets and summarizes all important concepts while maintaining diversity between the rule clusters. For each cluster, a representative rule is selected and then displayed to the expert who in turn can drill-down to other cluster members. We evaluate our algorithm on two cohort study datasets where the diseases hepatic steatosis and goiter serve as target variable, respectively. We report on our findings with respect to effectiveness of our algorithm and we present selected subpopulations.
机译:子组发现(SD)在旨在生成易于理解的模型的应用程序中充分利用了其全部价值。流行病学家在庞大且异类的数据集中搜索风险因素与结果之间的统计上显着的关系,这些数据集包含从问卷,医学检查和图像采集中收集到的有关参与者健康状况的信息。 SD算法可以通过自动检测以易于理解的规则表示的关系来帮助流行病学家,从而最终改善疾病的预防,诊断和治疗。但是,SD算法通常会产生大量且重叠的规则集,这需要专家进行耗时且繁琐的手动后过滤步骤。在这项工作中,我们提出了一种基于聚类的算法,该算法按层次结构重组规则集并总结所有重要概念,同时保持规则集群之间的多样性。对于每个集群,将选择一个代表规则,然后显示给专家,专家又可以向下钻取其他集群成员。我们在两个队列研究数据集上评估我们的算法,其中肝脂肪变性和甲状腺肿分别作为目标变量。我们就算法的有效性报告了我们的发现,并介绍了选定的亚群。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号