首页> 外文期刊>Applied Soft Computing >A clustering based methodology to support the translation of medical specifications to software models
【24h】

A clustering based methodology to support the translation of medical specifications to software models

机译:基于聚类的方法,支持向软件模型翻译医疗规范

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we propose a methodology to reduce the complexity to realize a software validation model, starting from medical specifications written in Italian natural language text. In order to obtain an automatic validation system it is necessary to manually translate the specification documents into software models. This task is long, tedious and error prone, due to the manual effort needed. To speed up this process and to reduce the errors that can occur, an important boost can be obtained from the grouping of the conformance rules belonging to the same pattern. Clustering algorithms can accomplish this task, but there is the need to know a priori the total cluster number, and this is not possible in this kind of problem. At this aim, we propose two innovative automatic cluster selection methodologies able to evaluate the optimal number of clusters, based on an iterative internal cluster measure evaluation. These approaches consider three different Vector Space Models (VSMs), two different clustering algorithms and the impact of the using the Principal Component Analysis technique. The experimental assessment has been performed on four different datasets extracted from the HL7 CDA R2 Italian language conformance rules specification documents, demonstrating the effectiveness of the proposed methodology. Finally, in order to compare the results of all possible configurations, we realized a non-parametric statistical analysis. The obtained results demonstrated the effectiveness of the proposed methodology for automatic cluster number selection. (C) 2018 Elsevier B.V. All rights reserved.
机译:在本文中,我们提出了一种方法来减少从意大利自然语言文本编写的医疗规范开始实现软件验证模型的复杂性。为了获得自动验证系统,必须手动将规范文档手动转换为软件模型。由于所需的手动努力,这项任务很长,乏味,易于错误。为了加快该过程并减少可能发生的错误,可以从属于相同模式的符合规则的分组获得重要的提升。群集算法可以完成此任务,但需要知道先验的总群集号,并且在这种问题中是不可能的。在此目的,我们提出了两种创新的自动聚类选择方法,能够根据迭代内部集群测量评估评估最佳的集群数量。这些方法考虑三种不同的矢量空间模型(VSM),两个不同的聚类算法和使用主成分分析技术的影响。实验评估已经在从HL7 CDA R2意大利语一致规则规范规则规则规范文件中提取的四种不同的数据集进行,证明了所提出的方法的有效性。最后,为了比较所有可能的配置的结果,我们实现了非参数统计分析。所获得的结果证明了所提出的自动聚类数字选择方法的有效性。 (c)2018 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号