...
首页> 外文期刊>Journal of Biomedical Semantics >The languages of health in general practice electronic patient records: a Zipf’s law analysis
【24h】

The languages of health in general practice electronic patient records: a Zipf’s law analysis

机译:普通电子病历中的健康语言:Zipf法则分析

获取原文
           

摘要

Background Natural human languages show a power law behaviour in which word frequency (in any large enough corpus) is inversely proportional to word rank - Zipf’s law. We have therefore asked whether similar power law behaviours could be seen in data from electronic patient records. Results In order to examine this question, anonymised data were obtained from all general practices in Salford covering a seven year period and captured in the form of Read codes. It was found that data for patient diagnoses and procedures followed Zipf’s law. However, the medication data behaved very differently, looking much more like a referential index. We also observed differences in the statistical behaviour of the language used to describe patient diagnosis as a function of an anonymised GP practice identifier. Conclusions This works demonstrate that data from electronic patient records does follow Zipf’s law. We also found significant differences in Zipf’s law behaviour in data from different GP practices. This suggests that computational linguistic techniques could become a useful additional tool to help understand and monitor the data quality of health records.
机译:背景自然人类语言表现出幂律行为,其中单词频率(在足够大的语料库中)与单词等级成反比-Zipf定律。因此,我们询问了在电子病历中的数据中是否可以看到类似的幂律行为。结果为了检查该问题,从索尔福德的所有常规实践中获得了为期7年的匿名数据,并以读取代码的形式捕获了这些数据。发现用于患者诊断和程序的数据遵循Zipf的定律。但是,药物数据的行为却大不相同,看起来更像是参考指数。我们还观察到用于描述患者诊断的语言的统计行为与匿名GP惯例标识符有关。结论这项工作表明,来自电子病历的数据确实符合Zipf的定律。我们还发现Zipf的法律行为在来自不同GP实践的数据中存在显着差异。这表明计算语言技术可能会成为有用的附加工具,以帮助理解和监视健康记录的数据质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号