...
首页> 外文期刊>BMC Medical Research Methodology >Optimising the use of electronic health records to estimate the incidence of rheumatoid arthritis in primary care: what information is hidden in free text?
【24h】

Optimising the use of electronic health records to estimate the incidence of rheumatoid arthritis in primary care: what information is hidden in free text?

机译:优化电子健康记录的使用以估计初级保健中的类风湿关节炎的发病率:自由文本中隐藏了哪些信息?

获取原文

摘要

Background Primary care databases are a major source of data for epidemiological and health services research. However, most studies are based on coded information, ignoring information stored in free text. Using the early presentation of rheumatoid arthritis (RA) as an exemplar, our objective was to estimate the extent of data hidden within free text, using a keyword search. Methods We examined the electronic health records (EHRs) of 6,387 patients from the UK, aged 30?years and older, with a first coded diagnosis of RA between 2005 and 2008. We listed indicators for RA which were present in coded format and ran keyword searches for similar information held in free text. The frequency of indicator code groups and keywords from one year before to 14?days after RA diagnosis were compared, and temporal relationships examined. Results One or more keyword for RA was found in the free text in 29% of patients prior to the RA diagnostic code. Keywords for inflammatory arthritis diagnoses were present for 14% of patients whereas only 11% had a diagnostic code. Codes for synovitis were found in 3% of patients, but keywords were identified in an additional 17%. In 13% of patients there was evidence of a positive rheumatoid factor test in text only, uncoded. No gender differences were found. Keywords generally occurred close in time to the coded diagnosis of rheumatoid arthritis. They were often found under codes indicating letters and communications. Conclusions Potential cases may be missed or wrongly dated when coded data alone are used to identify patients with RA, as diagnostic suspicions are frequently confined to text. The use of EHRs to create disease registers or assess quality of care will be misleading if free text information is not taken into account. Methods to facilitate the automated processing of text need to be developed and implemented.
机译:背景初级保健数据库是流行病学和卫生服务研究的主要数据来源。但是,大多数研究都是基于编码信息,而忽略了自由文本中存储的信息。以类风湿关节炎(RA)的早期呈现为例,我们的目标是使用关键字搜索来估计隐藏在自由文本中的数据范围。方法我们检查了2005年至2008年间英国6387名30岁及30岁以上首次诊断为RA的患者的电子健康记录(EHR)。我们列出了RA的指标,这些指标以编码格式出现并且使用关键字搜索以自由文本保存的类似信息。比较RA诊断前一年至诊断后14天的指示符代码组和关键字的频率,并检查时间关系。结果在RA诊断代码之前,有29%的患者在自由文本中找到了一个或多个RA关键字。 14%的患者使用了炎性关节炎诊断的关键字,而只有11%的患者具有诊断代码。在3%的患者中发现了滑膜炎代码,但在另外17%的患者中发现了关键词。在13%的患者中,只有未经编码的文本中有类风湿因子试验阳性的证据。未发现性别差异。关键字通常在类风湿关节炎的编码诊断时及时出现。通常在指示字母和通讯的代码下找到它们。结论仅将编码数据用于识别RA患者时,可能会漏诊或误标日期,因为诊断性怀疑通常仅限于文本。如果不考虑自由文本信息,则使用EHR创建疾病登记簿或评估护理质量将产生误导。需要开发和实现促进文本自动处理的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号