首页> 中文期刊> 《郑州大学学报(理学版)》 >基于自然语言处理的中文产科电子病历研究

基于自然语言处理的中文产科电子病历研究

         

摘要

电子病历中蕴含着大量的医疗知识和患者的健康信息,而产科电子病历的结构化及信息抽取对临床决策支持及提高人口的生育健康水平具有重要意义.首先对中文产科电子病历的结构特点及内容进行了分析,并采用基于规则的方法对电子病历数据进行了清洗和结构化;其次采用最大熵(ME)模型及基于规则方法按治疗类型对电子病历进行分类,分类的F值达到88.16%;最后,为了进一步利用电子病历进行信息抽取和知识挖掘,以短句为单位,相似度为衡量标准,采用支持向量机(SVM)模型对首次病程记录进行去重处理及自动差异化分析,从分析的结果中筛选出68.6%的重复及相似短句.%Electronic medical record contains a lot of medical knowledge and patient′s health informa-tion.The structuralization and information extraction of obstetric electronic medical records is of great sig -nificance on clinical decision and the bearing health .The structural characteristics and content of Chinese obstetric electronic medical records were analyzed .The EMR data was cleaned and structuralized by u-sing the rule-base method .Then the electronic medical records of different treatment types were automati-cally classified by using the maximum entropy model and rule-based methods .And the F value reached 88.16%.At last , in order to further use electronic medical records for information extraction and knowl-edge mining , the support vector machine model , in which a phrase was taken as a unit and similarity as benchmark , was used to remove the repetition in first course of disease records .And the result was that 68.6%of the reduplicate and similar phrases were deleted from the records .It was expected that this study could contribute to the further research on the information extraction from obstetrics electronic medi -cal records .

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号