首页> 美国卫生研究院文献>BioMed Research International >Natural Language Processing Based Instrument for Classification of Free Text Medical Records
【2h】

Natural Language Processing Based Instrument for Classification of Free Text Medical Records

机译:基于自然语言处理的自由文本病历分类工具

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

According to the Ministry of Labor, Health and Social Affairs of Georgia a new health management system has to be introduced in the nearest future. In this context arises the problem of structuring and classifying documents containing all the history of medical services provided. The present work introduces the instrument for classification of medical records based on the Georgian language. It is the first attempt of such classification of the Georgian language based medical records. On the whole 24.855 examination records have been studied. The documents were classified into three main groups (ultrasonography, endoscopy, and X-ray) and 13 subgroups using two well-known methods: Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). The results obtained demonstrated that both machine learning methods performed successfully, with a little supremacy of SVM. In the process of classification a “shrink” method, based on features selection, was introduced and applied. At the first stage of classification the results of the “shrink” case were better; however, on the second stage of classification into subclasses 23% of all documents could not be linked to only one definite individual subclass (liver or binary system) due to common features characterizing these subclasses. The overall results of the study were successful.
机译:佐治亚州劳工,卫生和社会事务部表示,必须在不久的将来引入新的卫生管理系统。在这种情况下,出现了对包含所有提供的医疗服务历史的文件进行结构化和分类的问题。本工作介绍了根据格鲁吉亚语言对医疗记录进行分类的工具。这是对基于格鲁吉亚语言的病历进行这种分类的首次尝试。总体上研究了24.855个检查记录。使用两种众所周知的方法将文档分为三个主要组(超声检查,内窥镜检查和X射线检查)和13个子组:支持向量机(SVM)和K最近邻(KNN)。获得的结果表明,两种机器学习方法均成功执行,而SVM的优势却很小。在分类过程中,引入并应用了基于特征选择的“收缩”方法。在分类的第一阶段,“收缩”案例的结果更好。但是,在归类为子类的第二阶段,由于这些子类的共同特征,因此无法将所有文档的23%仅链接到一个确定的子类(肝脏或二进制系统)。研究的总体结果是成功的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号