首页> 外国专利> Method and system for creating computer-understandable structured medical data from natural language reports

Method and system for creating computer-understandable structured medical data from natural language reports

机译:从自然语言报告中创建计算机可理解的结构化医学数据的方法和系统

摘要

A natural language translation method and system translating medical reports created in natural language into structured data frames that can be utilized in computer databases for decision support, billing, research, and other purposes. Structured data entry is elicited from a patient in order to identify an appropriate disease signature corresponding to his or her condition and symptoms. In turn, the disease signature identifies the appropriate lexical domain with which to analyze the natural language report. The translation method and system use statistical analysis based on empirical data that particular combinations of words have interdepended previously within a modeled context and how frequently individual words interdepend generally and with what kinds of words. For each sentence in the report, the words in the medical report are looked up in the lexical domain individually and in combination with all other words coexisting in the same sentence. The word combinations are parsed to determine the likelihood the words interdepend in the report. For those words determined to interdepend, a semantic interpreter defines the semantic relationship between the words. A frame generator compiles the word relationships into records having fields recognized as pertinent by the disease signature and that can be searched and sorted by computers on those fields.
机译:一种自然语言翻译方法和系统,可以将以自然语言创建的医学报告转换为结构化的数据框架,可以在计算机数据库中利用该数据库进行决策支持,计费,研究和其他目的。从患者中引出结构化数据输入,以识别与其病情和症状相对应的适当疾病特征。反过来,疾病特征标识了用于分析自然语言报告的适当词汇域。翻译方法和系统使用基于经验数据的统计分析,该经验数据是单词的特定组合先前已在建模上下文中相互依赖,以及单个单词通常相互依赖的频率以及与哪种类型的单词相互依赖。对于报告中的每个句子,医学报告中的单词将在词法域中单独查找,并与同一句子中共存的所有其他单词组合使用。解析单词组合以确定单词在报告中相互依赖的可能性。对于确定为相互依赖的那些词,语义解释器定义了词之间的语义关系。帧生成器将单词关系编译成记录,记录中包含由疾病特征识别为相关字段,并且可以由这些字段上的计算机进行搜索和分类。

著录项

  • 公开/公告号US2003105638A1

    专利类型

  • 公开/公告日2003-06-05

    原文格式PDF

  • 申请/专利权人 TAIRA RICK K.;

    申请/专利号US20010996522

  • 发明设计人 RICK K. TAIRA;

    申请日2001-11-27

  • 分类号G10L21/00;

  • 国家 US

  • 入库时间 2022-08-22 00:08:24

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号