首页> 外国专利> SYSTEMS, METHODS AND COMPUTER PROGRAMS FOR CUSTOMIZED NATURAL LANGUAGE PROCESSING AND SEARCHING BASED ON TECHNICAL TERMS WITHIN ELECTRONIC DOCUMENTS

SYSTEMS, METHODS AND COMPUTER PROGRAMS FOR CUSTOMIZED NATURAL LANGUAGE PROCESSING AND SEARCHING BASED ON TECHNICAL TERMS WITHIN ELECTRONIC DOCUMENTS

机译:基于电子文档内技术术语的自定义自然语言处理和搜索的系统,方法和计算机程序

摘要

Methods, systems, and computer readable media concern natural language processing and searching for identifying biological products in an electronic document. The method includes extracting, from the electronic document, a candidate text phrase representing a potential biological product reference in the electronic document and parsing the candidate text phrase into a syntactic structure including one or more terms. The method includes tagging each of the one or more terms in the syntactic structure with a vocabulary tag. The vocabulary tag represents a technical meaning of a term in the potential biological product reference. The method includes calculating a total score for the candidate text phrase based on relative tag scores associated with each vocabulary tag for the one or more terms. The method includes classifying the candidate text phrase as a biological product reference and includes searching a database for one or more product entries based on the biological product references.
机译:方法,系统和计算机可读介质涉及自然语言处理和搜索以识别电子文档中的生物产品。该方法包括从电子文档中提取表示电子文档中潜在生物制品参考的候选文本短语,并将该候选文本短语解析为包括一个或多个术语的句法结构。该方法包括用词汇标签来标记句法结构中的一个或多个术语中的每一个。词汇标签表示潜在的生物产品参考中术语的技术含义。该方法包括基于与针对一个或多个术语的每个词汇标签相关联的相对标签得分来计算候选文本短语的总得分。该方法包括将候选文本短语分类为生物产品参考,并且包括基于生物产品参考在数据库中搜索一个或多个产品条目。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号