首页> 外国专利> SYSTEMS AND METHODS TO DETERMINE AND UTILIZE SEMANTIC RELATEDNESS BETWEEN MULTIPLE NATURAL LANGUAGE SOURCES TO DETERMINE STRENGTHS AND WEAKNESSES

SYSTEMS AND METHODS TO DETERMINE AND UTILIZE SEMANTIC RELATEDNESS BETWEEN MULTIPLE NATURAL LANGUAGE SOURCES TO DETERMINE STRENGTHS AND WEAKNESSES

机译:确定和利用多种自然语言源之间的语义相关性来确定强度和弱点的系统和方法

摘要

A microprocessor executable method transforms unstructured natural language texts by way of a preprocessing pipeline into a structured data representation of the entities described in the original text. The structured data representation is conducive to further processing by machine methods. The transformation process is learned by a machine learned model trained to identify relevant text segments and disregard irrelevant text segments The resulting structured data representation is refined to more accurately represent the respective entities.
机译:微处理器可执行方法通过预处理管道将非结构化自然语言文本转换为原始文本中描述的实体的结构化数据表示。结构化的数据表示有利于通过机器方法进行进一步处理。转换过程是通过训练有素的机器学习模型学习的,该模型可以识别相关文本段,而忽略无关文本段。最终的结构化数据表示形式经过改进,可以更准确地表示各个实体。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号