首页> 外国专利> Extracting data from semi-structured information utilizing a discriminative context free grammar

Extracting data from semi-structured information utilizing a discriminative context free grammar

机译:利用区分上下文无关文法从半结构化信息中提取数据

摘要

A discriminative grammar framework utilizing a machine learning algorithm is employed to facilitate in learning scoring functions for parsing of unstructured information. The framework includes a discriminative context free grammar that is trained based on features of an example input. The flexibility of the framework allows information features and/or features output by arbitrary processes to be utilized as the example input as well. Myopic inside scoring is circumvented in the parsing process because contextual information is utilized to facilitate scoring function training.
机译:采用利用机器学习算法的判别语法框架来促进学习用于解析非结构化信息的评分功能。框架包括基于示例输入的特征进行训练的判别上下文无关文法。框架的灵活性允许信息特征和/或任意过程输出的特征也可以用作示例输入。在分析过程中避免了近视内部评分,因为利用了上下文信息来促进评分功能的训练。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号