首页> 外国专利> Extracting data from semi-structured information utilizing a discriminative context free grammar

Extracting data from semi-structured information utilizing a discriminative context free grammar

机译：利用区分上下文无关文法从半结构化信息中提取数据

页面导航

摘要
著录项
相似文献

摘要

A discriminative grammar framework utilizing a machine learning algorithm is employed to facilitate in learning scoring functions for parsing of unstructured information. The framework includes a discriminative context free grammar that is trained based on features of an example input. The flexibility of the framework allows information features and/or features output by arbitrary processes to be utilized as the example input as well. Myopic inside scoring is circumvented in the parsing process because contextual information is utilized to facilitate scoring function training.

机译：采用利用机器学习算法的判别语法框架来促进学习用于解析非结构化信息的评分功能。框架包括基于示例输入的特征进行训练的判别上下文无关文法。框架的灵活性允许信息特征和/或任意过程输出的特征也可以用作示例输入。在分析过程中避免了近视内部评分，因为利用了上下文信息来促进评分功能的训练。

著录项

公开/公告号US2006245641A1

专利类型
公开/公告日2006-11-02

原文格式PDF
申请/专利权人 PAUL A. VIOLA;MUKUND NARASIMHAN;MICHAEL SHILMAN;
展开▼

申请/专利号US20050119467
发明设计人 PAUL A. VIOLA;MUKUND NARASIMHAN;MICHAEL SHILMAN;
展开▼

申请日2005-04-29
分类号G06K9/62;G06F17/27;G06K9/46;
国家 US
入库时间 2022-08-21 21:45:50

相似文献

专利
外文文献
中文文献