首页> 外国专利> LEARNING SYNTACTIC PATTERNS FOR AUTOMATIC DISCOVERY OF CAUSAL RELATIONS FROM TEXT

LEARNING SYNTACTIC PATTERNS FOR AUTOMATIC DISCOVERY OF CAUSAL RELATIONS FROM TEXT

机译:用于从文本中自动发现因果关系的学习句法模式

摘要

The present invention provides a method for extracting relationships between words in textual data. Initially, training relationship data, such as word triplets describing a cause-effect relationship, is received and used to collect additional textual data including the training relationship data. Distributed data collection is used to receive the training data and collect the additional textual data, allowing a broad range of data to be acquired from multiple sources. Syntactic patterns are extracted from the additional textual data and a distributed data source is scanned to extract additional relationship data describing one or more causal relationships using the extracted syntactic patterns. The extracted additional relationship data is then stored, and can be validated by a supervised learning algorithm before storage and used to train a classifier for automatic validation of additional relationship data.
机译:本发明提供了一种用于提取文本数据中的单词之间的关系的方法。最初,训练关系数据,例如描述因果关系的单词三联词,被接收并用于收集包括训练关系数据的附加文本数据。分布式数据收集用于接收训练数据并收集其他文本数据,从而可以从多个来源获取广泛的数据。从附加文本数据中提取语法模式,并扫描分布式数据源,以使用提取的语法模式提取描述一个或多个因果关系的附加关系数据。然后存储提取的附加关系数据,并可以在存储之前通过监督学习算法对其进行验证,并用于训练分类器以自动验证附加关系数据。

著录项

  • 公开/公告号US2007282814A1

    专利类型

  • 公开/公告日2007-12-06

    原文格式PDF

  • 申请/专利权人 RAKESH GUPTA;

    申请/专利号US20070754966

  • 发明设计人 RAKESH GUPTA;

    申请日2007-05-29

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 20:10:44

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号