首页> 外国专利> Syntactic pattern learning to automatically discover causality from text

Syntactic pattern learning to automatically discover causality from text

机译:句法模式学习可自动发现文本的因果关系

摘要

The present invention provides a method for extracting relationships between words in textual data. Initially, training relationship data, such as word triplets describing a cause-effect relationship, is received and used to collect additional textual data including the training relationship data. Distributed data collection is used to receive the training data and collect the additional textual data, allowing a broad range of data to be acquired from multiple sources. Syntactic patterns are extracted from the additional textual data and a distributed data source is scanned to extract additional relationship data describing one or more causal relationships using the extracted syntactic patterns. The extracted additional relationship data is then stored, and can be validated by a supervised learning algorithm before storage and used to train a classifier for automatic validation of additional relationship data.
机译:本发明提供了一种用于提取文本数据中的单词之间的关系的方法。最初,训练关系数据,例如描述因果关系的单词三联词,被接收并用于收集包括训练关系数据的附加文本数据。分布式数据收集用于接收训练数据并收集其他文本数据,从而可以从多个来源获取广泛的数据。从附加文本数据中提取语法模式,并扫描分布式数据源,以使用提取的语法模式提取描述一个或多个因果关系的附加关系数据。然后存储提取的附加关系数据,并可以在存储之前通过监督学习算法对其进行验证,并用于训练分类器以自动验证附加关系数据。

著录项

  • 公开/公告号JP2009539191A

    专利类型

  • 公开/公告日2009-11-12

    原文格式PDF

  • 申请/专利权人 本田技研工業株式会社;

    申请/专利号JP20090513428

  • 发明设计人 グプタ、ラケシュ;

    申请日2007-05-30

  • 分类号G06F17/27;

  • 国家 JP

  • 入库时间 2022-08-21 19:01:05

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号