【24h】

Extracting Semantic Networks from Text Via Relational Clustering

机译:通过关系聚类从文本中提取语义网络

获取原文

摘要

Extracting knowledge from text has long been a goal of AI. Initial approaches were purely logical and brittle. More recently, the availability of large quantities of text on the Web has led to the development of machine learning approaches. However, to date these have mainly extracted ground facts, as opposed to general knowledge. Other learning approaches can extract logical forms, but require supervision and do not scale. In this paper we present an unsupervised approach to extracting semantic networks from large volumes of text. We use the TextRunner system [1] to extract tuples from text, and then induce general concepts and relations from them by jointly clustering the objects and relational strings in the tuples. Our approach is defined in Markov logic using four simple rules. Experiments on a dataset of two million tuples show that it outperforms three other relational clustering approaches, and extracts meaningful semantic networks.
机译:从文本中提取知识一直是AI的目标。最初的方法纯粹是逻辑性和脆弱性。最近,Web上大量文本的可用性导致了机器学习方法的发展。但是,迄今为止,这些知识主要是提取了事实,而不是常识。其他学习方法可以提取逻辑形式,但是需要监督并且不能扩展。在本文中,我们提出了一种从大量文本中提取语义网络的无监督方法。我们使用TextRunner系统[1]从文本中提取元组,然后通过将元组中的对象和关系字符串联合聚类而从中导出一般概念和关系。我们的方法是使用四个简单规则在马尔可夫逻辑中定义的。在200万个元组的数据集上进行的实验表明,它优于其他三种关系聚类方法,并且提取了有意义的语义网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号