首页> 外文期刊>SIGKDD explorations >Constructing High Precision Knowledge Bases with Subjective and Factual Attributes
【24h】

Constructing High Precision Knowledge Bases with Subjective and Factual Attributes

机译:用主观和事实属性构建高精度知识库

获取原文
获取原文并翻译 | 示例
       

摘要

Knowledge bases (KBs) are the backbone of many ubiquitous applications and are thus required to exhibit high precision. However, for KBs that store subjective attributes of entities, e.g., whether a movie is kid friendly, simply estimating precision is complicated by the inherent ambiguity in measuring subjective phenomena. In this work, we develop a method for constructing KBs with tunable precision-i.e., KBs that can be made to operate at a specific false positive rate, despite storing both difficult-to-evaluate subjective attributes and more traditional factual attributes. The key to our approach is probabilistically modeling user consensus with respect to each entity-attribute pair, rather than modeling each pair as either True or False. Uncertainty in the model is explicitly represented and used to control the KB's precision. We propose three neural networks for fitting the consensus model and evaluate each one on data from Google Maps-a large KB of locations and their subjective and factual attributes. The results demonstrate that our learned models are well-calibrated and thus can successfully be used to control the KB's precision. Moreover, when constrained to maintain 95% precision, the best consensus model matches the F-score of a baseline that models each entity-attribute pair as a binary variable and does not support tunable precision. When unconstrained, our model dominates the same baseline by 12% F-score. Finally, we perform an empirical analysis of attribute-attribute correlations and show that leveraging them effectively contributes to reduced uncertainty and better performance in attribute prediction.
机译:知识库(KBS)是许多普遍存在的应用的骨干,因此需要高精度。然而,对于存储实体的主观属性的KBS,例如电影是儿童友好的,只需估算精度是通过测量主观现象的固有模糊性的复杂性。在这项工作中,我们开发了一种用可调谐精度-i.e构建KB的方法,可以以特定的假阳性率操作,尽管存储难以评估主体属性和更传统的事实属性。我们方法的关键是概率地对每个实体属性对的用户共识,而不是将每对的对真实或虚假建模。模型中的不确定性明确表示并用于控制KB的精度。我们提出了三个神经网络,用于拟合共识模型,并评估来自Google地图的数据 - 一个大KB的位置及其主观和事实属性。结果表明,我们的学习模型很校准,因此可以成功用于控制KB的精确度。此外,当约束为维持95%的精度时,最好的共识模型与基线的F分数匹配,该基线模拟每个实体属性对作为二进制变量,并且不支持可调精度。当不受约束时,我们的模型将相同的基线占据12%F分数。最后,我们对属性属性相关性进行了实证分析,并显示利用它们有效地有助于降低属性预测中的不确定性和更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号