首页> 外文期刊>International journal of software engineering and knowledge engineering >Identifying Notable Tuples in Multi-concept Web Tables
【24h】

Identifying Notable Tuples in Multi-concept Web Tables

机译:Identifying Notable Tuples in Multi-concept Web Tables

获取原文
获取原文并翻译 | 示例
           

摘要

Identifying notable tuples in a web table is of great help for table understanding and table summarization. However, existing document-internal feature-based methods are inappropriate for identifying notable tuples in web tables. Additionally, for the web table describing multiple concepts, the notability evaluation of a tuple needs to take into account multiple entities as well as their importance in this tuple. In this paper, we investigate the task of identifying notable tuples in a multi-concept web table and propose a framework that includes three tasks: (1) identify multiple entity columns and their importance weights by building a column correlation graph based on types and relationships in the table; (2) obtain fine-grained entity notability scores based on entity link graph and provide solution for entity link failure and entity domain neglection; and (3) evaluate tuple notability by a weighted sum of notability scores of all entities in the tuple. Comprehensive evaluation of our approach is based on real-world web tables. The results demonstrate that our approach outperforms the state-of-the-art baselines by 4.6% on the precision of detecting multiple entity columns and by 12.5% on the metric normalized discounted cumulative gain (NDCG) of evaluating entity notability.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号