Zero-shot Entity Extraction from Web Pages

机译：从网页中抽取零镜头实体

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to extract entities of a fine-grained category from semi-structured data in web pages, existing information extraction systems rely on seed examples or redundancy across multiple web pages. In this paper, we consider a new zero-shot learning task of extracting entities specified by a natural language query (in place of seeds) given only a single web page. Our approach defines a log-linear model over latent extraction predicates, which select lists of entities from the web page. The main challenge is to define features on widely varying candidate entity lists. We tackle this by ing list elements and using aggregate statistics to define features. Finally, we created a new dataset of diverse queries and web pages, and show that our system achieves significantly better accuracy than a natural baseline.

机译：为了从网页中的半结构化数据中提取细粒度类别的实体，现有的信息提取系统依赖于种子示例或跨多个网页的冗余。在本文中，我们考虑了一个新的零击学习任务，该任务提取仅由单个网页提供的由自然语言查询（代替种子）指定的实体。我们的方法在潜在提取谓词上定义了对数线性模型，该模型从网页中选择实体列表。主要的挑战是在广泛变化的候选实体列表上定义特征。我们通过列出列表元素并使用汇总统计信息来定义特征来解决此问题。最后，我们创建了一个包含各种查询和网页的新数据集，并表明我们的系统比自然基准具有更高的准确性。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年|391-401|共11页
会议地点
作者
Panupong Pasupat; Percy Liang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Webly-supervised zero-shot learning for artwork instance recognition [J] . Del Chiaro Riccardo, Bagdanov Andrew D., Del Bimbo Alberto Pattern recognition letters . 2019,第Deca期

机译：网络指导的零镜头学习，用于艺术品实例识别
2. Zero-Shot Embedding for Unseen Entities in Knowledge Graph [J] . Yu ZHAO, Sheng GAO, Patrick GALLINARI, IEICE transactions on information and systems . 2017,第7期

机译：知识图中看不见实体的零射嵌入
3. Multi-Distribution Characteristics Based Chinese Entity Synonym Extraction from The Web [J] . Xiangfeng Luo, Xiuxia Ma, Subin Huang, International Journal of Intelligent Information Technologies . 2019,第3期

机译：基于多分发特征的Web中的中国实体同义词提取
4. Zero-shot Entity Extraction from Web Pages [C] . Panupong Pasupat, Percy Liang Annual meeting of the Association for Computational Linguistics . 2014

机译：零拍实体从网页提取
5. Using a named entity tagger and a syntactic parser to improve Web-based answer extraction [D] . Kamel, Yasser. 2004

机译：使用命名实体标记器和语法解析器来改进基于Web的答案提取
6. Liberal Entity Extraction: Rapid Construction of Fine-Grained Entity Typing Systems [O] . Lifu Huang, Jonathan May, Xiaoman Pan, -1

机译：自由实体提取：细粒度实体键入系统的快速构建
7. Zero-shot Entity Extraction from Web Pages [O] . Panupong Pasupat, Percy Liang 2015

机译：从网页中提取零镜头实体
8. Entity Came to Rescue - Leveraging Entities to Minimize Risks in Web Search. [R] . Liu, X., Yang, P., Fang, H. 2014

机译：实体拯救 - 利用实体最大限度地减少网络搜索中的风险。

Zero-shot Entity Extraction from Web Pages

摘要

著录项

相似文献

相关主题

期刊订阅