首页> 外文会议>International world wide web conference >Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds
【24h】

Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds

机译:组织和搜索全球事实网络 - 第二步:利用人群的智慧

获取原文

摘要

As part of a large effort to acquire large repositories of facts from unstructured text on the Web, a seed-based framework for textual information extraction allows for weakly supervised extraction of class attributes (e.g., side effects and generic equivalent for drugs) from anonymized query logs. The extraction is guided by a small set of seed attributes, without any need for handcrafted extraction patterns or further domain-specific knowledge. The attributes of classes pertaining to various domains of interest to Web search users have accuracy levels significantly exceeding cur-rent state of the art. Inherently noisy search queries are shown to be a highly valuable, albeit unexplored, resource for Web-based information extraction, in particular for the task of class attribute extraction.
机译:作为从网络上的非结构化文本获取大型事实的大量存储库的一部分,文本信息提取的基于种子的框架允许从匿名查询中逐渐监督类属性(例如,副作用和药物的副作用和通用等效物)日志。提取由一小组种子属性引导,无需任何手工制作的提取模式或进一步的域特异性知识。与Web搜索用户的各种域有关的类的属性具有明显超越艺术票据的准确级别。本质上嘈杂的搜索查询被证明是一个非常有价值的,虽然是未开发的基于Web的信息提取的资源,特别是对于类属性提取的任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号