首页> 外文会议>IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference >Research on open domain Named entity recognition based on Chinese query logs
【24h】

Research on open domain Named entity recognition based on Chinese query logs

机译:基于中文查询日志的开放域名的开放域研究

获取原文

摘要

Search engine query logs contain quantities of Named Entities. As the basic work of information extraction, traditional Named-entity extraction methods only can extract specific categories of entities. It is very difficult for them to be applied to the query log Named-entity recognition directly for their limitation. In this paper, a novel approach is proposed to extract Named Entities from user query logs. In order to avoid the dependence on large-scale tagging corpus, we annotate the data automatically by using distant supervision method. Thus the problem that the training data needs human-annotation effort is solved. Moreover, open domain Named Entities are extracted from user query logs based on the conditional random field model. Evaluation on user query logs shows the effectiveness of our approach in extracting Named Entities in open domain.
机译:搜索引擎查询日志包含数量的命名实体。作为信息提取的基本工作,传统的命名实体提取方法只能提取特定类别的实体。它们非常困难直接将查询日志命名实体识别应用于其限制。在本文中,提出了一种从用户查询日志中提取命名实体的新方法。为了避免对大规模标记语料库的依赖性,我们使用远程监控方法自动注释数据。因此,解决了培训数据需要人力注释工作的问题。此外,根据条件随机字段模型从用户查询日志中提取开放域名。用户查询日志的评估显示了我们在开放域中提取命名实体中的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号