首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Using Latent Topic Features for Named Entity Extraction in Search Queries
【24h】

Using Latent Topic Features for Named Entity Extraction in Search Queries

机译:使用潜在主题功能在搜索查询中提取命名实体

获取原文

摘要

Search is one of the most quickly growing applications in the mobile market. As people rely more on portable devices for performing search, it becomes increasingly important to analyze user queries in order to achieve more targetted results over a broad set of search entities. While most previous work has relied on lexico-syntactic features and handcrafted knowledge sources, this paper investigates methods for learning latent semantic features from unlabelled user-generated content. We extract word-topic associations by training a Latent Dirichlet Allocation model on a corpus of online reviews, and show that this information improves named-entity classification performance over broad domain search queries. We believe that topical features provide a rich source of information from data with minimal manual effort, and no dependency on a specific language.
机译:搜索是移动市场上增长最快的应用程序之一。随着人们越来越依赖便携式设备来执行搜索,分析用户查询以在广泛的搜索实体上获得更多目标结果变得越来越重要。虽然大多数以前的工作都依赖于词汇句法功能和手工制作的知识源,但本文研究了从未标记的用户生成的内容中学习潜在语义功能的方法。我们通过在在线评论语料库上训练潜在Dirichlet分配模型来提取单词-主题关联,并显示此信息可改善广域搜索查询中的命名实体分类性能。我们相信,主题功能可以以最少的人工工作就可以从数据中获取丰富的信息,并且不依赖于特定的语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号