首页> 外文会议>Annual conference of the International Speech Communication Association >Using Latent Topic Features for Named Entity Extraction in Search Queries
【24h】

Using Latent Topic Features for Named Entity Extraction in Search Queries

机译:在搜索查询中使用潜在的主题特征在搜索查询中的命名实体提取

获取原文

摘要

Search is one of the most quickly growing applications in the mobile market. As people rely more on portable devices for performing search, it becomes increasingly important to analyze user queries in order to achieve more targetted results over a broad set of search entities. While most previous work has relied on lexico-syntactic features and handcrafted knowledge sources, this paper investigates methods for learning latent semantic features from unlabelled user-generated content. We extract word-topic associations by training a Latent Dirichlet Allocation model on a corpus of online reviews, and show that this information improves named-entity classification performance over broad domain search queries. We believe that topical features provide a rich source of information from data with minimal manual effort, and no dependency on a specific language.
机译:搜索是移动市场中最迅速增长的应用之一。随着人们在用于执行搜索的便携式设备上依赖于便携式设备,越来越重要的是分析用户查询,以便通过广泛的搜索实体实现更多目标结果。虽然最先前的工作依赖于词典语法特征和手工制作的知识来源,但本文研究了从未标识的用户生成内容学习潜在语义特征的方法。通过在线评论语料库培训潜在的Dirichlet分配模型,提取单词主题关联,并显示此信息通过广泛的域搜索查询提高了命名实体分类性能。我们认为,主题功能从具有最小手动努力的数据提供丰富的信息来源,并且没有对特定语言的依赖性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号