首页> 外文会议>International conference on very large data bases >Supporting Keyword Search in Product Database: A Probabilistic Approach
【24h】

Supporting Keyword Search in Product Database: A Probabilistic Approach

机译:在产品数据库中支持关键字搜索:一种概率方法

获取原文

摘要

The ability to let users search for products conveniently in product database is critical to the success of e-commerce. Although structured query languages (e.g. SQL) can be used to effectively access the product database, it is very difficult for end users to learn and use. In this paper, we study how to optimize search over structured product entities (represented by specifications) with keyword queries such as "cheap gaming laptop". One major difficulty in this problem is the vocabulary gap between the specifications of products in the database and the keywords people use in search queries. To solve the problem, we propose a novel probabilistic entity retrieval model based on query generation, where the entities would be ranked for a given keyword query based on the likelihood that a user who likes an entity would pose the query. Different ways to estimate the model parameters would lead to different variants of ranking functions. We start with simple estimates based on the specifications of entities, and then leverage user reviews and product search logs to improve the estimation. Multiple estimation algorithms are developed based on Maximum Likelihood and Maximum a Posteriori estimators. We evaluate the proposed product entity retrieval models on two newly created product search test collections. The results show that the proposed model significantly outperforms the existing retrieval models, benefiting from the modeling of attribute-level relevance. Despite the focus on product retrieval, the proposed modeling method is general and opens up many new opportunities in analyzing structured entity data with unstructured text data. We show the proposed probabilistic model can be easily adapted for many interesting applications including facet generation and review annotation.
机译:让用户方便地在产品数据库中搜索产品的能力对于电子商务的成功至关重要。尽管可以使用结构化查询语言(例如SQL)有效访问产品数据库,但最终用户很难学习和使用。在本文中,我们研究如何通过关键字查询(例如“廉价游戏笔记本电脑”)优化结构化产品实体(以规格表示)的搜索。这个问题的主要困难是数据库中产品规格与人们在搜索查询中使用的关键字之间的词汇差异。为了解决该问题,我们提出了一种基于查询生成的新型概率实体检索模型,其中,将根据喜欢实体的用户提出查询的可能性,对给定关键字查询对实体进行排名。估计模型参数的不同方法将导致排序函数的不同变体。我们从基于实体规范的简单估算开始,然后利用用户评论和产品搜索日志来改进估算。基于最大似然和最大后验估计量,开发了多种估计算法。我们在两个新创建的产品搜索测试集合上评估建议的产品实体检索模型。结果表明,受益于属性级别相关性的建模,所提出的模型明显优于现有的检索模型。尽管侧重于产品检索,但是所提出的建模方法是通用的,并为分析具有非结构化文本数据的结构化实体数据开辟了许多新机会。我们展示了所提出的概率模型可以轻松地适用于许多有趣的应用程序,包括构面生成和评论注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号