首页> 外文会议>International Conference on Text, Speech and Dialogue >FAQIR - A Frequently Asked Questions Retrieval Test Collection
【24h】

FAQIR - A Frequently Asked Questions Retrieval Test Collection

机译:Faqir - 常见问题检索测试收集

获取原文

摘要

Frequently asked question (FAQ) collections are commonly used across the web to provide information about a specific domain (e.g., services of a company). With respect to traditional information retrieval, FAQ retrieval introduces additional challenges, the main ones being (1) the brevity of FAQ texts and (2) the need for topic-specific knowledge. The primary contribution of our work is a new domain-specific FAQ collection, providing a large number of queries with manually annotated relevance judgments. On this collection, we test several unsupervised baseline models, including both count based and semantic embedding based models, as well as a combined model. We evaluate the performance across different setups and identify potential venues for improvement. The collection constitutes a solid basis for research in supervised machine-learning-based FAQ retrieval.
机译:常见问题(常见问题解答)集合通常在网站上使用,以提供有关特定域的信息(例如,公司的服务)。关于传统信息检索,常见问题解答引入了额外的挑战,主要是(1)常见问题文本的简洁性和(2)对特定主题知识的需要。我们的工作的主要贡献是一个新的域特定的常见问题解答集合,提供了大量查询,手动注释相关性判断。在此集合上,我们测试了几种无监督的基线模型,包括基于计数和基于语义嵌入的模型,以及组合模型。我们评估不同设置的表现,并识别潜在的改进场地。该集合构成了基于监督基于机器学习的常见问题解答的研究的坚实基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号