首页> 美国卫生研究院文献>other >A Topic Clustering Approach to Finding Similar Questions from Large Question and Answer Archives
【2h】

A Topic Clustering Approach to Finding Similar Questions from Large Question and Answer Archives

机译:从大型问答档案库中查找相似问题的主题聚类方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

With the blooming of Web 2.0, Community Question Answering (CQA) services such as Yahoo! Answers (), WikiAnswer (), and Baidu Zhidao (), etc., have emerged as alternatives for knowledge and information acquisition. Over time, a large number of question and answer (Q&A) pairs with high quality devoted by human intelligence have been accumulated as a comprehensive knowledge base. Unlike the search engines, which return long lists of results, searching in the CQA services can obtain the correct answers to the question queries by automatically finding similar questions that have already been answered by other users. Hence, it greatly improves the efficiency of the online information retrieval. However, given a question query, finding the similar and well-answered questions is a non-trivial task. The main challenge is the word mismatch between question query (query) and candidate question for retrieval (question). To investigate this problem, in this study, we capture the word semantic similarity between query and question by introducing the topic modeling approach. We then propose an unsupervised machine-learning approach to finding similar questions on CQA Q&A archives. The experimental results show that our proposed approach significantly outperforms the state-of-the-art methods.
机译:随着Web 2.0的兴起,诸如Yahoo!等社区问题解答(CQA)服务。 Answers(),WikiAnswer()和Baidu Zhidao()等已成为知识和信息获取的替代方法。随着时间的流逝,已经积累了许多由人为智能组成的高质量的问答(Q&A)对,作为一个全面的知识库。与返回一长串结果的搜索引擎不同,在CQA服务中进行搜索可以通过自动查找其他用户已经回答的类似问题来获得对问题查询的正确答案。因此,极大地提高了在线信息检索的效率。但是,对于一个问题查询,查找相似且答案明确的问题并非易事。主要的挑战是问题查询(查询)和候选检索对象(问题)之间的单词不匹配。为了研究这个问题,在本研究中,我们通过引入主题建模方法来捕获查询和问题之间的单词语义相似性。然后,我们提出了一种无监督的机器学习方法,以在CQA问答档案中找到类似的问题。实验结果表明,我们提出的方法明显优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号