Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

Nouha Othman; Rim Faiz; Kamel Sma?li

首页> 外文期刊>Procedia Computer Science >Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

【24h】

Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

机译：使用词嵌入法增强社区问题解答中的问题检索

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Community Question Answering (CQA) services have evolved into a popular way of online information seeking, where users can interact and exchange knowledge in the form of questions and answers. In this paper, we study the problem of finding historical questions that are semantically equivalent to the queried ones, assuming that the answers to the similar questions should also answer the new ones. The major challenge of question retrieval is the word mismatch problem between questions, as users can formulate the same question using different wording. Most existing methods measure the similarity between questions based on the bag-of-words (BOWs) representation capturing no semantics between words. Therefore, this study proposes to use word embeddings, which can capture semantic and syntactic information from contexts, to vectorize the questions. The questions are clustered using Kmeans to speed up the search and ranking tasks. The similarity between the questions is measured using cosine similarity based on their weighted continuous valued vectors. We run our experiments on real world data set from Yahoo! Answers in English and Arabic to show the efficiency and generality of our proposed method.

机译：社区问答（CQA）服务已发展成一种流行的在线信息搜索方式，用户可以在其中以问答形式进行交互和交换知识。在本文中，我们假设在语义上等同于所查询问题的历史问题的查找问题，假设类似问题的答案也应回答新问题。问题检索的主要挑战是问题之间的词不匹配问题，因为用户可以使用不同的措词来表述相同的问题。现有的大多数方法都是基于词袋（BOW）表示法来度量问题之间的相似性，而这些词袋表示法在词之间没有语义。因此，本研究建议使用词嵌入，可以从上下文中捕获语义和句法信息，以对问题进行矢量化处理。使用Kmeans对问题进行聚类，以加快搜索和排名任务。问题之间的相似性是基于它们的加权连续值向量使用余弦相似性来衡量的。我们对Yahoo!的真实数据集进行了实验。用英语和阿拉伯语回答，说明我们提出的方法的效率和普遍性。

著录项

来源
《Procedia Computer Science》 |2019年第11期|共10页
作者
Nouha Othman; Rim Faiz; Kamel Sma?li;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Community Question AnsweringQuestion retrievalWord embeddings;

机译：社区问题解答问题检索单词嵌入;

相似文献

外文文献
中文文献
专利

1. Computing Word Semantic Relatedness for Question Retrieval in Community Question Answering [J] . Jung-Tae LEE, Young-In SONG, Hae-Chang RIM IEICE Transactions on Information and Systems . 2009,第4期

机译：计算词语义相关性以解决社区提问中的问题
2. Cross-lingual embedding for cross-lingual question retrieval in low-resource community question answering [J] . HajiAminShirazi Shahrzad, Momtazi Saeedeh Machine translation . 2020,第4期

机译：低资源社区问题回答中的交叉语言问题的交叉嵌入
3. Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering [J] . Information Sciences: An International Journal . 2020,第期

机译：混合查询扩展使用词汇资源和Word Embeddings用于句子检索中的句子回答
4. Learning Continuous Word Embedding with Metadata for Question Retrieval in Community Question Answering [C] . Guangyou Zhou, Tingting He, Jun Zhao, Annual meeting of the Association for Computational Linguistics;International joint conference on natural language processing of the Asian Federation of Natural Languages processing . 2015

机译：学习带有元数据的连续单词嵌入以在社区问答中检索问题
5. Automatic Neural Question Generation Using Community-Based Question Answering Systems [D] . Baghaee, Tina. 2018

机译：使用基于社区的问题应答系统的自动神经问题
6. Using the Weighted Keyword Model to Improve Information Retrieval for Answering Biomedical Questions [O] . Hong Yu, Yong-gang Cao 2009

机译：使用加权关键字模型改善回答生物医学问题的信息检索
7. Enhancing Question Retrieval in Community Question Answering Using Word Embeddings [O] . Nouha Othman, Rim Faiz, Kamel Smaïli 2019

机译：使用Word Embeddings回答社区问题中的提高问题

Enhancing Question Retrieval in Community Question Answering Using Word Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅