首页> 外文期刊>Information Processing & Management >A novel community answer matching approach based on phrase fusion heterogeneous information network
【24h】

A novel community answer matching approach based on phrase fusion heterogeneous information network

机译:基于短语融合异构信息网络的新型社区回答匹配方法

获取原文
获取原文并翻译 | 示例
       

摘要

Community Question Answering (CQA) allows users to ask or answer questions in a social way, so it is becoming the primary means for people acquiring knowledge. However, the asker must wait until a satisfactory answer appears, which reduces user activity. In this paper, we propose an innovative answering method that matches the most relevant answers for the new issue automatically. Firstly, we utilize phrases to represent the semantic of the posts (answers/questions) and construct a Phrase Fusion Heterogeneous Information Network, called PFHIN, to represent complex entity relationships in CQA. So, the answer selection is regarded as the related entity retrieval task. Then, we define the distance between entities in PFHIN, which is independent of the meta path. Finally, the Type-constrained Top-k Similarity Entity Finding Algorithm (TTSEF) is proposed for finding the nearest entities according to the known start entity and end-entity type, which can match the most relevant answers automatically.To the best of our knowledge, it is the first work to define the phrase information network for answer selection and provide a novel idea for the heterogeneous information network fusion. Experimental results on three large-scale datasets (Stack Overflow, Super User, and Mathematics) from Stack Exchange demonstrate that our proposed approaches significantly outperform the state-of-the-art answer retrieval methods. Moreover, we conduct an in-depth analysis of the meta path to the optimal answer and reveal the critical role of phrases in community answer matching.
机译:社区问题回答(CQA)允许用户以社会方式询问或回答问题,因此它正在成为获取知识的人的主要方式。但是,提问者必须等到出现令人满意的答案,这减少了用户活动。在本文中,我们提出了一种创新的应答方法,它会自动匹配新问题的最相关的答案。首先,我们利用短语来代表帖子的语义(答案/问题)并构建一个名为Pfhin的短语融合异构信息网络,以表示CQA中的复杂实体关系。因此,答案选择被视为相关实体检索任务。然后,我们定义了PFHIN中的实体之间的距离,它与元路径无关。最后,提出了根据已知的起始实体和终端实体类型查找最近实体的类型约束的Top-K类似实体查找算法(TTSEF),其可以自动匹配最相关的答案。我们的知识,它是第一个定义答案选择的短语信息网络的工作,并为异构信息网络融合提供新的思路。来自堆栈交换的三个大型数据集(堆栈溢出,超用户和数学)的实验结果表明,我们的提出方法显着优于最先进的答案检索方法。此外,我们对最佳答案的元路径进行了深入的分析,并揭示了短语在社区回答匹配中的关键作用。

著录项

  • 来源
    《Information Processing & Management》 |2021年第1期|102408.1-102408.22|共22页
  • 作者单位

    School of Mathematical Sciences Hebei Normal University Hebei China;

    College of Computer and Cyber Security Hebei Normal University Hebei China Hebei Provincial Key Laboratory of Network and Information Security Hebei China Hebei Provincial Engineering Research Center for Supply Chain Big Data Analytics & Data Security Hebei China;

    College of Computer and Cyber Security Hebei Normal University Hebei China Hebei Provincial Key Laboratory of Network and Information Security Hebei China Hebei Provincial Engineering Research Center for Supply Chain Big Data Analytics & Data Security Hebei China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Community question answering; Heterogeneous information network fusion; Phrase embedding; Related entity matching;

    机译:社区问题回答;异构信息网络融合;短语嵌入;相关实体匹配;
  • 入库时间 2022-08-18 22:53:25

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号