首页> 外文会议> >Learning to Recognize Reliable Users and Content in Social Media with Coupled Mutual Reinforcement
【24h】

Learning to Recognize Reliable Users and Content in Social Media with Coupled Mutual Reinforcement

机译:学会通过相互强化来识别社交媒体中的可靠用户和内容

获取原文

摘要

Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yahoo! Answers have exploded in popularity, and now provide a viable alternative to general purpose Web search. At the same time, the answers to past questions submitted in CQA sites comprise a valuable knowledge repository which could be a gold mine for information retrieval and automatic question answering. Unfortunately, the quality of the submitted questions and answers varies widely - increasingly so that a large fraction of the content is not usable for answering queries. Previous approaches for retrieving relevant and high quality content have been proposed, but they require large amounts of manually labeled data - which limits the applicability of the supervised approaches to new sites and domains. In this paper we address this problem by developing a semi-supervised coupled mutual reinforcement framework for simultaneously calculating content quality and user reputation, that requires relatively few labeled examples to initialize the training process. Results of a large scale evaluation demonstrate that our methods are more effective than previous approaches for finding high-quality answers, questions, and users. More importantly, our quality estimation significantly improves the accuracy of search over CQA archives over the state-of-the-art methods.
机译:社区问题解答(CQA)已经成为用户提出问题以供其他用户回答的流行论坛。在过去的几年中,CQA门户网站,例如Naver和Yahoo!答案已迅速普及,现在提供了通用Web搜索的可行替代方案。同时,在CQA网站上提交的过去问题的答案包括一个有价值的知识库,该知识库可能是信息检索和自动问题解答的金矿。不幸的是,提交的问题和答案的质量差异很大-越来越多,因此大部分内容不可用于回答查询。已经提出了用于检索相关和高质量内容的先前方法,但是它们需要大量的手动标记数据-这限制了有监督方法对新站点和域的适用性。在本文中,我们通过开发用于同时计算内容质量和用户信誉的半监督耦合互助框架来解决此问题,该框架仅需要相对较少的标记示例即可初始化训练过程。大规模评估的结果表明,在寻找高质量答案,问题和用户方面,我们的方法比以前的方法更有效。更重要的是,我们的质量估算大大提高了通过最新方法对CQA档案进行搜索的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号