首页> 外文会议>International Conference on Computational Science and Computational Intelligence >Topic Extraction and Classification for Questions Posted in Community-Based Question Answering Services
【24h】

Topic Extraction and Classification for Questions Posted in Community-Based Question Answering Services

机译:基于社区的问答服务中发布的问题的主题提取和分类

获取原文
获取外文期刊封面目录资料

摘要

This paper presents methods of simultaneously performing topic/keyword extraction and unsupervised classification for questions posted in community-based question answering services (CQA) or Q&A websites, using topic models and hybrid models. Large-scale experiments on two kinds of data, one called category data and the other called subtyping data, show the effectiveness of our methods. The purity and correct rate show that the topic models outperform clustering methods, hybrid models outperform topic models in question classification, and the adoption of term frequency-inverse document frequency is effective for the subtyping data. Manual evaluations with the extracted keywords show the effectiveness of the topic models in topic extraction.
机译:本文介绍了使用主题模型和混合模型同时对发布在基于社区的问答服务(CQA)或Q&A网站中的问题进行主题/关键字提取和无监督分类的方法。对两种数据(一种称为类别数据,另一种称为子类型数据)的大规模实验证明了我们方法的有效性。纯度和正确率表明主题模型优于聚类方法,混合模型在问题分类方面优于主题模型,并且采用词频-逆文档频率对子类型化数据有效。使用提取的关键字进行的手动评估显示了主题模型在主题提取中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号