【24h】

Semantic Enriched Short Text Clustering

机译:语义丰富的短文本聚类

获取原文

摘要

The paper is devoted to the issue of clustering short texts, which are free answers gathered during brain storming seminars. Those answers are short, often incomplete, and highly biased toward the question, so establishing a notion of proximity between texts is a challenging task. In addition, the number of answers is counted up to hundred instances, which causes sparsity. We present three text clustering methods in order to choose the best one for this specific task, then we show how the method can be improved by a semantic enrichment, including neural-based distributional models and external knowledge resources. The algorithms have been evaluated on the unique seminar's data sets.
机译:本文致力于聚类短信问题,这是在脑势袭击研讨会期间收集的免费答案。那些答案很短,通常不完整,高度偏向的问题,因此在文本之间建立近距离的概念是一个具有挑战性的任务。此外,答案次数计入百次,导致稀疏性。我们提出了三种文本聚类方法,以便为此特定任务选择最佳选择,然后我们展示了如何通过语义富集来提高该方法,包括基于神经的分布模型和外部知识资源。已经在唯一的研讨会的数据集上进行了评估了该算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号