Semantic Enriched Short Text Clustering

机译：语义丰富的短文本聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper is devoted to the issue of clustering short texts, which are free answers gathered during brain storming seminars. Those answers are short, often incomplete, and highly biased toward the question, so establishing a notion of proximity between texts is a challenging task. In addition, the number of answers is counted up to hundred instances, which causes sparsity. We present three text clustering methods in order to choose the best one for this specific task, then we show how the method can be improved by a semantic enrichment, including neural-based distributional models and external knowledge resources. The algorithms have been evaluated on the unique seminar's data sets.

机译：本文致力于聚类短信问题，这是在脑势袭击研讨会期间收集的免费答案。那些答案很短，通常不完整，高度偏向的问题，因此在文本之间建立近距离的概念是一个具有挑战性的任务。此外，答案次数计入百次，导致稀疏性。我们提出了三种文本聚类方法，以便为此特定任务选择最佳选择，然后我们展示了如何通过语义富集来提高该方法，包括基于神经的分布模型和外部知识资源。已经在唯一的研讨会的数据集上进行了评估了该算法。

著录项

来源
《International Symposium on Methodologies for Intelligent Systems》|2017年|747p|共11页
会议地点
作者
Marek Kozlowski; Henryk Rybinski;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Document clustering; Information retrieval; Semantic enrichment;

机译：文档聚类;信息检索;语义富集;

相似文献

外文文献
中文文献
专利

1. Clustering of semantically enriched short texts [J] . Kozlowski Marek, Rybinski Henryk Journal of Intelligent Information Systems . 2019,第1期

机译：语义丰富的短文本的聚类
2. Clustering of semantically enriched short texts [J] . Kozlowski Marek, Rybinski Henryk Journal of Intelligent Information Systems . 2019,第1期

机译：聚类语义丰富的短文本
3. Understanding Short Texts through Semantic Enrichment and Hashing [J] . Yu Zheng, Wang Haixun, Lin Xuemin, Knowledge and Data Engineering, IEEE Transactions on . 2016,第2期

机译：通过语义丰富和散列理解短文本
4. Semantic Enriched Short Text Clustering [C] . Marek Kozlowski, Henryk Rybinski International symposium on methodologies for intelligent systems . 2017

机译：语义丰富的短文本聚类
5. Semantic preserving text representation and its applications in text clustering. [D] . Howard, Michael. 2012

机译：语义保留文本表示及其在文本聚类中的应用。
6. ADEPt, a semantically-enriched pipeline for extracting adverse drug events from free-text electronic health records [O] . Ehtesham Iqbal, Robbie Mallah, Daniel Rhodes, 2011

机译：ADEPt，一种语义丰富的管道，用于从自由文本电子健康记录中提取不良药物事件
7. Text mining with semantic annotation : using enriched text representation for entity-oriented retrieval, semantic relation identification and text clustering [O] . Hou Jun 2014

机译：具有语义注释的文本挖掘：使用丰富的文本表示法进行面向实体的检索，语义关系识别和文本聚类

Semantic Enriched Short Text Clustering

摘要

著录项

相似文献

相关主题

期刊订阅