首页> 外文会议>Advances in information retrieval theory >Avoiding Bias in Text Clustering Using Constrained K-means and May-Not-Links

【24h】

Avoiding Bias in Text Clustering Using Constrained K-means and May-Not-Links

机译：使用约束K均值和May-Not-Links避免文本聚类中的偏差

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a new clustering algorithm which extends the traditional batch k-means enabling the introduction of domain knowledge in the form of Must, Cannot, May and May-Not rules between the data points. Besides, we have applied the presented method to the task of avoiding bias in clustering. Evaluation carried out in standard collections showed considerable improvements in effectiveness against previous constrained and non-constrained algorithms for the given task.

机译：在本文中，我们提出了一种新的聚类算法，该算法扩展了传统的批处理k均值，从而可以在数据点之间以Must，Cannot，May和May-Not规则的形式引入领域知识。此外，我们将提出的方法应用于避免聚类偏差的任务。在标准集合中进行的评估显示，针对给定任务，与以前的约束和非约束算法相比，有效性有显着提高。

著录项

来源
《Advances in information retrieval theory 》|2009年|322-329|共8页
会议地点 Cambridge(GB);Cambridge(GB)
作者
M. Eduardo Ares; Javier Parapar; Alvaro Barreiro;
展开▼
作者单位

IRLab, Department of Computer Science, University of A Coruna, Spain;

IRLab, Department of Computer Science, University of A Coruna, Spain;

IRLab, Department of Computer Science, University of A Coruna, Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工） ;
关键词

相似文献

外文文献
中文文献
专利

1. DIC-DOC-K-means: Dissimilarity-based Initial Centroid selection for DOCument clustering using K-means for improving the effectiveness of text document clustering [J] . Lakshmi R., Baskar S. Journal of Information Science . 2019 ,第6期

机译：DIC-DOC-K-means：使用K-means的DOCument聚类基于不相似性的初始质心选择，以提高文本文档聚类的效率
2. An Improved Clustering Algorithm for Text Mining: Multi-Cluster Spherical K-Means [J] . Tunali Volkan, Bilgin Turgay, Camurcu Ah The international arab journal of information technology . 2016 ,第1期

机译：一种改进的文本挖掘聚类算法：多簇球形K-均值
3. A Robust k-Means Type Algorithm for Soft Subspace Clustering and Its Application to Text Clustering [J] . Tiantian Yang, Jun Wang Journal of software . 2014 ,第8期

机译：一种用于软子空间聚类的鲁棒k均值类型算法及其在文本聚类中的应用
4. Avoiding Bias in Text Clustering Using Constrained K-means and May-Not-Links [C] . M. Eduardo Ares, Javier Parapar, Alvaro Barreiro International Conference on the Theory of Information Retrieval . 2009

机译：使用受限k-means和May-Not-Links避免文本聚类中的偏差
5. Evaluation of Text Document Clustering Using k-Means [D] . Beumer, Lisa. 2020

机译：使用K-Means的文本文档聚类评估
6. Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm [O] . Yaofang Xu, Jiayi Wu, Chang-Cheng Yin, 2011

机译：自适应约束K均值算法的无监督Cryo-EM数据聚类
7. Avoiding Bias in Text Clustering Using Constrained K-means and May-Not-Links [O] . M. Eduardo Ares, Javier Parapar, Álvaro Barreiro 2010

机译：使用约束K均值和May-Not-Links避免文本聚类中的偏差

Avoiding Bias in Text Clustering Using Constrained K-means and May-Not-Links

摘要

著录项

相似文献

相关主题

期刊订阅