首页> 外文会议>International Conference on Information Reuse and Integration for Data Science >Data Clustering Using Online Variational Learning of Finite Scaled Dirichlet Mixture Models
【24h】

Data Clustering Using Online Variational Learning of Finite Scaled Dirichlet Mixture Models

机译:使用在线变分学习有限缩放的Dirichlet混合模型的数据聚类

获取原文

摘要

With a massive amount of data created on a daily basis, the ubiquitous demand for data analysis is obvious. Recent development of technology has made machine learning techniques applicable to various problems. In this paper, we emphasize on cluster analysis, an important aspect of data analysis. In other words, being able to automatically discover different groups containing similar data is crucial for further information retrieving and anomaly detection tasks. Thus, we propose an online variational inference framework for finite Scaled Dirichlet mixture models. By efficiently handling large scale data, online approach is capable of enhancing the scalability of finite mixture models for demanding applications in real time. The proposed method can simultaneously update the model's parameters and determine the optimal number of components without the complex computation of conventional Bayesian algorithm. The effectiveness of our model is affirmed with challenging problems including spam detection and image clustering.
机译:随着每天创建的大量数据,对数据分析的无处不在的需求是显而易见的。最近的技术发展使机器学习技术适用于各种问题。在本文中,我们强调集群分析,是数据分析的一个重要方面。换句话说,能够自动发现包含类似数据的不同组对于进一步的信息检索和异常检测任务至关重要。因此,我们提出了一种用于有限缩放的Dirichlet混合模型的在线变分推理框架。通过有效处理大规模数据,在线方法能够实时提高有限混合模型的可扩展性,实时提高苛刻应用。所提出的方法可以同时更新模型的参数,并在没有传统贝叶斯算法的复杂计算的情况下确定最佳数量的组件。我们的模型的有效性是肯定的具有挑战性的问题,包括垃圾邮件检测和图像聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号