首页> 外文会议>IEEE International Symposium on Applied Computational Intelligence and Informatics >Cloud-based classification of text documents using the Gridgain platform
【24h】

Cloud-based classification of text documents using the Gridgain platform

机译:使用Gridgain平台的基于云的文本文档分类

获取原文

摘要

Motivation for the research effort presented in this paper is to use the cloud computing storage and computational capabilities for text mining tasks. Cloud computing is nowadays favored approach in area of data- analysis and related fields by providing data storage and computational capabilities as the services. Main aim of our research activities is to design and develop experimental cloud platform for text mining tasks. In this particular paper we describe the design and implementation of a distributed tree-based algorithm for text categorization purposes. We used our own implementation of decision tree classification algorithm and used Gridgain framework for its cloud implementation. Cloud also provides storage services for handling large data collections as well as increases computational effectiveness as the algorithm is implemented in distributed fashion. We describe the experiments we have performed on the private cloud using the two datasets and analyze the results.
机译:本文提出的研究工作的动机是使用云计算存储和用于文本挖掘任务的计算功能。现在,云计算是通过提供数据存储和计算能力作为服务的数据分析和相关领域的接近。我们的研究活动的主要目的是为文本挖掘任务设计和开发实验云平台。在这种特定论文中,我们描述了用于文本分类目的的分布式树的算法的设计和实现。我们使用了我们自己实现了决策树分类算法,并用于其云实现的Gridgain框架。云还提供用于处理大数据收集的存储服务,并增加计算效率,因为该算法以分布式方式实现。我们描述了使用两个数据集在私有云上执行的实验并分析结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号