首页> 外文会议>Applied Computational Intelligence and Informatics (SACI), 2012 7th IEEE International Symposium on >Cloud-based classification of text documents using the Gridgain platform
【24h】

Cloud-based classification of text documents using the Gridgain platform

机译:使用Gridgain平台基于云的文本文档分类

获取原文
获取原文并翻译 | 示例

摘要

Motivation for the research effort presented in this paper is to use the cloud computing storage and computational capabilities for text mining tasks. Cloud computing is nowadays favored approach in area of data- analysis and related fields by providing data storage and computational capabilities as the services. Main aim of our research activities is to design and develop experimental cloud platform for text mining tasks. In this particular paper we describe the design and implementation of a distributed tree-based algorithm for text categorization purposes. We used our own implementation of decision tree classification algorithm and used Gridgain framework for its cloud implementation. Cloud also provides storage services for handling large data collections as well as increases computational effectiveness as the algorithm is implemented in distributed fashion. We describe the experiments we have performed on the private cloud using the two datasets and analyze the results.
机译:本文提出的研究目的是为了将云计算存储和计算功能用于文本挖掘任务。如今,通过提供数据存储和计算功能作为服务,云计算已成为数据分析和相关领域中的首选方法。我们研究活动的主要目的是设计和开发用于文本挖掘任务的实验性云平台。在这篇特别的论文中,我们描述了用于文本分类的基于分布式树的算法的设计和实现。我们使用自己的决策树分类算法实现,并使用Gridgain框架进行云实现。云还提供用于处理大型数据收集的存储服务,并且由于该算法以分布式方式实现,因此提高了计算效率。我们使用两个数据集描述了我们在私有云上执行的实验并分析了结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号