首页> 外文会议>2012 Seventh Open Cirrus Summit >Design and Implementation of Parallel Term Contribution Algorithm Based on Mapreduce Model
【24h】

Design and Implementation of Parallel Term Contribution Algorithm Based on Mapreduce Model

机译:基于Mapreduce模型的并行项贡献算法的设计与实现

获取原文
获取原文并翻译 | 示例

摘要

MapReduce is a software framework introduced byGoogle in 2004 to support distributed computing on large datasets on clusters of computers. The term contribution(TC)algorithm is a relatively new algorithm in text mining to selectfeatures for clustering. In this paper, we design and implement a parallel term contribution(PTC) algorithm based on MapReduce model. By experiment, we come to the conclusion that the performance of TC is greatly enhanced using MapReduce framework.
机译:MapReduce是Google在2004年推出的一种软件框架,用于支持对计算机集群上的大型数据集进行分布式计算。术语贡献(TC)算法是文本挖掘中相对较新的算法,用于选择用于聚类的功能。本文基于MapReduce模型设计并实现了并行项贡献(PTC)算法。通过实验,我们得出的结论是,使用MapReduce框架大大提高了TC的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号