【24h】

MapReduce : Summarization Design Patterns for Processing Kernel Functions

机译:MapReduce:用于处理内核功能的摘要设计模式

获取原文

摘要

There is a growing need for pattern analysis algorithms on datasets to extract and analyses information. As datasets grow in size for applications such as topic modeling, recommender systems and internet search queries, there is a need for scalable implementations of pattern analysis algorithms require manual tuning on specialized hardware and methods to parallelize individual learning algorithms on a cluster on a cluster of machines. Practically, MapReduce provides an active device for dripping huge amount of data glitches. But then again, beyond that, MapReduce is vital in how it has changed the way of organization computations at a enormous scale. MapReduce characterizes the first widely accepted step away procedure of von Neumann prototype, which is called bridging model. A Bridging is a conceptual bridge between the physical execution of a hardware and software which has to be executed on that machine. Planning a data through nonlinear function into a appropriate characteristics space allows the use of the similar tools for finding nonlinear patterns. Kernels can make it feasible to use multi-dimensional feature space. The in detailed computation of the feature mapping are avoided. The proposed model of MapReduce Pattern design is done with Hadoop cluster. Algorithm of pattern analysis will take finite sample data from source and pattern may be anything like text, audio, video files. But here only text files are considered. The output of a detected regularity or pattern function will be displayed. The main three features such as efficiency, robustness and stability are expected to expose. The performance of an algorithm to noise in the training examples is inferred by computational efficiency. Summarization kernel function helps to find top view by summing data and grouping data. This helps to generate a performance profile in terms of computations, algorithms and Kernel functions.
机译:越来越需要在数据集上进行模式分析算法以提取和分析信息。随着用于主题建模,推荐系统和互联网搜索查询等应用的数据集规模的增长,需要模式分析算法的可扩展实现,这些算法需要在专用硬件和方法上进行手动调整,以并行化集群中集群上的单个学习算法。机器。实际上,MapReduce提供了一种主动设备来滴入大量数据故障。但是,除此之外,MapReduce对于它如何大规模改变组织计算方式至关重要。 MapReduce表征了冯·诺伊曼(von Neumann)原型机的第一个被广泛接受的步进程序,称为桥接模型。桥接是硬件的物理执行和必须在该计算机上执行的软件之间的概念性桥梁。通过非线性函数将数据计划到适当的特征空间中,可以使用类似的工具来查找非线性模式。内核可以使使用多维特征空间变得可行。避免了特征映射的详细计算。所提出的MapReduce模式设计模型是通过Hadoop集群完成的。模式分析算法将从源中获取有限的样本数据,模式可能是文本,音频,视频文件之类的东西。但是这里只考虑文本文件。将显示检测到的规律性或图案功能的输出。效率,鲁棒性和稳定性等主要三大特征有望暴露出来。通过计算效率可以推断出训练示例中针对噪声的算法的性能。汇总内核功能有助于通过汇总数据和分组数据来查找顶视图。这有助于根据计算,算法和内核功能生成性能概况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号