98￠/Mflops/s ultra-large-scale neural-network training on a pIII cluster

机译：在pIII集群上进行98 ￠ / Mflops / s的超大规模神经网络训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Artificial neural networks with millions of adjustable parameters and a similar number of training examples are a potential solution for difficult, large-scale pattern recognition problems in areas such as speech and face recognition, classification of large volumes of web data and finance. The bottleneck is that neural network training involves iterative gradient descent and is extremely computationally intensive. In this paper we present a technique for distributed training of Ultra Large Scale Neural Networks (ULSNN) on Bunyip, a Linux-based cluster of 196 Pentium III processors. To illustrate ULSNN training we describe an experiment in which a neural network with 1.73 million adjustable parameters was trained to recognize machine-printed Japanese characters from a database containing 9 million training patterns. The training runs with a average performance of 163.3 Gflops/s (single precision). With a machine cost of $150,913, this yields a price/performance ratio of 92.4¢ /Mflops/s (single precision).

机译：具有数百万个可调参数的人工神经网络和类似数量的训练示例，是潜在的解决方案，可解决语音和面部识别，大量Web数据分类和财务等领域中的难题，大规模模式识别问题。瓶颈在于神经网络训练涉及迭代梯度下降，并且计算量很大。在本文中，我们介绍了在Bunyip（基于Linux的196个Pentium III处理器集群）上进行超大规模神经网络（ULSNN）分布式培训的技术。为了说明ULSNN训练，我们描述了一个实验，其中训练了具有173万个可调参数的神经网络，以从包含900万个训练模式的数据库中识别机器打印的日语字符。训练的平均性能为163.3 Gflops / s（单精度）。机器成本为150,913美元，因此其性价比为92.4¢/ Mflops / s（单精度）。

著录项

来源
《ACM/IEEE conference on Supercomputing》|2000年|P.1-9|共9页
会议地点
作者
Douglas A. Aberdeen; Jonathan Baxter; Robert Edwards; PJonathan Baxter; PRobert Edwards;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
neural-network, Linux cluster, matrix-multiply;

机译：神经网络，Linux集群，矩阵相乘;

相似文献

外文文献
中文文献
专利

1. Role of the miR-17?￠????92 cluster family in cerebellar and medulloblastoma development Role of the miR-17?￠????92 cluster family in cerebellar and medulloblastoma development Role of the miR-17?￠????92 cluster family in cerebellar and medulloblastoma development [J] . Peter J. McKinnon, Martine F. Roussel, Andrea Ventura, Biology Open . 2014,第7期

机译：miR-17？92簇家族在小脑和髓母细胞瘤发展中的作用miR-17？92簇家族在小脑和髓母细胞瘤发育中的作用miR-17？ 92簇家族在小脑和髓母细胞瘤的发展
2. Strict Proof of the Perfection of the First 98 Masks (Solution to the ?￠?€??X-Problem of the Number 3?￠?€??) [J] . Kornyushkin A International Journal of Swarm Intelligence and Evolutionary Computation . 2017,第2期

机译：最初98个口罩的完善性的严格证明（第3号X问题的解决方案）
3. MX、4月国内価格は98円強に～USCPは30￠高の305￠で決着 [J] . 石油化学新報 . 2007,第4141期

机译：MX，4月国内价格升至98日元以上〜USCP收于305 ￠，上涨了30 ￠
4. 92¢ /MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster [C] . Aberdeen D., Baxter J., Edwards R. Supercomputing, ACM/IEEE 2000 Conference . -1

机译：在PIII集群上以92¢/ MFlops / s的超大型神经网络训练
5. A parallel computational framework for ultra-large-scale sequence clustering analysis [O] . Wei Zheng, Qi Mao, Robert J Genco, -1

机译：超大规模序列聚类分析的并行计算框架
6. A parallel computational framework for ultra-large-scale sequence clustering analysis [O] . Wei Zheng, Qi Mao, Robert J Genco, 2018

机译：超大型序列聚类分析的并行计算框架
7. Roadrunner '98: Training Effectiveness in a Distributed Mission Training Exercise [R] . Crane, P. M. , Schiflett, S. G. , Oser, R. L. 2000

机译：Roadrunner '98：在分布式任务训练中训练有效性

98￠/Mflops/s ultra-large-scale neural-network training on a pIII cluster

摘要

著录项

相似文献

相关主题

期刊订阅