| | |
Summary: PARALLEL NEURAL NETWORK TRAINING ON
MULTISPERT
PHILIPP F ¨
ARBER AND KRSTE ASANOVI '
C
International Computer Science Institute,
Berkeley, CA 94704
MultiSpert is a scalable parallel system built from multiple SpertII nodes which
we have constructed to speed error backpropagation neural network training for
speech recognition research. We present the MultiSpert hardware and software
architecture, and describe our implementation of two alternative parallelization
strategies for the backprop algorithm. We have developed detailed analytic models
of the two strategies which allow us to predict performance over a range of network
and machine parameters. The models' predictions are validated by measurements
for a prototype five node MultiSpert system. This prototype achieves a neural
network training performance of over 530 million connection updates per second
(MCUPS) while training a realistic speech application neural network. The model
predicts that performance will scale to over 800 MCUPS for eight nodes.
1 Background and Motivation
The SpertII board is a general purpose workstation accelerator based on the
|