TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML
- Authors:
-
- National Energy Research Scientific Computing CenterLawrence Berkeley National Laboratory Berkeley California
- Software and Services GroupIntel Corporation Moscow Russia
- Cray Programming Environments Performance EngineeringCray Inc Bloomington Minnesota
- Parallel Computing LabsIntel Corporation Karnataka India
- Data Center GroupIntel Corporation Hillsboro Oregon
- Publication Date:
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1479562
- Grant/Contract Number:
- AC02-05CH11231
- Resource Type:
- Publisher's Accepted Manuscript
- Journal Name:
- Concurrency and Computation. Practice and Experience
- Additional Journal Information:
- Journal Name: Concurrency and Computation. Practice and Experience; Journal ID: ISSN 1532-0626
- Publisher:
- Wiley Blackwell (John Wiley & Sons)
- Country of Publication:
- United Kingdom
- Language:
- English
Citation Formats
Kurth, Thorsten, Smorkalov, Mikhail, Mendygral, Peter, Sridharan, Srinivas, and Mathuriya, Amrita. TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML. United Kingdom: N. p., 2018.
Web. doi:10.1002/cpe.4989.
Kurth, Thorsten, Smorkalov, Mikhail, Mendygral, Peter, Sridharan, Srinivas, & Mathuriya, Amrita. TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML. United Kingdom. doi:10.1002/cpe.4989.
Kurth, Thorsten, Smorkalov, Mikhail, Mendygral, Peter, Sridharan, Srinivas, and Mathuriya, Amrita. Sun .
"TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML". United Kingdom. doi:10.1002/cpe.4989.
@article{osti_1479562,
title = {TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML},
author = {Kurth, Thorsten and Smorkalov, Mikhail and Mendygral, Peter and Sridharan, Srinivas and Mathuriya, Amrita},
abstractNote = {},
doi = {10.1002/cpe.4989},
journal = {Concurrency and Computation. Practice and Experience},
number = ,
volume = ,
place = {United Kingdom},
year = {2018},
month = {10}
}
Free Publicly Available Full Text
Publisher's Version of Record
DOI: 10.1002/cpe.4989
DOI: 10.1002/cpe.4989
Other availability
Cited by: 1 work
Citation information provided by
Web of Science
Web of Science
Save to My Library
You must Sign In or Create an Account in order to save documents to your library.
Works referenced in this record:
Backpropagation and stochastic gradient descent method
journal, June 1993
- Amari, Shun-ichi
- Neurocomputing, Vol. 5, Issue 4-5
Deep learning at 15PF: supervised and semi-supervised classification for scientific data
conference, January 2017
- Kurth, Thorsten; Smorkalov, Mikhail; Deslippe, Jack
- Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '17
DELPHES 3: a modular framework for fast simulation of a generic collider experiment
journal, February 2014
- de Favereau, J.; Delaere, C.; Demin, P.
- Journal of High Energy Physics, Vol. 2014, Issue 2