Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An Evaluation of the TensorFlow Programming Model for Solving Traditional HPC Problems
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology (CST). KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0001-6408-3333
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology (CST). KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology (CST). KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.ORCID iD: 0000-0003-0639-0639
KTH, School of Electrical Engineering and Computer Science (EECS), Computational Science and Technology (CST). KTH, School of Electrical Engineering and Computer Science (EECS), Centres, Centre for High Performance Computing, PDC.
Show others and affiliations
2018 (English)In: Proceedings of the 5th International Conference on Exascale Applications and Software, The University of Edinburgh , 2018, p. 34-Conference paper, Published paper (Refereed)
Abstract [en]

Computational intensive applications such as pattern recognition, and natural language processing, are increasingly popular on HPC systems. Many of these applications use deep-learning, a branch of machine learning, to determine the weights of artificial neural network nodes by minimizing a loss function. Such applications depend heavily on dense matrix multiplications, also called tensorial operations. The use of Graphics Processing Unit (GPU) has considerably speeded up deep-learning computations, leading to a Renaissance of the artificial neural network. Recently, the NVIDIA Volta GPU and the Google Tensor Processing Unit (TPU) have been specially designed to support deep-learning workloads. New programming models have also emerged for convenient expression of tensorial operations and deep-learning computational paradigms. An example of such new programming frameworks is TensorFlow, an open-source deep-learning library released by Google in 2015. TensorFlow expresses algorithms as a computational graph where nodes represent operations and edges between nodes represent data flow. Multi-dimensional data such as vectors and matrices which flows between operations are called Tensors. For this reason, computation problems need to be expressed as a computational graph. In particular, TensorFlow supports distributed computation with flexible assignment of operation and data to devices such as GPU and CPU on different computing nodes. Computation on devices are based on optimized kernels such as MKL, Eigen and cuBLAS. Inter-node communication can be through TCP and RDMA. This work attempts to evaluate the usability and expressiveness of the TensorFlow programming model for traditional HPC problems. As an illustration, we prototyped a distributed block matrix multiplication for large dense matrices which cannot be co-located on a single device and a Conjugate Gradient (CG) solver. We evaluate the difficulty of expressing traditional HPC algorithms using computational graphs and study the scalability of distributed TensorFlow on accelerated systems. Our preliminary result with distributed matrix multiplication shows that distributed computation on TensorFlow is extremely scalable. This study provides an initial investigation of new emerging programming models for HPC.

Place, publisher, year, edition, pages
The University of Edinburgh , 2018. p. 34-
Keywords [en]
TensorFlow, HPC, GPU
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-232985ISBN: 978-0-9926615-3-3 (electronic)OAI: oai:DiVA.org:kth-232985DiVA, id: diva2:1237206
Conference
International Conference on Exascale Applications and Software
Note

Published in Proceedings of the 5th International Conference on Exascale Applications and Software. Edinburgh: The University of Edinburgh (2018), ISBN: 978-0-9926615-3-3, pp.34, Published under license CC BY-ND 4.0.

Available from: 2018-08-07 Created: 2018-08-07 Last updated: 2019-05-10Bibliographically approved

Open Access in DiVA

fulltext(158 kB)179 downloads
File information
File name FULLTEXT01.pdfFile size 158 kBChecksum SHA-512
ec5a3954d940695fbb322c849ac3e868f1e615be910db767c12928c34e93fd34a42eb2e81fa790f3ef92425e2888fd9de172956b3c3f1518281e9ffeccb63a88
Type fulltextMimetype application/pdf

Other links

Conference webpage

Authority records BETA

Peng, Ivy Bo

Search in DiVA

By author/editor
Chien, Steven Wei DerSishtla, Chaitanya PrasadMarkidis, StefanoJun, ZhangPeng, Ivy BoLaure, Erwin
By organisation
Computational Science and Technology (CST)Centre for High Performance Computing, PDC
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 179 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 2485 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf