Unifying Data, Model and Hybrid Parallelism in Deep...

Articles

Scholar

Unifying data, model and hybrid parallelism in deep learning via tensor tiling

M Wang, C Huang, J Li - arXiv preprint arXiv:1805.04170, 2018 - arxiv.org

Deep learning systems have become vital tools across many fields, but the increasing model
sizes mean that training must be accelerated to maintain such systems' utility. Current
systems like Tensorflow and MXNet focus on one specific parallelization strategy, data
parallelism, which requires large training batch sizes in order to scale. We cast the problem
of finding the best parallelization strategy as the problem of finding the best tiling to partition
tensors with the least overall communication. We propose an algorithm that can find the …

Save Cite Cited by 30 Related articles All 2 versions View as HTML

[CITATION][C] Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling. CoRR abs/1805.04170 (2018)

M Wang, CC Huang, J Li - arXiv preprint arXiv:1805.04170, 2018

Save Cite Cited by 2 Related articles

Showing the best results for this search. See all results

Cite

Advanced search

Saved to My library

Unifying data, model and hybrid parallelism in deep learning via tensor tiling

[CITATION][C] Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling. CoRR abs/1805.04170 (2018)