Optimizing communication for a 2D-partitioned scalable BFS
Recent research projects have investigated partitioning, acceleration, and data reduction techniques for improving the performance of Breadth First Search (BFS) and the related HPC benchmark, Graph500. However, few implementations have focused on cloud-based systems like Amazon's Web Services,...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Chapter/Article Conference Paper |
| Language: | English |
| Published: |
2016
|
| In: |
2016 IEEE High Performance Extreme Computing Conference (HPEC)
Year: 2016, Pages: ? |
| Subjects: | |
| Online Access: | Verlag, Volltext: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7761596 Verlag, Volltext: http://ieeexplore.ieee.org/document/7761596/ |
| Author Notes: | J. Young, J. Romera, M. Hauck, H. Fröning |
| Summary: | Recent research projects have investigated partitioning, acceleration, and data reduction techniques for improving the performance of Breadth First Search (BFS) and the related HPC benchmark, Graph500. However, few implementations have focused on cloud-based systems like Amazon's Web Services, which differ from HPC systems in several ways, most importantly in terms of network interconnect. This work looks at optimizations to reduce the communication overhead of an accelerated, distributed BFS on an HPC system and a smaller cloud-like system that contains GPUs. We demonstrate the effects of an efficient 2D partitioning scheme and allreduce implementation, as well as different CPU-based compression schemes for reducing the overall amount of data shared between nodes. Timing and Score-P profiling results demonstrate a dramatic reduction in row and column frontier queue data (up to 91%) and show how compression can improve performance for a bandwidth-limited cluster. |
|---|---|
| Item Description: | Gesehen am 03.08.2017 |
| Physical Description: | Online Resource |
| ISBN: | 9781509035250 |