Performance analysis of parallel gravitational N-body codes on large GPU cluster

We compare the performance of two very different parallel gravitational $N$-body codes for astrophysical simulations on large GPU clusters, both pioneer in their own fields as well as in certain mutual scales - NBODY6++ and Bonsai. We carry out the benchmark of the two codes by analyzing their perfo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Huang, Si-Yi (VerfasserIn) , Spurzem, Rainer (VerfasserIn) , Berczik, Peter (VerfasserIn)
Dokumenttyp: Article (Journal) Kapitel/Artikel
Sprache:Englisch
Veröffentlicht: 2015
In: Arxiv

Online-Zugang:Verlag, kostenfrei, Volltext: http://arxiv.org/abs/1508.02510
Volltext
Verfasserangaben:Siyi Huang, Rainer Spurzem, Peter Berczik

MARC

LEADER 00000caa a2200000 c 4500
001 1565132459
003 DE-627
005 20220814015532.0
007 cr uuu---uuuuu
008 171108s2015 xx |||||o 00| ||eng c
035 |a (DE-627)1565132459 
035 |a (DE-576)495132454 
035 |a (DE-599)BSZ495132454 
035 |a (OCoLC)1340981785 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 29  |2 sdnb 
100 1 |a Huang, Si-Yi  |e VerfasserIn  |0 (DE-588)1150788453  |0 (DE-627)1010994611  |0 (DE-576)468135162  |4 aut 
245 1 0 |a Performance analysis of parallel gravitational N-body codes on large GPU cluster  |c Siyi Huang, Rainer Spurzem, Peter Berczik 
264 1 |c 2015 
300 |a 15 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Gesehen am 08.11.2017 
520 |a We compare the performance of two very different parallel gravitational $N$-body codes for astrophysical simulations on large GPU clusters, both pioneer in their own fields as well as in certain mutual scales - NBODY6++ and Bonsai. We carry out the benchmark of the two codes by analyzing their performance, accuracy and efficiency through the modeling of structure decomposition and timing measurements. We find that both codes are heavily optimized to leverage the computational potential of GPUs as their performance has approached half of the maximum single precision performance of the underlying GPU cards. With such performance we predict that a speed-up of $200-300$ can be achieved when up to 1k processors and GPUs are employed simultaneously. We discuss the quantitative information about comparisons of two codes, finding that in the same cases Bonsai adopts larger time steps as well as relative energy errors than NBODY6++, typically ranging from $10-50$ times larger, depending on the chosen parameters of the codes. While the two codes are built for different astrophysical applications, in specified conditions they may overlap in performance at certain physical scale, and thus allowing the user to choose from either one with finetuned parameters accordingly. 
650 4 |a Astrophysics - Instrumentation and Methods for Astrophysics 
700 1 |a Spurzem, Rainer  |e VerfasserIn  |0 (DE-588)1019645636  |0 (DE-627)690981015  |0 (DE-576)35852086X  |4 aut 
700 1 |a Berczik, Peter  |d 1964-  |e VerfasserIn  |0 (DE-588)1020741473  |0 (DE-627)691330328  |0 (DE-576)361912587  |4 aut 
773 0 8 |i Enthalten in  |t Arxiv  |d Ithaca, NY : Cornell University, 1991  |g (2015) Artikel-Nummer 1508.02510, 15 Seiten  |h Online-Ressource  |w (DE-627)509006531  |w (DE-600)2225896-6  |w (DE-576)28130436X  |7 nnas  |a Performance analysis of parallel gravitational N-body codes on large GPU cluster 
773 1 8 |g year:2015  |g extent:15  |a Performance analysis of parallel gravitational N-body codes on large GPU cluster 
856 4 0 |u http://arxiv.org/abs/1508.02510  |x Verlag  |z kostenfrei  |3 Volltext 
951 |a AR 
992 |a 20171108 
993 |a Article 
998 |g 1020741473  |a Berczik, Peter  |m 1020741473:Berczik, Peter  |d 500000  |d 500881  |e 500000PB1020741473  |e 500881PB1020741473  |k 0/500000/  |k 1/500000/500881/  |p 3  |y j 
998 |g 1019645636  |a Spurzem, Rainer  |m 1019645636:Spurzem, Rainer  |d 700000  |d 714000  |d 714100  |e 700000PS1019645636  |e 714000PS1019645636  |e 714100PS1019645636  |k 0/700000/  |k 1/700000/714000/  |k 2/700000/714000/714100/  |p 2 
999 |a KXP-PPN1565132459  |e 2986759955 
BIB |a Y 
JSO |a {"note":["Gesehen am 08.11.2017"],"relHost":[{"part":{"extent":"15","text":"(2015) Artikel-Nummer 1508.02510, 15 Seiten","year":"2015"},"id":{"zdb":["2225896-6"],"eki":["509006531"]},"origin":[{"publisher":"Cornell University ; Arxiv.org","dateIssuedKey":"1991","dateIssuedDisp":"1991-","publisherPlace":"Ithaca, NY ; [Erscheinungsort nicht ermittelbar]"}],"disp":"Performance analysis of parallel gravitational N-body codes on large GPU clusterArxiv","recId":"509006531","title":[{"title":"Arxiv","title_sort":"Arxiv"}],"pubHistory":["1991 -"],"language":["eng"],"physDesc":[{"extent":"Online-Ressource"}],"type":{"bibl":"edited-book","media":"Online-Ressource"},"note":["Gesehen am 28.05.2024"],"titleAlt":[{"title":"Arxiv.org"},{"title":"Arxiv.org e-print archive"},{"title":"Arxiv e-print archive"},{"title":"De.arxiv.org"}]}],"physDesc":[{"extent":"15 S."}],"language":["eng"],"type":{"bibl":"chapter","media":"Online-Ressource"},"person":[{"given":"Si-Yi","role":"aut","family":"Huang","display":"Huang, Si-Yi"},{"display":"Spurzem, Rainer","family":"Spurzem","given":"Rainer","role":"aut"},{"display":"Berczik, Peter","given":"Peter","role":"aut","family":"Berczik"}],"title":[{"title":"Performance analysis of parallel gravitational N-body codes on large GPU cluster","title_sort":"Performance analysis of parallel gravitational N-body codes on large GPU cluster"}],"name":{"displayForm":["Siyi Huang, Rainer Spurzem, Peter Berczik"]},"recId":"1565132459","id":{"eki":["1565132459"]},"origin":[{"dateIssuedKey":"2015","dateIssuedDisp":"2015"}]} 
SRT |a HUANGSIYISPERFORMANC2015