Development and implementation of a temperature monitoring system for HPC systems

Abstract: In the context of high-performance computing (HPC), the removal of released heat is one challenging topic due to the continuously increasing density of computing power. A temperature monitoring system provides insight into the heat development of an HPC cluster. The effectiveness of this i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Baumann, Martin (VerfasserIn) , Gebhart, Fabian (VerfasserIn) , Mattes, Oliver (VerfasserIn) , Nikas, Sotirios (VerfasserIn) , Heuveline, Vincent (VerfasserIn)
Dokumenttyp: Buch/Monographie
Sprache:Englisch
Veröffentlicht: Heidelberg Univ.-Bibliothek 2017
Schriftenreihe:Preprint series of the Engineering Mathematics and Computing Lab (EMCL) Preprint no. 2017-07
In: Preprint series of the Engineering Mathematics and Computing Lab (EMCL) (Preprint no. 2017-07)

DOI:10.11588/emclpp.2017.7.43398
Online-Zugang:Verlag, kostenfrei, Volltext: https://doi.org/10.11588/emclpp.2017.7.43398
Verlag, kostenfrei, Volltext: http://nbn-resolving.de/urn:nbn:de:bsz:16-emclpp-433989
Volltext
Verfasserangaben:Martin Baumann, Fabian Gebhart, Oliver Mattes, Sotirios Nikas, Vincent Heuveline

MARC

LEADER 00000cam a2200000 c 4500
001 1657399389
003 DE-627
005 20180208105046.0
007 cr uuu---uuuuu
008 171212s2017 xx |||||o 00| ||eng c
024 7 |a urn:nbn:de:bsz:16-emclpp-433989  |2 urn 
024 7 |a 10.11588/emclpp.2017.7.43398  |2 doi 
035 |a (DE-627)1657399389 
035 |a (DE-576)496316265 
035 |a (DE-599)BSZ496316265 
035 |a (OCoLC)1018241482 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 28  |2 sdnb 
100 1 |a Baumann, Martin  |e VerfasserIn  |0 (DE-588)1020895454  |0 (DE-627)691398887  |0 (DE-576)358440009  |4 aut 
245 1 0 |a Development and implementation of a temperature monitoring system for HPC systems  |c Martin Baumann, Fabian Gebhart, Oliver Mattes, Sotirios Nikas, Vincent Heuveline 
264 1 |a Heidelberg  |b Univ.-Bibliothek  |c 2017 
300 |a 1 Online-Ressource (18 Seiten) 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
490 1 |a Preprint series of the Engineering Mathematics and Computing Lab (EMCL)  |v Preprint no. 2017-07 
500 |a Gesehen am 12.12.2017 
520 |a Abstract: In the context of high-performance computing (HPC), the removal of released heat is one challenging topic due to the continuously increasing density of computing power. A temperature monitoring system provides insight into the heat development of an HPC cluster. The effectiveness of this is directly related to the number of sensors, their placing and the accuracy of the temperature measurements. Monitoring is important not only to investigate the efficiency of the cooling system for purposes of detecting defective operation of the HPC system, but also to improve the cooling of the servers and by this the achievable performance. The main purpose of a fine-grained and unified temperature monitoring is the possibility to optimize the applications and their execution regarding the temperature spreading on HPC systems. Based on this, we present a highly flexible and scalable – in terms of cable length and number of sensors – and at the same time budget-friendly monitoring infrastructure. It is based on low-cost components such as Raspberry Pi as monitoring client and a setup using the DS18B20 digital thermometer as temperature sensor. Focus is given on the selection of adequate temperature sensors and we explain in detail how the sensors are assembled and the quality assurance is done before these are used in the monitoring setup. Keywords: temperature monitoring; HPC monitoring; energy efficiency 
650 0 7 |0 (DE-588)4532701-4  |0 (DE-627)265680166  |0 (DE-576)21342469X  |a Hochleistungsrechnen  |2 gnd 
650 0 7 |0 (DE-588)7660153-5  |0 (DE-627)60111941X  |0 (DE-576)307187136  |a Energieeffizienz  |2 gnd 
650 4 |a Temperature monitoring 
650 4 |a HPC monitoring 
650 4 |a Energy efficiency 
700 1 |a Gebhart, Fabian  |e VerfasserIn  |0 (DE-588)1148481745  |0 (DE-627)1008751774  |0 (DE-576)496317148  |4 aut 
700 1 |a Mattes, Oliver  |e VerfasserIn  |0 (DE-588)1148481966  |0 (DE-627)1008751820  |0 (DE-576)496317113  |4 aut 
700 1 |a Nikas, Sotirios  |e VerfasserIn  |0 (DE-588)1148482083  |0 (DE-627)1008751863  |0 (DE-576)496317229  |4 aut 
700 1 |a Heuveline, Vincent  |d 1968-  |e VerfasserIn  |0 (DE-588)1046579266  |0 (DE-627)776691880  |0 (DE-576)399904727  |4 aut 
810 2 |a Engineering Mathematics and Computing Lab  |t Preprint series of the Engineering Mathematics and Computing Lab (EMCL)  |v Preprint no. 2017-07  |9 2017,7  |w (DE-627)776852515  |w (DE-576)399725873  |w (DE-600)2750748-8  |x 2191-0693  |7 am 
856 4 0 |u https://doi.org/10.11588/emclpp.2017.7.43398  |x Verlag  |x Resolving-System  |z kostenfrei  |3 Volltext 
856 4 0 |u http://nbn-resolving.de/urn:nbn:de:bsz:16-emclpp-433989  |x Verlag  |z kostenfrei  |3 Volltext 
951 |a BO 
992 |a 20171212 
993 |a Book 
994 |a 2017 
998 |g 1046579266  |a Heuveline, Vincent  |m 1046579266:Heuveline, Vincent  |d 700000  |d 708000  |e 700000PH1046579266  |e 708000PH1046579266  |k 0/700000/  |k 1/700000/708000/  |p 5  |y j 
998 |g 1148482083  |a Nikas, Sotirios  |m 1148482083:Nikas, Sotirios  |d 700000  |d 704000  |e 700000PN1148482083  |e 704000PN1148482083  |k 0/700000/  |k 1/700000/704000/  |p 4 
998 |g 1148481966  |a Mattes, Oliver  |m 1148481966:Mattes, Oliver  |d 700000  |d 704000  |e 700000PM1148481966  |e 704000PM1148481966  |k 0/700000/  |k 1/700000/704000/  |p 3 
998 |g 1148481745  |a Gebhart, Fabian  |m 1148481745:Gebhart, Fabian  |d 700000  |d 704000  |e 700000PG1148481745  |e 704000PG1148481745  |k 0/700000/  |k 1/700000/704000/  |p 2 
998 |g 1020895454  |a Baumann, Martin  |m 1020895454:Baumann, Martin  |d 700000  |d 704000  |e 700000PB1020895454  |e 704000PB1020895454  |k 0/700000/  |k 1/700000/704000/  |p 1  |x j 
999 |a KXP-PPN1657399389  |e 3395882055 
BIB |a Y 
JSO |a {"relMultPart":[{"title":[{"title":"Preprint series of the Engineering Mathematics and Computing Lab (EMCL)","title_sort":"Preprint series of the Engineering Mathematics and Computing Lab (EMCL)"}],"physDesc":[{"extent":"Online-Ressource"}],"id":{"zdb":["2750748-8"],"eki":["776852515"],"issn":["2191-0693"]},"origin":[{"dateIssuedKey":"2009","publisherPlace":"Heidelberg","publisher":"Univ.-Bibliothek","dateIssuedDisp":"2009-"}],"recId":"776852515","disp":"Preprint series of the Engineering Mathematics and Computing Lab (EMCL)","type":{"media":"Online-Ressource","bibl":"serial"},"corporate":[{"role":"aut","display":"Engineering Mathematics and Computing Lab"}],"language":["eng"],"part":{"number_sort":["2017,7"],"number":["Preprint no. 2017-07"]},"dispAlt":"Engineering Mathematics and Computing Lab: Preprint series of the Engineering Mathematics and Computing Lab (EMCL)","pubHistory":["2009 -"]}],"name":{"displayForm":["Martin Baumann, Fabian Gebhart, Oliver Mattes, Sotirios Nikas, Vincent Heuveline"]},"language":["eng"],"note":["Gesehen am 12.12.2017"],"person":[{"role":"aut","family":"Baumann","given":"Martin","display":"Baumann, Martin"},{"family":"Gebhart","role":"aut","display":"Gebhart, Fabian","given":"Fabian"},{"display":"Mattes, Oliver","given":"Oliver","family":"Mattes","role":"aut"},{"given":"Sotirios","display":"Nikas, Sotirios","role":"aut","family":"Nikas"},{"family":"Heuveline","role":"aut","given":"Vincent","display":"Heuveline, Vincent"}],"recId":"1657399389","id":{"uri":["urn:nbn:de:bsz:16-emclpp-433989"],"eki":["1657399389"],"doi":["10.11588/emclpp.2017.7.43398"]},"physDesc":[{"extent":"1 Online-Ressource (18 Seiten)"}],"title":[{"title_sort":"Development and implementation of a temperature monitoring system for HPC systems","title":"Development and implementation of a temperature monitoring system for HPC systems"}],"type":{"media":"Online-Ressource","bibl":"book"},"origin":[{"publisher":"Univ.-Bibliothek","dateIssuedDisp":"2017","dateIssuedKey":"2017","publisherPlace":"Heidelberg"}]} 
SRT |a BAUMANNMARDEVELOPMEN2017