PerfMiner: Cluster-wide collection, storage and presentation of application level hardware performance data
2005 (English)In: EURO-PAR 2005 PARALLEL PROCESSING, PROCEEDINGS / [ed] Cunha, JC; Medeiros, PD, 2005, Vol. 3648, 124-133 p.Conference paper (Refereed)
We present PerfMiner, a system for the transparent collection, storage and presentation of thread-level hardware performance data across an entire cluster. Every sub-process/thread spawned by the user through the batch system is measured with near zero overhead and no dilation of run-time. Performance metrics are collected at the thread level using tool built on top of the Performance Application Programming Interface (PAPI). As the hardware counters are virtualized by the OS, the resulting counts are largely unaffected by other kernel or user processes. PerfMiner correlates this performance data with metadata from the batch system and places it in a database. Through a command line and web interface, the user can make queries to the database to report information on everything from overall workload characterization and system utilization to the performance of a single thread in a specific application. This is in contrast to other monitoring systems that report aggregate system-wide metrics sampled over a period of time. In this paper, we describe our implementation of PerfMiner as well as present some results from the test deployment of PerfMiner across three different clusters at the Center for Parallel Computers at The Royal Institute of Technology in Stockholm, Sweden.
Place, publisher, year, edition, pages
2005. Vol. 3648, 124-133 p.
, LECTURE NOTES IN COMPUTER SCIENCE, ISSN 0302-9743 ; 3648
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-42722ISI: 000232259500017ScopusID: 2-s2.0-27144432473ISBN: 3-540-28700-0OAI: oai:DiVA.org:kth-42722DiVA: diva2:448095
11th International Euro-Par Conference Location: Lisbon, PORTUGAL Date: AUG 30-SEP 02, 2005
QC 201110142011-10-142011-10-122011-10-14Bibliographically approved