2004 MCS Divisional Seminars & Colloquia |
|
A Systemwide Performance Monitoring Tool for Large Linux
Clusters
|
|
| Abstract | NWPerf is a new system for analyzing fine granularity
performance metric data on large scale supercomputing clusters. This tool is
able to measure application efficiency on a system wide basis from both a
global system perspective as well as providing a detailed view of individual
applications. NWPerf provides this service while minimizing the impact on
the performance of user applications. The talk will describe the type of
information that can be derived from the system, and demonstrate how the
system was used detect and eliminate a performance problem in an application
that improved performance by up to several thousand percent. The talk will
also review the performance characteristics of the large 1954-CPU production
Linux cluster at PNNL. |
| [MCS | Research | Resources | People | Collaboration | Software | Publications | Information] | |||
|