Real Time Visualization Of Monitoring Data For Large Scale Hpc Systems

2015 IEEE International Conference on Cluster Computing(2015)

引用 2|浏览7
暂无评分
摘要
High Performance Computing (HPC) system users and administrators are often hampered in their ability understand application performance and system behavior due to a lack of sufficient information about how resources, such as memory, CPU, networks and filesystems are being used. While obtaining the related data is a necessary step, it is insufficient without tools that can turn the data into actionable information. Required capabilities of such tools are the ability to efficiently handle vast amounts of data in a timely fashion, the presentation of effective and understandable information representations for large node counts, and the correlation of that data with job and system events.This paper presents visualization approaches and tools that NCSA is developing, combined with the use of freely available web interfaces, to turn the eight billion platform related data points per day being collected from their 27,648 compute node Blue Waters platform into actionable information for both system administrators and users. Insights from the visualizations both at the system and the job levels are also presented.
更多
查看译文
关键词
Data visualization,resource monitoring
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要