Scalable Compression and Replay of Communication Traces in Massively Parallel Environments

Show full item record

Title: Scalable Compression and Replay of Communication Traces in Massively Parallel Environments
Author: Noeth, Michael James
Advisors: Dr. Tao Xie, Committee Member
Dr. Xiaosong Ma, Committee Member
Dr. Frank Mueller, Committee Chair
Abstract: Characterizing the communication behavior of large-scale applications is a difficult and costly task due to code and system complexity as well as the time to execute such codes. An alternative to running actual codes is to gather their communication traces and then replay them, which facilitates application tuning and future procurements. While past approaches lacked lossless scalable trace collection, we contribute an approach that provides near constant-size communication traces regardless of the number of nodes while preserving structural information. We introduce intra- and inter-node compression techniques of MPI events and present results of our implementation. Given this novel capability, we discuss its impact on communication tuning and beyond.
Date: 2006-10-02
Degree: MS
Discipline: Computer Science
URI: http://www.lib.ncsu.edu/resolver/1840.16/1116


Files in this item

Files Size Format View
etd.pdf 401.4Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record