User Application Monitoring through Assessment of Abnormal Behaviours Recorded in RAS Logs... Conference Paper May, 2011
Memphis on an XT5: Pinpointing Memory Performance Problems on Cray Platforms... Conference Paper May, 2011
Preserving Collective Performance Across Process Failure for a Fault Tolerant MPI... Conference Paper May, 2011
Building a Fault Tolerant MPI Application: A Ring Communication Example... Conference Paper May, 2011
Providing Runtime Clock Synchronization With Minimal Node-to-Node Time Deviation on XT4s and XT5s... Conference Paper May, 2011
Time Utility Functions for Modeling and Evaluating Resource Allocations in a Heterogeneous Computing System... Journal May, 2011
A Technique for Moving Large Data Sets over High-Performance Long Distance Networks... Conference Paper May, 2011