Skip to main content
SHARE
Publication

End-to-End Data Movement Using MPI-IO Over Routed Terabots Infrastructures...

by Geoffroy R Vallee, Edward S Atchley, Youngjae Kim, Galen M Shipman
Publication Type
Conference Paper
Publication Date
Conference Name
3rd IEEE/ACM International Workshop on Network-aware Data Management (NDM 2013)
Conference Location
Denver, Colorado, United States of America
Conference Sponsor
IEEE/ACM
Conference Date

Scientific discovery is nowadays driven by large-scale simulations running on massively parallel high-performance computing (HPC) systems. These applications each generate a large amount of data, which then needs to be post-processed for example for data mining or visualization. Unfortunately, the computing platform used for post processing might be different from the one on which the data is initially generated, introducing the challenge of moving large amount of data between computing platforms. This is especially challenging when these two platforms are geographically
separated since the data needs to be moved between computing facilities. This is even more critical when scientists tightly couple their domain specific applications with a post processing application.

The paper presents a solution for the data transfer between MPI applications
using a dedicated wide area network (WAN) terabit infrastructure. The proposed solution is based on parallel access to data files and the Message Passing Interface (MPI) over the Common Communication Infrastructure (CCI) for the data transfer over a routed infrastructure. In the context of this research, the Energy Sciences Network (ESnet) of the U.S. Department of Energy (DOE) is targeted for the transfer of data between DOE national laboratories.