Christian Engelmann Senior Scientist and Group Leader, Intelligent Systems and Facilities Research Contact engelmannc@ornl.gov | 865.574.3132 All Publications Supporting the Development of Resilient Message Passing Applications using Simulation... Scaling To A Million Cores And Beyond: Using Light-Weight Simulation to Understand The Challenges Ahead On The Road To Exasca... A Runtime Environment for Supporting Research in Resilient HPC System Software & Tools... Detection and Correction of Silent Data Corruption for Large-Scale High-Performance Computing... Tools for Simulation and Benchmark Generation at Exascale... Toward a Performance/Resilience Tool for Hardware/Software Co-Design of High-Performance Computing Systems... Investigating Operating System Noise in Extreme-Scale High-Performance Computing Systems using Simulation... Combining Partial Redundancy and Checkpointing for HPC... A Tunable, Software-based DRAM Error Detection and Correction Library for HPC... NVMalloc: Exposing an Aggregate SSD Store as a Memory Partition in Extreme-Scale Machines... File I/O for MPI Applications in Redundant Execution Scenarios... Proactive Process-Level Live Migration and Back Migration in HPC Environments... Simulation of Large-Scale HPC Architectures... A case for Virtual Machine based Fault Injection in a High-Performance Computing Environment... xSim: The Extreme-Scale Simulator... Redundant Execution of HPC Applications with MR-MPI... Hybrid Checkpointing for MPI Jobs in HPC Environments... Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures... Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments... Facilitating Co-Design for Extreme-Scale Systems Through Lightweight Simulation... System-Level Virtualization Research at Oak Ridge National Laboratory... Symmetric Active/Active Metadata Service for High Availability Parallel File Systems... Nonparametric Multivariate Anomaly Analysis in Support of HPC Resilience... Evaluating the Shared Root File System Approach for Diskless High-Performance Computing Systems... High Performance Computing with Harness over InfiniBand... Pagination First page « First Previous page ‹‹ … Page 2 Current page 3 Page 4 … Next page ›› Last page Last » Key Links Curriculum Vitae Google Scholar ORCID LinkedIn Researcher Website INTERSECT Initiative Organizations Computing and Computational Sciences Directorate Computer Science and Mathematics Division Advanced Computing Systems Research Section Intelligent Systems and Facilities Group