A Flexible-blocking Based Approach for Performance Tuning of Matrix Multiplication Routines for Large Matrices with Edge Case... Conference Paper December, 2018
SORA: Scalable Overlap-graph Reduction Algorithms for Genome Assembly using Apache Spark in the Cloud... Conference Paper December, 2018
Exploring flexible communications for streamlining DNN ensemble training pipelines... Conference Paper November, 2018
167-PFlops deep learning for electron microscopy: from learning physics to atomic manipulation... Conference Paper November, 2018
Coupling Exascale Multiphysics Applications: Methods and Lessons Learned... Conference Paper October, 2018
Mathematically Rigorous Verification & Validation of Scientific Machine Learning... Conference Paper September, 2018
FAWCA: A Flexible-greedy Approach to find Well-tuned CNN Architecture for Image Recognition Problem... Conference Paper August, 2018
Partitioning and Communication Strategies for Sparse Non-negative Matrix Factorization... Conference Paper August, 2018