Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Conference Paper November, 2023
CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types... Conference Paper November, 2023
Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support Conference Paper September, 2023
Optimizing Data Movement for GPU-Based In-Situ Workflow Using GPUDirect RDMA Conference Paper September, 2023
Experience Migrating OpenCL to SYCL: A Case Study on Searches for Potential Off-Target Sites of Cas9 RNA-Guided Endonucleases... Conference Paper September, 2023
Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation Conference Paper August, 2023
Scalable Incremental Checkpointing using GPU-Accelerated De-Duplication Conference Paper August, 2023