Skip to main content
SHARE
Publication

Advanced data science toolkit for non-data scientists – A user guide...

by Jian Peng, Sangkeun M Lee, Andrew T Williams, James A Haynes, Dongwon Shin
Publication Type
Journal
Journal Name
Calphad (Computer Coupling of Phase Diagrams and Thermochemistry)
Publication Date
Page Number
101733
Volume
68

Emerging modern data analytics attracts much attention in materials research and shows great potential for enabling data-driven design. Data populated from the high-throughput CALPHAD approach enables researchers to better understand underlying mechanisms and to facilitate novel hypotheses generation, but the increasing volume of data makes the analysis extremely challenging. Herein, we introduce an easy-to-use, versatile, and open-source data analytics frontend, ASCENDS (Advanced data SCiENce toolkit for Non-Data Scientists), designed with the intent of accelerating data-driven materials research and development. The toolkit is also of value beyond materials science as it can analyze the correlation between input features and target values, train machine learning models, and make predictions from the trained surrogate models of any scientific dataset. Various algorithms implemented in ASCENDS allow users performing quantified correlation analyses and supervised machine learning to explore any datasets of interest without extensive computing and data science background. The detailed usage of ASCENDS is introduced with an example of experimental high-temperature alloy data.