基于机器学习的材料弹性性能预测及可视化分析

Prediction of the elastic properties of materials based on machine learning and visualization analysis

  • 摘要: 在工程材料的应用中,弹性模量是重要的性能参数,找到特定弹性性能的材料是新材料合成领域的热点问题,如何快速且准确的预测弹性在工程上具有重要意义. 通过实际实验测量大量材料的弹性性能并不现实. 因此,通过计算机模拟筛选材料数据,选出候选材料,再通过实际实验进行验证,是一种理想的新材料发现方法. 目前材料性能预测的主要计算方法是基于第一性原理的高通量计算,这类方法效率低下,难以高效地完成大批量的材料筛选任务. 而基于材料统计学的机器学习预测方法,可通过大数据挖掘,快速预测材料性能,成为一种有可能替代高通量计算的方案. 本文将特征选择方法和机器学习模型进行组合,从中选择最有效的弹性模量预测组合方案,并设计交互界面对输入特征和材料弹性性能的关系进行可视化分析. 实验表明Pearson/RFE和GBDT的组合模型性能最好,同时通过可视化分析发现每原子能量、熔点、密度等特征对于预测结果的影响较大. 这些重要的特征可以从特征–目标关系中初步预测弹性模量的范围,目标属性值也可反过来估计材料的重要特征. 这些研究成果可应用于探究弹性的影响因素、预测大批量材料性能和可视化分析指导材料合成.

     

    Abstract: The elastic modulus is an important performance parameter that measures the ability of materials to resist deformation and is critical for assessing their reliability and stability. Thus, the elastic modulus serves as a guide in engineering design and material selection, and finding materials with specific elastic properties is a hot issue in the field of novel materials synthesis. Predicting elasticity quickly and accurately is of great significance in engineering. It is not practical to measure the elastic properties of many materials using practical experiments because this requires a significant amount of time and cost. For many material samples, each sample needs to be tested and analyzed, which is a time-consuming and expensive task. Thus, screening material data through computer simulation, choosing candidate materials, and then confirming them through actual experiments is an ideal method for new material discovery. Currently, the main calculation methods for material performance prediction are first-principles high-throughput calculation, which is inefficient and difficult to efficiently complete the high-volume material screening. Machine learning prediction methods based on material statistics can rapidly predict material properties through big data mining, which has become a possible alternative to high-throughput computing. In this work, the feature selection method and machine learning model are integrated to choose the most effective combination scheme for elastic modulus prediction, and an interactive interface is developed to perform a visual analysis of the relationship between input features and elastic properties of materials. For the analysis of the prediction results, the root mean square error (Rmse) and R-Square (R2) are employed as evaluation indicators for the performance of the prediction model. The experiment shows that the Pearson/RFE/LASSO-GBDT combination model possesses the best performance. On the other hand, by visualization analysis, it is revealed that the energy of each atom, melting point, density, and other characteristics have a great effect on the prediction results. These important characteristics can be utilized to preliminarily predict the range of elastic moduli from the feature–target relationship, and the value of target attributes can be used for the estimation of important characteristics of materials. These findings can be applied to investigate the influencing factors of elasticity, predict the properties of large quantities of materials, and guide the synthesis of materials by visualization analysis. This work has certain significance for guiding the discovery of novel materials and exploring the influencing factors of material properties.

     

/

返回文章
返回