Performance Evaluation of Machine Learning Algorithms for Predicting Organic Photovoltaic Efficiency

Article Open Access

Performance Evaluation of Machine Learning Algorithms for Predicting Organic Photovoltaic Efficiency

Author Information
Electrical and Computer Engineering, FAMU-FSU College of Engineering, Florida State University, Tallahassee, FL 32310, USA
*
Authors to whom correspondence should be addressed.

Received: 26 July 2025 Accepted: 21 October 2025 Published: 31 October 2025

Creative Commons

© 2025 The authors. This is an open access article under the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Views:48
Downloads:21
Clean Energy Sustain. 2025, 3(4), 10016; DOI: 10.70322/ces.2025.10016
ABSTRACT: This study forecasts the power conversion efficiency (PCE) of organic solar cells using data from experiments with donors and non-fullerene acceptor materials. We built a dataset that includes both numerical and categorical features by using standard scaling and one-hot encoding. We developed and compared several machine learning (ML) models, including multilayer perceptron, random forest, XGBoost, multiple linear regression, and partial least squares. The modified XGBoost model performed best, achieving a root mean squared error (RMSE) of 0.564, a mean absolute error (MAE) of 0.446, and a coefficient of determination (R2) of 0.980 on the test set. We also assessed the model’s ability to generalize and its reliability by examining learning curve trends, calibration curve analysis, and residual distribution. Plots of feature correlation and permutation importance showed that ionization potential and electron affinity were key predictors. The results demonstrate that with proper tuning, gradient boosting methods can provide highly accurate and easy-to-understand predictions of organic solar cell efficiency. This work establishes a repeatable machine learning process to quickly screen and thoughtfully design high-efficiency photovoltaic materials.
Keywords: Organic solar cell; Power conversion efficiency; Machine learning; XGBoost; Multilayer perceptron; Feature importance; Photovoltaic material; Data modeling
TOP