Integration of the Machine Learning Algorithms and I-MR Statistical Process Control for Solar Energy

dc.authoridAtalan, Abdulkadir/0000-0003-0924-3685
dc.authoridAyaz Atalan, Dr. Yasemin/0000-0001-7767-0342
dc.contributor.authorAtalan, Yasemin Ayaz
dc.contributor.authorAtalan, Abdulkadir
dc.date.accessioned2025-01-27T20:22:46Z
dc.date.available2025-01-27T20:22:46Z
dc.date.issued2023
dc.departmentÇanakkale Onsekiz Mart Üniversitesi
dc.description.abstractThe importance of solar power generation facilities, as one of the renewable energy types, is increasing daily. This study proposes a two-way validation approach to verify the validity of the forecast data by integrating solar energy production quantity with machine learning (ML) and I-MR statistical process control (SPC) charts. The estimation data for the amount of solar energy production were obtained by using random forest (RF), linear regression (LR), gradient boosting (GB), and adaptive boost or AdaBoost (AB) algorithms from ML models. Data belonging to eight independent variables consisting of environmental and geographical factors were used. This study consists of approximately two years of data on the amount of solar energy production for 636 days. The study consisted of three stages: First, descriptive statistics and analysis of variance tests of the dependent and independent variables were performed. In the second stage of the method, estimation data for the amount of solar energy production, representing the dependent variable, were obtained from AB, RF, GB, and LR algorithms and ML models. The AB algorithm performed best among the ML models, with the lowest RMSE, MSE, and MAE values and the highest R2 value for the forecast data. For the estimation phase of the AB algorithm, the RMSE, MSE, MAE, and R2 values were calculated as 0.328, 0.107, 0.134, and 0.909, respectively. The RF algorithm performed worst with performance scores for the prediction data. The RMSE, MSE, MAE, and R2 values of the RF algorithm were calculated as 0.685, 0.469, 0.503, and 0.623, respectively. In the last stage, the estimation data were tested with I-MR control charts, one of the statistical control tools. At the end of all phases, this study aimed to validate the results obtained by integrating the two techniques. Therefore, this study offers a critical perspective to demonstrate a two-way verification approach to whether a system's forecast data are under control for the future.
dc.identifier.doi10.3390/su151813782
dc.identifier.issn2071-1050
dc.identifier.issue18
dc.identifier.scopus2-s2.0-85172879422
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.3390/su151813782
dc.identifier.urihttps://hdl.handle.net/20.500.12428/22025
dc.identifier.volume15
dc.identifier.wosWOS:001074267000001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherMdpi
dc.relation.ispartofSustainability
dc.relation.publicationcategoryinfo:eu-repo/semantics/openAccess
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20250125
dc.subjectsolar energy
dc.subjectmachine learning
dc.subjectrandom forest
dc.subjectAdaBoost
dc.subjectgradient boosting
dc.subjectlinear regression
dc.subjectstatistical process control
dc.subjectI-MR control chart
dc.titleIntegration of the Machine Learning Algorithms and I-MR Statistical Process Control for Solar Energy
dc.typeArticle

Dosyalar