0% found this document useful (0 votes)

59 views22 pages

Electronics 2025

This article explores the use of machine learning algorithms to improve the forecasting of solar energy power output, addressing the inherent variability of solar energy due to factors like weather and geographic location. The study evaluates several machine learning models, including Linear Regression, Decision Tree, Random Forest, and XGBoost, with Random Forest demonstrating the highest accuracy in predictions. The research aims to enhance the reliability and efficiency of solar energy systems by optimizing forecasting methods and identifying the best algorithms for photovoltaic (PV) forecasting.

Uploaded by

Abhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views22 pages

Electronics 2025

Uploaded by

Abhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Article

Using Machine Learning Algorithms to Forecast Solar Energy

Power Output
Ali Jassim Lari 1, * , Antonio P. Sanfilippo 2 , Dunia Bachour 2 and Daniel Perez-Astudillo 2

1 Electrical Power and Renewable Energy, College of Engineering and Technology, University of Doha for
Science and Technology, Doha 24449, Qatar
2 Qatar Environment and Energy Research Institute, Hamad Bin Khalifa University, Qatar Foundation,
Doha 34110, Qatar; asanfilippo@[Link] (A.P.S.); dbachour@[Link] (D.B.);
dastudillo@[Link] (D.P.-A.)
* Correspondence: [Link]@[Link] or qat7@[Link]

Abstract: Solar energy is an inherently variable energy resource, and the ensuing un-
certainty in matching energy demand presents a challenge in its operational use as an
alternative energy source. The factors influencing solar energy power generation include
geographic location, solar radiation, weather conditions, and solar panel performance.
Solar energy forecasting is performed using machine learning for better accuracy and
performance. Due to the variability of solar energy, the forecasting window is an important
aspect of solar energy forecasting that must be integrated into any machine learning model.
This study evaluates the suitability of selected machine learning (ML) models comprising
Linear Regression, Decision Tree, Random Forest and XGBoost, which have been proven
to be effective at forecasting. The data forecasting horizon used was a 24-h window in
steps of 30 min. We focused on the first 30-min, 3-h, 6-h, 12-h, and 24-h windows to gain
an appreciation of the impact of forecasting duration on the accuracy of prediction using
the selected machine learning algorithms. The study results show that Random Forest
outperformed all other tested algorithms. It recorded the best values in all evaluation
metrics: an average mean absolute error of 0.13, mean absolute percentage error of 0.6,
root-mean-square error of 0.28 and R-squared value of 0.89.

Keywords: machine learning; algorithm; photovoltaic; solar energy; solar radiation

Academic Editor: François Auger

Received: 30 September 2024

Revised: 1 December 2024 1. Introduction
Accepted: 2 December 2024
Solar power is one of the most ubiquitous renewable energies [1]. Currently, there is a
Published: 21 February 2025
growing shift towards environmentally friendly energy forms and concern for the depletion
Citation: Lari, A.J.; Sanfilippo, A.P.;
of fossil fuel energy sources. Recently, there has been a growing need to incorporate solar
Bachour, D.; Perez-Astudillo, D. Using
Machine Learning Algorithms to
PV systems into buildings either on the roof or façade. Building roofs have become limiting
Forecast Solar Energy Power Output. as high-rise buildings characterise the market. This has diverted focus from roofs to general
Electronics 2025, 14, 866. https:// building faces, and there are trends to integrate solar PV panels on the buildings’ façades [2].
[Link]/10.3390/electronics14050866 The result of these factors is an intense pursuit of solar energy to ensure it forms a large
Copyright: © 2025 by the authors. proportion of the current mix of energy sources. For solar energy to take a central stage
Licensee MDPI, Basel, Switzerland. in driving the world’s economy, it must be reliable, available, and responsive to energy
This article is an open access article demand [2].
distributed under the terms and
Solar energy is a relatively variable energy resource [3]. This is because solar energy
conditions of the Creative Commons
is affected by volatile weather conditions, including wind speed, cloud cover, ambient
Attribution (CC BY) license
([Link]
temperature, atmospheric transparency index, and relative humidity. The variability in
licenses/by/4.0/). solar energy presents a dilemma in trying to design a reliable solar energy system that

Electronics 2025, 14, 866 [Link]

Electronics 2025, 14, 866 2 of 22

will respond
Electronics 2024, 13, x FOR PEER REVIEW to the already variable energy demand. This challenge can be addressed 2 ofby
34
developing a solar energy forecasting model, which will be beneficial in several ways [4].
First, the model will ensure a reliable control system. This will maintain grid stability
and optimize
optimize operating
operating costscosts by committing
by committing appropriate
appropriate amounts
amounts of solar
of solar energy
energy through
through co-
co-generation strategies
generation strategies [5].Second,
[5]. Second,it itwill
willhelp
helpininthe
the effective
effective integration
integration of
of solar
solar energy
energy
and
andstorage
storagetotooptimize
optimizeenergy
energyresource
resourceuse use[6].
[6].Third,
Third,the
theforecasting
forecastingmodel
modelwillwilladdress
address
the
thedemand
demandresponse.
[Link]
maximizes the
theuseuseof of
solar energy
solar in times
energy of peak
in times consumption
of peak consump-
tion
to to reduce
reduce stressstress on power
on the the power gridgrid
andand increase
increase energy
energy efficiency
efficiency [7].The
[7]. Theforecasting
forecasting
model will
model will help plants implement
implementdynamic
dynamicelectricity
electricitypricing.
[Link]
Dynamic pricing
pricingis vital for
is vital
adjusting
for electricity
adjusting sales
electricity to energy
sales demand
to energy demand thatthat
varies overover
varies [Link].
The solar energy
The solar fore-
energy
casting model
forecasting willwill
model enable
enableinstalled plants
installed to optimise
plants PV plant
to optimise performance
PV plant performance[8]. This con-
[8]. This
tributes to increasing the productivity and longevity of solar PV plants.
contributes to increasing the productivity and longevity of solar PV plants. By developing By developing
and implementing an effective forecasting model, PV plants will avoid the injection of
and implementing an effective forecasting model, PV plants will avoid the injection of ex-
excessive solar power into the grid during times of low demand and high PV productivity
cessive solar power into the grid during times of low demand and high PV productivity [9].
[9]. The forecasting model will help with the protection of On Load Tap Changers (OLTC),
The forecasting model will help with the protection of On Load Tap Changers (OLTC),
which regulates the voltage ratio in sub-station transformers.
which regulates the voltage ratio in sub-station transformers.
1.1. Review of Related Work
1.1. Review of Related Work
According to Mellit et al. [10], four major PV forecasting methods have dominated
According
the field to Mellit
in the period et al.
from [10],These
2010. four major PV forecasting
are physical methods,methods have
statistical dominated
methods, the
artificial
field in the period
intelligence methodsfrom
and2010. Thesehybrid
emergent are physical methods,
methods, statistical
as presented methods,
in Figure artificial
1 below.
intelligence methods and emergent hybrid methods, as presented in Figure 1 below.

Figure 1. PV Forecasting models [11].

Figure 1. PV Forecasting models [11].
Physical methods are rather indirect at PV forecasting, as they involve the use of
meteorological data to predict
Physical methods are rather conditions
indirect at ofPV
theforecasting,
atmosphere asthat
theyaffect solar
involve theirradiance.
use of me-
This method data
teorological is preferable
to predictfor long-term
conditions of forecasting
the atmosphere but is plagued
that with irradiance.
affect solar the challenges
This
method
of is preferable
expensive equipment for and
long-term forecasting
the volatility but is plagued
of weather [Link] the challenges
Statistical methodsofuse ex-
pensive equipment
historical and the models
data in regression volatility toof weather
perform PVpatterns. Statistical
forecasting. Unlike methods
physicaluse histori-
methods,
cal data
static in regression
methods can performmodels to perform
reliable PV forecasting.
forecasting Unlike physical
over short periods, methods,
usually within 24 hstatic
[11].
methods can perform reliable forecasting over short periods, usually
The most explored forecasting method at present is artificial intelligence methods. This within 24 hours [11].
The most explored forecasting method at present is artificial intelligence
entails machine learning and deep learning as the major branches. Machine learning methods. This
entails machine learning and deep learning as the major branches. Machine learning
methods that have been largely tested are unsupervised machine learning methods.
methods that have been largely tested are unsupervised machine learning methods.
Many studies have been carried out on the use of machine learning in the prediction of
Many studies have been carried out on the use of machine learning in the prediction
solar energy.
of solar A majority
energy. of the
A majority of previous
the previousstudies and reviews
studies and reviews havehave
focused on unsupervised
focused on unsuper-
machine
vised machine learning, while others have advocated for deep learning to PV
learning, while others have advocated for deep learning to improve forecast-
improve PV
ing [10]. In these studies, the main evaluation metrics used have often
forecasting [10]. In these studies, the main evaluation metrics used have often been mean been mean absolute
error (MAE),
absolute errorR-squared and root-mean-square
(MAE), R-squared error (RMSE)
and root-mean-square error[11].
(RMSE)This[11].
research focuses
This research
on supervised
focuses machine learning
on supervised machinemodels,
learningparticularly ensemble methods,
models, particularly ensemblewhich havewhich
methods, been
have been
proven to beproven to be
effective at effective
[Link] forecasting.
In this study, Inwe
thisexpanded
study, wethe expanded the evaluating
metrics for metrics for
evaluating
the theofsuitability
suitability each of theof each
studiedof the studied
models modelsa to
to ensure moreensure a more
reliable reliable conclu-
conclusion on the
sion on the effectiveness of the evaluated machine learning model. Further to MAE and
RMSE, we included the mean absolute percentage error (MAPE) and R-squared measures
to assess the prediction reliability of each of the evaluated machine learning models. This
Electronics 2025, 14, 866 3 of 22

effectiveness of the evaluated machine learning model. Further to MAE and RMSE, we
included the mean absolute percentage error (MAPE) and R-squared measures to assess
the prediction reliability of each of the evaluated machine learning models. This article
contributes to revolutionizing the design and development of solar-based energy projects
by improving forecasting methods and narrowing down the options for the best algorithms
for developing PV forecasting models.

1.2. Factors Affecting Photovoltaic Power Output

Solar power output is a function of the geographic nature of the site, solar radiation,
cloud cover, wind speed, humidity and ambient temperature, and solar panel perfor-
mance [12–15].

1.3. Solar Eclipse

A solar eclipse is a natural occurrence whereby the sun is partially or obscured by the
moon; this cuts off solar radiation reaching the earth’s surface and therefore affects the
output of PV modules [16]. The solar eclipse is mentioned here as a special case because it
is not a day-to-day phenomenon but has a huge impact on PV plant output. The nature and
duration of the event are key to designing PV plant operational processes. For example, a
PV plant may be designed with a storage capacity in anticipation of a solar eclipse. It will
be important for PV plant developers to understand the location, the path, and the level of
obscurity of the expected eclipse.

1.4. PV Output Prediction

PV output prediction and forecasting based on machine learning (ML) increases
accuracy and performance [11]. Output forecasting relies on historical time-stamped data
of solar radiation to predict the PV output. The forecasting strategy uses time-series analysis
to develop models and then uses the models in future strategic decision-making.

1.5. Machine Learning Methods for Prediction of Solar Power

This article will evaluate the suitability of selected machine learning models. The data
used are continuous numerical weather and solar radiation data for the years 2020 to 2023.
The target algorithms are the main supervised learning algorithms, including Random
Forest, Linear Regression, Decision Tree, and XGBoost.

1.5.1. Linear Regression

This algorithm applies to determine the relationship between two continuous variables.
Linear regression is developed by drawing a line of best fit to a set of the actual data points
using the least squares (squares of residuals). The equation of the regression line predicts
the value of the dependent variable. The formula for linear regression is given by:

y = n + mx (1)

Wherein:
■ y—values of the (dependent) second dataset
■ x—values of the (independent) first dataset
■ n—y-intercept of the line
■ m—slope of the line

1.5.2. Decision Tree

This is a branching configuration, whereby a variable is analysed into every possible
stepwise outcome. At each branch node, a test is applied, and every possible outcome
Electronics 2025, 14, 866 4 of 22

is mapped to the right as an outcome. The decision tree structure can be summarized as
Electronics 2024, 13, x FOR PEER REVIEWconsisting
of a root node (the entire dataset), internal nodes (decisions or tests),4branches
of 34

(an outcome of a decision) and leaf nodes (final decisions); Figure 2.

Figure 2. Decision
Figure tree tree
2. Decision algorithm structure
algorithm [17].[17].
structure

TheThe
testtest applied
applied represents
represents thethe mostappropriate
most appropriateattribute
attributeofof the
the target dataset
datasetthat
thatwill
willlead
leadtotothe
the optimal
optimal dichotomy
dichotomy of the
of the dataset.
dataset. TheThe selection
selection processprocess employs
employs ap-
approaches
proaches like Gini impurity, entropy, and information gain. Gini impurity is a
like Gini impurity, entropy, and information gain. Gini impurity is a metric which evaluates metric
which
the evaluates
probabilitytheofprobability
occurrenceofofoccurrence of an
an incorrect incorrect classification
classification of a new data of point
a newthat
datawas
point that wasclassified.
randomly randomlyItclassified. It is determined
is determined based on
based on Equation (2)Equation
below. (2) below.
𝐺𝑖𝑛𝑖 = 1 − 𝑝𝑖 (2)
Gini = 1 − ∑in=1 ( pi )2 (2)
The entropy metric evaluates the level of uncertainty in the data set and is calculated
accordingThe
to entropy
Equationmetric evaluates the level of uncertainty in the data set and is calculated
(3) below.
according to Equation (3) below.
𝐸𝑛𝑡𝑟𝑜𝑝𝑦 = − 𝑝𝑖𝑙𝑜𝑔 𝑝𝑖 (3)
Entropy = −∑in=1 pilog2 ( pi ) (3)
The information gain splitting method evaluates the reduction in entropy or Gini im-
purity after
Theainformation
dataset has been split based
gain splitting on an evaluates
method attribute. The formula for
the reduction inimplementation
entropy or Gini im-
of information gain is as shown in Equation (4) below.
purity after a dataset has been split based on an attribute. The formula for implementation
of information gain is as=shown in Equation (4) |
|𝐷below.
𝐼𝑛𝑓𝑜. 𝐺𝑎𝑖𝑛 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 − ∗ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝐷 (4)
|𝐷|

In some cases, the | Di |
Indichotomisation
f o. Gain = Entropymay−lead
0 ∑in=1to too little data in
∗ Entropy ( Dthe
i)
given subtree, (4)
| D |
and this results in overfitting. Decision trees, therefore, tend to have a preference for di-
chotomies that culminate in as few branches as possible. If the dichotomy leads to features
In some cases, the dichotomisation may lead to too little data in the given subtree,
of less significance, a process known as pruning is conducted. When a decision tree adopts
and this results in overfitting. Decision trees, therefore, tend to have a preference for
an ensemble approach to ensure the accuracy of the classification process, it changes to a
dichotomies
random that culminate
forest algorithm in as few
[17], described in branches as possible.
Section 1.5.3 below. If the dichotomy leads to
features of less significance, a process known as pruning is conducted. When a decision
tree
1.5.3. adoptsForest
Random an ensemble approach to ensure the accuracy of the classification process, it
changes
Random toforests
a randomtakeforest
note algorithm [17],
of the factor described
that no singleinmodel
Sectioncan
1.5.3
fit below.
all aspects of the
problem to be modelled. Therefore, it encompasses a variety of modelling techniques,
1.5.3. Random Forest
each applied at the appropriate stage as deemed suitable to give better results. As in some
Random discussed
of the previously forests take note of the
methods, factor
it starts thatan
with noinput
single model
feed that can
thenfitfollows
all aspects of the
a deci-
sion-tree-like
problem toanalysis and, at Therefore,
be modelled. each stage,itanencompasses
appropriate atechnique
variety ofis modelling
applied to techniques,
give an
outcome, which forms
each applied at thean input into the
appropriate stagenext
as stage.
deemed It issuitable
best described
to give as a collaboration
better results. As in
amongsome decision trees giving
of the previously a single output.
discussed methods, Random
it startsforest
with anbuilds
inputupon
feedthe
thatweakness of a
then follows
its parent algorithm, decision trees, which have inherent overfitting. It prevents
decision-tree-like analysis and, at each stage, an appropriate technique is applied to give overfitting
by introducing
an outcome,randomness
which formsin anthe construction
input of the
into the next decision
stage. trees.
It is best Becauseasrandom
described for-
a collaboration
est eﬀectively deals with overfitting and handles the issues of missing data, it presents
itself as one of the most eﬀective methods for the forecasting process.
Electronics 2025, 14, 866 5 of 22

among decision trees giving a single output. Random forest builds upon the weakness of
its parent algorithm, decision trees, which have inherent overfitting. It prevents overfitting
by introducing randomness in the construction of the decision trees. Because random forest
effectively deals with overfitting and handles the issues of missing data, it presents itself as
one of the most effective methods for the forecasting process.

1.5.4. XGBoost
Extreme gradient boosting (XGBoost) is a gradient-boosted machine learning model.
XGBoost is one of the ensemble machine learning algorithms that is renowned for the
efficient treatment of missing values in a dataset.

2. Materials and Methods

We present a comparative study of several algorithms for forecasting global horizontal
irradiance (GHI) in Qatar [18]. This forecasting was based on high-quality data of measured
GHI recorded by the Qatar Environment and Energy Research Institute (QEERI) solar
radiation monitoring station, located in Education City (Doha, Qatar), during the years
2020–2023 [19]. These measurements are part of a fully equipped station with a solar
tracker, one first-class pyrheliometer for Direct Normal Irradiance (DNI) measurements,
and two pyranometers of secondary standard for GHI and Diffuse Horizontal Irradiance
(DHI) measurements. The pyranometer is fitted with ventilation units, and a sensor kit is
used to improve the alignment accuracy. The data, sampled every second and recorded as
1-min averages in W/m2 , follows thorough data quality checks using the Baseline Surface
Radiation Network (BSRN) procedures [20].
The objective of the study presented is to identify which algorithm provides the best
model to reliably estimate the solar power plant output for a given location given the time
of year. The GHI data spanning four years (from 2020 to 2023) provides a sufficiently rich
training dataset to ascertain the comparative reliability of the forecasting algorithms tested.
The measured data were collected at 1-min resolution and averaged at different intervals of
30 min, 3 h, 6 h, 12 h and 24 h respectively.

2.1. Evaluation Metrics

Each machine learning algorithm has a unique approach to creating a predictive model.
It is therefore necessary to evaluate the level of accuracy and reliability of each machine
learning algorithm in developing a working model. This document assessed each machine
learning algorithm based on the root-mean-square error (RMSE), R-squared, mean absolute
error (MAE) and mean absolute percentage error (MAPE). All the evaluation metrics
were analysed through the use of the Scikit-Learn utility of the Python programming
language [21].

2.1.1. Root Mean Square Error

Root-mean-square error/deviation (RMSE/RMSD) determines how far from the re-
gression line each data point is set. The error values are squared. A low RMSE value means
the prediction model is credible in evaluating the output variable of interest, which in this
case is the solar irradiance. The formula for RMSE is provided by Equation (5) below. The
value of [e] in the equation is equivalent to the predicted value minus the input value.
s
1 n 2
n∑
RMSE = ei (5)
1
Electronics 2025, 14, 866 6 of 22

2.1.2. R-Squared
The R2 value is another measure that is used to evaluate an algorithm in predicting the
expected outcome. The value of R2 in a model is determined by Equation (6). The numerator
is the sum of the squares of residuals. The denominator of the function represents the
total sum of the squares. A value of R2 that is closer to 1 signifies greater accuracy of the
regression model.
2
∑(yi − ŷi )
R2 = 1 − 2
(6)
∑(yi − ŷ)

2.1.3. Mean Absolute Error

Mean absolute error evaluates predictability using the absolute values between the
actual value and the predicted value. The formula for determination of the mean absolute
error is given by Equation (7).
1 n
MAE = ∑|ei | (7)
n 1

2.1.4. Mean Absolute Percentage Error

Mean absolute percentage error is a metric used to determine the prediction accuracy
of a forecasting method. It expresses the accuracy as a ratio of the actual value and is
calculated by the following formula:

1 n At − Ft
n t∑
MAPE = 100 (8)
=1 At

where:
At —Actual value
Ft —Forecast value
n—Number of fitted points

2.2. Data Analysis

The data forecasting horizon was a 24-h window in steps of 30 min. We focused on
the first 30-min, 3-h, 6-h, 12-h, and 24-h windows to appreciate the impact of forecasting
duration on the accuracy of prediction using the selected machine learning algorithms.
Measurement gaps in the data were filled with the mean parameters and then normalised.
Moving averages, time of day, day of the week, and seasonal indicators were then created
on the data frame.

3. Results
3.1. Linear Regression Model
Table 1 below provides the results of MAE, mean MAPE, RMSE and R-squared values
for the linear regression model. The evaluation metrics were analysed for each of the five
forecast windows.

Table 1. Results of linear regression model.

Metric 0.5-h 3-h 6-h 12-h 24-h Average

MAE 0.3006 0.6833 0.490997 0.2434 0.3057 0.404799
MAPE 1.2483 3.1533 1.0935 1.3071 1.1118 1.5828
RMSE 0.3564 0.8823 0.6091 0.3973 0.5831 0.5656
R2 0.8745 0.2098 0.6402 0.8226 0.69699 0.64882
Electronics 2025, 14, 866 7 of 22

To appreciate the impact of the forecast horizon (window) on the quality of predictions
for linear regression, we developed a regression plot based on the predicted value. We
chose the 3-h window and the 24-h window. Figure 3 presents the regression plot for
Electronics 2024, 13, x FOR PEER REVIEW 7 of 34
the 3-h window and shows that the points are less symmetrically distributed along the
regression line.

Electronics 2024, 13, x FOR PEER REVIEW 8 of 34

Figure 3. Linear regression based on a 3-h window.
Figure 3. Linear regression based on a 3-h window.

Figure 4 shows the plot for the 24-h window for the linear regression model. The 24-h
Figure 4 shows the plot for the 24-hour window for the linear regression model. The
forecast window shows an improved distribution of points along the regression line in the
24-hour forecast window shows an improved distribution of points along the regression
linear
line inregression
the linear model.
regression model.

Figure
[Link]
Linearregression
regressionbased
basedon
onaa24-hour period.
24-h period.

Figure
Figure55 shows
showsMAEMAEvalues
valuesacross
acrossthe
theforecast
forecastwindows
windowsforforthe
the linear
linear regression
regression
model.
[Link]
showsaahigher
higherMAE
MAEatatthe
the3-h
3-hwindow,
window,which
whichdeclines
declinesto
tothe
thelowest
lowestvalue
valueof
of
0.2434 at the 12-h forecast window.
0.2434 at the 12-h forecast window.
Electronics 2024, 13, x FOR PEER REVIEW 9 of 34

Electronics 2025, 14, 866 8 of 22

Figure5.
Figure 5. Linear
Linear regression
regressiongraph
graphof
ofMAE
MAEacross
acrossthe
theforecast
forecastwindows.
windows.
Electronics 2024, 13, x FOR PEER REVIEW 10 of 3
AA similar
similartrend
trendinin
thethe values
values of MAPE
of MAPE canobserved
can be be observed
for thefor the regression
linear linear regression
model;
model; Figure 6. There is a general reduction in the value of MAPE with increases
Figure 6. There is a general reduction in the value of MAPE with increases in the span in the
of
span of the forecast window in the linear regression
the forecast window in the linear regression model. model.

[Link]
Figure Linear regression
regression graph
graph of MAPE
of MAPE acrossacross the forecast
the forecast windows.
windows.

Figure
Figure7 represents thethe
7 represents RMSE
RMSEvalue for the
value forlinear regression
the linear model. model.
regression It depicts
It adepicts
sine a sin
profile
profilefor
forthe RMSE
the RMSEvalues across
values the forecast
across windows.
the forecast windows.
Electronics 2024, 13, x FOR PEER REVIEW 11 of 34

Electronics 2025, 14, 866 9 of 22

Figure7.
Electronics 2024, 13, x FOR PEER REVIEW
Figure 7. Linear
Linear regression
regressiongraph
graphof
ofRMSE
RMSEacross
acrossthe
theforecast
forecastwindows.
windows. 12 of 34

The R-squared
The R-squaredtrend
trendfor
forthe
thelinear
linearregression
regressionmodel
modelisispresented
presentedin
inFigure
[Link] shows
shows
an irregular profile for the values of R-squared across the forecast windows.
an irregular profile for the values of R-squared across the forecast windows.

Figure 8.
Figure 8. Linear
Linear regression
regression graph
graph of
of R-squared
R-squaredacross
acrossthe
theforecast
forecastwindows.
windows.

3.2. Results
3.2. Results of
of Decision
Decision Tree
TreeModel
Model
Table 22 shows
Table shows the
the evaluation
evaluation results
results for
for the
the decision
decision tree
tree model
model based
based on
on the
the five
five
forecasting windows of 0.5, 3, 6, 12 and 24 h respectively.
forecasting windows of 0.5, 3, 6, 12 and 24 h respectively.

Table2.
Table 2. Results
Results of
of decision
decision tree
treemodel.
model.

Metric
Metric 0.5-h
0.5-h 3-h
3-h 6-h
6-h 12-h
12-h 24-h
24-h Average
Average
MAE 0.0431 0.0445 0.0856 0.2961 0.2927 0.1524
MAE 0.0431 0.0445 0.0856 0.2961 0.2927 0.1524
MAPE 0.4021 0.2812 0.3376 1.6362 1.0623 0.7439
MAPE 0.4021 0.2812 0.3376 1.6362 1.0623 0.7439
RMSE 0.1410 0.1429 0.2380 0.5878 0.6446 0.3509
RMSE
R2 0.1410
0.9803 0.1429
0.9793 0.2380
0.9450 0.5878
0.6119 0.6446
0.6297 0.3509
0.8292
R2 0.9803 0.9793 0.9450 0.6119 0.6297 0.8292
For the decision tree prediction model, the visual results for a 3-h forecast window
are presented in Figure 9. The results show that the data points are symmetrically distrib-
uted along the regression line.
Electronics 2024,
Electronics 14, x866
2025, 13, FOR PEER REVIEW 1310ofof 34
22

For the decision tree prediction model, the visual results for a 3-h forecast window are
presented in Figure 9. The results show that the data points are symmetrically distributed
along the regression line.

Electronics 2024, 13, x FOR PEER REVIEW 14 of 34

Figure 9. Decision tree regression based on a 3-h period.

Figure 10 shows results for a 24-h forecast window for the decision tree regression plot.
Figure 10 shows results for a 24-hour forecast window for the decision tree regression
The outcome maintains
plot. The outcome a symmetric
maintains distribution
a symmetric of the
distribution datadata
of the points along
points thethe
along regression
regres-
line.
sion line.

Figure 10.
Figure Decision tree
10. Decision tree regression
regression based
based on
on aa 24-h
24-h period.
period.

Figure 11
Figure 11 represents
represents the
the graph
graph of
of MAE
MAE values across the
values across the forecast
forecast windows for the
windows for the
decision tree algorithm. The curve depicts a sigmoid shape with a peak at the
decision tree algorithm. The curve depicts a sigmoid shape with a peak at the 12-hour12-h
forecast window.
forecast window.
Electronics 2024, 13, x FOR PEER REVIEW 15 of 34

Electronics 2025, 14, 866 11 of 22

Figure11.
Figure [Link]
Decisiontree
treegraph
graphof
ofMAE
MAEacross
acrossthe
theforecast
forecastwindows.
windows.
Electronics 2024, 13, x FOR PEER REVIEW 16 of 34
The graph
The graph of
of MAPE
MAPE values
values for
for the
the decision
decision tree
tree algorithm
algorithm across
across the
the forecast
forecast win-
win-
dows is shown in Figure 12. The graph reflects a sigmoid shape revealed
dows is shown in Figure 12. The graph reflects a sigmoid shape revealed by the MAE by the MAE
profile above.
profile above.

Figure12.
Figure [Link]
Decisiontree
treegraph
graphof
ofMAPE
MAPEacross
acrossthe
theforecast
forecastwindows.
windows.

Thecorresponding
The correspondingsigmoid
sigmoidcurve
curvefor
forthe
thevalues
valuesof
ofRMSE
RMSEisisrepresented
representedby
bythe
thegraph
graph
of RMSE values for the decision tree across the forecasting windows in Figure 13.
of RMSE values for the decision tree across the forecasting windows in Figure 13.
Electronics 2024, 13, x FOR PEER REVIEW 17 of 34

Electronics 2025, 14, 866 12 of 22

Figure13.
Figure [Link]
Decisiontree
treegraph
graphof
ofRMSE
RMSEacross
acrossthe
theforecast
forecastwindows.
windows.
Electronics 2024, 13, x FOR PEER REVIEW 18 of 34
Figure14
Figure 14represents
representsthethegraph
graphofofR-squared
R-squaredvalues
valuesacross
acrossthe
theforecast
forecastwindows
windowsforfor
the decision tree. It is evident that the graph is a reverse sigmoid curve with a peak at
the decision tree. It is evident that the graph is a reverse sigmoid curve with a peak at the the
6-h forecast window.
6-h forecast window.

Figure14.
Figure 14. Decision
Decisiontree
treegraph
graphof
ofR-squared
R-squaredacross
acrossthe
theforecast
forecastwindows.
windows.

3.3.
3.3. Random
Random Forest
ForestResult
Result
Table 3 displays
Table displays the
theMAE,
MAE,RMSE
RMSEand
andR-squared results
R-squared forfor
results random forest
random model
forest pre-
model
diction analysis.
prediction analysis.

[Link]
Table Resultsof
ofrandom
randomforest
forestmodel.
model.

Metric
Metric 0.5-h
0.5-h 3-h
3-h 6-h
6-h 12-h
12-h 24-h
24-h Average
Average
MAE
MAE
0.0318
0.0318
0.0347
0.0347
0.0673
0.0673
0.2392
0.2392
0.2741
0.2741
0.1294
0.1294
MAPE 0.3031 0.3647 0.2315 1.1180 1.0212 0.6077
MAPE 0.3031 0.3647 0.2315 1.1180 1.0212 0.6077
RMSE 0.1053 0.1162 0.1951 0.4164 0.54996 0.27659
RMSE 0.1053 0.1162 0.1951 0.4164 0.54996 0.27659
R2 0.9890 0.9863 0.9631 0.8052 0.7305 0.8948
R 2 0.9890 0.9863 0.9631 0.8052 0.7305 0.8948
The regression plot for the evaluation of the random forest model is presented in
Figure 15, showing a 3-h forecast window. As with the decision tree model, the data points
for the random forest model are distributed symmetrically along the regression plot.
Electronics 2025, 14, 866 13 of 22

Electronics 2024, 13, x FOR PEER REVIEW 19 of 34

The regression plot for the evaluation of the random forest model is presented in
Figure 15, showing a 3-h forecast window. As with the decision tree model, the data points
for the random forest model are distributed symmetrically along the regression plot.

Electronics 2024, 13, x FOR PEER REVIEW 20 of 3

Figure 15. Random Forest regression based on a 3-h period.
Figure 15. Random Forest regression based on a 3-h period.

Figure 16 shows reduced point density at the 24-h prediction window. However, the
Figure 16 shows reduced point density at the 24-hour prediction window. However,
random
the forest
random regression
forest maintains
regression a symmetric
maintains distribution
a symmetric of the
distribution of points.
the points.

Figure 16. Random Forest regression based on a 24-h period.

Figure 16. Random Forest regression based on a 24-hour period.
Figure 17 presents the MAE curve across the forecast windows for the random forest
FigureThe
algorithm. 17 graph
presents the MAE
is similar curve
to that across
of the the forecast
decision windows
tree model, for the
depicting random fores
a sigmoid
algorithm. The graph is similar to that of
curve with its maturity stage at the 12-h window. the decision tree model, depicting a sigmoid
curve with its maturity stage at the 12-h window.
Electronics 2024, 13, x FOR PEER REVIEW 21 of 34

Electronics 2025, 14, 866 14 of 22

Figure17.
Figure [Link]
RandomForest
Forestgraph
graphof
ofMAE
MAEacross
acrossthe
theforecast
forecastwindows.
windows.
Electronics 2024, 13, x FOR PEER REVIEW 22 of 34
Figure18
Figure 18shows
showsthe theMAPE
MAPEcurvecurveacross
acrossthe
theforecast
forecastwindows
windowsfor
forthe
therandom
randomforest
forest
model. The
model. The graph
graph isislike
likethat
thatof
ofthe
thepreceding
precedingmachine
machinelearning
learningmodel,
model,albeit
albeitwith
withsome
some
level of sensitivity at the 3-h window.
level of sensitivity at the 3-h window.

Figure18.
Figure [Link]
RandomForest
Forestgraph
graphof
ofMAPE
MAPEacross
acrossthe
theforecast
forecastwindows.
windows.

Asigmoid
A sigmoidcurve
curvefor
forthe
theRMSE
RMSEvalues,
values, corresponding
correspondingto to the
the MAE
MAE and
and MAPE
MAPE values,
values,
for the random forest model, is shown in Figure 19. Unlike the decision tree model,
for the random forest model, is shown in Figure 19. Unlike the decision tree model, the the
RMSEcurve
RMSE curvehas
hasaatransition
transitionphase
phaseat
atthe
the12-h
12-hinstead
insteadof
ofthe
thepeak.
peak.
Electronics 2024, 13, x FOR PEER REVIEW 23 of 34

Electronics 2025, 14, 866 15 of 22

Figure19.
Figure [Link]
RandomForest
Forestgraph
graphof
ofRMSE
RMSEacross
acrossthe
theforecast
forecastwindows.
windows.
Electronics 2024, 13, x FOR PEER REVIEW 24 of 34
Figure20
Figure 20presents
presentsthe
theR-squared
R-squaredvalues
valuesfor
forthe
therandom
randomforest
forestmodel.
[Link]
Comparedto to
the corresponding curve for the decision tree model, the random forest has a less promi-
the corresponding curve for the decision tree model, the random forest has a less prominent
nent at
peak peak
the at
3-hthe 3-h window
window on theon the reverse
reverse sigmoid
sigmoid [Link].

Figure20.
Figure [Link]
RandomForest
Forestgraph
graphof
ofR-squared
R-squaredacross
acrossthe
theforecast
forecastwindows.
windows.

3.4.
[Link]
XGBoostResults
Results
Table44shows
Table showsthe
theevaluation
evaluationresults
resultsof
ofMAE,
MAE,RMSE
RMSEand andR-squared
R-squaredforforthe
theXGBoost
XGBoost
algorithm. The
algorithm. The analysis was performed for each of the five data forecast windows
was performed for each of the five data forecast windows from 30
from
min
30 to to
min 2424
hours.
h.

[Link]
Table ResultsofofXGBoost
XGBoostmodel.
model.

Metric
Metric 0.5-h
0.5-h 3-h
3-h 6-h
6-h 12-h
12-h 24-h
24-h Average
Average
MAE
MAE 0.0350
0.0350 0.0441
0.0441 0.0738
0.0738 0.2060
0.2060 0.2893
0.2893 0.1296
0.1296
MAPE
MAPE
0.3499
0.3499
0.4805
0.4805
0.2372
0.2372
0.9386
0.9386
0.9065
0.9065
0.5825
0.5825
RMSE 0.1051 0.1162 0.1939 0.3616 0.6272 0.2808
RMSE 0.1051 0.1162 0.1939 0.3616 0.6272 0.2808
R2 0.9891 0.9863 0.9635 0.8531 0.6465 0.8877
R2 0.9891 0.9863 0.9635 0.8531 0.6465 0.8877
Figure 21, below, presents the regression plot for a 3-h forecast window for the
XGBoost model. The graph shows a strong linear relationship based on this model as de-
picted by the symmetric distribution of data points along the regression line.
Electronics 2025, 14, 866 16 of 22

Electronics 2024, 13, x FOR PEER REVIEW 25 of 34

Figure 21, below, presents the regression plot for a 3-h forecast window for the XGBoost
model. The graph shows a strong linear relationship based on this model as depicted by
the symmetric distribution of data points along the regression line.

Electronics 2024, 13, x FOR PEER REVIEW 26 of 34

Figure 21. XGBoost
Figure 21. XGBoost regression
regression based
based on
on aa 3-h
3-h period.
period.

Figure
Figure 22
22 provides
provides results
results for
for the
the 24-h forecast
24-hour window
forecast for the
window for XGBoost regression
the XGBoost regres-
analysis.
sion analysis.

Figure 22. XGBoost regression based on a 24-h period.

Figure 22. XGBoost regression based on a 24-hour period.

Figure 23 presents the MAE values for XGBoost over all forecast windows. The trend
Figure 23 presents the MAE values for XGBoost over all forecast windows. The trend
is
is aa curve
curve approaching
approaching anan exponential
exponential curve
curve with
with growth
growth starting
starting at
at the
the 6-h
6-h window.
window.
Electronics 2024, 13, x FOR PEER REVIEW 27 of 34

Electronics 2025, 14, 866 17 of 22

Figure23.
Electronics 2024, 13, x FOR PEER REVIEW
Figure [Link]
XGBoostgraph
graphof
ofMAE
MAEacross
acrossthe
theforecast
forecastwindows.
windows. 28 of 34

Thecurve
The curveofofthethe MAPE
MAPE values
values across
across the the forecasting
forecasting windows
windows is a sigmoid
is a sigmoid curve curve
with
with regularity at the 3-h forecasting window, as shown
regularity at the 3-h forecasting window, as shown in Figure [Link] Figure 24.

Figure24.
Figure [Link]
XGBoostgraph
graphof
ofMAPE
MAPEacross
acrossthe
theforecast
forecastwindows.
windows.

The RMSE
The RMSE curve,
curve, Figure
Figure 25,
25, for
forXGBoost
XGBoostacross
acrossthe
theforecast windows
forecast depicts
windows an an
depicts ex-
ponential curve, unlike the decision tree and random forest models. An exponential
exponential curve, unlike the decision tree and random forest models. An exponential in-
crease ininthe
increase theRMSE
RMSEvalue
valuestarts
startsatatthe
the3-h
3-hforecast
forecastwindow.
window.
Electronics 2024, 13, x FOR PEER REVIEW 29 of 34

Electronics 2025, 14, 866 18 of 22

Figure25.
Figure [Link]
XGBoostgraph
graphof
ofRMSE
RMSEacross
acrossthe
theforecast
forecastwindows.
windows.
Electronics 2024, 13, x FOR PEER REVIEW 30 of 34
Correspondingly, the
Correspondingly, the graph
graph of
ofR-squared
R-squaredvalues forfor
values XGBoost across
XGBoost the forecast
across win-
the forecast
dows shows a reverse exponential curve with its maximum value at the 3-h window;
windows shows a reverse exponential curve with its maximum value at the 3-h window; Fig-
ure 26.
Figure 26.

Figure26.
Figure 26. XGBoost
XGBoostgraph
graphof
ofR-Squared
R-Squaredacross
acrossthe
theforecast
forecastwindows.
windows.

4. Discussion
4. Discussion
Theresults
The resultsshow
showthat
thatthe
thehighest
highestaccuracy
accuracywas
was recorded
recordedat atthe
the30-minute
30-minute prediction
prediction
window, with the highest R-squared measurement at 0.9890 and the lowest
window, with the highest R-squared measurement at 0.9890 and the lowest at 0.8745. The at 0.8745. The
lowestprediction
lowest predictionaccuracy
accuracywas
wasrecorded
recordedforfor the
the 24-hour
24-h forecastforecast
window, window, with random
with random forest
forest scoring 0.7305 and decision tree scoring the lowest, 0.6297, in this category. Predict-
scoring 0.7305 and decision tree scoring the lowest, 0.6297, in this category. Predictability
ability becomes a challenging undertaking and cannot remain reliable over a longer fore-
becomes a challenging undertaking and cannot remain reliable over a longer forecasting
casting period. Predictions within half an hour can be extremely accurate but may not be
period. Predictions within half an hour can be extremely accurate but may not be of much
of much use to the intended application in the solar energy industry and energy produc-
use
tiontoand
thedistribution.
intended application in the solar energy industry and energy production and
distribution.
The linear regression model’s performance was unpredictable over the five forecast
The linear
windows. regression model’s of
The underperformance performance was unpredictable
linear regression is seen here inover theoffive
terms forecast
high values
windows.
of MAE (up Thetounderperformance
0.49) and RMSE (up of to
linear
0.58)regression is seen
accompanied byhere
low in termsofofR-squared,
values high valuesas
low
of as 0.2
MAE (upinto
the0.49)
outlier
andinstance.
RMSE (up Despite theaccompanied
to 0.58) poor model outcome, the algorithm
by low values depicted
of R-squared, as
considerable forecast reliability in the 24-hour window with an R-squared value ap-
proaching 0.7 (0.69699), which is similar to the results presented in [22].
Decision tree was one of the best-performing algorithms overall forecast categories.
It showed consistency, recording a high of 0.98 in the half-hour window and 0.6297 in the
24-hour forecast window. However, performing the decision tree suddenly dropped in
Electronics 2025, 14, 866 19 of 22

low as 0.2 in the outlier instance. Despite the poor model outcome, the algorithm depicted
considerable forecast reliability in the 24-h window with an R-squared value approaching
0.7 (0.69699), which is similar to the results presented in [22].
Decision tree was one of the best-performing algorithms overall forecast categories. It
showed consistency, recording a high of 0.98 in the half-hour window and 0.6297 in the
24-h forecast window. However, performing the decision tree suddenly dropped in the 12-h
window to 0.6119 from 0.9450 in the 6-h window. Generally, predictions by the decision
tree model returned low values of MAE (to the tune of 0.0431) and RMSE (the lowest being
0.141).
Random forest was the most outstanding machine-learning algorithm in this study.
The values were consistent and decreased gradually across the forecast window tests. This
underscores the strength of the random forest algorithm in modelling solar radiation given
the input data. The algorithm had the lowest mean absolute error to the tune of 0.0318 and
an R2 of 0.989. However, in forecasting the 24-h window period, it did not outperform the
decision tree in terms of point–cluster symmetry. This does not make it less superior to the
decision tree in this test.
The last machine model analysed was XGBoost, which also depicted consistency across
the set of test windows. XGBoost maintained high values for the test metrics above all the
tested algorithms, presenting it as one of the most effective forecasting models [23,24].
Random forest outperformed all the tested algorithms in this research. It recorded
the best values on all the evaluation metrics: an average mean absolute error of 0.1294,
root-mean-square error of 0.27659 and R-squared value of 0.8948. These summary results
are presented in Table 5 below.

Table 5. Summary results for the machine learning algorithm tests.

Metric Linear Reg. Decision Tree Random Forest XGBoost

MAE 0.4048 0.1524 0.1294 0.1296
MAPE 1.5828 0.7439 0.6077 0.5825
RMSE 0.5656 0.3509 0.27659 0.2808
R2 0.64882 0.8292 0.8948 0.8877

Random forest and XGBoost models showed higher prediction accuracies compared to
the models. This may be attributed to their ensemble nature, which enables these models to
capture complex patterns within the dataset. Both XGBoost and random forest share a basic
prediction approach. Both algorithms are founded on the fact that data can be complex, and
a single prediction model cannot satisfactorily model the data. Therefore, both algorithms
adopt a tree-like analysis approach, which makes them robust in the prediction of data.
Figure 27 provides the average MAE values for the evaluated machine learning
models. Random forest and XGBoost recorded the lowest MAE values, indicating them as
potentially reliable forecasting algorithms.
Figures 28 and 29 echo the outcome of the MAE metric, presenting random forest and
XGBoost as the models with the lowest error among the studied group of algorithms.
To conclude that random forest and XGBoost are the most reliable machine learning
algorithms suited for solar power forecasting, Figure 30 shows that both random forest and
XGBoost recorded the highest average values of R-squared at 0.89.
both algorithms adopt a tree-like analysis approach, which makes them robust in the pre-
diction of data.
Figure 27 provides the average MAE values for the evaluated machine learning mod-
Electronics 2025, 14, 866
els. Random forest and XGBoost recorded the lowest MAE values, indicating them 20 asofpo-
22

tentially reliable forecasting algorithms.

Electronics 2024, 13, x FOR PEER REVIEW 32 of 34

Electronics 2024, 13, x FOR PEER REVIEW
Figure 27. Graph of average MAE values for each algorithm. 32 of 34
Figure 27. Graph of average MAE values for each algorithm.

Figures 28 and 29 echo the outcome of the MAE metric, presenting random forest
and XGBoost as the models with the lowest error among the studied group of algorithms.

Figure28.
Figure [Link]
Graphofofaverage
averageMAPE
MAPEvalues
valuesfor
foreach
eachalgorithm.
algorithm.
Figure 28. Graph of average MAPE values for each algorithm.

Figure 29. Graph of average RMSE values for each algorithm.

Figure29.
Figure [Link]
Graphof
ofaverage
averageRMSE
RMSEvalues
valuesfor
foreach
eachalgorithm.
algorithm.
To conclude that random forest and XGBoost are the most reliable machine learning
To conclude that random forest and XGBoost are the most reliable machine learning
algorithms suited for solar power forecasting, Figure 30 shows that both random forest
algorithms suited for solar power forecasting, Figure 30 shows that both random forest
and XGBoost recorded the highest average values of R-squared at 0.89.
and XGBoost recorded the highest average values of R-squared at 0.89.
Figure 29. Graph of average RMSE values for each algorithm.

To conclude that random forest and XGBoost are the most reliable machine learning
Electronics 2025, 14, 866
algorithms suited for solar power forecasting, Figure 30 shows that both random21forest
of 22
and XGBoost recorded the highest average values of R-squared at 0.89.

Figure30.
Figure [Link]
Graphof
ofaverage
averageR-squared
R-squaredvalues
valuesfor
foreach
eachalgorithm.
algorithm.

5. Conclusions
5. Conclusions
Random Forest and XGBoost algorithms provide the most reliable ML models for
forecasting PV power output. The 6-h window provided the best forecast period for all the
models except for the linear regression model. The longer the forecast window, the less
reliable the prediction model became, as seen by the diminishing values of R-squared for
the tested ML models.
The forecasting of solar radiation is most reliable over short window periods, and this
can be attributed to the sensitivity of GHI to changing weather conditions. To develop a
regression model, one needs to understand the output variable and the input variable. The
forecast for solar PV power output, which is represented by trends in global horizontal
radiation, is susceptible to changes in weather conditions. This is the underlying reason the
prediction provided reliable results over short window periods.
The study findings present an opportunity for the solar energy industry in terms of
improving decision-making, efficiency design and implementation of solar energy plants.
It also presents an opportunity to develop software packages that can take the design
variables for any selected location and produce design parameters quickly and efficiently.
This study was limited to solar radiation as a variable affecting solar power output. Future
research work should incorporate other factors affecting solar energy output to develop a
comprehensive machine-learning model.
The application of machine learning models in predicting PV power output requires
a good amount of historical data to draw reliable conclusions. This may lack in target
geographical regions of interest. PV plant establishment requires feasibility studies that
may yield new locations whose meteorological data may be available in times of need. This
presents a limitation but also an opportunity to venture into developing meteorological
databases for the entire globe.

Author Contributions: Conceptualization, A.J.L.; Methodology, A.J.L.; Software, A.J.L.; Validation,

A.J.L.; Formal analysis, A.J.L.; Resources, A.J.L. and D.P.-A.; Data curation, D.P.-A.; Writing—original
draft, A.J.L.; Writing—review & editing, A.P.S., D.B. and D.P.-A.; Supervision, A.P.S. and D.B. All
authors have read and agreed to the published version of the manuscript.

Funding: This research received no external funding.

Data Availability Statement: Data are contained within the article.

Conflicts of Interest: The authors of this article declare no conflict of interest or situations in which
finances and personal preferences could compromise the research.
Electronics 2025, 14, 866 22 of 22

References
1. Global Solar Atlas. World-Bank, ESMAP and Solargis. Global Solar Attlas. 2024. 2014. Available online: [Link]
info/map (accessed on 1 September 2024).
2. Mangherini, G.; Diolaiti, V.; Bernardoni, P.; Andreoli, A.; Vincenzi, D. Review of Façade Photovoltaic Solutions for Less Energy-
Hungry Buildings. Energies 2023, 16, 6901. [CrossRef]
3. Iea-org. Global-Energy-Review-2021. 2021. Available online: [Link] (accessed
on 1 September 2024).
4. Impram, S.; Nese, S.V.; Oral, B. Challenges of renewable energy penetration on power system flexibility: A survey. Energy Strategy
Rev. 2020, 31, 100539. [CrossRef]
5. Ahmed, R. A review and evaluation of the state-of-the-art in PV solar power forecasting: Techniques and optimization. Renew.
Sustain. Energy Rev. 2020, 124, 109792. [CrossRef]
6. Kabir, M. Coordinated control of grid-connected photovoltaic reactive power and battery energy storage systems to improve
the voltage profile of a residential distribution feeder. In Proceedings of the IEEE Transactions on Industrial Informatics, Porto
Alegre, Brazil, 27–30 July 2014; pp. 967–977.
7. Trondle, T. Trade-offs between geographic scale, cost, and infrastructure requirements for fully renewable electricity in Europe.
Joule 2020, 4, 1929–1948. [CrossRef] [PubMed]
8. Abdelshafy, A.M.; Hassan, H.; Jurasz, J. Optimal design of a grid-connected desalination plant powered by renewable energy
resources using a hybrid PSO-GWO approach. Energy Convers. Manag. 2018, 173, 331–347. [CrossRef]
9. Johnson, D.O.; Hassan, K.A. Issues of power quality in electrical systems. Int. J. Energy Power Eng. 2016, 5, 148–154. [CrossRef]
10. Mellit, A.; Massi Pavan, A.; Ogliari, E.; Leva, S.; Lughi, V. Advanced methods for photovoltaic output power forecasting: A
review. Appl. Sci. 2020, 10, 487. [CrossRef]
11. Iweh, C.D. Distributed generation and renewable energy integration into the grid: Prerequisites, push factors, practical options,
issues and merits. Energies 2021, 14, 5375. [CrossRef]
12. Abbassi, R. An efficient salp swarm-inspired algorithm for parameters identification of photovoltaic cell models. Energy Convers.
Manag. 2019, 179, 362–372. [CrossRef]
13. Khare, V.; Nama, S.; Baredar, P. Solar-wind hybrid renewable energy system. Renew. Sustain. Energy Rev. 2015, 10, 23–33.
[CrossRef]
14. Hassan, A. Thermal management and uniform temperature regulation of photovoltaic modules using hybrid change materials-
nanofluids system. Renew. Energy 2020, 145, 282–293. [CrossRef]
15. Abdel-Nasser, M.; Mahmoud, K. Accurate photovoltaic power forecasting models using deep LSTM-RNN. Neural Comput. Appl.
2019, 31, 2727–2740. [CrossRef]
16. Solar Energy Research Institute. Basic Photovoltaic Principles and Methods; Technical Information Office: Washington, DC, USA,
1981.
17. Madhavan, B.L.; Ratnam, M.V. Impact of a solar eclipse on surface radiation and photovoltaic energy. Sol. Energy 2021, 223,
351–366. [CrossRef]
18. IBM. IBM Research. 2024. Available online: [Link] (accessed on 31 August 2024).
19. Khandakar, A. Machine learning-based photovoltaics (PV) power prediction using different environmental parameters of Qatar.
Energies 2014, 12, 2782. [CrossRef]
20. Long, C.N.; Dutton, E.G. BSRN Global Network Recommended QC Tests; V2.0 BSRN Technical Report, BSRN; 2010. Available
online: [Link] (accessed on 1 September 2024).
21. Perez-Astudillo, D.; Bachour, D.; Martin-Pomares, L. Improved quality control protocols on solar radiation measurements. Sol.
Energy 2018, 169, 425–433. [CrossRef]
22. Hao, J.; Ho, T.K. Machine learning made easy: A review of Scikit-learn package in Python programming language. A quarterly
publication sponsored by the American Educational Research Association and the American Statistical Association. J. Educ. Behav.
Stat. 2019, 44, 348–361. [CrossRef]
23. Babatunde, A.A.; Abbasoglu, S. Predictive analysis of photovoltaic plants specific field with the implementation of multiple linear
regression tool. Environ. Prog. Sustain. Energy 2019, 38, 13098. [CrossRef]
24. Wang, J. A short-term photovoltaic power prediction model based on the gradient boost decision tree. Appl. Sci. 2018, 8, 689.
[CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

RP 4
No ratings yet
RP 4
13 pages
Machine Learning Techniques For Solar Energy Generation Prediction in Photovoltaic Systems
No ratings yet
Machine Learning Techniques For Solar Energy Generation Prediction in Photovoltaic Systems
8 pages
Solar Forecasting with ML Models
No ratings yet
Solar Forecasting with ML Models
9 pages
Forecasting Solar Irradiance Using Machine Learning Methods
No ratings yet
Forecasting Solar Irradiance Using Machine Learning Methods
4 pages
10 29137-Umagd 1100957-2364053
No ratings yet
10 29137-Umagd 1100957-2364053
10 pages
AI Group - 5
No ratings yet
AI Group - 5
22 pages
Major
No ratings yet
Major
6 pages
Performance Analysis of Energy Production of Large-Scale Solar Plants Based On Artificial Intelligence
No ratings yet
Performance Analysis of Energy Production of Large-Scale Solar Plants Based On Artificial Intelligence
15 pages
Analysis of Solar Power Generation Forecasting Usi
No ratings yet
Analysis of Solar Power Generation Forecasting Usi
7 pages
Renewable and Sustainable Energy Reviews: Amandeep Sharma, Ajay Kakkar
No ratings yet
Renewable and Sustainable Energy Reviews: Amandeep Sharma, Ajay Kakkar
16 pages
Forecasting Hourly Short-Term Solar Photovoltaic Power Using Machine Learning Models
No ratings yet
Forecasting Hourly Short-Term Solar Photovoltaic Power Using Machine Learning Models
17 pages
1885 5482 1 PB
No ratings yet
1885 5482 1 PB
11 pages
Solar Forecasting with AI Techniques
No ratings yet
Solar Forecasting with AI Techniques
45 pages
Prediction of Solar Power Generation Based On Machine Learning Algorithm
No ratings yet
Prediction of Solar Power Generation Based On Machine Learning Algorithm
5 pages
Main Article With TTU Libraries Cover Page
No ratings yet
Main Article With TTU Libraries Cover Page
12 pages
Sinhgad Institute of Technology & Science, Pune: Academic Year: 2024-2025 Class: BE Synopsis
No ratings yet
Sinhgad Institute of Technology & Science, Pune: Academic Year: 2024-2025 Class: BE Synopsis
4 pages
Journal For Refrence 1
No ratings yet
Journal For Refrence 1
17 pages
Applsci 13 13072
No ratings yet
Applsci 13 13072
12 pages
Automated Deep CNN-LSTM Architecture Design For Solar Irradiance Forecasting
No ratings yet
Automated Deep CNN-LSTM Architecture Design For Solar Irradiance Forecasting
12 pages
Optimizing Short-Term Photovoltaic Power Forecasting With Advanced Machine Learning Techniques
No ratings yet
Optimizing Short-Term Photovoltaic Power Forecasting With Advanced Machine Learning Techniques
21 pages
Machine Learning-Based Short-Term Solar Power Fore
No ratings yet
Machine Learning-Based Short-Term Solar Power Fore
25 pages
A Predictive Model For Solar Photovoltaic Power Using The Levenberg-Marquardt and Bayesian Regularization Algorithms and Real-Time Weather Data
No ratings yet
A Predictive Model For Solar Photovoltaic Power Using The Levenberg-Marquardt and Bayesian Regularization Algorithms and Real-Time Weather Data
7 pages
IRJMETS70200017156
No ratings yet
IRJMETS70200017156
7 pages
Solar Power Prediction via ML
No ratings yet
Solar Power Prediction via ML
7 pages
22 - Improved Solar Photovoltaic Energy Generation Forecast Using Deep Learning-Based Ensemble Stacking Approach
No ratings yet
22 - Improved Solar Photovoltaic Energy Generation Forecast Using Deep Learning-Based Ensemble Stacking Approach
16 pages
PV Generation Prediction Using Multilayer Perceptr
No ratings yet
PV Generation Prediction Using Multilayer Perceptr
16 pages
Energies: Machine Learning Based Photovoltaics (PV) Power Prediction Using Di Parameters of Qatar
No ratings yet
Energies: Machine Learning Based Photovoltaics (PV) Power Prediction Using Di Parameters of Qatar
19 pages
Solar Power Generation Forecasting
No ratings yet
Solar Power Generation Forecasting
20 pages
RP 6
No ratings yet
RP 6
15 pages
Intelligent Solar Forecasting with TinyML
No ratings yet
Intelligent Solar Forecasting with TinyML
19 pages
Solar PV Power Prediction Model
No ratings yet
Solar PV Power Prediction Model
18 pages
1 s2.0 S0140988324005929 Main
No ratings yet
1 s2.0 S0140988324005929 Main
12 pages
Solar Project SJ Ak
No ratings yet
Solar Project SJ Ak
11 pages
Solar Forecasting for Engineers
No ratings yet
Solar Forecasting for Engineers
14 pages
1 s2.0 S0142061521001563 Main
No ratings yet
1 s2.0 S0142061521001563 Main
12 pages
Energies 14 02404 v2
No ratings yet
Energies 14 02404 v2
23 pages
STI 4.0 8346 Camera Ready
No ratings yet
STI 4.0 8346 Camera Ready
6 pages
Prediction of Solar Power Using Machine Learning Algorithm: February 2022
No ratings yet
Prediction of Solar Power Using Machine Learning Algorithm: February 2022
12 pages
Main Solar
No ratings yet
Main Solar
8 pages
para Infografia
No ratings yet
para Infografia
234 pages
Energies 17 04145 v2
No ratings yet
Energies 17 04145 v2
38 pages
Enhancing Solar Power Generation Through AC Power Prediction Optimization in Solar Plants
No ratings yet
Enhancing Solar Power Generation Through AC Power Prediction Optimization in Solar Plants
8 pages
Global Solar Energy Estimation Using Improved Greedy Based Genetic Algorithm With Deep Convolutional Neural Network
No ratings yet
Global Solar Energy Estimation Using Improved Greedy Based Genetic Algorithm With Deep Convolutional Neural Network
9 pages
6CV 1 6
No ratings yet
6CV 1 6
6 pages
A Hybrid Machine Learning Forecasting Model For Photovoltaic Power
No ratings yet
A Hybrid Machine Learning Forecasting Model For Photovoltaic Power
14 pages
Short-Term Solar Irradiance Forecasting in Streaming With Deep Learning
No ratings yet
Short-Term Solar Irradiance Forecasting in Streaming With Deep Learning
12 pages
Solar Power Prediction Models Using AI
No ratings yet
Solar Power Prediction Models Using AI
14 pages
Prediction of Power Generation of A Photovoltaic Power Plant Based On Neural Networks
No ratings yet
Prediction of Power Generation of A Photovoltaic Power Plant Based On Neural Networks
12 pages
A Hybrid Approach of Solar Power Forecasting Using Machine Learning
No ratings yet
A Hybrid Approach of Solar Power Forecasting Using Machine Learning
6 pages
Comparative Analysis of Machine Learning For Solar Irradiance Forecasting in Smart Grids
No ratings yet
Comparative Analysis of Machine Learning For Solar Irradiance Forecasting in Smart Grids
6 pages
Short-Term Photovoltaic Output Power
No ratings yet
Short-Term Photovoltaic Output Power
19 pages
Dynamic Forecasting of Solar Energy Microgrid Systems Using Feature Engineering
No ratings yet
Dynamic Forecasting of Solar Energy Microgrid Systems Using Feature Engineering
13 pages
A Hybrid Machine Learning Model For Solar Power Fo
No ratings yet
A Hybrid Machine Learning Model For Solar Power Fo
8 pages
MS Synopsis Defense Presentation Template
No ratings yet
MS Synopsis Defense Presentation Template
18 pages
Optimized Support Vector Regression-Based Model For Solar Power Generation Forecasting On The Basis of Online Weather Reports
No ratings yet
Optimized Support Vector Regression-Based Model For Solar Power Generation Forecasting On The Basis of Online Weather Reports
11 pages
TSP CMC 21015
No ratings yet
TSP CMC 21015
16 pages
Paper Introduction
No ratings yet
Paper Introduction
2 pages
Major - Project - 25I - MP013 - ARPIT TRIPATHI (RA2111003030013)
No ratings yet
Major - Project - 25I - MP013 - ARPIT TRIPATHI (RA2111003030013)
52 pages
ALKANDARI Predicción de Energia
No ratings yet
ALKANDARI Predicción de Energia
20 pages
Building Science All Slides
No ratings yet
Building Science All Slides
211 pages
John Clauser
No ratings yet
John Clauser
70 pages
EECE 674 CH 2 2021 Solar Radiation
No ratings yet
EECE 674 CH 2 2021 Solar Radiation
32 pages
The Impact of Cracked Solar Cells On Solar Panel Energy Delivery
No ratings yet
The Impact of Cracked Solar Cells On Solar Panel Energy Delivery
4 pages
Atmospheric Processes
No ratings yet
Atmospheric Processes
3 pages
Tamil Nadu Salt Corporation - Internship Program Report
No ratings yet
Tamil Nadu Salt Corporation - Internship Program Report
33 pages
Design and Development of Walk-In Type Hemicylindr
No ratings yet
Design and Development of Walk-In Type Hemicylindr
10 pages
Turkey PV Plant Design Report
No ratings yet
Turkey PV Plant Design Report
21 pages
A Smart Street Lighting System Using Solar Energy: October 2016
No ratings yet
A Smart Street Lighting System Using Solar Energy: October 2016
7 pages
UDM Validation
No ratings yet
UDM Validation
92 pages
Deep Learning for Solar Power Forecasting
No ratings yet
Deep Learning for Solar Power Forecasting
14 pages
Solar Radiation Insights
No ratings yet
Solar Radiation Insights
17 pages
Pblms
No ratings yet
Pblms
61 pages
Chapter 03 - Solar Energy and Earth-Sun Relationships True / False
No ratings yet
Chapter 03 - Solar Energy and Earth-Sun Relationships True / False
15 pages
Design and Implementation of A Two Axis Solar Tracking System Using PLC Techniques by An Inexpensive Method
No ratings yet
Design and Implementation of A Two Axis Solar Tracking System Using PLC Techniques by An Inexpensive Method
12 pages
Caie As Level Geography 9696 Core Physical Geography v1
No ratings yet
Caie As Level Geography 9696 Core Physical Geography v1
18 pages
Solar Dehydrator for Guatemalan Communities
No ratings yet
Solar Dehydrator for Guatemalan Communities
57 pages
Assessmentofthe Performanceof Bifacial Solar Panels
No ratings yet
Assessmentofthe Performanceof Bifacial Solar Panels
6 pages
Challenges and Future of CSP in India
No ratings yet
Challenges and Future of CSP in India
6 pages
KippZonen Manual Pyranometers Albedometers CMP CMA Series 0901
No ratings yet
KippZonen Manual Pyranometers Albedometers CMP CMA Series 0901
36 pages
PS 3
No ratings yet
PS 3
4 pages
Application of Coconut Fibres As Outer Eco-Insulat
No ratings yet
Application of Coconut Fibres As Outer Eco-Insulat
9 pages
Chap13 Insolation
No ratings yet
Chap13 Insolation
7 pages
Info Iec61215-2 (Ed1.0) B
No ratings yet
Info Iec61215-2 (Ed1.0) B
17 pages
Ocean-Atmosphere Flux Dynamics
No ratings yet
Ocean-Atmosphere Flux Dynamics
7 pages
PV Panel Tilt & Orientation Guide
No ratings yet
PV Panel Tilt & Orientation Guide
1 page
Documentation and Users Manual: Steve Chapra and Greg Pelletier
No ratings yet
Documentation and Users Manual: Steve Chapra and Greg Pelletier
121 pages
07huldpresentation618533th 1704111634462fce
No ratings yet
07huldpresentation618533th 1704111634462fce
17 pages
Solar Energy Systems for CSE Students
No ratings yet
Solar Energy Systems for CSE Students
45 pages
2013 Monthly Climate Data for Latitude 11.36
No ratings yet
2013 Monthly Climate Data for Latitude 11.36
12 pages

Electronics 2025

Uploaded by

Electronics 2025

Uploaded by

Article

Using Machine Learning Algorithms to Forecast Solar Energy

Keywords: machine learning; algorithm; photovoltaic; solar energy; solar radiation

Academic Editor: François Auger

Received: 30 September 2024

Electronics 2025, 14, 866 [Link]

Figure 1. PV Forecasting models [11].

1.2. Factors Affecting Photovoltaic Power Output

1.3. Solar Eclipse

1.4. PV Output Prediction

1.5. Machine Learning Methods for Prediction of Solar Power

1.5.1. Linear Regression

1.5.2. Decision Tree

(an outcome of a decision) and leaf nodes (final decisions); Figure 2.

2. Materials and Methods

2.1. Evaluation Metrics

2.1.1. Root Mean Square Error

2.1.3. Mean Absolute Error

2.1.4. Mean Absolute Percentage Error

2.2. Data Analysis

Table 1. Results of linear regression model.

Metric 0.5-h 3-h 6-h 12-h 24-h Average

Electronics 2024, 13, x FOR PEER REVIEW 8 of 34

Electronics 2025, 14, 866 8 of 22

Electronics 2025, 14, 866 9 of 22

Electronics 2024, 13, x FOR PEER REVIEW 14 of 34

Figure 9. Decision tree regression based on a 3-h period.

Electronics 2025, 14, 866 11 of 22

Electronics 2025, 14, 866 12 of 22

Electronics 2024, 13, x FOR PEER REVIEW 19 of 34

Electronics 2024, 13, x FOR PEER REVIEW 20 of 3

Figure 16. Random Forest regression based on a 24-h period.

Electronics 2025, 14, 866 14 of 22

Electronics 2025, 14, 866 15 of 22

Electronics 2024, 13, x FOR PEER REVIEW 25 of 34

Electronics 2024, 13, x FOR PEER REVIEW 26 of 34

Figure 22. XGBoost regression based on a 24-h period.

Electronics 2025, 14, 866 17 of 22

Electronics 2025, 14, 866 18 of 22

Table 5. Summary results for the machine learning algorithm tests.

Metric Linear Reg. Decision Tree Random Forest XGBoost

tentially reliable forecasting algorithms.

Electronics 2024, 13, x FOR PEER REVIEW 32 of 34

Figure 29. Graph of average RMSE values for each algorithm.

Author Contributions: Conceptualization, A.J.L.; Methodology, A.J.L.; Software, A.J.L.; Validation,

Funding: This research received no external funding.

Data Availability Statement: Data are contained within the article.

You might also like