Abstract
To compensate for the massive variations in power output brought on by unpredictability, enormous quantities of pricy battery storage or power reserve capacity are required. Precise prediction of the Solar Wind Power Forecasting system improves energy conversion efficiency, reduces the risk of overloading the system, and optimizes unit commitment. In this paper, design a SWIFT-Net (Solar & Wind Integrated Forecasting Technology Network) model for precise solar and wind energy forecasting. Unlike existing models, SWIFT-Net uniquely integrates a deep learning ensemble framework with a dual-stage hybrid feature engineering mechanism using Hybrid Kookaburra Optimization and Botox Optimization, which has not been previously applied in this context. This ensures the selection of the most impactful features and enhances generalization across variable weather conditions. The weighted averaging method is employed to combine solar and wind power predictions. Finally, the designed model gained results are validated with existing classifiers in terms of MSE, RMSE, and MAE, ensuring continual refinement for enhanced forecasting accuracy, reliability.
Keywords
Introduction
The primary means of addressing the global warming issue is becoming more and more focused on Renewable Energy Sources (RES). Researchers are paying more and more consideration to RES and sustainable energy sources, including wind, solar, wave, and tidal energy because fossil fuels are becoming scarcer and have negative environmental effects (AlKandari and Ahmad, 2020). Because of this, the use of RES has grown significantly in recent years and is predicted to grow considerably more in the ensuing decades (Carneiro et al., 2022). This poses a new issue because solar and wind energy systems only produce electricity when sunlight and wind are present. Because the primary RES are stochastic, their proper integration depends on precise and dependable forecasts (Rosa De Jesus et al., 2020). However, both physically and temporally integrating RES into the electrical system is difficult, and the cost of doing so remains high. The two most widely utilized RES are solar and wind (Kumari and Toshniwal, 2021; Wu and Wang, 2021). One distinctive quality of consumable electricity is that it is produced and consumed with little to no storage. Conventional electricity is produced in response to consumer demand from businesses, industries, and residences (Singla et al., 2022). The main issue in integrating RES into the electricity grid is their unpredictable character, such as the sporadic nature of wind speed (Massaoudi et al., 2021). In addition, the power output of individual wind turbines inside a wind farm varies according to the wind direction and turbine placement. The haze effect, solar elevation angle, and cloud cover all affect solar irradiation (Mora et al., 2021; Ribeiro et al., 2022). Cloud cover and haze impact are stochastic, but the sun elevation angle is theoretically determinant (Malhan and Mittal, 2022).
Inconsistent power sources interfere with traditional techniques of energy system planning and operation presenting challenges for power generation (Chakraborty et al., 2023). Grid operators are forced to modify their day-ahead, hour-ahead, minute-ahead, and real-time operations procedures due to the variability and fluctuation of renewable electricity output on numerous time scales (Xing and He, 2023). The building and organizing of wind and solar (W&S) power frameworks, the creation of electricity markets, and the planning and execution of power generation are only a few of the issues that depend on the accuracy and dependability of the prediction models for these energy sources (Marndi et al., 2020). However, given that errors in W&S power forecasting are inevitable, predictive modeling is a crucial tool for assessing risk in power systems, quantifying the unpredictability of traditional point predictions, and providing insightful data (Tarek et al., 2023). For this reason, load forecasting and immediate RES (such as W&S power) are essential in modern power plants for scheduling servicing, generating planning, and security evaluation (Guermoui et al., 2022).
Huge amounts of expensive battery storage or power reserve capacity are needed to smooth out the huge fluctuations in power outputs caused by unpredictability. It is possible to decrease these reserves by increasing forecasting accuracy (Devaraj et al., 2021). Accurate forecasting of the W&S power system enhances unit commitment optimization, lowers the danger of system overloading, and increases the efficiency of energy conversion. Power systems need accurate W&S power predictions because this RES has the potential to enhance both the economy and the environment (Jalali et al., 2021; Ma et al., 2020).
In response to these challenges, this paper suggesting a new deep learning ensemble framework, SWIFT-Net, which is the first attempt to combine hybrid optimization and multi-modal deep learning networks to improve the forecasting of solar and wind. In contrast to the traditional models, SWIFT-Net has a number of novel contributions that contribute to the significant improvement of accuracy, robustness, and interpretability. The following are the paper’s primary contributions: • To effectively extract the features, the BOA optimization is hybrid with the KOA optimization method. The hunting approach of the KOA is improved using the mean value of the BOA algorithm. • Utilizes LSTM network with an optimized MA mechanism to capture sequential data patterns effectively. By allowing the model to concentrate on pertinent elements of the input data, the attention mechanism improves the model’s capacity to identify significant temporal correlations. • Integrates Inceptionv2 for efficient feature extraction from the input images, enabling the model to incorporate additional relevant information. Utilizes Efficient Capsule Networks for robust feature representation, improving the model’s capability to represent intricate spatial relationships in the information. • Combines the forecasts from O-MA-LSTM and I2EMv2-E-CapsNet using a weighted averaging approach. By utilizing both models’ advantages, this integration method creates a forecasting system that is more reliable and accurate. • All these contributions make SWIFT-Net stand out among the current forecasting models that incorporate a hybrid metaheuristic optimization, temporal attention modeling, and advanced feature fusion techniques. The obtained model shows better forecasting ability when tested against conventional measures of MSE, RMSE, and MAE.
The manuscript’s remaining section is arranged as follows: Section 2 lists the publications that are currently available on the solar and wind forecasting. In Section 3, the architecture of the proposed model is described. The experimental findings for the suggested method are shown in Section 4. Section 5 serves as the conclusion.
Related works
Wu and Wang (2021) created an ensemble neural network system for W&S power forecasting in China using LSTM, SVM, BP neural network, and ELM. To lessen the negative consequences of the original series’ fluctuation, VMD first breaks down unstable W&S power time series to smooth portions. Then, four fundamental models that are optimized using the EOSSA method are utilized to forecast W&S power based on the deconstructed subsequences. Ultimately, weighing the predictions from the four designs, the ENN prediction results were rebuilt.
Cannizzaro et al. (2021) offer a brand-new approach for both short- and long-term time frame Global Horizontal Irradiance (GHI) solar forecasting. It examines real-world datasets containing various time-series weather information, including GHI. Next, employ Random Forest (RF) with LSTM in conjunction with two Convolutional Neural Networks (CNN) and Variational Mode Decomposition (VMD). The creation of more precise energy management policies is aided by GHI forecasting.
Massaoudi et al. (2021) offer an accurate and dependable ensemble learning framework for short-term PV power generation prediction to address the significant volatility of PV electricity. The forecasting tool that is being presented includes layers of both base and meta-models. To obtain excellent predicting accuracy, the tree-structured Parzen estimators’ method is used to tune the hyper-parameters of the suggested stacking ensemble. The suggested approach outperforms the current forecasting techniques with the lowest Mean Absolute Percentage Error (MAPE) of 2.30%, as confirmed by the simulation outcomes.
For W&S power forecasting, Ahmad et al. (2021) introduced the ensemble-based k-nearest neighbor (KNN) model. This method’s nonlinear modeling problem in W&S energy allows for a more precise assessment of the spatial characteristics of wind and solar power. Specifically, the technique known as KNN was trained and optimized using Makowski metric evaluation. The chi-square distance was employed to evaluate the k-nearest neighbor method’s theoretical goodness of fit among observed and predicted values. The 5-fold cross-validation technique was used for extremely complex solar/wind data to ascertain the precise lasso-penalty strength.
To forecast solar energy, Khan et al. (2022) suggested an enhanced stacked ensemble method (DSEXGB) that utilizes two DL techniques: LSTM and Artificial Neural Network (ANN). To improve the forecast accuracy of solar power production, an extreme gradient boosting technique is used that integrates forecasts from the basic models. Furthermore, the designed study employed the shapely additive explanation paradigm to offer a more profound understanding of the algorithm’s learning process. The suggested model’s effectiveness was assessed by contrasting the prediction outcomes with those of individual ANN, LSTM, and bagging.
For wind speed ensemble prediction, Ibrahim et al. (2021) suggested an Adaptive Dynamic Particle Swarm System (AD-PSO) for machine learning in combination with the Guided Whale Optimization Algorithm (Guided WOA). The best hyperparameter variable for the LSTM DL system is chosen by the AD-PSO-Guided WOA method to anticipate wind speed. The outcomes showed that compared to multiple comparative efficiency and DL techniques, the AD-PSO-Guided WOA method performs well and offers excellent accuracy.
Lee et al. (2020) ensemble learning-based techniques to forecast time-dependent characteristics of the wind power data to build a method to estimate wind power output with an acceptable level of accuracy based on a variety of criteria. In essence, ensemble learning techniques pool the knowledge of several learners to provide predictions that are better than those of traditional standalone learners. The ensemble algorithms’ prediction performance with two regression techniques is contrasted. Comparing the ensemble approaches against the solo models, experimental data demonstrate that the latter is less accurate in predicting the output of wind power.
Suárez-Cetrulo et al. (2022) presents a novel method that accounts for constraints like curtailment and turbine deterioration to anticipate generating electricity at high frequencies 1 day ahead of schedule. Conducted an initial investigation to examine the correlation between all possible combinations of electrical power, predicted wind, and observed wind. Second, to evaluate the levels of predictability of various algorithms and geographies, a broad range of machine learning methods were applied to each site. In comparison to existing Machine Learning methods that estimate wind power 1 day in advance, our study and experimental findings demonstrate how enhancing ensembles is a more cost-effective approach in terms of runtime.
Peng et al. (2020) created a high-accuracy Wavelet Soft Threshold Denoising (WSTD) technique to forecast wind speed. Furthermore, a brand-new DL technique called the Gated Recurrent Unit (GRU) has been effectively created and used in conjunction with WSTD to forecast the wind speed series. The ensemble WSTDGRU approach uses WSTD to remove unnecessary details from the initial wind speed series data, and GRU to forecast later multi-step wind speed numbers. GRU is very useful for learning features when working in conjunction with WSTD. Ultimately, the test results confirm this methodology’s great performance, good flexibility, and reasonably high accuracy.
De Jesús et al. (2020) offer a new DL ensemble-based wind power prediction technique with very short forecast periods. To forecast wind power, a number of hybrid DL techniques—including CNN, LSTM, and HDNN—are integrated. To arrive at the ultimate wind power estimate, the suggested method uses ensemble averaging to take past wind speed data into account. An actual wind farm in Texas is used to simulate the stated approach and show that it produces a greater degree of very short-term wind power prediction precision.
Although these methods have been developed, several limitations remain many models consider solar or wind power but not both, little use is made of deep feature interactions between weather and energy variables, metaheuristic optimization is seldom combined. Furthermore, the combination of a high-level temporal attention module and spatial feature extraction is not well studied. The proposed SWIFT-Net addresses these issues by proposing a hybrid feature selection approach (HKOBO), combining attention-enhanced LSTM and Inceptionv2-CapsNet blocks and employing ensemble averaging to make robust predictions. This single architecture captures multimodal dependencies well and it performs better than both traditional and new ensemble models in terms of forecasting accuracy.
Problem definition
It is still difficult to forecast daily, weekly, monthly, seasonal, and yearly information for several sites with various observation/data sample sizes while utilizing an identical forecasting method. Data-driven methods offer a chance to enhance solar generation prediction because of the increased granularity of data available (Banik et al., 2020). Power systems with high wind penetration, experience increased security and stability with the invention and implementation of efficient forecasting of wind speed technology. Accurate W&S power forecasting is quite difficult because of the wind’s erratic and unstable characteristics. For this reason, several methods were put forth to raise the forecasting dependability threshold. These days, bidding procedures require estimates at the minute level 1 day ahead of time, which presents an additional challenge for many energy markets. It is crucial for safety to predict wind speed and efficiency of the power system’s functioning (Jalali et al., 2022). Three characteristics of wind speed—randomness, volatility, and unpredictability make reliable wind speed forecasting still difficult. Even though decomposition-based pre-processing techniques have been extensively researched in the past, their use in actual prediction is challenging since the values of the first decomposed segments would be significantly impacted by the recently received data.
Proposed methodology
The suggested approach starts with careful data gathering and pre-processing, filling in any missing values and using normalization to make sure all the features are the same. To reveal underlying links, feature extraction uses statistical measures, time-based indicators, frequency-based features, and metrics based on entropy correlation. Robust characteristics are chosen by hybrid optimization techniques such as Botox and Hybrid Kookaburra optimizations. For effective feature representation, the model architecture consists of an O-MA-LSTM and I2EMv2-E-CapsNet. Forecasts are incorporated using weighted averaging. Splitting data, training models, and adjusting hyperparameters are all part of training. The accuracy and dependability of the SWIFT-Net model, which has been verified against other classifiers, are demonstrated by the results, highlighting its potential to improve energy systems and facilitate sustainable energy transitions. The block diagram of the proposed solar and wind energy prediction model is shown in Figure 1. Block diagram of the proposed energy prediction model.
Data collection and pre-processing
Pre-processing data is essential for fixing missing value issues and enhancing data quality. While min-max scaling is employed to normalize the data and ensure that each characteristic has a uniform scale, interpolation is employed to fill in the missing values. Figure 2 shows the pre-processing process in a diagrammatic representation. Step by step procedure of the pre-processing phase.
Missing values handling
Two methods of interpolation quadratic and linear are employed in this stage for estimating the missing values (Noor et al., 2014).
Linear interpolation
Making a straight-line connection between two data points is the most basic type of interpolation. Equation (1) depicts the linear interpolation function
Let,
Data normalization
To make sure that all of the characteristics are scaled similarly, standardize the data using methods such as min-max scaling.
Min-max normalization
The values of a feature are scaled to lie between 0 and 1 using this method. This is achieved by subtracting the lowest value of the feature from each value and dividing the resulting number by the width of the feature. This data normalization process involves a linear transformation of the original data. The next equation (4) modifies each value after retrieving the minimum and maximum values of the data
Feature extraction
Extraction of pertinent features, such as statistical, time-, frequency-, entropy-, and correlation-based features, are done from the gathered data.
Statistical features
Mean
It calculates the data points’ average values, which are given in equation (5)
Standard deviation
Equation (6) is utilized to compute the dispersion or spread of the given collection of values
Minimum and maximum
It measures the highest and lowest values in the collected dataset using equation (7)
Percentiles of solar irradiance and wind speed
If the value is below the given percentage, the values mean the observation falls and is measured using equation (8)
Time-based features
Seasonality indicators
Time-based seasonality indicators, like the month, days, and hour of the day, are retrieved, representing the monthly, weekly, and hourly aspects of the data’s seasonality.
Time lags
Retrieve historical values for the wind speed and sun irradiance variable. For instance, the solar radiance value is
Moving averages
It calculates rolling averages over a given period. If the wind power’s moving average over the last
Frequency-based features
Wavelet transform is used to retrieve features from various frequency ranges. It is a mathematical transformation that allows frequency analysis by breaking down a signal into different frequency components. Furthermore, wavelet transform is used to extract features from various frequency bands.
Entropy and correlation-based features
Determine possible relationships by calculating the correlation and entropy between different features.
Entropy
Equation (10) is used to determine the dataset’s randomness and variability
Let,
Correlation
It establishes the two factors’ linear relationship’s intensity and direction. The correlation-based features are extracted using equation (11), which is the Pearson correlation coefficient (PCC) method
Let,
Feature selection
Use a hybrid optimization called HKOBO at this phase to determine which features are most important for model performance. The Kookaburra Optimization Algorithm (KOA) (Dehghani et al., 2023) assesses feature combinations according to prediction accuracy by imitating the Kookaburra bird’s hunting technique. A swarm intelligence method called the Botox Optimization Algorithm (BOA) (Hubálovská et al., 2024) assists in determining the best feature subsets. Hybrid Kookaburra–Botox feature selection strategy aims to improve the performance of the model in terms of efficiency and accuracy by choosing the most relevant features of high-dimensional input data. This strategy combines the global search ability of KOA with local exploitation capability of BOA. KOA, based on the intelligent foraging behavior of kookaburras, is able to explore the various areas of the search space well and avoid premature convergence. Conversely, BOA emulates the muscle-relaxation mechanism of botulinum toxin in biological systems, which allows it to carry out fine-tuning of results locally around good solutions. These two algorithms are hybridized to achieve a balanced trade-off between exploration and exploitation in feature selection. Consequently, the strategy discovers a small subset of features that are informative, minimizing the computational complexity and still retaining essential information that is needed in subsequent learning. The input to the SWIFT-Net ensemble model is then the selected features in order to achieve better predictive performance.
Process of the HKOBO
The kookaburra’s instinctive hunting habits, which involve striking their prey with trees to kill it, are far more fascinating than any scientific explanation. The clever techniques used in the architecture to choose the greatest features are these innate kookaburra habits. Equation (12) is used to represent the KOA population matrix, which is made up of kookaburras combined
Using equation (13), the starting arbitrary position of the kookaburras during KOA installation is determined
Let,
Equation (14) states that the set of examined values for the problem’s goal function and is expressed as a vector
Let,
The proposed KOA approach modifies kookaburra locations in the two separate phases of exploration and exploitation through an iterative process based on modeling real kookaburra actions in the wild. Provided next is the procedure for upgrading the KOA population in finding space.
Hunting approach (Exploration)
The way kookaburras choose and attack their prey causes them to move significantly in position. In this strategy, the term “exploration” refers to the method of closely examining the problem-solving space in order to identify the primary optimal area without becoming entangled in the local ideal. This procedure represents the global search.
Each kookaburra imitates its companions’ hunting approach by designating its position in the KOA mechanism as the prey location, since it has a higher goal value. Equation (15) is used to identify the possible prey set for every kookaburra based on the contrast of the function’s goal values
Let
Each kookaburra is thought to choose its target at random and attack it according to the KOA design. Equation (16) is used to identify the kookaburra’s updated position, which simulates the movement of the reptile toward its target in the hunting method
During this step, update the Botox optimization variables as
Let,
Making certain the prey is killed (Exploitation)
After assaulting its prey, kookaburras carry the victim with them and ensure that it dies by repeatedly striking it against trees. This is their second distinguishing trait. The kookaburra eats its meal after crushing it with its powerful claws. Kookaburras exhibit this behavior near their hunting grounds, which causes slight shifts in their location. This procedure, which embodies the idea of exploitation in local search, speaks to the method’s capacity to produce superior results close to discovered solutions and areas of potential. Equation (18) is used to determine an arbitrary location in the KOA architecture, which mimics kookaburra behavior based on where they travel about the hunting region
Let,
SWIFT-net: DL ensemble model
SWIFT-Net is a deep learning ensemble model, which incorporates the processing of temporal and visual data to increase the accuracy of forecasting. The model is based on two essential elements O-MA-LSTM and I2EMv2-E-CapsNet. The O-MA-LSTM module is used to learn sequential patterns in time-series data with the use of a multi-attention mechanism that selectively highlights the most relevant input features in various time steps. This increases the interpretability and the predictive power of the temporal model. In addition to this, the I2EMv2-E-CapsNet module is specialized in extracting and representing features of images of sky conditions. It integrates the multi-scale feature extraction properties of Inceptionv2 and the powerful spatial representation of Enhanced Capsule Networks (E-CapsNet) which enables it to preserve the hierarchical relationship in the visual input. To obtain the final output, SWIFT-Net uses weighted averaging approach to combine the predictions of the two sub-models, with the weights being computed using validation performance. Such ensemble strategy allows SWIFT-Net to obtain more consistent and generalizable predictions on a wide range of data modalities.
O-MA-LSTM model
To identify sequential data sequences and concentrate on pertinent elements of the input, this model makes use of an LSTM network with an improved MA strategy. The attention mechanism’s primary goal is to draw attention to a particular weight differential. The attention layer increased by the final representation, aids in the strategy performance improvement. By detecting sequential data patterns, the study’s improved MA system using LSTM predicts the W&S power (Zhao et al., 2021).
LSTM model
The memory gate in an LSTM cell is used to update the cell state and remove unnecessary information from the previous instant. The updating method makes use of the output of the previously buried layer as well as the present moment’s input. Several structures have been suggested to enhance the performance of LSTM. The trials’ findings indicate that the attention gate improves how well LSTM models are trained. Therefore, to capture consecutive data trends and concentrate on pertinent features of the input, the study updates the LSTM in the MA approach, shown in Figure 3. Architecture of the LSTM model.
Optimized MA
Their research identified the benefits of various attention heads for the queries, values, and keys by using an attention mechanism as a way to connect a variety of key-value pairings and a query to the outcomes. The multi-headed focus layer performs better than single-head attentive by computing the undetectable data by exponentially mapping the context vectors into several subspaces. Weighted numbers are calculated by the proposed framework using queries and their corresponding keys to determine the output. Equations (20) and (21), when applied to single-head attention, yield the attention time-dimension computation for attention weighting
Let,
Let,
Let,
I2EMv2 and efficient-CapsNet (I2EMv2-E-CapsNet)
This approach integrates Efficient Capsule Networks for reliable feature representation with Inceptionv2 for effective feature extraction on sky condition photos (if available).
I2EMv2
To accomplish the classification job, the I2EMv2 model (Wu et al., 2023) stacks successive convolution blocks to increase the receptive area and capture deep semantic data. The deep separable convolution feature, which splits the standard convolution into depth-wise and pointwise convolutions and significantly reduces the number of parameter values and calculations while maintaining accuracy, is the main innovation of the I2EMv2 system. • Inverted inception
The inception module is the basis for the inverted inception layout concept, which expands on it by employing a range of size convolution kernels to produce distinct responsive fields, enhancing the network’s width, and enhancing the model’s susceptibility to forecast W&S power. The inverted inception increases the feature information by using a 1 × 1 convolution opposite dimension grow, extracts features using a variety of convolution kernels, and fully employs the two pooling techniques—maximum pooling and average pooling—to boost feature richness and enhance network expansion capabilities. 1 × 1 convolution reduction in dimension matching is performed after multi-scale feature fusion. After convolution, BN layer batch standardization is applied to remove gradient distribution, prevent network saturation, and speed up convergence. • Efficient excitation and filtering bottleneck
By using mask vectors to apply weights to characteristic channels, the attention mechanism simulates the link of dependency among crucial information in features and channels, avoids internal data in channels, and enhances the efficacy of semantic features. As opposed to a fully linked layer learning channel weights, the ECANet utilizes 1 × 1 convolution to prevent information loss that results from dimension reduction followed by dimension increase. 1 × 1 convolution has fewer parameters than fully linked layers, making it appropriate for creating lightweight convolutional neural networks. Equation (28) defines the ECA formula
Let,
Let, Block diagram of the I2EMv2 model.
E-CapsNet
E-CapsNet consists of three components: the main capsule layer, the convolution layer, and the self-attention mechanism-based routing (Mazzia et al., 2021). To get ready to create the capsule, the input is converted to a more complex space in the convolution layer using BN and multi-layer convolution. In the second section, depth separable convolution is used to map the features with high dimensions. Comprehensive convolution: the number of input channels and the amount of feature map layers are identical. Each channel is associated with a convolution kernel. Point-by-point convolution is used to convert the generated feature map into channels, with each input channel denoting a filter. By using point-by-point convolution, the depth layer’s output is merged sequentially, and the channel from the previous layer’s depth convolution output is projected into the new channel space. The self-attention method is used in the last phase to route the first capsule to the broad target object. It is possible to obtain each layer’s trained feature distribution using equations (30), (31), and (32), respectively
The routing portion of the self-attention system is comparable to a fully connected network. The input to the top capsule is the weighted sum of all the “forecasting matrix” from the bottom capsule, and every Wind and power category is represented by an output capsule. The network’s squeezing f unction is distinct from CapsNet’s. An implementation of the squeeze function variation described in equation (33), aimed at preventing the network from becoming convergent due to an initial vector length of 0
Let,
Final forecasting
Initially, generate prediction results of both techniques O-MA-LSTM and I2EMv2-E-CapsNet, and normalize the prediction values into the mean of 0 and standard deviation of 1. Apply weight to both models based on the performance. Assign higher weights to the model that historically performs better or has more credibility. Finally, calculate the weighted average by multiplying both prediction results based on the sum and weight to form a combined forecasting. The calculation of the weighted average technique is detailed in equation (34)
Let,
Results
This section compares the results of the suggested model with those of other methods. We use the Python platform for the implementation. Two versions of the model must be created in order to train it on a single subgroup and then assess its ability to generalize. Thirty percent goes toward testing, while the remaining 70 percent is used for training. Utilize measures such as Explained Variance Score (EVS), Mean Squared Error (MSE), Mean Percentage Error (MPL), Mean Squared Logarithmic Error (MSLE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Root Mean Squared Error (RMSE) to assess the performance of the ensemble model.
Dataset description
The SWIFT-Net model suggested is trained and validated on the publicly accessible Renewable Energy and Weather Conditions dataset provided by Kaggle (https://www.kaggle.com/datasets/samanemami/renewable-energy-and-weather-conditions/code). The data set consists of about 42,000 records measured at an hourly frequency with a time span between January 2016 and December 2020. It has 17 features altogether, including various meteorological and energy related variables, including temperature, pressure, humidity, wind speed, precipitation (rain and snow), cloud cover, solar radiation (GHI), and energy consumption delta (Energy delta (Wh)). There are also other attributes like availability of sunlight (isSun), time of sunlight, length of day, and SunlightTime/dayLength, as well as weather_type, hour, and month, which assist in temporal and categorical modeling. Preprocessing entailed missing value imputation through linear interpolation, continuous feature normalization with the Min–Max method, and categorical variables one-hot encoding. The data was divided into training and testing data with the ratio of 80:20, which was chosen due to the hyperparameter tuning to maximize the performance and minimize the overfitting. Despite the completeness of the data, it seems to be geographically limited to a particular area in Europe, that be a limitation when it comes to generalizing the results to other climatic regions.
Overall comparison by varying the train data split
The results on comparison of the performance of the proposed SWIFT-Net model with other models such as KNN (22), ANN + LSTM (23), GRU (27), and EDL (28)) on the basis of various evaluation metrics like EVS, MAE, MAPE, MPL, MSE, MSLE, and RMSE) reveal that the proposed SWIFT-Net model outperforms other models in all evaluation metrics in both training-validation splits (70% and 80%). In particular, SWIFT-Net has the highest explained variance score (EVS), which means that it is better at explaining variance in the data. At the same time, it shows the least values of errors among all the other measures, such as MAE, MAPE, and RMSE, which demonstrates its accuracy and strength. The lower prediction loss (MPL) and smaller values of MSE, MSLE also prove the effectiveness of the model in reducing both small and large errors. These findings confirm the high generalization power and better prediction accuracy of SWIFT-Net to process the complex and multimodal renewable energy data. Figure 5 displays the comparison of various metrics with the proposed method. Comparison of various metrics with the proposed method.
MSE comparison showing improved performance with HKOBO over KO, BO, and no optimization.
Comparison of O-MA-LSTM, I2EMv2-E-CapsNet, and SWIFT-Net under different data splits. SWIFT-Net shows the best performance across all metrics.
Statistical significance testing of SWIFT-Net vs. existing models (MSE & RMSE).
Discussion
The high performance of the proposed SWIFT-Net model can be explained by the fact that it combines the hybrid feature optimization (HKOBO), deep time modeling with the attention-enhanced LSTM, and advanced spatial representation with the Inceptionv2 and Capsule Networks. The hybrid optimization assists in the identification of highly relevant features, which minimizes noise and dimensionality. In the meantime, the temporal attention process enables the model to concentrate on the important time durations, which enhances sequence learning. The ensemble design also stabilizes the predictions through the exploitation of the complementary power of the two branches of the model. SWIFT-Net outperforms the baseline models in all metrics of evaluation; this is because the model is capable of capturing complex dependencies in multimodal data as compared to the baseline models. Nevertheless, this improved performance is at the expense of greater computational complexity and more time-consuming training. Also, the model deep architecture can only be used with large datasets to avoid overfitting, and additional research is required to enhance its interpretability. These limitations notwithstanding, SWIFT-Net is a good framework to forecast renewable energy, and the trade-off between performance and model complexity is reasonable. The evaluation is provided using several metrics, such as MSE, RMSE, and MAE, as they measure various properties of errors. It is not guaranteed that the proposed model outperforms in each of the metrics individually, but it consistently performs well and reliably in all of them, which means that it generalizes well. Small variations in scores indicate trade-offs between the sensitivity of the metric and do not compromise the overall performance of the approach.
Conclusion
In conclusion, the paper introduces a comprehensive methodology for forecasting solar and wind power outputs, crucial for efficient energy conversion and system optimization. Through meticulous data handling, feature extraction, and the use of hybrid optimization algorithms like HKOBO, we identify key factors essential for accurate predictions. The proposed SWIFT-Net ensemble model, combining O-MA-LSTM and I2EMv2-E-CapsNet, demonstrates promising results in capturing complex data relationships. By validating the proposed model against established classifiers using MSE, RMSE, and MAE metrics, it ensures the reliability and continual refinement. This approach not only reduces reliance on costly storage solutions but also contributes to the efficient utilization of renewable energy resources, facilitating a sustainable energy transition. Overall, the SWIFT-Net model represents a significant step forward in solar and wind power forecasting, offering a robust framework for optimizing energy systems and supporting a more sustainable energy future.
Footnotes
Acknowledgments
The authors would like to thank the Puducherry Technological University for supporting this work.
Author’s contributions
Mr R. Tharwin Kumar, He performed the conceptualization, Methodology, Data collection and writing the study. Dr C. Christober Asir Rajan, He analysis the dataset and conceptualization in the study.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
All the data is collected from the simulation reports of the software and tools used by the authors. Data will be made available upon reasonable request.
