Abstract
To estimate a wind turbine output, optimize its dimensioning, and predict the economic profitability and risks of a wind energy project, wind speed distribution modeling is crucial. Many researchers use directly Weibull distribution basing on a priori acceptance. However, Weibull does not fit some wind speed regimes. The goal of this work is to model the wind speed distribution at Agadir. For that, we compare the accuracy of four distributions (Weibull, Rayleigh, Gamma, and lognormal) which have given good results in this yield. The goodness-of-fit tests are applied to select the effective distribution. The obtained results explain that Weibull distribution is fitting the histogram of observations better than the other distributions. The analysis deals with comparing the error in estimating the annual wind power density using the examined distributions. It was found that Weibull distribution presents minimum error. Thus, wind energy assessors in Agadir can use directly Weibull distribution basing on a scientific decision made via statistical tests. Moreover, assessors worldwide can use the followed methodology to model their wind speed measurements.
Keywords
Introduction
Morocco is the only country in North Africa which has no reserves in fossil fuel. The demographic and economic growth of the kingdom will increase the energy demand up to 70% till 2025. Therefore, Morocco imports 97% of its energy needs. That makes this developing country sensitive to fluctuations of energy prices in global market. Since 2008, the government has planned a new strategy for development and investment in renewable energy projects. Its aim is to increase the share of renewable energy to 42% by 2020, where 14% is wind energy (Ettaik, 2016). For that reason, an accurate wind power assessment is extremely necessary.
Jourdier (2015) has stated that in recent years, industrialists in North America have observed an under-production (approximately 10%) of existing wind farms and that a negative bias in estimating the expected wind energy on more than 100 wind farms is noted in the United States and Canada. For Morocco, there are no official statistics. In fact, if the installed wind turbines produce less than expected, the profitability of the wind power project is in risk, especially if these differences continue during the project lifetime (20 years in general). In addition, the overestimating of potential increases the loan interest. Indeed, to assess the economic profitability for a wind energy farm, all economic indicators (net present value, payback period, income, annual saving, etc.) are based on the estimated cost of energy (COE), which is defined as the average cost per kWh of the produced wind electricity. It is obtained by dividing the annualized cost (AC) by the annual electricity generation
The previous equation shows that COE depends strongly on the annual electricity generation. Therefore,
where T is the time in hours of a full year, f(v) is the wind probability density function (PDF), P(v) is the machine power curve, and v is the wind speed (m/s).
As one can see in equation above, to avoid the overestimating of potential, it is crucial to choose the PDF that models the wind speed distribution accurately. Moreover, that helps to analyze and interpret the wind speed variation, manage the wind farms, and optimize its dimensioning. The power curve has also impact on the accuracy of wind power assessment. But, its uncertainty is less than estimation of the PDF (Morgan et al., 2011) (investigation on impact of modeling the power curve will be the subject of a future study).
To model the wind speed, many researchers have used directly the Weibull distribution. However, they are based on a priori acceptance, because it does not always display some wind speed characteristics well, and it may not fit all wind speed regimes encountered in nature. For that, several distributions (such as Weibull, lognormal, Rayleigh, Inverse Gaussian, Gamma, generalized Normal, etc.) are examined by many researches.
In this wise, the goal of this study is to model the wind speed at Agadir in Morocco in order to reduce the financial risk in investing in a wind energy project and give investors the confidence in its economic feasibility. For that, after presenting a literature overview of many studies in wind speed modeling, we compare four distributions (Weibull, Rayleigh, Gamma, and lognormal distributions) which are commonly used by researches (Alavi et al., 2016; Aries et al., 2018; Sohoni et al., 2016; Wang et al., 2016) and which have shown good results in fitting the unimodal wind speed distribution according to the literature. To carry out those distribution performances regarding the modelization of wind speed regime at Agadir, we conduct a statistical study using four tests of goodness of fit, namely, the coefficient of determination, the root mean square error (RMSE), the Chi-square test, and the mean bias absolute error (MBAE). For more accuracy in modeling the wind speed at Agadir, we deal with calculating the error between the estimated wind power density via the studied theoretical distributions and the calculated wind power density using measurements.
Literature review
Many researches compared several PDFs in order to select the most efficient in fitting their measurement. Morgan et al. (2011) have compared the non-parametric kernel density distribution with six parametric distributions (Weibull, Rayleigh, lognormal, Gamma, Normal, and loglogistic) in four sites in China. The achieved results show that the non-parametric estimation outperforms the parametric models. Pobočíková et al. (2017) have evaluated the suitability of four analytical distributions (two-parameter Weibull; three-parameter Weibull, Gamma, and lognormal) in fitting the wind speed distribution in Slovakia. It was found that the three-parameter Weibull performs as the best distribution. Yilmaz and Heçeli (2008) have compared 10 PDFs, namely, Erlang, Beta, exponential, loglogistic, lognormal, Gamma, Pearson V, Pearson VI, Weibull, and Uniform. Weibull was found to be the best one to fit the wind speed distribution at Aegean region in Turkey. Morgan et al. (2011) have modeled the wind speed at 178 stations in United States. They have concluded that the bimodal Weibull gives the best results. For higher wind speeds, authors have found that the Wakeby and Kappa distributions are the most suited, those distributions estimate the average wind turbine power output very with more accuracy. For extreme wind speeds, the two-parameter lognormal distribution performs better. Sohoni et al. (2016) have compared Gamma, Weibull, Rayleigh, lognormal, Inverse Gaussian PDFs in India; they have found that Weibull distribution is efficient in fitting the wind speed data with less skewness, and Gamma distribution fitted better to wind speed data which presented highly skewed histograms. Ben Amar and Elamouri (2011) have developed a new model using maximum entropy. They have compared this model with Weibull distribution. As a result, the proposed model adjusts better the measured data.
To reduce the uncertainty in wind potential evaluation in four sites in Algeria, Aries et al. (2018) have assessed the wind speed distribution models. They have compared eight distributions, namely, Weibull, Gamma, Inverse Gaussian, lognormal, Gumbel, Generalized Extreme Value, Nakagami, and Generalized Logistic. As a result, the Generalized Extreme Value and Gamma are the most suited. Furthermore, authors deal with comparing power density error. Gamma, Inverse Gaussian, and lognormal were found to be the most appropriate. Parajuli (2016) has fitted the measured data using Weibull and Rayleigh PDFs; the author has concluded that Weibull is accurate in estimation of wind power at Jumla in Nepal. Akgül et al. (2016) have used the Inverse Weibull distribution to model the wind speed variation at two sites in Turkey. This PDF has been compared with Weibull distribution and it was concluded that Inverse Weibull performances better than Weibull. In order to estimate the wind speed distribution at five stations in Iran, Alavi et al. (2016) have evaluated a various PDFs using the actual and the truncated data. They have compared the performance of four typically used distribution functions (Weibull, Gamma, Rayleigh, and lognormal functions). Results indicate that for actual wind speed data, the effective PDF is lognormal while Weibull PDF performs better for the truncated data. Ouarda et al. (2015) have analyzed the wind speed distribution at nine stations in United Arab Emirates. The authors have carried out the performance of non-parametric models, parametric models, and mixture models. Results show that Generalized Gamma and Kappa distributions give the best fit to the observations. The two-parameter Weibull shows also a good fit comparing with Generalized Extreme Value and three-parameter lognormal. For bimodal regimes, the mixture distributions provide a very good result than non-parametric distribution
Methodology
Case study and wind speed data
The wind speed data in hourly time series format were recorded at a height of 10 m at the International Airport of Agadir Al Massira over the year 2016 (geographical coordinates: 30.33N, 9.40O). Figure 1 represents the monthly averages of the wind speed. The maximum monthly average is 3.71 m/s, while the minimum is 3.05 m/s. The wind speed at Agadir appeared to be uniform during the year. The annual standard deviation was found to be equal to 1.91. It is weak enough, which means that observations revolve around the average value. The monthly mean wind speed and the annual standard deviation are calculated using the following equations, respectively
where N is the number of measurement.

Monthly averages of wind speed at height of 10 m.
The available data are summarized in Table 1. For each wind speed class with an equal width of 1 m/s, the number of occurrences is presented (second column), the measured wind speed frequency
Wind speed number of occurrences, its frequency, and its cumulative probability.
Wind speed distribution modeling
In order to choose the PDF that fits better the wind speed at Agadir region, we compare in this study four distributions namely, Weibull, Rayleigh, Gamma, and lognormal. Those distributions were selected because they have been used commonly by researches and they have shown good performances regarding the unimodal wind speed distribution modeling. Those distributions definitions including their PDF and cumulative distribution function (CDF) are described below. Their other statistical properties are reported in Table 2, in which Γ is the Gamma function. It is expressed as
Statistical properties of the studied distributions.
Weibull distribution function
The Weibull distribution is a two-parameter function, namely, scale parameter c (m/s) and shape parameter k (dimensionless). It is the most used distribution in wind energy field, because it presents the majority of wind distribution in worldwide. Furthermore, it is flexible and simple to use (Tizgui et al., 2016). The Weibull PDF (f) and CDF (F) are given by equations (6) and (7), respectively
Rayleigh distribution function
The Rayleigh PDF has just one parameter that makes it the simplest distribution and the most used by researchers to model the wind speed. It is a particular case of Weibull PDF in which the shape parameter k = 2. The Rayleigh PDF and CDF are expressed mathematically as
where c is the scale parameter.
Gamma distribution function
The gamma distribution is also applied to wind speed by many researches. It indicates the sum of the exponentially distributed random variables. The expression of the gamma distribution PDF and CDF are presented by the following equations, respectively (Pobočíková et al., 2017)
where k is the shape parameter, θ is the scale parameter, and
Lognormal distribution function
The lognormal distribution is one of the versatile distributions. Since it provides a good result regarding the wind speed distributions modeling, it has been used by many researches. If the logarithm of the wind speed is distributed in a normal manner, the wind speed can be modeled by lognormal PDF. The formula of the lognormal distribution is (Aries et al., 2018)
where
Estimation the distributions parameters
Many methods can be used to estimate the studied distributions parameters (Tizgui et al., 2017). In this study, the maximum likelihood method (MLM) is applied. This method gives the values of the parameters which maximize the probability of obtaining the measured data. For each parameter
The likelihood function (L) for a random sample
Table 3 lists the equations to estimate the parameters of the used PDFs using the MLM.
Parameters estimation using MLM.
MLM: maximum likelihood method
Goodness-of-fit tests
In order to determinate the distribution that fit better the wind speed, we conduct a statistical study, which is based on calculating the differences between the predicted values from the models and the experimental values. The used statistical tests are given below.
For all equations (17)–(20), n is the number of bins,
The coefficient of determination
The coefficient of determination
RMSE
RMSE calculates the residuals of frequency of examined PDF and the measured data (Pobočíková et al., 2017). A model trend to be precise, if the RMSE is close to zero. The RMSE formula is given by
Chi-square test
Chi-square test
MBAE
MBAE can be also used to assess the quality of each used distributions. It measures the average of total absolute bias errors between the theoretical PDF and the frequency obtained by actual observations. MBAE is defined by Jung and Schindler (2017) as
Available wind power evaluation
One distribution may be better in fitting the histogram of observations but not on estimating the power density. For example, Weibull and Rayleigh models were compared by Ahmed and Mahammed (2012); they found that for the whole year, Weibull distribution fit the measured data better than the Rayleigh, but the Rayleigh distribution provides more accuracy in estimating the power density in 9 months. For that, we deal with comparing the estimated wind power using the four distributions and observed data. For a wind device located perpendicular to the direction of the wind speed, the average of available wind power density is
where
For a theoretical distribution, the wind power density estimated using the theoretical PDF
Absolute error in estimating the wind power density
In order to select the PDF that estimates the wind power density with more accuracy, we calculate the absolute error (AE) between the wind power density estimated using the measured data and that obtained using the four theoretical distributions. This error can be expressed as (Ben Amar and Elamouri, 2011)
Results and discussion
Measured data adjustment
As we mentioned, in this study, we highlight the effectiveness of four distributions which are widely applied to fit wind speed, namely, Weibull, Rayleigh, Gamma, and lognormal. Figures 2 and 3 depict the histogram of the wind speed data adjusted by the four PDFs and its CDFs, respectively. Graphically, it can be noticed that Weibull, Rayleigh, and Gamma PDFs may be acceptable, but lognormal PDF is poor in fitting the data distribution and its cumulative function. Table 4 presents the parameters of the considered PDFs obtained using the MLM under MATLAB software.

Data distribution fitted by the four theoretical PDFs.

Data cumulative frequency and the four theoretical cumulative functions.
Estimated parameters for the studied distributions.
Goodness of fits
To address the sensibility of the studied PDFs in term of fitting with more accuracy the wind speed distribution at Agadir region, we applied the most used statistical indicators, namely
Comparison of Weibull, Rayleigh, Gamma, and lognormal PDFs.
PDF: probability density function; RMSE: root mean square error; MBAE: mean bias absolute error.

Goodness-of-fit tests: (a)
The values of
In general, it can be seen that the Weibull PDF gives the maximum value of coefficient of determination, minimum values of RMSE, Chi-square, and MBAE, followed by Rayleigh and Gamma distributions. While the lognormal PDF is the worst distribution in modeling the measurements, it is ranked fourth for all indicators.
Error in estimating the power density
In addition to goodness-of-fit indicators (
Table 6 presents the annual wind power density calculated using measured data and the annual wind power density estimated via the four studied distributions. The error on estimating the power density using the considered PDFs is also given in the table. The average error was found to be 0.88% for Weibull PDF, 6.12%, 18.49%, and 83.14% for Rayleigh, Gamma, and lognormal, respectively. We concluded that Weibull PDF presents a minimum error in estimating the annual wind power density. So, it is the effective and most accurate distribution to model the wind speed at Agadir.
Errors in estimating wind power density using the four studied distributions.
AE: absolute error.
Conclusion
The overestimating of potential influences negatively the profitability of a wind energy project. It may be caused by the uncertainty in wind speed distribution modeling. In fact, it is common to fit the wind speed distribution by Weibull PDF. However, Weibull PDF may not always model the wind speed. Therefore, in this study, the wind speed data in hourly time series format over the year 2016 at Agadir are statistically analyzed and fitted by four candidate PDFs (Weibull, Rayleigh, Gamma, and lognormal) were fitted to a wind speed sample at Agadir. Their parameters were estimated using MLM. Graphically, the PDFs and cumulative functions curves indicate that the lognormal gives the worst adjustment to the measurements and the other PDFs are acceptable. In order to make a scientific decision, four tests of goodness of fit (coefficient of determination, RMSE, Chi-square test, and MBAE) are used to select the effective distribution that fit better the histogram of observations. As a result, lognormal is ranked fourth for all statistical indicators. The Weibull PDF was found to be the best one. It is ranked first and fits well to wind speed data with maximum value of coefficient of determination and minimum values of RMSE, Chi-square, and MBAE. Gamma distribution is ranked second according to
Furthermore, we deal with calculating the error between the annual estimated wind power density via the studied theoretical distributions and the calculated wind power density using measurements. The error was found to be 0.88%, 6.12%, 18.49% and 83.14% for Weibull, Rayleigh, Gamma, and lognormal, respectively. We note that Weibull PDF presents minimum error in estimating the annual wind power density. Therefore, we concluded that Weibull PDF is the most suitable distribution that models wind speed at Agadir. Thus, wind energy assessors in the studied region can use directly Weibull distribution basing on a scientific decision made via statistical tests. Moreover, assessors worldwide can use the followed methodology to model their wind speed measurements with more accuracy.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
This study was supported by the Moroccan National Center for Scientific and Technical Research (CNRST).
