Abstract
The demands to food in China rural areas are strong while construction of food supply chain lags. Lack of statistical data presents difficulties in construction of food supply chain in China rural areas. This paper designed the prediction technique based on nonlinear combined AWNG-BP. Firstly, this prediction technique used BP neural network to nonlinearly combine conventional regression prediction, three exponential smoothing prediction and grey sequence prediction, in order to reduce negative effects on prediction results caused by data missing and abnormal data. Secondly, modified the inherent defect and especially solve structure optimization problems of general BP neural network, by modifying error function and introducing dynamic self-adapting and genetic algorithm of ecological niche. Thirdly, made sensitivity analysis to three input variables of the prediction technique based on nonlinear combined AWNG-BP, then turned the sensitivity analysis data into a sensitivity curve and synthesized the three sensitivity curves into an image. The image prediction results indicate: first, each single prediction technique has imbalanced effects on prediction results of food supply chain scale in rural areas of Hubei Province of China and the multiple regression and three exponential smoothing prediction affect system evidently; second, total consumption of cold chain food in Hubei Province has been in saturation and will decline slightly in future.
Keywords
Introduction
The rural economy in China has been developed rapidly and the living standard of rural residents is improved obviously, but the development of rural food supply chain lags so much that it cannot meet rural residents’ active demands to food. This deficit between supply and demand has caused huge social contradictions, therefore governments and related functional departments urgently need to know tendency and scale of future development of food supply chain in rural areas well to achieve reasonable planning and guidance to it. However, during the prediction of development scale of food supply chain in rural areas, several difficulties had been countered: first, food in rural areas of China is self-sufficient all the time and the food supply chain – this modern management mode – is introduced lately, so theoretical circle has not mastered law of development and approach of food supply chain in rural areas. Second, rural areas in China is vast and population flow is huge, the statistical work related to food is difficult to conduct, thus the statistical caliber in various area is different and the statistical data lacks greatly. Third, food supply chain is a complex nonlinear system, so the prediction accuracy of existing prediction technique is not ideal.
The combination prediction technique means appropriate methods are applied to combine various single prediction techniques to highlight the advantages and avoid disadvantages of single prediction technique, aiming at obtain better prediction results than that of single prediction technique [1]. In 1969, J.N. Bates and G.w.J. Granger first proved that the output results from prediction technique combined with two unbiased single prediction techniques evidently preceded the prediction results from each single prediction technique. Thereafter, foreign and domestic research scholars all were devoted to study prediction accuracy, combining mode and theoretical support of combined prediction technique and obtained abundant achievements [2–4]. Spyros and Robert [5] combined two different prediction techniques and found that the error rate of prediction results could be reduced by 7.2% during the extrapolation experiment of 111-time series data. When the single technique was added to five in the combined prediction model, the error rate of prediction results could be reduced by 16.3%. In the same sample data, if prediction model based on econometrics was applied to predict and calculate, the same conclusion could be drawn. Meantime, the research showed, the prediction accuracy combined prediction models based on time series and based on regression statistics was 51% higher than the average accuracy predicted by any single model and 44% higher than the best accuracy of two single models.
Logistics scale in the area is important basis for formulating development policies and plans of logistics in the areas [6]. But there are many factors affecting logistics scale, so traditional single prediction technique cannot fully consider various factors affecting predicted objects and the prediction accuracy is low. Given this, we planned to apply the method of combination prediction, that is, nonlinearly combining the conventional regression prediction, three exponential smoothing prediction and grey sequence prediction using BP neural network, modify and optimize inherent defects of general BP neural network, propose a brand-new nonlinear combined AWNG-BP prediction technique and the sensitivity analysis method based on image, demonstrate with the example of food supply chain in rural areas of Hubei Province, China, testify adaptability, effectiveness and superiority of the new prediction technique and provide effective data support for the food logistics planning and construction by government and related functional department.
Data source and explanation
Determination of statistical index
Scale of Supply Chain is a comprehensive and obscure idea. For the reasons of economy, politics and policies in different countries and areas, there is certain deviation in understanding and implementation of the idea. But in different circumstances, three statistical indexes can be used to measure it: cost of supply chain, volume of freight and GDP ratio. The relevant statistic data of food supply chain in rural areas of China was badly missing, so we could not obtain specific values of the above three statistical indexes. And therefore, according to relevant literature, suggestions from experts and scholars and availability and authenticity of statistic data, and combining the real situation of food supply chain development in rural areas in Hubei, we assumed: the food supply chain development in rural areas in Hubei followed the general rules of supply chain system and total food consumption could show the food demand and scale of food supply chain system of this area in a certain period.
Data source and processing
We picked annual consumption of regular food (including meat, fish, egg products, vegetables, fresh and dried fruit, dairy product and pastry, etc.) in rural areas in statistical yearbook of Hubei Province from 2000 to 2009, formulated total food consumption in rural areas of Hubei Province from 2000 to 2009 by accumulating, and the statistical data was the scale of food supply chain in rural areas of Hubei Province. We took the circumstances of paroxysmal missing of data of some products in statistical yearbook as abnormal data in statistics, which would be obtained by interpolation method of statistics. See Table 1 for specific data.
2000–2009 total food consumption of rural residents in Hubei Province (t*10,000)
2000–2009 total food consumption of rural residents in Hubei Province (t*10,000)
BP neural network and its optimization
BP neural network is a kind of feed forward artificial neural network, has typical features of artificial neural network and is a main application form of artificial neural network. But there are two inherent defects in BP artificial neural network, that is, algorithm and structure of artificial neural network. We proposed the following solutions for the two major problems existing in algorithm.
(1) Algorithm of BP neural network
The correcting algorithm of error of BP neural network was linear gradient descent. Connection weight and threshold value of this algorithm was adjusted based on constant study step and inertia factor. When self-correction could not be conducted based on the real error during training, once the optimal point was missed, convergence would slow down, even stop [7]. So, linear gradient descent algorithm had two evident disadvantages: ➀ being easily stuck in local optimum; ➁ convergence slowed down and even stopped.
For the two problems, we optimized algorithm of general BP neural network.
➀ Problem of being easily stuck in local optimum.
Error term (Formula 1) of general BP neural network was a kind of pattern of manifestation of squared error. The error formula dissatisfied positive definite condition in weight space, was not convex quadratic form and had many traps of local optimum, which was against process of optimizing. Therefore, we corrected error term as single-pole function (Formula 2) to basically solve the problem of being easily stuck in local optimum of general BP neural network.
Wherein, O k = (o1, o2, …, o p ) was target vector;
D k is output on output layer;
λ is adjustment coefficient.
➀ Problem of slow convergence.
We introduce the self-weight and adaptive threshold value to solve the two problems of the general BP neural network algorithm. The first is easily falling into partial optimum; the second is slow convergence, even without convergence. In the ordinary BP neural network algorithm, momentum factor α and study step length (study rate) η are two important parameters which determines the convergence speed of neural network and to some extent. Too large or too small of them will cause the problems in the convergence process [8]. Given this, we set the momentum factor α and study step length (study rate) η to the adaptive, real-time variable which can change with the error information. Namely in the relatively flat region of error function space, properly increasing the value of step length η and momentum factor α and speeding up the optimum-seeking speed of BP neural network; while in the relatively steep region of error function space, properly reduce the value of step length η and momentum factor α, and the optimum-seeking speed of BP neural network to avoid the aggravation of concussion. That method contributes to better seeking for the globally optimal solution and speeding up the convergence speed of BP neural network.
The calculation formula of self-adaption weight and threshold values:
Where, w ij , v ij are representatively the connection weights from input layer to middle layer and from middle layer to output layer;
θ j , δ k , are representatively the output threshold values of middle layer and output layer in each unit;
α is momentum factor and η is study step length (study rate).
The specific implementation pattern of momentum factor and study step length is:
Set 0 < α < 1; α0 = 0.6, αmin = 0.1, αmax = 0.9 are set in the region.
(2) Structure of BP neural network
Structure optimization of BP neural network could be regarded as the problem of optimal solution in structure space. The uncertain factor in structure space of BP neural network was the numbers of neurons in hidden layer. The numbers of neuron in hidden layer had great effects on convergence rate and prediction accuracy of BP neural network. If there were too much of it, there would appear too slow convergence rate and over fitting; on the contrary, web-based learning would not be enough and the requirement of training accuracy could not be met. With respect to algorithm optimization of BP neural network, the structure optimization of BP neural network was more difficult and there were no clear methods and rules to follow still. At present, common methods for structure optimization of BP neural network were empirical approach and trying method.
Niche genetic algorithm [9] was a kind of globally optimized heuristic algorithm [10] using living beings to find the positive features in reproduction of the same species and overcoming the disadvantages of general genetic algorithm through balancing chromosome’s structure. The core idea of this algorithm was: divide each generation of chromosome into several “cluster” and each cluster was a little environment; then, individual with larger fitness in each cluster could be selected to be excellent representatives to constitute a population; at last, a new generation of chromosome could be produced by selection, hybridization and variation. Introducing ecological niche technology in basic genetic algorithm could make all chromosomes breed and evolve in designated living environment, ➀ effectively control the procedure of selection, hybridization and variation and lower randomness [11]; ➁ control “inbreeding”, keep diversity of chromosomes at the utmost and avoid the algorithm being stuck in local optimization; ➂ at the same time, improve global searching ability and convergence speed of algorithm.
That nonlinear combined prediction model demanded little for data could reduce the effects of distorted or missing data on prediction results at the utmost and was especially fit for predict system short for statistical data, but it had difficulties in complex nonlinear approach function. Although BP neural network had very strong nonlinear mapping capabilities, it still had inherent disadvantages. Obviously, nonlinear combined prediction model and BP neural network could compensate mutually. Therefore, we proposed to adopt improved BP neural network to nonlinearly combine the three single prediction techniques (conventional regression prediction, three exponential smoothing prediction and grey sequence prediction) and formulate a kind of new prediction model of BP neural network based on nonlinear combination, adaption weight and niche genetic algorithm (hereafter referred as AWNG-BP prediction model based on nonlinear combination).
(1) AWNG-BP prediction model based on nonlinear combination
➀ Numbers of network layers: design a three-layer model structure including input layer, hidden layer and output layer.
➁ Numbers of neurons in layers and sample dimension: assume that there were three neurons in input layer of prediction model and the input data were fitted values (2002–2009) against scale of food supply chain in rural areas of Hubei Province predicted by multiple regression, three exponential smoothing prediction and grey sequence prediction; the numbers of neurons in hidden layer was decided by experiment via genetic algorithm of ecological niche; there was one neurons in output layer, outputting prediction results of scale of food supply chain in rural areas of Hubei Province. See Tables 2 and 3 for model structure and sample dimension.
Model structure of AWNG-BP prediction model based on nonlinear combination
Model structure of AWNG-BP prediction model based on nonlinear combination
Sample dimension of AWNG-BP prediction model based on nonlinear combination
(2) Algorithm flow of AWNG-BP prediction model based on nonlinear combination
➀ Pre-processing of data
For sampling data had different dimension, in order to ensure authenticity and reliability of training results, input and output data should be processed with normalization. At the same time, to quicken the backward transmission of error and avoid slow change space of two tops of Sigmoid function, the data after normalization should be corrected. See Table 4 for preprocessed data.
Data after preprocessing
Data normalization processing:
Data correction processing:
➁ Parameter determination: transmit function: Sigmoid function; weight value and threshold value: initial weight value and threshold value were produced by random function; expectant error: formula (2).
➂ Algorithm formulated two-layer circulation, internal and external, wherein, the internal circulation was to determine the numbers of neuron in hidden layer and external circulation was to obtain the final prediction results by gradual convergence of prediction error.
➃ External circulation adopted AWNG-BP algorithm, see Formula (3–10).
➄ Internal circulation adopted genetic algorithm of ecological niche: Coding: the maximum empirical value of the numbers of neuron in hidden layer of prediction model was coding object, which was coded by binary system. If the gene bit in corresponding binary coding was 0, it indicated that this node was invalid or there were no network connections; if it was 1, there was corresponding node, and the weight value and threshold value were valid and needed to be assigned with starter and this node would join in the training of BP neural network. For example, if the maximum empirical value of the numbers of neuron in hidden layer was 15, chromosome would be initialized based on 15 bits. If one of chromosomes was coded as 110100110110111, it meant 1, 3, 4, 7, 8, 10, 11, 13, 14 and 15 gene bits were selected as neurons in hidden layer, that is, there were 10 neurons in hidden layer. Population initialization: the number of population was assumed as m, m = 20. Chromosome was generated randomly according to the number of population. Fitness function: reciprocal of error term. Selection: strategies of championship + roulette. First, select two chromosomes with maximum values of fitness and enter into filial generation without cross and variation; then, select remaining chromosomes with roulette mechanism according to the value of fitness. Cross and variation: single point crossed randomly with probability of 0.95; single point with random variation with probability of 0.05. Obsolescence in ecological niche: in each generation of group, first, compare hamming distance - code distance - between any two individuals. The formula of hamming distance is:
Wherein, k is gene bit of chromosome, l is length of gene string. For example, two chromosomes 10101 and 00110 were different from the first bit, fourth and fifth bits, and then the hamming distance was 3. When the distance was in the preassigned distance L, that is, x
i
- x
j
≤ L, the individual with lower fitness was implied with a stronger penalty function to change its fitness value into a smaller value, that is:
In two chromosomes with distance in L, the inferior one’s fitness would be worse after processing in ecological niche and the probability obsoleted would be bigger in latter evolutionary process. In this way, there would be one excellent chromosome in distance L, which kept diversity of the group and certain distance among each chromosome to disperse in whole constrained space. If two chromosomes were same totally, either of two was selected to punish to reduce rate of coincident code of chromosomes. Assume: L = 2, penalty = 0.7.
➅ Data post-processing, i.e. inverse unification processing:
Training procedure
Assign initial value: η = 0.3, α = 0.6, wherein η and α are dynamic self-adaptive adjustment. Assume the internal circulation of prediction model is 100 times and 200 times respectively; and the external circulation is 100 times, 200 times and 300 times respectively. Combine the number of times in internal and external circulation to form six groups of training for prediction model and finally determine the number of neuron in hidden based on the results of training. The convergence results of corresponding training were showed as Figs. 1–6.

Convergence situation of 100 times of internal circulation and 100 times of external circulation.

Convergence situation of 100 times of internal circulation and 200 times of external circulation.

Convergence situation of 100 times of internal circulation and 300 times of external circulation.

Convergence situation of 200 times of internal circulation and 100 times of external circulation.

Convergence situation of 200 times of internal circulation and 200 times of external circulation.

Convergence situation of 200 times of internal circulation and 300 times of external circulation.
From the training results, no matter what values were assigned in internal circulation and external circulation, the training results of prediction model could be in stable convergence. From the average value of error and convergence rate, when internal circulation was 200 and external circulation was 300, the training error was minimum and the convergence rate was quickest. Therefore, the number of neuron in corresponding hidden layer of this training model was assigned 15, which was the number of neuron in hidden layer in final prediction model.
The well-trained prediction model was used to predict the scale of food supply chain in rural areas of Hubei Province, and the fitting and error results in prediction results were showed in Fig. 7 and prediction data is showed in Table 5.

Fitness Figure and Error Figure of Prediction Results of AWNG-BP Prediction Model Based on Nonlinear Combination.
Prediction data of AWNG-BP prediction model based on nonlinear combination
There were 3 input values in AWNG-BP prediction model based on nonlinear combination, and we used the same sensitivity analysis method to analyze 3 values’ sensitivity: average divided the variable range of each input value into 8 phases and assign x i to be the values of the 8 phases respectively, that is, the values of x i equalled to (0.1500, 0.2500, 0.3500, 0.4500, 0.5500, 0.6500, 0.7500 and 0.8500). In the change of x i , other two variables remained unchanged, that is, are (0.1500, 0.1500, 0.1500, 0.1500, 0.1500, 0.1500, 0.1500 and 0.1500) all the time. These were input values and were successively input in the trained prediction model, the value of y could be gotten in output layer, whose range should be in [0–l]. In order to compare and analyze sensitivity of three input values directly, we superimposed the corresponding y values of three input variables in one figure. See Table 6 and Fig. 8 for detailed information.
Sensitivity analysis data of AWNG-BP prediction model based on nonlinear combination
Sensitivity analysis data of AWNG-BP prediction model based on nonlinear combination

Sensitivity curve of AWNG-BP prediction model based on nonlinear combination. Note: In Fig. 8, on the edge of horn mouth in the figure showed the values of y from top to bottom respectively, showed as note.
Simulation experiment was used to conduct checking computations of various performance of each single and AWBG-BP prediction model based on nonlinear combination, see Table 7 for detailed results. Obviously, all the prediction models were feasible and efficient and could complete the prediction. Although performance of each single prediction model had own advantages in different statistical indexes, in general, the prediction accuracy was not high, indicating that the results using just single prediction model to predict scale of food supply chain in Hubei Province were not ideal. In comparison, various statistical indexes of performance in AWBG-BP prediction model based on nonlinear combination had improvement of 2–3 orders of magnitude and the prediction accuracy was evidently improved. Sensitivity experiment was conducted against each single prediction technique of AWBG-BP prediction model based on nonlinear combination. The experiment results showed that during the prediction procedure of food supply chain in rural areas of Hubei Province, the order of effect degree of three single prediction techniques was multiple regression, three exponential smoothing prediction and grey sequence prediction, in which the maximum effect degree was 0.53 and the minimum was 0.18. Therefore, the effect of each single prediction technique on scale of food supply chain in rural areas of Hubei Province was unbalanced: effects of multiple regression and three exponential smoothing prediction on the system were far more significant than that of grey sequence prediction. This may be caused by that fluctuation of total food consumption in rural areas was wide, and grey generation in grey sequence prediction fuzzily processed this fluctuation. Although this procedure of grey turning white reduced effect of distorted data on prediction procedure, at same time it reduced change information contained in data. So in techniques combination, the significance of grey sequence prediction was less.
Performance comparison of various prediction model
Performance comparison of various prediction model
This paper took three important optimization measures to general BP neural network: first, optimize error function against the problem of BP neural network easily sticking in local optimum; second, optimize search algorithm of BP neural network with method of dynamic self-adaption against the problem of fluctuation and slow convergence rate in the late iteration of BP neural network; and third, optimize network structure of BP neural network with genetic algorithm of ecological niche against the problem of difficulty in determining network structure of BP neural network. The experiment data showed theses optimization measures could evidently improve the accuracy of prediction models and was suitable for predicting the complex nonlinear system with in missing history statistical data. Food supply chain was a typical complex nonlinear system. In regard to various reasons, the history statistical data for food supply chain in rural areas in China was missing and inaccurate. The prediction result adopted single prediction technique was very unsatisfactory. Combining multiple regression, three exponential smoothing prediction and grey sequence prediction with improved BP neural network, building up AWBG-BP prediction model based on nonlinear combination and making full use of nonlinear combination of each single prediction technique could weaken effect of distortion and lake of data on output results and nonlinear capability of BP neural network, which obtained tremendous prediction effects and adaptability and was deserved to be promoted. The sensitivity analysis method based on image can be more intuitive to discover the impact of the three individual prediction methods on nonlinear combination forecasting methods and easier to adjust the weight of nonlinear combination forecasting method. In this way, the unbiasedness of the nonlinear combination forecasting method will be smaller, and the accuracy will be higher. From food supply chain market in rural areas of Hubei Province, because of effect of average per capita income and living and consumption habits, the scale of food supply chain in rural areas had been stable for late ten years, with smaller fluctuation (except for 2004 and 2005). From prediction data, it was obvious that the scale of food supply chain in rural areas would continuously increase and trend to descend slightly.
Footnotes
Acknowledgments
This work is supported by the National Natural Science Foundation of China (No. 70160376), the Natural Science Foundation of Hubei Province (No. 2016CFB490), the China Postdoctoral Science Foundation (No. 2015M580648), the China Postdoctoral Special Science Foundation (No. 2017T100560) and Hubei Logistic Development Research Center Sponsored Project.
