Water wave optimization algorithm for autonomous underwater vehicle path planning problem

Abstract

In this paper, a water wave optimization (WWO) algorithm is proposed to solve the autonomous underwater vehicle (AUV) path planning problem to obtain an optimal or near-optimal path in the marine environment. Path planning is a prerequisite for the realization of submarine reconnaissance, surveillance, combat and other underwater tasks. The WWO algorithm based on shallow wave theory is a novel evolutionary algorithm that mimics wave motions containing propagation, refraction and breaking to obtain the global optimization solution. The WWO algorithm not only avoids jumps out of the local optimum and premature convergence but also has a faster convergence speed and higher calculation accuracy. To verify the effectiveness and feasibility, the WWO algorithm is applied to solve the randomly generated threat areas and generated fixed threat areas. Compared with other algorithms, the WWO algorithm can effectively balance exploration and exploitation to avoid threat areas and reach the intended target with minimum fuel costs. The experimental results demonstrate that the WWO algorithm has better optimization performance and is robust.

Keywords

Water wave optimization (WWO)autonomous underwater vehicle (AUV)path planning randomly generated threat areas generated fixed threat areas

1 Introduction

Due to the continuous advancement and development of science and technology, the discovery and exploitation of marine resources has become increasingly important. As effective pieces of marine equipment, AUVs have been widely used to solve certain problems, such as marine hydrographic survey and mine detection, military investigation and rescue, undersea inspection and data collection, ocean exploration, anti-submarine, drilling support, subsea construction, and underwater equipment laying and maintenance. AUV path planning is the process of determining an optimal path from the starting point to the target point under certain specific constraints [1 –4]. AUV path planning is not only a manifestation of autonomy and intelligence but also a guarantee of safety and reliability. AUV can effectively avoid all threat areas to obtain an optimal or near-optimal path according to autonomous navigation and obstacle avoidance capabilities. The core concept of AUV path planning is to find the shortest path that satisfies a series of constrains under the premise of comprehensively considering factors, such as fuel consumption, threat level, threat area and navigation area. In recent years, some meta-heuristic optimization algorithms have been used to solve the path planning problems, such as artificial bee colony (ABC) [5], flower pollination algorithm (FPA) [6], bat algorithm (BA) [7], particle swarm optimization (PSO) [8], and moth-flame optimization algorithm (MFO) [9].

Zhuang et al. designed a two-stage cooperative path planner for multiple autonomous underwater vehicles operating in a dynamic environment. The proposed method is capable of reacting quickly to a dynamic ocean environment and can successfully avoid collisions [10]. Lim et al. proposed the method of using selectively hybridized particle swarm optimization algorithms to solve constrained path planning of autonomous underwater vehicles. The method had a low computational requirement and an excellent solution quality, so that the proposed is effective and feasible to obtain the optimal path from the starting point to the target point [11]. Wu tried to solve the coordinated path planning for an unmanned aerial-aquatic vehicle and an autonomous underwater vehicle in an underwater target strike mission. The coordinated path derived from the proposed method is close to the optimal solution in theory [12]. Barua et al. described the algorithms for guidance and control of an autonomous underwater vehicle for a specific identification mission. Their approach obtained better optimization results and the optimal or near-optimal path [13]. Taheri et al. applied a closed-loop rapidly exploring random tree algorithm to solve the path planning problem of an autonomous underwater vehicle, which provided a feasible planned path [14]. Pi et al. provided a search-based motion planning algorithm applied to the coordinated manipulation problem of a dual-arm intervention AUV [15]. Li et al. introduced an autonomous underwater vehicle optimal path planning method for seabed terrain matching navigation to avoid problem areas, and the proposed method had higher matching accuracy and lower time consumption [16]. Ding et al. demonstrated the optimal anti-submarine search path of an AUV by maximizing the cumulative detection probability. The optimal path was effective and suggestive for anti-submarine operation [17]. Lin et al. conducted a study on multiobjective particle swarm optimization for the AUV route planning problem, and the proposed method required less sailing time and energy consumption [18]. Zeng et al. designed a quantum-behaved particle swarm optimization algorithm to solve the optimal path planning problem of an AUV, and their proposed method obtained the best path [19]. Khan et al. presented a protocol that utilized AUVs to save energy wasted by sensor nodes during clustering [20]. MahmoudZadeh et al. applied evolutionary algorithms to solve the path planning problem. The proposed algorithms generated an optimal and collision-free path to avoid obstacles [21]. Gan et al. adopted the quantum-behaved particle swarm optimization algorithm to solve the dynamic trajectory tracking problem for AUVs, where the proposed method balanced exploration and exploitation to obtain a global optimal solution [22]. Zhu et al. showed a biologically inspired self-organizing map algorithm to achieve task assignment and path planning of an AUV system, and the effectiveness and feasibility of the algorithm were verified [23]. Wang et al. utilized the improved gray wolf optimizer to solve the monitoring trajectory optimization problem for USVs, and found that the proposed algorithm consumed minimal energy and exhibited obstacle avoidance [24]. Zhou et al. designed an improved flower pollination algorithm to solve the UUV path planning problem, the proposed method obtain the optimal or near-optimal path [25].

The WWO algorithm based on shallow wave theory simulates propagation, refraction and breaking to perform global optimization [26]. The WWO algorithm accelerates convergence speed and improves calculation accuracy. The WWO algorithm has strong stability and robustness. The WWO algorithm is applied to solve the AUV path planning problem, which not only expands the search space to avoid falling into the local optimum but also balances the global search ability and the local search ability to obtain the optimal solution. The experiments show that the convergence speed and calculation accuracy of the WWO algorithm are better than those of other algorithms, and the WWO algorithm is an effective and feasible method for solving the AUV path planning problem.

This article is divided into following sections. Section 2 introduces the problem description and formulation. Section 3 reviews the WWO algorithm. The experimental results and analysis are provided in Section 4. Finally, conclusions are drawn and future research is proposed in Section 5.

2 Problem description and formulation

An AUV system principally contains two main propellers, four auxiliary thrusters, a horizontal rudder and a vertical rudder. The two main propellers provide navigation power (surge) and are installed at the stern of the vehicle. Two of the auxiliary thrusters provide sway power and yaw momentums and are installed on a transverse layout, and the other two auxiliary thrusters provide power for heave and moments for pitching and are installed on a vertical layout. The horizontal rudder and vertical rudder are used to change the heading angle of the horizontal direction and vertical direction, respectively. Therefore, surge, sway, heave, pitch and yaw are controllable while roll is not.

The North-East-Down (NED) coordinate and body-fixed coordinate is given in Fig. 1. The NED coordinate is used to describe the position information of the AUV. The coordinate origin O_E is a random point at sea lavel; the ξ axis is always horizontal and pointing to the north, the η axis is kept perpendicular to the ξ axis and points to the east, and the ζ axis is kept perpendicular to the ζη plane and points to the center of the Earth. The coordinate origin O_B is used as the original coordinate point of the AUV, the x axis points to the bow along the longitudinal section, the y axis is perpendicular to the longitudinal section and points to the right chord, and the z axis points to the bottom of the boat along the longitudinal section. To determine the mutual correspondence between the NED coordinate and the body-fixed coordinate, the kinematic model of AUV is as follows:

Fig. 1

Earth-inertial frames and body-fixed reference frames.

$[\begin{matrix} \overset{\cdot}{x} \\ \overset{\cdot}{y} \\ \overset{\cdot}{z} \end{matrix}] = [\begin{matrix} cos ψ cos θ & cos ψ sin θ sin ϕ - sin ψ cos ϕ & cos ψ sin θ cos ϕ + sin ψ sin θ \\ sin ψ cos θ & sin ψ sin θ sin ϕ + cos ψ cos ϕ & sin ψ sin θ cos ϕ - cos ψ sin ϕ \\ - sin θ & cos θ sin ϕ & cos θ cos ϕ \end{matrix}] \cdot [\begin{matrix} u \\ v \\ w \end{matrix}]$ (1) where ϕ, θ and ψ are important control parameters, and the value ranges are $- \frac{π}{2} < ϕ < \frac{π}{2}$ , $- \frac{π}{2} < θ < \frac{π}{2}$ and -π < ψ < π. $[\begin{matrix} \overset{\cdot}{ϕ} \\ \overset{\cdot}{θ} \\ \overset{\cdot}{ψ} \end{matrix}] = [\begin{matrix} 1 & sin ϕ \tan θ & cos ϕ tan θ \\ 0 & cos ϕ & - sin ϕ \\ 0 & sin ϕ / cos θ & cos ϕ / cos θ \end{matrix}] \cdot [\begin{matrix} p \\ q \\ r \end{matrix}]$ (2)

Assuming that the roll movement is uncontrollable and the structure of AUV is bilaterally symmetrical, take w = 0, v = 0, and we rearrange to obtain $[\begin{matrix} \overset{\cdot}{x} \\ \overset{\cdot}{y} \\ \overset{\cdot}{z} \end{matrix}] = [\begin{matrix} cos ψ cos θ \\ sin ψ cos θ \\ - sin θ \end{matrix}] \cdot u$ (3) ${\begin{matrix} \overset{\cdot}{θ} = q \\ \overset{\cdot}{ψ} = r / cos θ \end{matrix}$ (4)

As an active sonar, the forward looking sonar (FLS) is necessary for AUV safe navigation. FLS not only detects and perceives information about the environment surrounding an AUV but also identifies threat areas and sends out danger signals, which is beneficial for an AUV to successfully avoid threat areas and reach the target point safely. FLS based on the characteristics of sound waves propagating in water is used to complete underwater target detection and underwater communication through signal conversion and processing. FLS uses discrete data points to output and save data for detecting underwater obstacles. During the navigation of the AUV, the FLS is used to perceive the underwater road conditions. The data of the detected obstacles uses the sonar position as a reference point to effectively process more stored echo information.

For distributed threat areas, the goal of AUV path planning is to obtain an optimal or near-optimal path under certain constraints. The transformation of the coordinate system is given in Fig. 2.

Fig. 2

Transformation of the coordinates system.

The coordinate system O_xy is transformed to the coordinate system O_x′y′. The positive direction of the horizontal axis is the line from the starting point to the target point. The transformation relationships can be defined as: $γ = arcsin \frac{y_{2} - y_{1}}{| \vec{AB} |}$ (5)

$[\begin{matrix} x \\ y \end{matrix}] = [\begin{matrix} \begin{matrix} cos γ & sin γ \end{matrix} \\ \begin{matrix} - sin γ & cos γ \end{matrix} \end{matrix}] \cdot [\begin{matrix} x^{'} \\ y^{'} \end{matrix}] + [\begin{matrix} x_{1} \\ y_{1} \end{matrix}]$ (6) where point (x, y) denotes the coordinate in the O_xy, point (x′, y′) denotes the new coordinate in the O_x′y′, and γ denotes the rotation angle of the coordinate system. On the horizontal axis x′, the line segment from the starting point to the target point is divided into equal D segments. These lines are parallel to the axis y′, the intersections with axis x′ are represented by small rectangles, D abscissas are obtained, then a point is elected on each line to obtain D ordinates. Finally, nodes with horizontal coordinates and vertical coordinates are obtained, thus a path is designed by connecting points.

Threat cost J_t and fuel cost J_f are main evaluation criteria for AUV path planning, and the calculation formulas can be defined as: $J_{t} = \int_{0}^{L} w_{t} dl$ (7) $J_{f} = \int_{0}^{L} w_{f} dl$ (8) where w_t and w_f denote the threat cost and fuel cost of the neutron segment, respectively. L denotes the total length of the path. An approximate method is applied to calculate the optimal path. Each subsegment is divided into five smaller subsegments, and the threat cost on each subsegment is concentrated at the end point of the subsegment. The model of the AUV threat cost is given in Fig. 3.

Fig. 3

Model of the AUV threat cost.

If the distance between the threat point and the end of the subsegment is less than the radius of the threat area, the threat cost of a subsegment can be defined as: $\begin{matrix} w_{t, L_{i}} = \frac{L_{i}}{5} \cdot \\ \sum_{k = 1}^{N_{t}} t_{k} \cdot (\frac{1}{d_{0.1, i, k}^{4}} + \frac{1}{d_{0.3, i, k}^{4}} + \frac{1}{d_{0.5, i, k}^{4}} + \frac{1}{d_{0.7, i, k}^{4}} + \frac{1}{d_{0.9, i, k}^{4}}) \end{matrix}$ (9)

where N_t denotes the number of the threat regions, L_i denotes the length of ith segment, d_0.1,i,k denotes a distance from 1/10 point on the ithsegment to the kth threat center. Assuming that the speed of AUV is constant, the fuel cost J_f denotes proportional to the path length. The total cost of the path planning can be defined as: $J = {kJ}_{t} + (1 - k) J_{f}$ (10) where k ∈ (0, 1) denotes the important weight coefficient. As k is close to 0, the planned path is shorter. As k is close to 1, the planned path has a higher safety factor. In this paper, k is set to 0.5.

3 WWO

The WWO algorithm mainly simulates wave motions including propagation, refraction and breaking for effective searching. The process of wave movement is from deep regions to shallow regions. For a wave object, wave height h and wavelength λ are two important attributes, and the fitness value varies with the unevenness of the seabed. In deep regions, a wave obtains lower fitness value, lower wave height, and longer wavelength. In shallow regions, the wave obtains higher fitness value, higher wave height, and shorter wavelength, as illustrated in Fig. 4. The correspondence between problem space and population space is given in Table 1.

Fig. 4

Different wave shapes in deep and shallow water.

Table 1

Correspondence between problem space and population space

Problem space F	Population space
The solution space of F	The seabed area
Each solution of the problem	A water wave
The fitness value of each solution	It is inversely proportional to vertical distance to seabed

3.1 Propagation

Each wave performs a propagation operation and generates a new wave x′. The new location update can be defined as: $x^{'} (d) = x (d) + rand (- 1, 1) \cdot λ L (d)$ (11) where L (d) denotes the length of the solution space, rand (-1, 1) denotes a uniformly distributed random number, and the location of a new wave does not exceed the search boundary. If f (x′) > f (x), the new wave x′ replaces the original wave x, and the wave height is reset to h_max. Otherwise, wave x is retained and energy loss is expressed as wave height minus one. The wavelength can be defined as: $λ = λ \cdot α^{- (f (x) - f_{min} + ɛ) / (f_{max} - f_{min} + ɛ)}$ (12) where α denotes the wavelength attenuation coefficient, and f_min and f_max denotes minimum and maximum fitness values, respectively. ɛ denotes a tiny positive integer to avoid the denominator being zero.

3.2 Refraction

If the location of wave x is not ameliorated after many propagations, wave height decays to zero. The wave will perform refraction to avoid search stagnation. The location update can be defined as: $x^{'} (d) = N (\frac{(x^{*} (d) + x (d))}{2}, \frac{| x^{*} (d) - x (d) |}{2})$ (13) where x^* denotes the optimal wave in the entire population, N (μ, σ) denotes a Gaussian random number with mean μ and standard deviation σ. After refraction, the wave height of the new wave x′ is reset to h_max. The updated wavelength can be defined as: $λ^{'} = λ \frac{f (x)}{f (x^{'})}$ (14)

3.3 Breaking

As the energy of water wave continues to increase, the wave crest will continue to steepen. When the velocity of the wave crest exceeds the propagation velocity of the water wave, the wave will break up into a series of solitary waves. In the WWO algorithm, the optimal wave x^* is used to perform breaking operations. The specific way is to randomly select k dimensions from D, and selecting each dimension creates solitary waves. The updated location can be defined as: $x^{'} (d) = x (d) + N (0, 1) \cdot β L (d)$ (15) where β denotes the breaking coefficient. If the fitness value of all solitary waves is worse than that of the optimal wave x^*, x^* is retained. Otherwise, wave x^* is replaced by an optimal solitary wave.

In general, the propagation operation modulates the fitness values of different regions to effectively balance exploration and exploitation; the refraction operation removes the exhausted wave from the population and obtains the optimal wave, which can avoid search stagnation and accelerate convergence speed; and the breaking operation enhances the intensive search near the optimal solution and improves calculation accuracy. These three operations provide a good search mechanism for the WWO algorithm to obtain the global optimal solution. The correspondence between AUV path planning and the WWO algorithm is given in Table 2. For the swarm intelligence optimization algorithm, the search process and optimization process simulate the evolution and foraging process of individuals, the points in the search space simulate individuals in nature, the objective function of the problem is analogous to the individual’s ability to adapt to the environment, and the individual foraging process or the survival of the fittest is analogous to the iterative process of replacing the less feasible solutions with more feasible solutions in the search and optimization process. The solution procedure of the AUV path planning problem is given in Table 3. The solution methodology flow chart of the AUV path planning problem is provided in Fig. 5.

Table 2

The correspondence between the AUV path planning and the WWO algorithm

The AUV path planning space	The WWO algorithm space
A collection contains all the optimization schemes to solve AUV path planning problem	A wave population with
An optimal or near-optimal path	An optimal wave
The total cost of the AUV path planning problem	The fitness function of the WWO algorithm

Table 3

The solution procedure of the AUV path planning problem

Step 1. According to the environmental model, initialize the detailed information about the AUV path planning task, starting point coordinates, target point coordinates, threat center, threat radius and threat level. Transform coordinate system according to Equation (5) and (6).

Step 2. Initialization: Randomly initialize a wave population Pof n waves (solutions), the wavelength λ, the wave height h_max, the wavelength reduction coefficient α, the breaking coefficient β, maximum number k_max of breaking directions, and the maximum number iter_max.

Step 3. Calculate the total cost of each water wave by Equation (10) based on water wave parameters. The smaller the cost value is, the better performance the AUV path maintains.

Step 4. Obtain the optimal wave

While stop criterion is not satisfied do

Step 5. for each wave x ∈ P do

Step 5.1. Propagate wave x to a new wave x′ by Equation (11).

Step 5.2. if f (x′) < f (x) then

Step 5.2.1. if f (x′) < f (x^*), then wave x′ perform breaking operation by Equation (15), wave x^* updates to wave x′.

Step 5.2.2. water wave x is replaced by wave x′ in the population.

Step 5.3. else, the wave height of wave x is decreased by one to express the energy loss. Ifx · h = 0, then wave x perform refraction operation by Equation (13) and (14).

Step 5.4. Update the wavelength of each wave by Equation (12).

Step 6. Calculate the total cost of each water wave by Equation (10).

Step 7. Set iter = iter + 1. Ifiter < iter_max, go to step 5.

Step 8. Inversely transform the coordinates in final path into the original coordinate and output the optimal solution.

Fig. 5

Solution methodology flow chart of AUV path planning.

4 Experimental results and analysis

To verify effectiveness and feasibility, the WWO algorithm was applied to solve the randomly generated threat areas and generated fixed threat areas. The WWO algorithm has faster convergence speed and higher calculation accuracy, so that it can effectively obtain an optimal or near-optimal path in solving the path planning problem. The numerical experiment is set up on a computer with an Intel Core i7-8750 H 2.2 GHz CPU, a GTX1060, and 8 GB memory running on Windows 10.

4.1 Simulation experiments of AUV path planning in random threat areas

The AUV sails in the deep sea, and the threat regions encountered are randomly distributed. The generated threat regions are as follows:

Threat point coordinates: $x = rand (n, 1) \cdot λ + m$ (16) $y = rand (n, 1) \cdot λ + m$ (17) where n denotes the number of the threat regions, λ denotes the path length, and m denotes the adjustment parameter to ensure the generation of the starting point.

The threat radius: $r = rand (n, 1) / 4 \cdot λ$ (18) where r denotes the randomly generated radius. In this paper, the radius of the randomly generated threat regions is not less than 10.

Threat coefficient: $k = round (rand \cdot r)$ (19) where round () denotes the result of rounding the specified number of decimal places.

The starting point coordinates and the target point coordinates are: $start = [min (x) - r, 0]$ (20) $target = [max (x) + r + Δ x, round (rand \cdot λ) + Δ y]$ (21) where the starting point and target point are guaranteed to be outside the threat regions. Δx and Δy adjust the distance of the target point to the threat regions.

In this section, the WWO algorithm is used to solve the randomly generated threat areas, and the number of the threat points is 6. Since comparison experiments cannot be performed, the simulation tests of the WWO algorithm is 5. The population size is 20, the maximum number of iterations is 200, and the dimension of the problem is 14. Table 4 gives some relevant parameters about threat center, threat radius and threat level. The starting point and the target point coordinates are given in Table 5. The experimental results for random threat areas are given in Table 6.

Table 4

A randomly generated parameter table with six threat points

Parameters	1	2	3	4	5	6
Threat center	(98,35)	(73,68)	(28,72)	(92,99)	(101,69)	(27,55)
Threat radius	23	12	13	17	10	13
Threat level	9	5	5	7	4	5
Threat center	(30,48)	(109,56)	(113,79)	(61,90)	(55,27)	(79,55)
Threat radius	10	21	25	10	17	25
Threat level	5	10	12	5	8	12
Threat center	(106,75)	(56,46)	(79,75)	(51,81)	(85,33)	(71,60)
Threat radius	10	10	10	10	24	10
Threat level	9	9	9	9	21	9
Threat center	(52,33)	(28,29)	(82,49)	(54,85)	(24,76)	(21,64)
Threat radius	20	10	15	16	16	10
Threat level	9	4	6	7	7	4
Threat center	(109,52)	(81,53)	(101,64)	(49,107)	(109,109)	(38,60)
Threat radius	10	12	10	17	10	10
Threat level	4	5	4	7	4	4

Table 5

The starting point and the target point coordinates

Parameters	1	2	3	4	5
Starting point	(14,0)	(20,0)	(40,0)	(7,0)	(27,0)
Target point	(121,103)	(139,108)	(116,108)	(98,101)	(119,106)

Table 6

Experimental results for random threat areas

Threat numbers	Population	Iteration	1	2	3	4	5
6	20	200	75.6242	86.4332	71.9365	70.5012	71.1435

For five randomly generated threat areas, we conducted five independent experiments on AUV path planning. Figures 6 –10 are the effect diagrams of the path. The WWO algorithm can effectively avoid the threat areas and obtain an optimal path from the starting point to the target point in response to randomly generated threat regions. Figure 11 gives the convergence graph of five independent experiments, which indicates that the WWO algorithm has a faster convergence speed in path planning. Since the randomly generated threat regions are different, the optimal path obtained will be different. The WWO algorithm cannot be compared with other optimization algorithms due to different threat regions. Therefore, we select three sets of fixed threat regions for AUV path planning in the following section.