Abstract
In recent years humanoid robots have been widely used in toy, performance, education and other service industries, but most biped robots walk slowly and have poor stability. The reason is that the driver parameters of the robot cannot properly match the walking gait algorithm, and the insufficient performance of the robot driver leads to the poor motion capability of the robot. In this paper, the optimization design process of biped robot parameters is studied and expounded, and its motion capability is improved by optimizing the driving parameters of the robot. Firstly, the contradiction between walking speed, stability and driver performance of biped robot is analysed. The performance evaluation functions of the three are further established, and the optimal parameter design to a certain extent is realized based on the multi-objective optimization method. Finally, combining with the physical simulation engine, the design parameters are simulated and checked, and the robot design process is completed through the guidance of simulation results.
Introduction
Humanoid robots are designed with reference to human body structure and motion characteristics, which is an advanced platform for studying human intelligence. The research of humanoid robot brings together many disciplines such as machinery, electronics, computer, artificial intelligence, bionics, etc. Humanoid robots have shown good application prospects in service, medical treatment, education and other fields, as well as in anti-terrorism and anti-explosion and anoxic and high temperature environments that are not suitable for manual operation. In order to realize the extensive application of humanoid robots, how to optimize the performance of humanoid robots and make them more efficiently and stably in complex dynamic environment has become the key problem [1–5].
In the walking process of biped robots, to ensure good stability, it is often necessary to limit the walking speed. If the walking speed needs to be increased, the speed and range of joint movement need to be increased, and the stability of the robot will deteriorate. When the joint drive parameters do not meet the requirements, that is, the torque and rotational speed of the drive motor are lower than expected, the robot is difficult to reach a larger walking speed and stability is not easy to be guaranteed. The driving ability of the motor is related to the weight of the motor. The larger the driving ability, the greater the weight of the motor, resulting in the greater weight of the robot, which in turn increases the demand for driving [6–8].
There is a contradiction between the stability and speed of the robot walking and the performance of the driver, a contradiction between the safety, working efficiency and cost of the robot. At present, the research at home and abroad is mainly to ensure the speed and stability of the robot walking by improving the structure and the performance of the driver. It is often only aimed at some problems and does not pay attention to the contradictory relationship between various parameters. In the actual application of large-scale humanoid robots, due to the limitations of device performance and cost, it is necessary to select the relatively optimal parameters among the conflicting parameters under the existing conditions. This paper studies the multi-objective optimization of the conflicting objectives such as stability, speed and motor cost to realize the balance among the various performances, maximize the utilization of the various performances of the robot, and optimize the selection of the proportion of each objective according to different needs.
Scholars have also conducted some research on how to introduce multi-objective optimization methods in the field of robot control to solve the compromise solution that satisfies several constraint problems [9, 10]. Azhar et al. of Tokyo Metropolitan University proposed a three-dimensional motion control system based on biological methods for human-like bipedal walking robots. The neuromotor system was extended to establish a multi-neuronal system model in which motor neurons represent muscles. Systematic, sensory neurons represent the sensing system in the human body. To establish a motion pattern, they applied a multi-objective evolutionary algorithm to solve synaptic weights between motor neurons and validated them on a 12-DOF biped robot [11, 12].
Canadian scholar Philippe et al. proposed a multi-objective genetic algorithm based on NSGA-II to solve the contradiction between the two goals of maximizing dynamic stiffness and minimizing system noise sensitivity during the grinding process of underwater metal grinding robots. The control parameters of the robot discrete time observer are optimized and verified by the underwater grinding robot controlling the hydroelectric dam [13–16].
In addition, Andrej Gams et al. based on the priority task control mechanism to support vector machine estimation and construct the humanoid robot centroid stability model, and achieve synchronization stability and motion tracking control on the robot COMAN [17]; Nierhoff T. et al. The -RRT algorithm incrementally samples the trajectory mimicry process in the clustering environment and uses the quadratic term distance to describe the underlying distance metric for the velocity and acceleration deviation on the reference trajectory. The nearest neighbour algorithm is used to extend the RRT* tree in the task space. The convergence speed is improved, and the motion simulation effect is verified on the humanoid robot HRP-4 [18]; Okamoto T. et al. extended the target motion abstract model in the observation learning paradigm, so that the humanoid robot self-extracts and self-adjusts the constraints. Imitate the different feature movements in the human body demonstration circle game [19]; Hamed et al. perform multi-objective optimization on the natural kinematics adjustment and kinematic redundancy solution for the periodic task execution problem of the redundant serial manipulator to output Minimum torque [20]; Miguel G. et al. to improve the robustness of the robot system and achieve the nominal design of uncertain parameters The object has the lowest sensitivity, multi-objective optimization of mechanical and control nominal design objects and verification on planar parallel robots [21], and so on.
At present, researchers at home and abroad have made many achievements in the optimization of humanoid robot balance stability, such as applying probabilistic reinforcement learning algorithm ∖cite{c4}, constructing a centroid stability model [7], and optimizing motion posture through learning algorithms [22]; A lot of researches have been done on the optimization of robot energy consumption, such as tracking the human kMPs trajectory by resonance mechanism [8], analyzing the parametric parameters of human gait [23], and some scholars applying multi-objective optimization algorithms to control humanoid robot control parameters. Optimize and improve the performance of certain aspects of the robot [9, 10]. However, how to optimize the stability, speed and motor cost of humanoid robots in a dynamic time-varying environment to achieve a balance between robot safety, work efficiency and cost, there is no general method in the existing literature; The existing dynamic multi-objective optimization algorithm [24, 25] is analysed, and it is found that there is no dynamic multi-objective optimization algorithm suitable for humanoid robots with high robustness, high real-time and high reliability [26–30].
This article is divided into six parts. The second section analyzes the influence factors of robot gait and stability. The third section gives the objective function of robot optimization. The fourth section is the optimization method of robot parameter design. The fifth section is the simulation design verification of biped robot.
Robot gait and stability criterion
Robot kinematics analysis is the basis of various motion control. The main content is to study the relationship between the joint angle of the robot and the position and attitude of the connecting rod, and to express these relationships through mathematical modelling. Most humanoid robots drive the joint angle through the motor. In order to realize the control of the robot, it is necessary to determine the position of each link and joint of the robot. Therefore, the reference coordinate system should be established. The humanoid robot is divided into five parts, each part is regarded as a connecting rod, which is a structural model of two hip joints, two knee joints and two joints to form a robot; the joint of the left calf and the ground is the coordinate origin. The robot advances in the X-axis, the horizontal axis is the Y-axis perpendicular to the X-axis, and the Z-axis in the height direction establishes the coordinate system, as shown in Fig. 1, the parameter definition is shown in Table 1.

Humanoid robot structure model.
Humanoid robot model parameters
After the coordinate system and structural model are established, the kinematics analysis of the humanoid robot can be performed. The kinematics can be divided into two parts: positive kinematics and inverse kinematics. Based on the parameters such as the length and mass of each link of the robot, positive kinematics is based on known joint angles, angles and other parameters to solve the position and attitude of the connecting rod. It can be used to describe the robot state and obstacles. The judgment of the object collision; the inverse kinematics is to solve the joint angle according to the known robot link attitude. When the robot is required to complete the specified motion, the inverse kinematics is needed to calculate the joint angle, and then the motor is driven to control the robot motion.
The kinematic derivation can plan the motion posture of the humanoid robot in an ideal environment, but there are external force disturbances such as gravity and friction in the actual motion, which will cause the robot to move in accordance with the planned posture, even shaking and falling. Industrial robots can fix the base on the ground to maintain stability; wheeled, crawler and other robots can maintain balance by stable contact with the ground and low centre of gravity; while humanoid robots stand through the foot, the contact area with the ground is small and unstable. At the same time, its shape is like that of human beings, and the centre of gravity is higher and easy to fall. Therefore, judging the stability of the humanoid robot is a key issue for achieving normal walking and accomplishing the target operation. The zero-torque point is an important physical quantity for judging the stability of the humanoid robot.
The zero-moment point was first proposed by Vukobratovic et al. in a paper on humanoid robot control, often abbreviated as ZMP, which means: all the forces on the robot’s sole, if when replaced by a resultant force, the point of action of the foot through which the resultant force passes is called ZMP; as shown in Fig. 2. The ground force of the robot’s sole can be replaced by the resultant force f and the torque around the ZMP τ
p
, as shown in the following equation:

ZMP of Robot.
The intersection of the resultant force F and the sole is ZMP, and the ZMP coordinates are calculated by Equation 2, the parameter definition is shown in Table 2.
Force analysis related parameter parameter definition table
In the actual situation, the specific distribution of the force of the plantar is difficult to measure, and other parameters of the robot need to be used for calculation and derivation. The commonly used physical parameters are as follows: M is the quality of the humanoid robot (kg);
Where
The ground force of the humanoid robot’s sole can be replaced by the resultant force f and the torque around the ZMP τ
p
, which is obtained by the formula 1. The ground force around the origin is as follows:
τ
p
is the resultant moment of the ground force around point p. Substituting the formula 3 into the formula 4 solving τ
p
gives:
Decompose the vector in the formula 5 and substitute it into τ
px
= 0 and τ
py
= 0 in the formula 1. The results of the first two lines are:
Solving the formula 6 yields the Z-axis X-axis p
x
and the Y-axis coordinate p
y
and substitutes it into the formula 2 for p
z
= 0:
The ZMP is approximated by the robot linkage model. The linkage model is considered to be composed of N mass points. The inertia tensor of each link around the centroid is ignored. The angular momentum of the humanoid robot around the coordinate origin can be expressed as:
Where N is the number of particles that make up the humanoid robot;
If the humanoid robot is treated as a particle, the formula 9 becomes:
The gait planning of the humanoid robot is usually based on the requirements of the walking operation, and the trajectory of the hip joint and the ankle joint during the walking process is planned. When the humanoid robot is walking forward in the horizontal plane, the height of the hip joint is relatively stable, and the highest point of the hip joint starting moment is very small from the lowest point height during the movement, and no additional trajectory planning is required, so the gait planning is usually for the trajectory of the ankle joint, the commonly used planning method is the three-point planning method.
When three-point planning is performed on the ankle joint, the gait planning is usually selected as the key point by selecting the starting time of the swinging foot, the highest point of the swinging ankle joint, and the moment of swinging the foot. After setting three key points, the entire cycle trajectory curve can be derived through the cubic spline interpolation function, and then the position and attitude of the humanoid robot link during gait movement can be determined. Then, other joint angles can be calculated by inverse kinematics. Curve to determine the walking gait of the robot.
Set the humanoid robot to walk in the horizontal plane. If the obstacle is crossed and the upper and lower steps are not considered, the highest point of the swinging ankle joint should be reduced as much as possible, thereby reducing the hip and knee joint swing angle during the movement and reducing the joint. Motor torque for maximum protection of the motor. Therefore, when performing the gait planning of the humanoid robot, it is only necessary to determine the distance between the starting position of the swinging foot and the landing position of the swinging foot, that is, the step distance, and the motion posture of the robot in the walking period can be determined; in addition, a walking cycle consumption is set. At the time, you can determine the gait planning of the humanoid robot. The step and walking cycle are the variables of the gait planning, and the robot linkage model is shown in the Fig. 3.

Gait planning variable model.
Set D to the step distance of the gait walking of the humanoid robot, T is the time consumed by the walking cycle. When the swinging foot is landing, the thigh and the lower leg of the swinging leg can be approximated as the same straight line, then let L be the approximation of the robot leg. The total length, θ is the approximate angle between the legs at the time of landing. According to this model, the performance function model of the gait progression speed, walking stability and motor cost of the humanoid robot is established.
The gait forward speed of the humanoid robot is determined by the step distance and the walking cycle time. The larger the step distance and the shorter the walking cycle time, the faster the robot walks faster. The expression is as follows:
Where D is the gait walking step of the humanoid robot; T is the humanoid robot walking cycle; Q1(D,T) is the humanoid robot walking speed; j1 is the adjustment parameter.
The stability of the humanoid robot walking is related to the centre of gravity and the position of the ZMP. When the distance between the centre of gravity of the humanoid robot and the ZMP is large or the ZMP position tends to the tip of the toe, the robot is likely to fall; if the humanoid robot is regarded as a mass point, the formula 10 is available.
The larger the centroid coordinate of the humanoid robot, the more the ZMP coordinates increase, that is, the ZMP position tends to the toes, and the stability of the robot becomes worse. When the humanoid robot reaches the limit point of the single-leg support period, the mass centre is farthest from the support foot, and the distance value can be approximated as half of the step distance D. That is, the larger the step distance D, the worse the stability of the humanoid robot.
Adjust the formula 12, and the distance between the centre of gravity of the humanoid robot and the ZMP can be expressed as follows:
It can be seen that the centre of gravity of the humanoid robot and the ZMP distance are related to the acceleration of the centre of gravity. The greater the acceleration of the centre of gravity of the robot, the larger the distance between the centre of gravity and the ZMP, and the worse the stability of the humanoid robot. Analyze the relationship between the acceleration of the centre of gravity of the robot and the cycle time. The formula for the reference acceleration and time is as follows:
The comprehensive formula 13 and 14 show that the longer the walking cycle time of the humanoid robot, the smaller the acceleration of the center of gravity projection point, and the better the stability of the robot, the walking stability and step and walking cycle of the humanoid robot. The relationship of time can be expressed by the following formula:
Where D is the gait walking step of the humanoid robot; T is the humanoid robot walking cycle; Q2 (D,T) is the humanoid robot walking hazard; j2 is the adjustment parameter.
Joint performance mainly has two parameters, namely speed and torque. During walking of biped robot, both speed and torque are required to meet the requirements. Joint speed is related to walking speed. Joint motor torque depends on robot link mass and posture. Robot link mass is constant and will not change with walking motion, so motor torque is simplified to be related to acceleration. From the angle of torque work, the power is equal to the product of force and speed, and the force and acceleration are directly proportional. Therefore, the demand function for joint performance is written as follows:
Random values of step distance and period are substituted into equations 11, 15 and 16 to obtain the relationship between demand functions as shown in Figs. 4 and 5. With the increase of walking speed, the walking stability of the robot decreases and the demand for joint torque increases. For the same walking speed, the walking stability and joint performance are different for different step distances and periodic parameters, so the joint stability can be improved and the performance requirements for joints can be reduced under the condition of ensuring the walking speed through the relationship of demand functions. In Figs. 4 and 5, random point 1 is in the area between point sets, which is not the optimal choice. Random point 2 is located at the edge of the point set, but it is the worst choice. Only the edge of the point set where the random point 3 is located is the optimal choice, that is, the walking speed and stability, walking speed and performance requirements are optimized simultaneously.

Simulation results with walking speed and stability.

Simulation results with walking speed and performance.
In summary, there are three optimization objectives in the dynamic multi-objective optimization problem of humanoid robot gait planning: the speed of gait advancement of humanoid robot, the stability of walking and the cost of motor; two decision variables: The step of the humanoid robot’s gait walking D and the time spent in a walking cycle is T. To facilitate the optimization calculation, use the decision vector X = [x1, x1]
T
instead of the decision variables D and T to set x1 has a value range of [a1, b1], x2 has a range of [a2, b2], Combining the formulas 11, 15, and 16, the expression for this dynamic multi-objective optimization problem is as follows:
Where f1(X) is the humanoid robot walking speed objective function; f2(X) is the stability objective function of the humanoid robot; f3(X) is the cost objective function of the humanoid robot hip motor; X = [x1, x1] T is the system decision variable; x1 is the gait walking step variable of the humanoid robot; x2 is the humanoid robot walking cycle variable; L is the approximate total length of the humanoid robot legs; j1, j2, j3 are adjustment parameters.
Multi-objective optimization algorithm
In order to optimize the performance of the humanoid robot during walking, it is necessary to analyze the relationship between the speed of the gait, the stability of the walking and the performance of the motor, and classify the performance into the objective function to determine the relevant decision vector. Then, the optimized gait is obtained through simulation experiments to verify whether the performance of the gait walking of the humanoid robot is optimized. The superiority of the improved dynamic multi-objective optimization algorithm and the feasibility of the gait optimization system are proved.
The steps of the dynamic multi-objective optimization algorithm are as follows: Establish an initial parent population; cloning and expanding the initial population; Perform particle swarm variability and non-uniform variation on cloned populations; Perform fast non-dominated sorting on all populations; Perform individual crowding distance calculation; Conduct elite strategy optimization to get a new parent population If the current number of iterations reaches the set requirement, the solution set population is output, otherwise step 8 is performed; Substitute the time variable and perform the next round of dynamic optimization.
Multi-objective optimization algorithm
The walking distance and walking period of biped robot are taken as variables, the walking speed, walking stability and motor cost of robot are taken as optimization objectives, and optimization is carried out under different population numbers and iteration times. The smaller the optimization objective value is, the better the representative performance is. The number of selected populations is 100 and the number of iterations is 100. The simulation results are shown in Figs. 6 and 7. The red dot part in the figure is on the boundary of the data point set. When the walking speed is fixed, the stability is the highest and the requirement for joint performance is the lowest.

Simulation results with walking speed and stability.

Simulation results with walking speed and performance.
Structural strength and space check
The first consideration in the structural design of the robot is to reduce the weight. The heavier the robot is, the stronger the driving capability of the joint motor is required, and the stronger the driving capability of the joint will also lead to the increase of the joint weight. The development of material science and technology has promoted the lightweight of robots, such as lighter and higher strength materials, while multi-axis NC machining technology can realize the lightweight of hollow structures. The direct purpose of these designs is to reduce the weight, reduce the design difficulty of the driving part, reduce the moment of inertia of the structural parts, and improve the walking flexibility of the robot. As little material as possible can lead to insufficient structural strength of the structural members, and when the structural members deform, the foot end position control error increases. Therefore, after the mechanical structure design is completed, the structural strength shall be checked. As shown in Fig. 8, based on ensuring the structural strength, the weight of the structural parts shall be reduced as much as possible and the deformation shall be as small as possible. Before the simulation, check the motion space of the structural members. As shown in Fig. 9, check the motion range and interference of the components during the motion process. After the completion of this part of work, the data of structure size, weight and centre of gravity can be used by simulation software for further gait simulation.

Structural strength check.

Structural strength check.
V-REP is a flexible and expandable universal robot simulator, as shown in Fig. 10. The simulator supports a variety of control methods and programming methods and can easily import the robot models built by developers themselves. The biped robot model is imported into the V-REP environment to generate the robot model in the simulation environment, and the robot linkage, steering gear and other parameters are set to obtain a robot model similar to the “real object". In addition, the V-REP environment allows various sensors to be added to track the angle value of the robot so as to facilitate real-time tracking of the robot’s motion state and analysis of problems in the robot simulation environment, and facilitate analysis and improvement at the algorithm level.

V-REP simulation interface.
The gait planner transmits the planned centroid trajectory and foot end trajectory to the joint angle planner, and the joint angle planner solves a total of 12 joint angle values of the left leg motion chain and the right leg motion chain through inverse kinematics and transmits the 12 joint angle values to the V-REP simulator. Finally, the V-REP simulator controls the biped robot motion in the simulation environment according to the received joint values, thus realizing real-time control of the biped robot.
The virtual prototype adopts a biped robot model built under a V-REP simulator, each leg has six degrees of freedom, the trunk is provided with an electronic gyroscope and an electronic accelerometer, and the foot end is provided with six-dimensional force, so that the robot speed and acceleration can be monitored in real time, each joint position and the centroid position of the robot can be output in real time, the running state of the robot can be judged, and the motion curves of important nodes such as the centroid and the foot end of the robot under different algorithms can be observed.
Based on the trajectory of the centre of mass and the trajectory of the foot end, the position of each joint is planned out through inverse kinematics to realize the normal walking of the robot. Given the positions and postures of the trunk and ankles of the left and right feet, the positions of 12 joints of the left and right legs are solved by analytical method, and the angles of each joint are calculated by analytical method. The simulation results are shown in the Figs. 11, 12 and 13.

Simulation curves of joint angle.

Simulation curves of joint speed.

Simulation curve of biped robot’s joint torque.
V-rep gait simulation can be used to obtain joint speed and torque demand data. If the current drive performance does not meet the requirements, re-parameter design and repeat the above design process until the simulation verification experiments meet the requirements.
By analysing the gait planning during the robot’s walking, it was found that the speed, stability, and motor cost performance of the walking gait conflicted with each other, and there was a conflict between the path length of the path planning and the safety. To solve these problems with multiple optimization goals and mutual constraints between the goals. This paper proposes an improved dynamic multi-objective optimization algorithm to optimize the problem, summarizes the gait speed, walking stability, and motor cost performance as the objective function, performs dynamic multi-objective optimization, and finally performs simulation experiments to obtain a relatively ideal The result of performance optimization is to replace the hardware iteration by simulation iteration, and finally to achieve the performance optimization of humanoid robot walking.
