Abstract
Dielectric elastomer actuators (DEAs) enable to create soft robots with fast response speed and high-energy density, but the fast optimization design of DEAs still remains elusive because of their continuous electromechanical deformation and high-dimensional design space. Existing approaches usually involve repeating and vast finite element calculation during the optimization process, leading to low efficiency and time consuming. The advance of deep learning has shown the potential to accelerate the optimization process, but the high-dimensional design space leads to challenge on the accuracy and generality of the deep learning model. In this work, we propose a deep learning-based automatic design framework for DEAs, capable of rapidly generating high-dimensional distributed electrode patterns based on different design objects. This framework is developed as follows: (1) a dataset construction strategy combining with a finite element model is developed to optimize the data distribution within the high-dimensional design space; (2) a neural network-embedded physical information is designed and trained to achieve accurate prediction of the continuous deformation within
Introduction
Soft robots, capable of safely interacting with humans and manipulating objects in unstructured environments, represent a burgeoning technology in the field of robotics. Those bioinspired mechanical features of soft robots mainly come from the muscle-like materials (known as artificial muscles), such as shape memory polymers,1–6 pneumatic and fluid actuators,7–11 and dielectric elastomer actuators (DEAs).12–15 Due to the advantages of electric actuation, fast response speed, high-energy density, and ease of integration, DEAs have been widely used to develop advanced soft robots.
In general, DEAs mainly consist of a dielectric elastomer membrane coated with compliant electrodes on both sides. When a high voltage is applied, the Maxwell stress between the electrodes squeezes the membrane, leading to expansion in area and decrease in thickness. Based on this working principle, DEAs with various configurations have been proposed to generate different actuations, such as elongation,16–18 bending,19–22 and contracting.23,24 Furthermore, there are a lot of mechanical achievements in the design of dielectric elastomer-actuated soft robots, such as the flying robot,25–27 deep sea swimming robot,28,29 wall-climbing robot,30–32 soft pocket pump, 33 and wearable haptic device,34–37 becoming an important trend in the development of soft robots. As the application of DEAs is emerging, the automatic design of DEAs is increasingly being desired. However, due to the nonlinear electromechanical coupling, continuous deformation, and high-dimensional design space, the automatic design of DEAs faces huge challenge.
Most of the previous works mainly focus on establishing physical models38–43 to explain the experimental phenomena (such as large deformation, mechanical instability, electric breakdown, and viscoelasticity) of DEAs. However, it is difficult to obtain the inverse analytical models for these physical models, limiting their application in the design of DEAs. As a result, most existing DEAs mainly adopt an intuitive or empirical design paradigm for prescribed object, limiting their output potential for larger actuation capability. To overcome the above drawback, some gradient-descent optimization approach-embedded finite element method have been proposed to improve the performance of DEAs, such as pairs of Bezier curve,44,45 fat Bezier curve, 46 0–1 layout, 47 the level set-based topology optimization method, 48 and the Solid Isotropic Material with Penalization method.45,49 In general, those approaches usually involve three steps: (1) calculating the continuous deformation under series of random electrode patterns based on finite element model (FEM); (2) calculating the gradient of the object function; and (3) updating the electrode patterns based on gradient-descent topology optimization methods and repeating the above process. By taking the advantage of gradient-descent iteration, the electrode patterns would be updating during the optimization process. However, vast finite element calculations of the optimization process lead to low efficiency and time consuming for one single prescribed target. Moreover, FEM could not achieve the convergence results as the design parameters increase, limiting their applications for designing DEAs with high-dimensional design space.
The advance of deep learning is capable of describing complex nonlinear mechanical systems with high accuracy and efficiency,50,51 showing potential application in solving the optimization problems, such as mechanical design52,53 and chemical synthesis.54,55 Recently, some deep learning-based design methods have been proposed in the field of soft robotics. For example, Bertoldi et al. have proposed a deep learning-based inverse design approach that can obtain an optimal 2D elastomer membrane and the inflation pressure for target 3D shape. Renaud et al. have adopted the convolutional neural network and the Bezier curve-based genetic algorithm (GA) to optimize the inner structure of pneumatic actuators. 56 Li et al. have proposed a graph neural network-based methods to optimize the supporting frame of DEA with a minimum energy structure. 57 Different from those finite element-based optimization approaches, deep learning-based design method usually starts from constructing a proper dataset and train a neural network model to predict the continuous deformation under arbitrary design parameters, achieving fast optimization design. In general, the key of deep learning-based design methods depends on the accuracy and generality of neural network model trained on constructed dataset. Usually, the dataset consists of randomly generated design parameters and the continuous deformation calculated by FEM. To construct the dataset for training a deformation prediction model, a common approach is to randomly sample the design space (which is the electrode pattern). However, due to the high-dimensional design space of DEAs, randomly selecting the electrode patterns to construct the deformation dataset exits bias, as the increase of random parameters of the high-dimensional design space leads to the small deformation due to the fragmental electrodes. While the large deformation is usually demanded for the applications of DEAs. And the dataset is difficult to cover the whole deformation space of DEAs with randomly sampling of high-dimensional design space. As a result, the accuracy and generality of the deep learning model may deteriorate, limiting their applications for high-dimensional design spaces. To date, existing deep learning-based methods mainly focus on optimizing a small number of design parameters. How to rapidly design the distributed electrodes of DEAs still remains elusive.
In this work, we propose a deep learning-based design framework to design the distributed electric field of DEAs fast and automatically. To this end, we first adopt the neo-Hookean constitutive model to describe the nonlinear electromechanical effect of dielectric elastomer material and establish a membrane element through a user-defined material model (UMAT) in ABQUAS. Then, a dataset evaluation method and dataset construction strategy are developed to optimize the electrode sampling for the high-dimensional design space. Furthermore, a neural network is trained on the dataset-embedded physical information to describe the relationship between distributed electric field and continuous deformation of DEAs. Lastly, by using the fast deformation prediction neural network as a surrogate model, a GA is adopted to automatically optimize the distributed electric field (Supplementary Movie S1). To validate the effectiveness, we employ the framework to automatically optimize the distributed electric field according to different design objects, such as specific displacement and maximum displacement of single point or multiple points. Both the calculated results (FEM and Neural Network predicted) and the experimental data demonstrate that: (1) the Neural Network (NN) model trained on the constructed dataset can predict the deformations of DEAs precisely compared with the FEM results (
Method
Working principle of the design framework
Figure 1 shows the working principle of the automatic design framework for DEAs based on different design objects. In general, a planar DEA mainly consists of a prestretched dielectric elastomer membrane (made of acrylic, 3M VHB 4910, thickness of 1.0 mm size of 50 mm × 50 mm, prestretch of

Automatic design framework for DEAs.
A neo-Hookean FEM is first established to obtain the continuous deformation under distributed electrodes. In our work, we focus on the static responses of DEAs instead of dynamic responses. Then, we adopt a neo-Hookean constitutive model without considering the viscosity of the material (see Supplementary Data S1 and Supplementary Fig. S2 for more details about the development of the FEM). Moreover, a dataset evaluation method and dataset construction strategy are developed to optimize the electrode sampling for high-dimensional design space, paving the way for training a deep learning model.
Based on the dataset, a neural network-embedded physical information is designed and trained, which can accurately predict the continuous deformation within
The trained neural network is working as the surrogate model and a GA-based optimization method is adopted to automatically design distributed electrodes of DEAs. For proof-of-concept testing, a series of case studies (including maximum displacement, specific displacement, multiplicity of solutions, multiple degree-of-freedom actuations, and complex actuations) have been conducted. Both simulation results and experimental data demonstrate that our design framework can automatically design the electrode pattern within 2 min. Supplementary Movie S1 shows one example of the design process of the planar DEA. It can be observed that it only takes about 120 s to obtain the desired electrode pattern. Compared with existing design methods (Supplementary Table S1), our design framework can accelerate the design process by three orders. In addition, with the designed electrode pattern, the displacement of the central point is
Based on the above working principle of our design framework, the key is to establish an accurate surrogate model. To this end, we first need to construct a proper dataset that satisfies the following conditions: (1) its deformation distribution should cover the deformation space of the planar DEA under any electrode pattern and (2) the size of the dataset should be affordable. However, due to the high-dimensional design space of the distributed electrodes, the deformation space is difficult to figure out. Increasing the size of the dataset may contribute to enlarging the deformation space of the dataset, but it leads to time-consuming calculation. In this work, we propose a dataset evaluation and augmentation approach to construct the dataset.
Dataset construction
In general, the dataset consists of a series of distributed electrode patterns and corresponding continuous deformations. The first step is to generate distributed electrode patterns. To this end, the dielectric elastomer membrane is meshed into small squares, and we use “0” and “1” to represent without and with electrodes of each small square, respectively. It should be noted that the performance of the planar DEAs relies on the total area of the electrode. Without loss of generality, we first set the total area of the electrode pattern as half of the dielectric elastomer membrane and expand it to changing area. In addition, the prediction accuracy of the complex deformation field can be improved by reducing the size of the small square, but it will enlarge the design space and lead to fabrication problems. Therefore, this work adopts 1.0 mm as the size of the square, and the dielectric elastomer membrane is meshed into
Specifically, we first define the range of the maximum displacement under arbitrary electrode as the deformation space. Based on our experience, the deformation space of planar DEAs is selected as
Based on the above evaluation method, the dataset is constructed by the following four steps. The first step is to randomly generate electrode patterns. We adopt a random algorithm (randomly select 1250 units out of 2500 and set as 1) to generate 10,000 electrodes patterns (called small grid) and calculate the continuous deformation through the FEM. Based on the dataset with small grid electrode patterns, we can obtain the probability density of the small grid-based dataset (Fig. 2a). It can be seen that most of the data are concentrated on the small deformation area. Based on the dataset, a neural network model is also trained, which is working as the surrogate model for optimizing the design of planar DEAs. However, for the designed planar DEA, its maximum deformation predicted by the surrogate model is much smaller than that of the FEM with same electrode pattern (see Supplementary Fig. S5 for more details about the surrogate model based on the random electrode-based dataset). It is worth noting that the prediction RMSE of fragmental electrode patterns increases for the model trained without the small grid-based dataset predicted. The main reason depends on the fact that the small grid-based dataset is concentrated on the small deformation area due to fragmental electrodes. As a result, the trained surrogate model cannot predict large deformation.

Construction of the dataset.
To reduce the fragmentation, the second step is to enlarge the size of the unit (called large grid-based dataset). We adopt another nine kinds of size to mesh the dielectric elastomer membrane, including
To overcome the above drawback, the third step is to adopt a Bezier curve method to generate continuous electrodes. We choose two arbitrary points along the edge and middle point as the through points, then two hyperparameters are introduced to generate four control points of the cubic Bezier curve. By randomly sampling the two arbitrary points and two hyperparameters, we obtain 10,000 Bezier curve-based electrode patterns. The probability density distribution of the Bezier curve-based dataset (Fig. 2c) demonstrates that it can significantly expand the maximum displacement distribution of the dataset. Based on the above three steps, we obtain three datasets with three different probability density distributions. Furthermore, we use them to construct a bigger dataset and analyze its probability density distribution, shown in Figure 2e. It can be observed that the displacement in the dataset can cover different displacement levels, including small, middle, and large deformation. However, as the distribution is still ununiform, the sample size under different deformation is unbalanced and the trained surrogate model will tend to be high probability area. To further improve the generality of the dataset, the fourth step is to augment the dataset. Due to the geometric symmetry property of planar DEA, the displacement field of the electrode after transformation (rotation or mirroring) is the same transformation of previous displacement field, paving a way to augment the data. As shown in Figure 2d and Supplementary Fig. S3d, the process of data augmentation include the following three steps: (1) evaluate the maximum displacement distribution of all the electrodes in the original dataset; (2) split the electrodes by the uniform distribution curve; (3) for the electrode with a large sample size (maximum displacement above the uniform distribution), we calculate how many times it is larger than the uniform distribution (e.g.,
Finally, we construct a dataset with physical information that consists of
Neural network-embedded physical information-based surrogate model
With the physical prior knowledge of DEAs, we build a specialized convolutional neural network to model the continuous deformation of the planar DEA under distributed electric field.
Design of the neural network
To describe the relationship between the distributed electric field and the continuous deformation, we define it as a nonlinear mapping problem. The input is distributed electric field

Design and validation of the neural network.
Encoder module is the feature extraction part, which is used to extract spatial features from electrode pattern distribution. To sufficiently extract local and global information, we utilize a five-layer sequential neural network. Considering that DEA has mechanical and electrical continuity, and strong interactions in adjacent electric nodes, we take large kernels in the early layer to increase the receptive fields and extract features, which can depict the spatial correlations, and fuse local information to generate effective features. Each layer can perform nonlinear transformation. Five-layer neural network can effectively extract local and global features from raw data and mapping them into high-dimensional embedding space. The bottleneck layers are used to map extracted spatial features to high-dimensional displacement feature space for successive decoding and reconstruction process.
Decoder module is the reconstruction part, which is utilized to decode the high-dimensional features into displacement field. Here, we utilize three convolutional neural network layers, which gradually enlarges the spatial resolution and reduces the feature dimension, and finally reconstructs the displacement field. Considering the displacement on X and Y directions are strongly coupled and implied in the spatial features of electric pattern, we build two parallel branches to reconstruct horizontal displacement simultaneously (Supplementary Fig. S4). Furthermore, to increase efficiency and stability of the training process, we add an assistive branch to help the neural network converge. The branch takes 1/2X as a reconstruct target and can guide network modeling process. Due to the symmetrical data, 1/2X branch can act as a 1/2Y branch in some extend and only design one 1/2 branch for network efficiency. In the whole network, 1/2 branch is acting as coarse scale, which can reconstruct global and coarse displacement, while full-scale branch can further provide details and high-resolution displacement field. With multiscale branch structure and corresponding supervisions, the training process would be more stable and converge fast, as well as higher performance.
Loss function of the neural network
The target of the proposed network is estimating the accurate displacement under distributed electric field. To introduce the physical mechanism into our predicting model, we design the loss function to guide the network leaning process and minimize the output error. We first choose the L1 smooth loss to make the output displacement fiddle close to the ground truth. L1 smooth loss can provide stable loss sign for the network without big gradient backpropagation. The maximum displacement output represents the motion capacity of DEAs, which is important to constrain the noise and outlier that may affect the training process. Thus, we introduce the maximum displacement error into loss function that is used to constrain the error and abnormal points. Besides, a hyperparameter
Based on the above loss function, the neural network is trained to predict the continuous deformation field. We train five prediction models based on the datasets in Figure 2, including small grid, large grid, Bezier, augmented, and without small grid datasets. Figure 3b shows one example of the finite element analysis of DEAs and the prediction result based on models trained on five datasets. It can be seen that with the augmented dataset, the surrogate model can precisely predict the continuous deformation. Compared with the FEM simulation results, the RMSE is <0.06, which is considered an acceptable prediction error. Furthermore, we adopt three electrode patterns to validate the effectiveness of those surrogate models (Fig. 3b and Supplementary Fig. S5). Based on the simulation results, the maximum displacement of each electrode pattern of the surrogate model is plotted as a function of that of the FEM. Linear coefficient of determination
We then trained a model without small grid dataset and test the predicted maximum displacement of each electrode pattern of the dataset. The results in Figure 3d show that the model trained without small grid dataset cannot predict the deformation in small scale of deformation, while the model trained by the augmented dataset could predict the deformation during all deformation space (Fig. 3e). We then predict the displacement field of several electrode pattern on various datasets (Supplementary Fig. S5). The model trained without small grid dataset has twice as much error PV as the model trained on augmented dataset when the maximum displacement of input electrode pattern is at a small level. In addition, for the neural network-based surrogate model, it only takes
GA-based inverse design
Based on the surrogate model, many kinds of optimization approaches can be used. Considering the serious nonlinearity of DEAs and high-dimensional design space, we adopt a GA to optimize the distributed electrodes of the planar DEAs in this work.
To optimize the electrode pattern, we first need to define a design object. Although the DEAs can generate continuous deformations, we usually only care about the performance of a few points. Without loss of generality, the design object is defined as maximum displacement of specific points, which can be expressed as:
With the loss function, we employ the GA to achieve the inverse design of DEAs, as shown in Figure 4a. To this end, we first mesh the dielectric elastomer membrane into

Schematics of inverse design with genetic algorithm embedded neural network.
Case I: Maximum displacement of a single point
Based on the above GA-based inverse design method, we first set the design object as maximum displacement of a single point toward an arbitrary direction. And the experimental setup is shown in Supplementary Fig. S7. Without loss of generality, the selected points and directions are illustrated in Supplementary Table S2. For the convenience of comparison, we also need eight intuitive designs (Fig. 5a). To construct the intuitive designs, we adopted a same rule for all cases instead of selecting specifical electrode pattern to deteriorate the output displacement of the intuitive design. We separate the dielectric elastomer membrane into two regions by a line that crosses the object point and is perpendicular to its desired motion direction, then the region behind the motion direction is selected as the electrode pattern. According to the above eight design objects, we separately employ the GA-based inverse design method by taking the coordinate and motion direction of the target point as an input to automatically design the electrode pattern. Based on the designed electrode pattern, corresponding experiments are conducted and evaluated. Figure 5 and Supplementary Movie S2 show the design results, which demonstrate that:

Designed DEAs of maximum displacement of a single point.
For arbitrary points and directions, our method always can rapidly generate optimized electrode patterns (Fig. 5a), which will accelerate the application of DEAs in the field of soft robotics.
Compared with the intuitive design (Fig. 5b and Supplementary Fig. S8), the performance of optimized electrode pattern is improved by about
Case II: Maximum displacement under different area fraction
In the above analysis, we keep the area fraction of the electrode pattern constant. Of course, the area fraction also influences the performance of the planar DEAs. Therefore, changing area fraction is adopted to evaluate the expandability of the automatic design framework. To this end, the design object is selected as maximum displacement of the central point under different area fraction, which can be described as:
Based on the above design object, the area fraction is changing from
Case III: Maximum displacement of multiple points
Except the single point, we further explore the application for motion of multiple points. To this end, we select two kinds of design objects. The first one is to design an electrode pattern that simultaneously generates the centripetal motion of four points. The second one is to simultaneously generate the rotation motion of four points. Two design objects can be expressed as:
Based on the above deign objects, we adopt the automatic design framework to design the electrode patterns. Figure 6b and Supplementary Fig. S9 show the simulation results and experimental data. It can be seen that with the optimized electrode pattern, the four points can generate centripetal motion with 0.8 mm and the rotation motion by

Experimental results of multiple tasks.
Case IV: Multiple solutions
Due to the high-dimensional design space, multiple solutions may widely exist for the same design object. By taking advantage of the GA, it can easily generate multiple solutions. As shown in Figure 6c, we employ our automatic design framework to generate eight electrode patterns with basically same maximum displacement of the central point (about 2.2 mm). Furthermore, when the displacement of the central point is set as 1.0 mm, our automatic design framework also can generate a series of electrode patterns (such as the eight electrode patterns shown in Fig. 6d) with different shape and area fraction. The capability of multiple solutions of our automatic design framework may contribute to satisfying different application conditions of DEAs.
Conclusions
In this work, we propose a deep learning-based design framework for DEAs to automatically generate distributed electrode patterns based on desired performances. To this end, we first establish a membrane element through UMAT in ABAQUS to generate dataset for training neural network and propose a dataset construction approach to optimize the distribution of the dataset. With the optimized dataset, a neural network-embedded physical information is trained to accurately and rapidly predict continuous deformation under arbitrary electrode patterns within
Footnotes
Acknowledgment
The authors thank openbayes.com for providing high-performance computing.
Author Disclosure Statement
No competing financial interests exist.
Funding Information
This work was supported in part by the National Natural Science Foundation of China under Grant 52275024 and 52025057, in part by the Natural Science Foundation of Shanghai under Grant 23ZR1435500, and in part by the Chenguang Program of Shanghai Education Development Foundation and Shanghai Municipal Education Commission under Grant 22CGA11.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
