Abstract
Accurate segmentation of casting defects plays a positive role in the quality control of casting products, and is of great significance for accurate extraction of the mechanical properties of defects in the casting solidification process. However, as the shape of casting defects is complex and irregular, it is challenging to segment casting defects by existing segmentation methods. To address this, a spectrum domain instance segmentation model (SISN) is proposed for segmenting five types of casting defects with complex shapes accurately. The five defects are inclusion, shrinkage, hot tearing, cold tearing and micro pore. The proposed model consists of three sub-models: the spectrum domain region proposal model (SRPN), spectrum domain region of interest alignment model (SRoIAlign) and spectrum domain instance generation model (SIGN). SRPN uses a multi-scale anchoring mechanism to detect defects of various sizes, where the SSReLU and SCPool functions are used to solve the spectrum domain gradient explosion problem and the spectrum domain over-fitting problem. SRoIAlign uses the floating-point quantization operation and the tri-linear interpolation method to quantize the 3D proposals to the feature values in an accurate manner. SIGN is a full-spectrum domain neural network applied to 3D proposals, generating a segmentation instance of defects in a point-wise manner. In the experiments, we test the effectiveness of the proposed model from three aspects: segmentation accuracy, time performance and mechanical property extraction accuracy.
Introduction
Segmenting casting defects is an essential and practical problem in the field of casting CAE and plays an important role in accurately extracting the mechanical properties of defects. Because accurately segmenting casting defects prepares the ground for the exact extraction of the mechanical properties of defects, it is essential to research defect segmentation methods that are applied to improve the extraction accuracy of the mechanical properties of casting defects. Most of the existing methods can only detect defects or extract simple feature parameters of defects. Even the most advanced method can only segment the casting defect region; that is, the segmentation range is limited to the region around the defect. This coarse-grained segmentation method will lead to inaccurate extraction of the mechanical properties of casting defects. In addition, because the shape of the casting defect is complex and irregular, it is difficult to segment casting defects by using the existing methods. In this paper, a spectrum domain instance segmentation model (SISN) is proposed for segmenting five types of casting defects with complex shapes accurately. The five defects are inclusion, shrinkage, hot tearing, cold tearing and micro pore. SISN extracts the three-dimensional topological characteristics from the input volumes and the micro-structure features of the defects from metallographic images and outputs the segmentation instance of the defect. First, the SRPN sub-model is proposed to extract three-dimensional topological characteristics from the input volumes and output multi-scale three-dimensional proposals of the defects. The three critical parts of the SRPN are the multi-scale anchoring mechanism, the spectrum domain activation function (SSReLU) and the spectrum domain pooling operation (SCPool). Second, the SRoIAlign sub-model uses the floating-point quantization operation at the boundary of the feature volume to transform the multi-scale 3D proposals into fixed-size feature volumes and uses the tri-linear interpolation method to calculate the feature values of the featurse bins, which are aggregated by the following SCPool layer. Finally, the SIGN sub-model is proposed to learn characteristics from the three-dimensional proposals of the defects and generate the segmentation instance for each defect. The segmentation instance of the defect is used to calculate the mechanical properties of the defect in the experiment section. The structure of this paper is as follows: the SISN model is outlined in Section 3. The SRPN, SRoIAlign and SIGN sub-models are introduced in Sections 4–6, respectively. Finally, the experimental results are introduced in Section 7. The innovative aspects of this paper include:
A spectrum domain instance segmentation model is proposed to segment casting defects accurately. A spectrum domain region proposal network (SRPN) is proposed to detect defects of multiple sizes. A spectrum domain RoI alignment model (SRoIAlign) is proposed to quantize the multi-size 3D proposals to the feature values in point-wise manner. A spectrum domain instance generation network (SIGN) is proposed to generate the segmentation instance of the defect.
The segmentation and detection of casting defects have become a popular issue in the field of casting simulation. A large number of scholars and research institutions have carried out in-depth research on segmenting casting defects and achieved many results. In this section, based on the summary of existing articles, combined with some of the latest research results of artificial intelligence methods in the field of casting defect recognition, the related work of this paper is summarized and analyzed as follows.
Since 1922, the United States established the world’s first industrial X-ray laboratory, making X-rays to detect the quality of casting products a reality. The X-ray detection method uses the X-ray absorption principle of casting defects to identify defects through defect ray images with specific grey distributions generated by the absorption of photosensitive materials. The X-ray detection method is still widely used because of its simple, intuitive, and convenient operation [1, 2, 3, 4, 5, 6]. However, X-ray imaging mostly reflects the casting defect morphology above the centimetre level, and manual operation is needed to complete the detection work. Due to the rise of artificial intelligence methods, this manual defect detection method shows disadvantages.
The use of neural networks for detecting casting defects appeared approximately 10 years ago. Because large-scale datasets obtained from foundries can be used to train neural networks, the casting defect recognition method based on an artificial neural network can be realized. Lewis et al. constructed a simple neural network model to predict the influence of casting processing parameters on casting defects [7] and established the relationship between the casting temperature parameters and casting defects. This method can predict whether there are defects in casting products and the possible causes but cannot predict the key geometric information, such as the location, shape, and quantity of defects. Zheng et al. constructed a double-layer BP neural network to predict the relationship between casting defects and casting processing parameters [8]. However, the neural network structure is too simple, and the depth of layers is too shallow, which limits the accuracy of the network, and the network is not used to detect casting defects. To address this, researchers have proposed a variety of improvement strategies to improve the accuracy of neural networks for defect detection. In 2012, Masci et al. used multi-scale pyramid pooling technique to build a neural network to detect defects in steel castings [9, 10]. Liu et al. constructed a deep confidence network model to extract the defect information in high-dimensional feature space and introduced a parameter transfer mechanism to alleviate the over-fitting phenomenon in a neural network [11]. However, these methods are not designed for the segmentation of defects, which makes it impossible for them to extract the mechanical properties of defects. Additionally, a variety of artificial intelligence methods for defect detection have emerged. Hanzaei used the RIMLV operator and the close morphological operator to detect ceramic defects [12], which can extract the geometric features of the defects automatically. Dounias used machine learning method to detect the mechanical defect of washing machine in the production line [13, 14, 15, 16]. Yan et al. [17]. proposed an AASI-GTP model to detect low-contrast defects on micro3D textured surfaces. However, these methods are designed specifically for ceramic defects and low-contrast defects, which are not applicable to the detection or segmentation of casting defects. Therefore, using an artificial neural network to detect defects entered a bottleneck period until the appearance of convolutional neural networks. In 2017, Wang et al. [18]. designed a deep convolutional neural network (CNN) based on the classical convolution neural network benchmark framework [19, 20, 21], which can automatically extract the geometric characteristics from the defect regions. Fang et al. [21]. constructed a deep convolutional neural network based on the traditional Faster R-CNN [23, 24] framework to detect crack defects. Zhang et al. [25]. constructed a category-aware object detection network to classify and detect surface defects. These CNN-based methods make it possible to detect surface defects accurately. However, the traditional CNN has a lower recognition accuracy for inner defects, which limits the extraction accuracy of the characteristic parameters of the inner defects. To address this, in 2018, Lin et al. proposed an improved deep convolution neural network architecture to detect casting inner defects more accurately, and the inner defects included shrinkage, porosity and hot tearing [26]. However, this method still cannot segment casting defects or extract the mechanical properties of castings. In 2019, reference [27] proposed a spatial domain segmentation network to segment the casting defect region. However, this method was proposed to segment defects in a coarse-grained manner, and the segmentation is limited to the area around the defect. Therefore, for defects with complex shapes, this method cannot segment them accurately, and inaccurate segmentation instances will also lead to a decline in the extraction accuracy of the mechanical properties. To improve the segmentation accuracy of casting defects with complex shapes and maintain the time performance of the network at a better level, a spectrum domain instance segmentation network (SISN) is proposed in this paper. SISN is a full spectrum domain training neural network model, from which the mechanical properties can be extracted. The model can improve casting process simulation accuracy and improve the rationality of casting process selection to improve the quality of castings.
The architecture of SISN
In casting process simulation, the segmentation accuracy of casting defects directly affects the extraction accuracy of the mechanical properties of defects. In this paper, a spectrum domain instance segmentation model (abbreviated as SISN) is proposed for segmenting defects accurately. SISN consists of three sub-models: the spectrum domain region proposal network, abbreviated as SRPN; the spectrum domain region of interest alignment network, abbreviated as SRoIAlign; and the spectrum domain instance generation network, abbreviated as SIGN. The SRPN sub-model extracts three-dimensional topological characteristics from the input volumes and output multi-scale three-dimensional defect proposals. In addition, a residual network (ResNet-50-C4) [28] is used to extract the micro-structure features of the defects from metallographic images. SISN uses the ORN training framework [29] to train the SRPN and ResNet-50-C4 jointly and projects the extracted micro-structure features to the three-dimensional defect proposals. SRoIAlign uses a floating-point quantization operation to subdivide the multi-scale three-dimensional proposals of the defects into fixed-size feature volumes and uses the tri-linear interpolation method to calculate the feature values of the feature bins. SIGN learns characteristics from the three-dimensional proposals of the defects and generates the segmentation instance of the defect. The segmentation instance of the defect is a three-dimensional geometric layout of the defect with micro-structure features. The SRPN, SRoIAlign and SIGN sub-models are completely trained and tested in the spectrum domain. The pipeline of the proposed method is shown in Fig. 1. The proposed method is introduced in detail in the following sections.
The pipeline of the proposed method.
The SRPN pipeline. The SRPN is a full-spectrum domain neural network model which includes the Sconv layers, SSReLU layers and SCPool layers. The Sconv layer is a convolution layer in spectrum domain, which uses dot product operation in each convolutional layer instead of the traditional convolutional operation.
The precondition for accurately segmenting casting defects is how to detect defects of various sizes accurately. Therefore, a spectrum domain region proposal network (SRPN) is designed for accurately detecting multi-size defects. The SRPN pipeline is shown in Fig. 2. On the one hand, the SRPN uses the multi-scale anchoring mechanism to detect defects of various sizes. On the other hand, the spectrum domain architecture of SRPN ensures that the SRPN quickly detects defects from hundreds of millions of input volumes. In the rest of this section, we explain the SRPN structure in detail from these two aspects.
First, the SRPN uses a multi-scale anchoring mechanism for detecting defects of various sizes. The essence of the anchor mechanism is to use a sliding window to detect defects in a multi-scale manner [30]. We use a feature pyramid mechanism to generate the multi-size volumes for each defect and use a sliding window to scroll on the multi-size volume to generate a set of candidate regions; that is, we assign a set of rectangular boxes (anchors) with different scales and ratios to each spatial point. The location of these anchors is the coordinates of the central spatial point contained in the rectangular box. Then, the SRPN model proposed in this section is used to classify the defect feature information in the candidate regions. The parameter settings of the anchor mechanism used in this paper are explained as follows.
The calculation process of the SSReLU function and the SCPool operation. We assume that the size of the output volume is 
The camera coordinates of the input volume refer to the position coordinates of the input volume overlapping with the gravity direction. The effective scanning range of our 3D casting scanner is [ The number of anchor boxes is defined as 20 within a sliding window. An anchor box corresponds to a 3D proposal, as shown in Fig. 2. When the anchor aspect ratio is different, we define two anchors with the same volume but different angles. For example, the aspect ratio of an anchor is 1:2:1, and the aspect ratio of another anchor is defined as 2:1:1, which are two independent and different anchors. The aspect ratio of the anchor boxes ranges from 0.5 millimetres (gas porosity defect) to 3 millimetres (crack defect). Because the shape of the defects is complex and diverse, if we use single-scale candidate regions to predict the positions of all anchor boxes, there will be a large number of invalid mappings. Therefore, we use the multi-scale SRPN to detect defects of various sizes. We define the multi-task loss function on each anchor box as
Second, the SRPN sub-model is a whole-spectrum domain training model. The SRPN model uses the SSReLU function to eliminate the gradient explosion phenomenon in the spectrum domain back-propagation pass and uses the spectrum domain down sampling operation (SCPool) to solve the spectrum domain over-fitting problem [31]. Inspired by the spatial domain activation function [32, 33, 34], we design the SSReLU function as follows:
where
where
The pipeline of SRoIAlign sub-model. The floating-point quantization operation is used to quantify the feature volumes, and the tri-linear interpolation method is used to calculate the feature values of the feature bins.
The SRPN sub-model extracts three-dimensional topological characteristics from the input volumes and output multi-scale three-dimensional proposals of the defects. In addition, the residual network (ResNet-50-C4) is used to extract the micro-structure features of the defects from metallographic images. SISN uses the ORN training framework to train the SRPN and ResNet-50-C4 jointly and maps the extracted micro-structure features to the three-dimensional proposals of the defects.
Because the pooling layer only accepts the fixed-size feature volume, it is necessary to transform the multi-scale 3D proposals into fixed-size feature volumes. The traditional RoIPooling method uses truncated rounding quantization at the boundary of the feature volume to transform the multi-scale 3D proposals into fixed-size feature volumes [40, 41]. This quantization method leads to a mismatch between the 3D RoI and the sfeature values. To address this, the SRoIAlign sub-model is proposed to quantize the 3D proposals to the feature values accurately. The SRoIAlign sub-model completes the quantization task in three steps. We describe the three tasks in detail as follows.
The floating-point quantization operation is used to quantize the 3D proposal to the feature volume. We assume that the size of a 3D proposal output by SRPN is The feature volume is subdivided into fixed-size feature bins. We assume that the size of the input feature volume of the SCPool is The tri-linear interpolation method [42] is used to calculate the feature values of the feature bins. The tri-linear interpolation method implements the linear interpolation operation on the tensor product grid of eight discrete sampling points in the feature bin. We perform a tri-linear interpolation on each feature bin to obtain
The pipeline of SIGN sub-model. The head of SIGN is built on the fifth layer of the ResNet50 backbone structure and the feature pyramid network structure. The numbers under the name of each layer indicate the volume resolution and the number of spectrum-domain channels. In addition, the porosity parameters of each defect instance in the solidification process are characterized by the hierarchical visualization method.
An important factor affecting the extraction accuracy of the mechanical properties of casting defects is whether an accurate segmentation instance of the defect can be obtained. However, it is a challenging task to obtain accurate segmentation instances of defects, especially for defects with complex shapes. To address this, a spectrum domain instance generation network (SIGN) is proposed to generate segmentation instances of defects with complex shapes accurately. The SIGN sub-model is a full-spectrum domain convolutional neural network applied to 3D proposal, generating a segmentation instance in a point-to-point way. In this section, we introduce the SIGN sub-model in detail from four aspects: instance representation, backbone framework, head framework, and loss function.
Instance representation. The segmentation instance encodes the geometric layout of the input volume. Therefore, it is different from the classification regression and bounding box regression that are projected into small output volumes by full-connection operations. SIGN extracts the geometric structure of the defect by point-to-point alignment and spectrum domain convolution operations [43]. The spectrum domain pipeline of the SIGN is shown in Fig. 5. SIGN uses a full-spectrum domain convolutional neural network pipeline to generate a segmentation instance from a 3D proposal. This pipeline structure allows a 3D proposal to pass through the pipeline in a complete geometric layout without projecting into a small volume representation that lacks geometric information. It is different from the previous networks that use the full spatial domain convolutional layers to predict the segmentation instances, SIGN uses fewer feature parameters to represent the segmentation instances and is more accurate than the spatial network, which is verified in the experiments in Section 7. Backbone framework. The backbone framework is used to extract features from the input volumes. We use three typical networks as our backbone frameworks: AlexNet7, VGGNet19 and ResNet50. The number after the network name indicates the depth of the network. In addition, the feature pyramid network is used to extract the multi-scale features of the 3D proposal from the multi-level feature pyramid. This feature pyramid network is also used in the SRPN sub-network to generate the multi-size feature volumes for each defect. Therefore, the two models can share the same training process, which is helpful to improve the training speed of the network. Head framework. The head framework is used to generate the classification label, the bounding box and the segmentation instance. The segmentation instance is applied exclusively to 3D proposals that are output by the SRPN sub-model. The head framework pipeline is shown in Fig. 5. The head of SIGN is built on the fifth layer of the ResNet50 backbone structure and the feature pyramid network structure. The size of the convolutional filter is Loss function. We use the multi-task loss function to measure the regression loss of the instance segmentation results. The multi-task loss contains three parts: the classification loss denoted as
To verify that the proposed model plays a positive role in the segmentation of casting defects, we test and analyse the advantages of the SISN model from three aspects: segmentation accuracy, time performance and extraction accuracy. Before giving the experimental data, we first explain the pre-set parameters of the experiment.
Datasets. The in-house EA88 dataset is constructed for training and testing the spectrum domain networks in our experiments. The EA88 dataset consists of two parts: the three-dimensional volume models of EA88 and the metallographic images. The EA88 dataset contains 12,000 pieces of metallographic images, each metallographic image contains 400 attribute labels, and each attribute label contains 8 micro-structure feature parameters; that is, the EA88 dataset contains Casting defect samples. The casting defect samples used in this paper are all from an EA88 cylinder. The EA88 cylinder is made by a sand casting process. The process parameters of EA88 cylinder casting are as follows: 1. the pouring temperature is 1,390
Detection and segmentation performance of DCNN, koCNN and SISN
The experiment streamline. The casting defect samples used in this paper are all from an EA88 cylinder which is made by a sand casting process. A Prisma E scanning electron microscope was used to scan the defect samples to obtain metallographic images of the defects.
In this experiment, we select four typical neural network models to test their detection accuracy and segmentation accuracy for casting defects and then compare the testing results with our models. We use the parameter AP to evaluate the defect detection accuracy and segmentation accuracy of these models. AP is the abbreviation of average precision, and the AP error threshold is set to 0.25 and 0.15. Due to the different microstructures and formation mechanisms of casting defects, we further subdivide the AP parameters into five grades, which are represented by five AP parameters with subscripts:
Classification performance for SISN, koCNN and DCNN
Classification performance for SISN, koCNN and DCNN
Then, we use the spectrum domain model koCNN to detect five kinds of defects. For the five defects, the average detection accuracy of koCNN is 41.08 AP and that of DCNN is 43.44 AP, which is 2.36 points higher than the former. This is because koCNN uses sinc interpolation and Hermitian symmetry to speed up the detection. These two functions use boundary interception to reduce the training time, which also reduces the detection accuracy. Similar results are obtained for the five defect combinations. However, we integrate the proposed SIGN sub-model into the koCNN and the IGN sub-model into the DCNN model (denoted as koCNN-I and DCNN-I) so that koCNN-I and DCNN-I can segment the defects. For the five defects, the average detection accuracy of koCNN-I is 30.82 AP and that of DCNN-I is 32.56 AP, which is 1.74 points higher than the former. Furthermore, for the five defect combinations, the average detection accuracy of koCNN-I is 25.94AP and that of DCNN-I is 28.64AP, which is 2.7 points higher than the former. It can be seen that the complexity of the defect increases, but the segmentation accuracy of koCNN-I and DCNN-I does not decrease, and is even nearly one point higher than that of koCNN and DCNN. This shows that the proposed SIGN framework has good compatibility and high accuracy. It can be integrated into other detection frameworks to segment complex defects accurately.
Finally, we evaluate the detection and segmentation performance of our model in three levels. In the first level, we combine the SRPN sub-model and ResNet50 network to generate the first-level detection framework (denoted as SISN-L1) and calculate the detection accuracy of the first-level framework for five kinds of defects and five kinds of defect combinations. In the second level, we combine the SRoIAlign and SIGN sub-model to generate the second-level segmentation framework (denoted as SISN-L2) and calculate the segmentation accuracy of the second-level framework. In the third level, we combine the SRPN, SRoIAlign and SIGN sub-models to generate the last level segmentation framework (denoted as SISN-L3) and calculate the segmentation accuracy of the SISN for five defects and five defect combinations. As shown in Table 1, the average detection accuracy of SISN-L1 is 45.72 AP, which is 2.28 points higher than that of the DCNN model and 4.64 points higher than that of the koCNN model. This is due to the use of the proposed SRPN sub-model, in which the SSReLU and the SCPool are employed to train the network completely in the spectrum domain. The average segmentation accuracy of SISN-L2 is 41.18 AP, which is 8.62 points higher than that of the koCNN-I model and 10.36 points higher than that of the DCNN-I model. This is due to the proposed SRoIAlign sub-model, in which the floating-point quantization operation is used to quantize the 3D proposal to the feature volume. However, the integrated models (koCNN-I and DCNN-I) use truncated rounding quantization at the boundary of the feature volume, which reduces the segmentation accuracy. In addition, SRoIAlign uses the tri-linear interpolation method to calculate the feature value of the centre point of the feature bin, which also improves the segmentation accuracy. For the five single defects, the average segmentation accuracy of SISN-L3 is 45.40 AP, which is 4.22 points higher than that of SISN-L2. For the five defect combinations, the average segmentation accuracy of SISN-L3 is 42.72 AP, which is 3.84 points higher than that of SISN-L2. The difference in accuracy increment between the two is only 0.38 points. This is attributed to the SIGN sub-model which generates the segmentation instance of the defect in a point-to-point way. The results show that regardless of whether the geometric complexity of defects is consistent, the combination of SRPN, SRoIAlign and SIGN greatly improves the defect segmentation accuracy, and the three sub-models are compatible with each other.
To evaluate the classification accuracy of the proposed model for casting defects, we present another set of experimental data. We select the seven models in Table 1 as the test models and calculate the classification values of the seven models for five types of defects(Type-A, Type-B, Type-C, Type-D and Type-E). The calculation results are shown in Table 2. For Type-A defect, the classification value of the SISN-L1 model and koCNN model are 90.0 AP and 65.8 AP respectively. The mean Average Precision of SISN-L1 is 87.9 mAP and that of koCNN is 62.6 mAP. This shows that the SISN-L1 model has better classification performance than koCNN. This is due to the SSReLU and SCPool functions, which can better maintain the feature extraction precision of the whole-spectrum domain framework. Although koCNN also uses the whole-spectrum domain framework, the sinc interpolation and Hermitian symmetry method used by koCNN increases the feature extraction error of defects. Furthermore, the mean Average Precision of DCNN is 75.4 mAP, which is 12.5 mAP lower than that of SISN-L1 and 12.8 mAP higher than that of koCNN. This indicates that the classification precision of koCNN is not better than that of DCNN without integrating the proposed SIGN. For Type-E defect, the classification values of the SISN-L2 model and koCNN-I model are 88.0 AP and 67.1 AP respectively. The mean Average Precision of SISN-L2 is 90.4 mAP and that of koCNN-I is 68.9 mAP, with a difference of nearly 21.5 mAP. This is due to the adoption of the SRoIAlign sub-model in SISN-L2. The SRoIAlign sub-model implements the floating-point quantization operation on feature volumes, which eliminates the decline in feature extraction accuracy caused by the traditional quantization method. Furthermore, The mean Average Precision of DCNN-I is 79.8 mAP, which is 10.9 mAP higher than that of koCNN-I. This shows that the SIGN sub-model has better compatibility with koCNN than the DCNN. In addition, the classification error of DCNN-I is 4.4 mAP lower than that of DCNN, and the classification error of koCNN-I is 6.3 mAP lower than that of koCNN. This shows that SIGN performs well on the whole spectrum-domain framework, and it also plays a positive role in improving the classification performance of the spatial domain framework. Finally, the mean Average Precision values of SISN-L3 and koCNN-I are 92.1 mAP and 68.9 mAP, with a difference of 23.2 mAP. This is due to the SRPN and SRoIAlign sub-models which are used in SISN-L3. The SRPN sub-model can detect defects with complex shapes and accurately generate multi-scale 3D proposals. Even if SRPN gives rise to classification errors in the forward propagation pass, SRoIAlign can also correct the errors to a certain extent. Note that when choosing different backbone frameworks, the classification performance of koCNN and DCNN will be greatly affected. When the accuracy of backbone frameworks is low, the classification error of koCNN and DCNN will also significantly increase. However, because the proposed SISN adopts the spectrum domain training mechanism, the training accuracy of the SISN model is less affected by the backbone frameworks, which indicates that SISN is robust.
The multiplication arithmetic complexity of koCNN, DCNN, LCNN, fbFFT and SISN
We select four typical models as the Ref Models to test the time performance of the SISN. The four Ref Models are koCNN, DCNN, LCNN and fbFFT. We design a complexity measurement unit to calculate the time performance of the SISN model. The complexity measurement unit is the average arithmetic complexity reduction unit (abbreviated as AACr). We use ResNet50, VGGnet19 and AlexNet7 as the baseline framework to calculate the AACr values of the SISN under the four Ref Models.
The four AACr values of SISN with respect to the three baseline frameworks. The four AACr values of SISN with respect to the three baseline frameworks are computed.
AACr is a rate coefficient designed for estimating the training time of the model under a particular baseline framework. As we choose koCNN as the Ref Model, the AACr value of SISN is calculated by dividing the training time of koCNN by the training time of SISN. The AACr value of SISN calculated by using koCNN as the Ref Model is denoted as AACr
where
Accurate extraction of the mechanical properties of casting defects is the key factor in evaluating the quality of casting products. However, due to the low segmentation accuracy for casting defects, it is not easy to calculate the mechanical properties of casting defects accurately. To address this, the SISN model is proposed in this work. To measure the influence of the SISN on the mechanical property extraction accuracy, we present five typical casting defects segmented by the SISN in this section and use the stress finite element method [50] to calculate the mechanical properties of these defects. Then, the calculation results are compared with the two representative models: DCNN-I and koCNN-I.
Casting defects segmented by the SISN model. We select five representative defect instances as test samples (denoted as Type-A, Type-B, Type-C, Type-D and Type-E), calculate their mechanical properties, and characterize the porosity parameters of each defect instance in the solidification process by the hierarchical visualization method.
The mechanical properties of Type-B, Type-C, Type-D and Type-E defect instances. Note that the mechanical properties of Type-A defect is analyzed in the next experiment.
First, we use the in-house EA88 dataset to train the SISN model and select the EA88 cylinder block casting as the segmentation object. The segmentation results are shown in Fig. 8. The EA88 model contains 101 defect instances, and the maximum diameter of each defect instance is in the range of 0.02 to 1.5 cm. We select five representative defect instances as test samples (denoted as Type-A, Type-B, Type-C, Type-D and Type-E), calculate their mechanical properties, and characterize the porosity parameters of each defect instance in the solidification process by the hierarchical visualization method, as shown in Fig. 8. According to the solidification sequence of Type-A instances, the porosity value of Type-A fluctuates from 250 seconds to 400 seconds and tends to be stable after 400 seconds. The fluctuation process in the solidification sequence is characterized by the colour hybrid transition at the intersection of the segmentation lines. The porosity of Type-B defects fluctuates during the solidification process from 440 seconds to 700 seconds and tends to be stable after 700 seconds. The porosity of Type-C defects fluctuates from 490 seconds to 610 seconds and tends to be stable after 610 seconds. The porosity of Type-D defects fluctuates during the solidification process from 130 seconds to 250 seconds and tends to be stable after more than 250 seconds. The solidification sequence of the five types of defects is consistent with their porosity distribution curve (see Fig. 9). The results show that the proposed model can accurately segment the defects and characterize the solidification sequence of the segmented defects, which lays the foundation for the next step of the calculation of the mechanical properties of defects.
Second, we calculate the mechanical properties of the four defect instances (Type-B, Type-C, Type-D and Type-E; Note that the Type-A defect is analyzed in the next experiment), and select 15 sampling points on each type of defect instances to calculate the mechanical properties of each point in the solidification process, as shown in Fig. 9. We use the characters Pt1, Pt2,
The porosity error, tensile strength error and hardness error of the three models at 15 sampling points
List of validation parameters and functions
Mechanical property extraction pipeline for a Type-A defect instance. Fifteen sampling points are selected from Type-A defect to calculate their mechanical properties during solidification process.
At last, to verify the superiority of the proposed model in accurate calculation of mechanical properties of defects, we select two representative models: DCNN-I and koCNN-I as the comparison model. We use the Type-A defect to test the extraction performance of DCNN-I, koCNN-I and SISN. We select 15 sampling points on the Type-A defect instance to calculate the mechanical properties of each point in the solidification process, as shown in Fig. 10. Then, we use the stress finite element method to calculate the porosity, tensile strength and hardness of the three models at 15 sampling points. Note: the sampling time of the solidification sequence is from 0 to 800 seconds. Finally, we calculate the average values of the porosity, tensile strength and hardness of the three models at 15 sampling points and calculate the difference between these average values and the truth values of the mechanical properties; that is, the difference values are the porosity error values, tensile strength error values and hardness error values of the three models at 15 sampling points, as shown in Table 4. We use Ep to represent the porosity error, Et to represent the tensile strength error and Eh to represent the hardness error. The Ep value of SISN model at Pt1 is equal to 1.1010. Compared with other sampling points, the Ep value of the SISN model at Pt1 is the smallest. The Ep values of DCNN and koCNN are 7.1030 and 6.2341 at Pt1, respectively. Therefore, the error values are six to seven times that of the SISN. This shows that the SISN model can extract the porosity of defects with complex shapes more accurately than the other two models. In addition, the maximum tensile strength error of the SISN model appears at Pt4, where Et is equal to 1.3055; the maximum tensile strength error of the DCNN model appears at Pt8, where Et is equal to 9.4114; and the maximum tensile strength error of the koCNN model appears at pt8, where Et is equal to 11.3345. The results show that in the worst case, the tensile strength error of the SISN model is still lower than that of the other two models, and the error value of SISN is only 1/11 that of the koCNN model. Furthermore, the average porosity error of the SISN model at all points is equal to 1.1666, the average tensile strength error is equal to 1.2203, and the average hardness error is equal to 1.3257. Compared with the DCNN model and koCNN model, the average error is at least 6 orders of magnitude lower. The results show that the mechanical properties of 15 sampling defects in the solidification process can be extracted more accurately by using SISN to segment casting defects. Note that we list the main parameters used in this paper in Table 5.
Taking SRPN, SRoIAlign and SIGN as three crucial sub-models, a spectrum domain instance segmentation model is proposed for accurately segmenting casting defects. The recognition accuracy and segmentation accuracy of the proposed model for casting defects are at least 3.4AP and 11.1AP higher than those of the state-of-the-art models. However, in our spectrum domain training architecture, the spectrum domain activation function is limited to the stochastic rectified linear unit operation, which needs to be implemented in the high-performance GPU-CUDA unit. This limitation restricts the wide-spread use of spectrum domain models in the field of casting CAE. Furthermore, our model is designed for segmenting five types of casting defects: inclusion, shrinkage, hot tearing, cold tearing and micro pores exclusively. In the future, we intend to probe the potential segmentation ability of the spectrum domain model for various types of casting defects. We also try to extend our idea to 3D data simplification [51] and 3D data exchange [52] and other powerful supervised machine learning/classification algorithms [53, 54, 55]. The above generality exploration and recent references will be interesting to the readers of ICAE journal.
Footnotes
Acknowledgments
The authors would like to thank the casting metallographic Research Institute of FAW Foundry Co., Ltd. This work was supported in part by the National Natural Science Foundation of China under Grant 62006027, in part by the National High-tech Research and Development Program under Grant 2014AA7031010B, and in part by the Science and Technology Research Project of Jilin Provincial Department of Education (13th five year plan) under Grant JJKH20200681KJ, in part by the enterprise cooperation project under Grant 1834003XXXX.
