Abstract
Abstract
1. Introduction
In recent years, RNA interference (RNAi) and differential interference contrast (DIC) imaging techniques are emerging as essential tools to assist biologist in analyzing embryonic development, morphology, and phenotypes in single-cell level. The large number of images produced in these studies makes manual analysis intractable. Hence, automatic cellular image processing and analysis becomes an important task. To automatically analyze the nuclear development in a four-dimensional (4D) way based on a series of three-dimensional (3D) DIC images obtained at different time points, image segmentation and registration are the two necessary steps. For 3D embryonic images, the registration is usually based on the orientation of anteroposterior (AP) axes. Principal component analysis (PCA) is the mainly applied method to compute AP axes. Before using PCA, it is needed to segment embryonic boundaries in images.
In this article, we present a novel scheme on cellular boundary segmentation. Furthermore, an ellipsoid-fitting scheme is used to build models for embryonic boundaries according to the segmented incomplete boundaries. Based on the fitted ellipsoid models, the AP axes are computed and the registration is carried out.
Based on the shape index (SI), the proposed segmentation method focuses on the detection of cytoplasm granules inside the cellular regions in each slice of a 3D embryonic image. Because the size of the cytoplasm granules is usually too small compared with the thickness of focal planes in the z-stack, we could not calculate the SI by using the method of constructing the intensity isosurface in 3D images. Consequently, we regard intensity as Z coordinate and compute the SI within each slice. An SI value is computed to represent the shape of intensity surface near to a target pixel. With several geometric postprocessing techniques, the SI thresholding results are integrated into a boundary in each slice. Experimental results show that the proposed algorithm has higher accuracy than existing schemes despite the existence of egg shells with different shapes and fluctuated intensities.
On the computation of AP axes, PCA works well when the contour of an embryo is obtained entirely. However, because of the existence of blur in marginal slices of 3D DIC images, using image segmentation methods usually could not obtain correct boundaries in these slices. Therefore, these slices are often discarded, which will make the final segmented boundaries incomplete and missing on the margins of z-stack. When the segmented boundaries are incomplete, the AP axis calculated from PCA may shift away from the correct orientation. Therefore, we used the ellipsoid-fitting method to compute AP axes in this article, which can calculate the correct value even when the segmented boundaries are incomplete. A preliminary result of segmentation was presented in an international conference (Yang et al., 2017).
The remainder of this article is organized as follows. In Section 2, the proposed segmentation algorithm based on the SI is described in detail. Experimental results on segmentation algorithms to be compared are presented in Section 3. Section 4 demonstrates the ellipsoid-fitting method on the calculation of AP axes. In Section 5, we experimentally compared the computed AP axes using the ellipsoid-fitting method and PCA. Finally, Section 6 concludes the article.
2. DIC Embryonic Image Segmentation based on the SI
In DIC images, it is difficult to obtain robust and accurate cellular boundaries by automatic image segmentation methods. First, microscopy images are often with many blurs, nonuniform intensity backgrounds. The contrast also varies between the foreground and the background. Second, the size, shape, and location of cells are usually different. Finally, the existence of egg shells interferes the detection of correct boundaries.
A large number of algorithms have been developed for automatic or semi-automatic cellular segmentation (Irshad et al., 2013; Xing and Yang, 2016). These methods are based on a few basic approaches: intensity thresholding, feature detection, morphological filtering, and deformable model fitting (Meijering, 2012). Learning-based schemes are also proposed in cell segmentation tasks. A notable work is the five categories of segmentation system in cellular regions (Ning et al., 2005). However, the learning-based methods require labeling samples manually for training, and it is a time-consuming and labor-intensive task. Consequently, we focused on the nonlearning-based schemes in this study.
Although some adaptive thresholding techniques are presented (Bradley and Roth, 2007), thresholding methods are prone to be influenced by nonuniform background. Methods based on morphological filtering, especially watershed techniques, are widely used in cell segmentation (Adiga and Chaudhuri, 2001; Lin et al., 2003). Nevertheless, it performs well in the segmentation of touching cells, and the existence of egg shells makes it tricky in our case. Deformable models and the related algorithms are also widely applied in biomedical image segmentation (McInerney and Terzopoulos, 1996). However, when they are used in raw images, their performances are restricted. Schemes based on the feature detection are different and flexible, but they lack the overall control of the boundaries in general.
Kyoda et al. (2013) proposed an effective segmentation method based on entropy filter and the classic active contour method. They also applied the segmented nuclei to trace the development of nuclei. Although their method performs well in nuclear segmentation, its accuracy still suffers the nonuniform of background intensity when applying to cellular boundary segmentation.
2.1. The segmentation method based on the SI
The SI is a descriptor to quantify the pure shape of a two-dimensional (2D) surface. It was proposed by Koenderink and Doorn (1992). Its computational formula is shown in Equation (1). Here, κ1 and κ2 are two principal curvatures of the surface.
Based on their original work, researchers developed many techniques. Cantzler and Fisher (2001) exhaustively compared the properties of the Gaussian curvature, mean curvature, and SI. Dorai and Jain (1997) established a linkage between the SI and Gaussian mapping and creatively proposed shape spectrum to accumulate morphological information. Huibin et al. (2011) studied the histogram of SI values and used it as a descriptor to identify facial expression. Yang et al. (2016) showed an equivalent formula on the SI and deduced the counterpart formula when there are three principal curvatures to be considered based on spherical coordinate system. Yoshida and Näppi (2001) applied SI to the detection of colonic polyps in 3D images based on the intensity isosurface. Their work is adopted in many biomedical diagnostic software packages. As shown in Figure 1a, there are many cytoplasm granules in cellular regions. For each cytoplasm granule in a 2D slice, the intensity surface (Fig. 1b) is close to a “cap,” whereas for a pixel surrounded by cytoplasm granules, the intensity surface is similar to a “cup.” Therefore, the SI can be used to detect these featured local regions.

A sample cell slice.
2.2. The proposed algorithm
To obtain the segmented embryonic boundaries, the following steps are applied on each 4D DIC image:
Step 1. For each 2D slice It, n (n: slice number and t: time), use a 11 × 11 Gaussian kernel to smooth the slice.
Step 2. For each pixel in It, n , compute the Gaussian curvature K and mean curvature H in its 7 × 7 neighborhood and then calculate the two principal curvatures κ1, κ2, the SI value S, and curvedness C.
Step 3. Set S = 0 for pixels with C < THC, then threshold It, n based on |S| ≥ THS.
Step 4. Use geometric postprocessing techniques to obtain the final segmentation results.
In the rest of this section, the four steps are demonstrated in detail. Figure 2 shows the scheme of the proposed algorithm.

The procedure of the presented algorithm.
2.2.1. Initializing
In Step 1, t is the order of time points and n is the order of z-stack. A pair of t and n defines a 2D slice. The data set and its format are introduced in Section 3.1. For reducing noise, we used the Gaussian kernel to smooth each slice before computing S.
2.2.2. The computation of SI
In the presented algorithm, Steps 2–3 are related to the computation of SI. When detecting colonic polyps, Yoshida and Näppi (2001) used an SI value S to represent the local shape of the intensity isosurface of a voxel based on the approach of Thirion and Gourdon (1995). From this point of view, for each voxel in a 3D image, we can determine only one isosurface and calculate two principal curvatures within its neighborhood. An SI value S thus can be calculated. This method is appropriate to detect voxels located on a surface, which is distinguishable from background on space and intensity. In their study, the size of colonic polyps to be detected ranges from 5 to 30 mm and the interval of rebuilt 3D images is 1.5–2.5 mm (Yoshida and Näppi, 2001). Thus, their scheme is an appropriate way to characterize the shape of colonic polyps.
However, when detecting cytoplasm inside cellular boundaries, using z coordinate is futile because the radii are about 0.4–0.6 μm, whereas the interval of the acquired 3D images is 0.5 μm on z coordinate. As a result, it is hard to observe isosurface on z coordinate for most of the cytoplasm granules.
For using the SI in each slice to describe the variation of local intensity, we directly regard the intensity as z coordinate and calculate the two principal curvatures from the intensity surface for a neighborhood region of a pixel (Peet and Sahota, 1985). For the computation of principal curvatures, the method described by Besl and Jain (1986) was used. Suppose f (x,y) is the intensity surface over image planes (x,y), based on the first and second partial derivatives fx, fy, fxx, fyy, fxy, we calculate the Gaussian curvature and mean curvature with Equation (2) at first and then obtain Equation (4) according to Equation (3) to compute the SI value S and the curvedness value C. In Step 3, THS and THC are thresholds of S and C, respectively, here are 0.68 and 0.05. To improve the computational accuracy, the computation is based on an interpolated neighborhood as discussed later in this section.
2.2.3. SI images
Figure 3 shows the SI computational results under this viewpoint. After computing the SI for each pixel of a slice, we obtained the corresponding SI image (Figure 3a, b). Then, a threshold will be set to screen pixels with small SI value (Fig. 3c, d). From the result, we observed that when choosing an appropriate threshold, SI can simultaneously detect the cytoplasm granules and remove most of points on egg shells.

SI image.
2.2.4. Geometric postprocessing techniques
After thresholding, we need to build masks from the detected points, as Step 4 shows. We used two steps to reach the goal.
First, along the direction from each boundary pixel Pb of a 2D slice to the central of the whole detected points by SI, we used a disk model to count the number of detected points (Fig. 4a) within the disk. If the number is less than a threshold, we removed these detected points. Once the detected points Pd within the disk is larger than a threshold, we constructed the convex hull of Pd (Fig. 4b).

Boundary construction based on SI detection result.
Second, we filled the image and then detected and removed occasional protrusions. Although choosing proper threshold of SI can mainly remove the interference caused by an egg shell, there may exist few detected points at its position and occasionally not be removed by the disk-searching step, which will bring protrusions, as shown in Figure 4c. To detect them, for each cellular boundary point P1 generated by the filled image, let P2 is another boundary point along a given direction, for example clockwise, when move P2 and make the boundary length L between P1 and P2 ranging from Lmin to Lmax, a D/L ratio is calculated, here D is the Euclidean distance between P1 and P2. Once the ratio is less than a threshold, the piece of boundary is judged as a protrusion and a circular arc with radius 100 with the same x, y shifts is used to replace the boundary from P1 to P2.
3. Experiments on Embryonic Image Segmentation
3.1. Data set
The performance of our proposed scheme is evaluated on Worm Developmental Dynamics Database (WDDD) (Kyoda et al., 2013). Using 4D DIC microscopy, WDDD has 50 sets of quantitative data from wild type (WT) embryos. After silencing embryonic genes on chromosome III individually, 72 genes with 136 sets of quantitative data from RNAi experiments were also built (http://so.qbic.riken.jp/wddd). For each set, there are 180 time points (40 s/time point) and a 3D image is stored at each time point. In each 3D image, the z-stack includes 66 focal planes (0.5 μm/plane), and there are 600 × 600 pixels (0.105 μm/pixel) in each focal plane.
3.2. Evaluation of segmentation performance
To evaluate the performance of the proposed scheme, we chose 123 typical slices from 3 embryos (15th–55th in 66 slices) and manually segmented them as the ground truth. Then, we used entropy filter (Kyoda et al., 2013) and the distance regularized level set algorithm (EL algorithm; Li et al., 2010) as the comparison object. We also tested the scheme using the adaptive thresholding method according to Bradley and Roth (2007) and the morphological operators to segment cellular boundaries (TM algorithm). Besides, Watershed algorithm is also considered here. Table 1 shows the experimental results.
The Comparison of the Proposed Method with Other Algorithms
EL, entropy filter and the distance regularized level set algorithm; TM, adaptive thresholding method and morphological operators.
3.3. Experimental results on different thresholds of SI
To evaluate the performance of the proposed method under different SI thresholds, we computed the segmentation results and the precision, recall, and dice ratio when SI changing from 0.55 to 0.90 with 0.05 as the step. Figure 5 shows the result. As shown in Figure 5, choosing 0.65–0.7 can obtain a balanced result in this study.

The comparison result when setting different SI thresholds.
3.4. Computational accuracy of the SI
In the computation of SI, the discretization of image may make the calculated value deviate from the correct value of the corresponding desire shape. To evaluate the accuracy of SI in practical computation, we build 5 ideal models of intensity shapes and the corresponding images. Then, we computed the SI from these images under different parameters and observed the difference between the calculated value and the theoretical one. In practical computation, the computational accuracy of SI is mainly related to the “resolution” of intensity shape, marked as RIS. When computing partial derivatives, the computational domain can be chosen as the same size of RIS.
Suppose there is a perfect intensity shape and it is a semi-sphere, when the intensity shape is discretized as RIS = 7 × 7, 9 × 9, … (having different “resolution”), the calculated values are slightly different. To observe the accuracy of computed SI under different resolutions, we build ideal models for SI = −1, −0.5, 0, 0.5, 1 (see Equation 5) and then discretized these models making RIS = 5 × 5, 7 × 7, …, as shown in Figure 6. After that, we computed the SI value at the center of these images.

The built models with different RIS. RIS, “resolution” of intensity shape.
By evaluating typical slices, we found the size of cytoplasm granules mainly ranging from 7 × 7 to 17 × 17. In comparison, we found that using larger RIS will increase the computational accuracy of SI. However, the real size of cytoplasm granules confines the maximal size we could use. Therefore, we used interpolation to enhance the resolution of both built models and real neighborhoods. After that, we used large RIS to compute the SI. Figure 7a shows the computational result when directly using the built model to compute the SI. Figure 7b shows the computational result when using cubic interpolation to obtain the interpolated models.

Computed SI before and after interpolation.
3.5. Nuclear segmentation results
Using thresholding of the SI image with different thresholds and similar geometric methods, we segmented nuclear regions in DIC images. Figure 8 shows the segmentation result of a 3D image both on boundary and on nuclei.

Segmentation results on both the boundary and the nuclei of a 3D image. 3D, three-dimensional.
4. Estimation of AP Axes and 3D Embryonic Registration based on Ellipsoid-Fitting
In DIC imaging, usually the position of embryos is different. However, for each embryo, its egg shell is always oval and the main axis is AP axis. Furthermore, the AP axis will later develop to the body of Caenorhabditis. elegans, and one side is the head and the other side is tail. Therefore, usually, we used this characteristic to implement the registration.
4.1. The AP axes of embryos
PCA is widely used in computing AP axes. The eigenvector corresponding to the largest eigenvalue can be viewed as the main axis (AP axis). This method works well when the contour of an embryo is extracted entirely. However, when the segmented contours are incomplete, the AP axis calculated from PCA may shift away from the correct orientation (see Section 5.1). As we have mentioned, the DIC images are blur on the two marginal slices of z-stack, it is very difficult to automatically segment the correct boundary of embryos in these slices. Therefore, to facilitate the whole computation, we chose slices 15–53 to run the segmentation algorithm, whereas other slices are left blank.
To avoid the computational deviation of AP axes when using incompletely segmented boundaries, a reasonable scheme is to build a model for the whole boundary of an embryo according to the incomplete segmentation. Then, the model was used to calculate its AP axis. Based on empirical observation, we applied ellipsoids as the fitted model of embryonic boundaries.
There are two categories of ellipsoid-fitting methods. One is geometric fitting, which searches for the ellipsoid minimizing the sum of the square of the Euclidean distances between the given data set and the ellipsoid to be fitted. Another is algebraic fitting, which looks for the parameters of the ellipsoid f to minimize the sum of the Frobenius norm of f (
The ellipsoid-fitting methods are the generalization of ellipse-fitting methods. Since the works of Bookstein (1979), Fitzgibbon and Fisher (1995), and Fitzgibbon et al. (1999), the computation of ellipse-fitting has been translated to the solving of a generalized eigenvalue system and there is a fixed procedure. However, the solving of ellipsoid-fitting in general cases is still a suspended problem. Qingde and Griffiths (2004) proposed an ellipsoid-fitting method, which is effective when the ratio of the major axis and other axes is not too large. The computation of this method is also fairly stable and fast. In this study, we used this method to compute the fitted ellipsoid and the corresponding AP axes.
4.2. The ellipsoid-fitting method
The procedure of fitting an ellipsoid is introduced in Qingde and Griffiths (2004). The steps for computing AP axes in our case are as follows:
Segment original 3D images to obtain embryonic boundaries. Build matrices Compute the correlation matrix Compute the generalized inverse solution Perform eigen-decomposition of Build matrix Perform eigen-decomposition of
In the above procedure, suppose the coordinates of voxels on the boundaries are [

The ellipsoid-fitting method based on incomplete boundaries.
4.3. Interpolation
The fitting of ellipsoids can use segmented boundaries directly, and we do not need interpolation in the computation of AP axes. However, when constructing the registered images, we should conduct it by isotropic way. As we have mentioned in Section 3.1, the spans on x, y, and z stacks are 0.105 μm/pixel, 0.105 μm/pixel, and 0.5 μm/plane. Therefore, the ideal interpolation is constructing 100 slices for every 21 planes. However, this operation is hard to handle for common number of slices occupied by a nucleus. In addition, it destroys a large part of original slices in interpolation. Consequently, in this study, we applied an approximate ratio—five slices for each focal plane. The interpolation among slices in this study applies Herman's method (Herman et al., 1992). Between every two adjacent original slices, four slices are inserted into them.
4.4. Registration
To build registered 3D images, a 600 × 600 × 600 image is built for each 3D image. According to the coordinates of voxels within each interpolated nucleus, their transformed coordinates are calculated and then mounted into the built image. In our study, the embryonic registration consists of one translation and two rotations.
4.4.1. Translation
After ellipsoid fitting, the center of the fitted ellipsoid can be calculated by the mean of coordinates of points located on it. To meet the requirement of accuracy, a large number of these points are calculated when setting a small span on x, y, and z and then these points are searched according to the distance between the candidate points and the fitted ellipsoid. Once a center is calculated, it is set 300 × 300 × 300 in the built image.
4.4.2. Rotation
The rotation includes two steps. The first rotation makes the main axis of the fitted ellipsoid in line with the x-axis in the 3D image space. Then, the second rotation ensures the middle point of two centrals of P1 and AB located on the z-axis. The reason why we need two steps of rotation can be explained by the registration of human bodies. The first rotation registers the direction of torsos, whereas the second rotation determines the face orientation.
In this study, we used the main axis of a fitted ellipsoid to approximate the AP axis of an embryo. However, the AP axis is oriented while the main axis of an ellipsoid is not. To give the orientation of the main axis, we chose a time point with P1 and AB nuclei for each embryo and then calculated the projections for both P1's center and AB's center. We stipulated the direction from the projection of AB's center to P1's center as the direction of the main axis, which is also the positive direction of the y-axis after the first rotation. After calculating the two rotation matrices, they are applied to all time points of the same embryo.
Once the registration of the main axes is finished, we calculated the middle point of P1's center and AB's center at the first time point they formed from P0. Then, the rotation matrix is determined by rotating the middle point to the z-axis. Once fixed, the rotation matrix is also applied to all time points of the same embryo.
5. Experimental results on AP Axes Estimation and Registration
To check the accuracy of computed AP axes, we compared the PCA method with the ellipsoid-fitting method using the segmented boundaries from slices 15–53. In Table 2, the two methods are marked as “SI seg. PCA: 15–53” and “Ellipsoid-fitting: 15–53,” respectively. We also manually segmented the whole boundaries (slices 0–65) and used PCA to compute the AP axes, which were used as the ground truth of the embryonic AP axes, marked as “Manual PCA: 0–65” in Table 2. From the result, we observed that the computed AP axes from the ellipsoid-fitting method using incomplete boundaries are quite close to the result computed from manually segment images using complete boundaries.
The Computed Anteroposterior Axes Using Two Ways
PCA, principal component analysis; SI, shape index.
5.1. The influence of the degree of missing slices
To evaluate the influence of the missing slices on the computation of AP axes, we built an ellipsoid whose main axes locate right on x-, y-, and z-axes and the lengths of semi-major axes are 130, 130, and 260 respectively. The ellipsoid was put into a 3D 600 × 600 × 600 image centralized at [300,300,300]. Then, we made the ellipsoid rotate α on the x-axis and then truncated on the z-axis. The PCA and ellipsoid-fitting methods were applied to the truncated ellipsoid to compute AP axes.
When set α as π/6, π/4, and π/3 and chose the truncated z coordinates on one side as 20, 40, …, 260 (thus, the total truncated slices are 40, 80, …, 520), we obtained the angle deviation computed from the PCA and ellipsoid-fitting methods. In Figure 10, the x-axis is the number of truncated slices in z coordinates and the y-axis is the computed deviation radians. From the result, we observed that the deviation of PCA result is quite large, whereas the AP axes computed from the ellipsoid-fitting method hold high accuracy till the missing slices are severe.

The calculation accuracy of the ellipsoid-fitting method.
5.2. Results on registration
Based on the ellipsoid-fitting method, we registered all the 186 embryos. Figure 11 shows the registration result of 50 WT embryos. For each embryo, we chose the middle time point, during it has two daughters P1 and AB. For the 3D DIC images corresponding to the chosen time point, we selected 11, 16, 21, …, 56 slices from the total 66 ones to show the original embryo. The registered images followed the registration scheme described in Section 4.4 and based on the interpolated DIC images. From the registered DIC image (with size 600 × 600 × 600), we chose 192, 216, 240, …, 408 to show the registration result. In addition, the variation range of intensity in Figure 11 is set to [0,200] to obtain a better observation. In Figure 12, we also show the registered boundaries and the registered fitted ellipsoid.

The original images and registered images for WT embryos. The row 1, 3, 5 is original DIC images from three WT embryos, respectively:

The registered segmented boundaries and the registered fitted ellipsoids for the three WT embryos (the same as Fig. 11). In the figure, blue lines are segmented boundaries; red points are fitted ellipsoids; pink bold lines are along the main axes of the ellipsoids. WT, wild type.
6. Conclusion
The emergence of fully automated phenotyping system will allow very large-scale exploratory experiments in functional genomics. Cellular boundary segmentation is a basic and indispensable step in studying morphological embryonic development. This study presented a cellular boundary segmentation algorithm based on the SI and geometric postprocessing techniques. Instead of detecting cellular boundaries themselves, our method focuses on the detection of cytoplasm granules. Experimental results show high accuracy of the method. As we have shown, the proposed method can also be applied to nuclear segmentation with slight modification. As a second-order descriptor of intensity, SI is hardly influenced by the zero-order and first-order variation of intensity, which makes it a desired tool working on images with nonuniform illumination.
As we have demonstrated, using PCA to compute AP axes based on DIC images is ventured. The blur in marginal slices may cause incomplete boundaries, and this will make the calculated AP axes deviate from the correct direction. Using the ellipsoid-fitting method, however, can achieve higher accuracy even when the missing is severe. Based on our experimental results, we recommend using the ellipsoid-fitting method instead of PCA to compute AP axes when the boundaries are similar to ellipsoids.
Footnotes
Acknowledgments
This work was supported in part by the Grant-in Aid for Scientific Research from the Japanese Ministry for Education, Science, Culture and Sports (MEXT) under the Grant No. 18H04747. The work was also supported in part by the Grant-in Aid for Scientific Research from the Huaqiao University under the Grant No. 09HZR13. The authors thank Prof. Nishigawa of Ritsumeikan University, Dr. Onami, Dr. Tohsato, and Dr. Kyoda of RIKEN for their advice in this research.
Author Disclosure Statement
The authors declare that no competing financial interests exist.
