Edge-oriented dual-dictionary guided enrichment (EDGE) for MRI-CT image reconstruction

Abstract

In this paper, we formulate the joint/simultaneous X-ray CT and MRI image reconstruction. In particular, a novel algorithm is proposed for MRI image reconstruction from highly under-sampled MRI data and CT images. It consists of two steps. First, a training dataset is generated from a series of well-registered MRI and CT images on the same patients. Then, an initial MRI image of a patient can be reconstructed via edge-oriented dual-dictionary guided enrichment (EDGE) based on the training dataset and a CT image of the patient. Second, an MRI image is reconstructed using the dictionary learning (DL) algorithm from highly under-sampled k-space data and the initial MRI image. Our algorithm can establish a one-to-one correspondence between the two imaging modalities, and obtain a good initial MRI estimation. Both noise-free and noisy simulation studies were performed to evaluate and validate the proposed algorithm. The results with different under-sampling factors show that the proposed algorithm performed significantly better than those reconstructed using the DL algorithm from MRI data alone.

Keywords

MRI-CT Image reconstruction dictionary learning (DL)dual-dictionary learning (DDL)multimodality

1 Introduction

For diagnosis and therapy, it is highly desirable to integrate different modalities for comprehensive analysis, which is called multimodality imaging [1]. As the first generation of medical multimodality technology, PET-CT has had a major impact especially on cancer treatment [2]. With an integrated gantry, CT and PET images are sequentially acquired within a relatively short time period. These two types of images are thus in a good registration. The PET-CT system effectively solves the attenuation correction and image registration problems, being widely used in oncological settings [3]. After PET-CT, other multimodality imaging instruments such as SPECT-CT and PET-MRI were developed and accepted in various healthcare applications. For example, PET-MRI, a latest multimodality system, performs well in neurosurgery planning and cardiac examination, such as localization of epileptic foci and stroke [4 –7].

Nowadays, MRI has become an indispensable medical modality. However, a typical MRI procedure usually lasts ten minutes or even longer. It is uncomfortable for the patient to keep motionless surrounded by the huge and noisy MRI gantry in such a long time. Moreover, some organs movements such as heartbeat and spasm would cause motion artifacts. It has been demonstrated that the average displacement is over 0.35mm within 100 seconds when a healthy young adult is lying on the table, and this displacement would increase to 2.5mm for a patient [8, 9]. Therefore, it is significant to shorten MRI scanning time for better image quality. On the other hand, it is well known that the CT scanner only takes a few seconds to scan most parts of the human body, which is much faster than MRI. In addition, the spatial resolution of CT image is better than MRI as well. Therefore, although both CT and MRI are good at structural imaging, simultaneous MRI and CT imaging are an under-investigated yet interesting multimodality mode which can provide complementary information for healthcare benefits. Usually, MRI offers better soft tissue information while CT depicts sharper interfaces between air and tissue or tissue and bone. Hence, MRI-CT hybrid imaging in one machine was expected to provide superior image quality in both soft and hard tissues [10, 11]. Moreover, it is possible to greatly shorten the scanning time of MRI by reconstructing an MRI image from highly under-sampled data and CT data from the same subject [12, 13]. Actually, before the conception of the MRI-CT scanner Fahrig et al. worked to put the X-ray radiography devices into an MRI scanner [14, 15]. They found that the magnetic field influenced on the electron beam in the X-ray tube and proposed a method for handling the deflection of the electron beam. It was suggested that paramagnetic and ferromagnetism materials must be replaced by other non-magnetic material in the integrated device.

This paper focuses on the joint CT-MRI image reconstruction. A new algorithm is proposed to reconstruct an MRI image from highly under-sampled data with a good initial MRI image generated using a dual-dictionary learning (DDL) method from a well-registered MRI and CT image training set. In the next section, the dictionary learning (DL) method is introduced for medical image reconstruction. Then, an edge-oriented dual-dictionary guided enrichment (EDGE) scheme is proposed to reconstruct an MRI image from highly under-sampled MRI k-space data. The numerical results are presented in the third section. In the last section, the discussions are presented to inspire further efforts.

2 Method

2.1 Background

CT and MRI have different imaging principles. The CT image reconstruction is to solve the inverse problem of the integrals of the X-ray attenuation coefficient distribution in the human body. X-ray projections or integrals of linear attenuation coefficients is usually described as $\int μ (l) dl = ln (\frac{I_{0}}{I})$ (1) where I ₀ is the incident intensity of an X-ray beam, I is the transmitted intensity, and μ (l) is the X-ray attenuation coefficient along an X-ray path l. During a CT scan, a number of X-ray projections are acquired. Then, the relationship of Equation (1) can be discretized in the matrix form: $Ax = p$ (2) where the vector $x \in ℝ^{N \times 1}$ stands for a CT image, N is the total number of pixels, $A \in ℝ^{M \times N}$ is a system matrix which is determined by the geometry of the CT system, and the vector $p \in ℝ^{M \times 1}$ stands for projection data with M being the product of the number of angle views and the number of detector elements. The CT reconstruction problem is to solve the inverse problem of the linear systemEquation (2).

Different from X-ray CT, each pixel value of an MRI image represents a property relevant to the resonance nuclear density (hydrogen protons) and two relaxation times, with the details depending on a specific pulse sequence. The scan signals, known as k-space data, are recorded by phase and frequency encoding. An image can be reconstructed by the inverse Fourier transform, $x = F^{- 1} y$ (3) where x is an MRI image, and y is its k-space data.

Although Equations (2-3) represent different models, the two reconstruction problems can be both transfered into similar mixed-norm regularization problems according to compressed sensing (CS) theory [16]. Moreover, though the physical principles and quantities are different for these imaging modalities, the human structural information in these images are correlated [17, 18]. Inspired by the correlative nature of CT and MRI datasets, we are motivated to study whether an MRI image can be reconstructed from less k-space data when the prior CT information of the same patient is available.

2.2 Dictionary Learning (DL)

Recently, CS theory has served as a powerful tool in producing high quality image restoration and reconstruction from under-sampled data which may be much less than the requirements of the traditional Shannon/Nyquist sampling theory [19 –24]. According to the CS theory, a highly under-sampled image reconstruction problem is to solve an underdetermined system of linear equations F _u x = y by minimizing the l ₀ norm of a sparsified transformΨx. The corresponding optimization problem may be described as $min_{x} {∥ Ψ x ∥}_{0} s . t . F_{u} x = y$ (4)

This problem is also known as a sparse coding problem which is an NP-hard problem. Usually, the l ₀ norm can be replaced by the l ₁ norm because the l ₁ norm minimization is a proxy for the l ₀ sparsity constraint when the restricted isometry property (RIP) holds [19]. Then, the problem can be efficiently solved by linear programming in the real domain or second order cone programming in the complex domain.

DL is a promising implementation of CS theory by training a dictionary for efficient sparse representation [25, 26]. It aims to find a proper representation of data in adaptive low-dimensional subspaces. DL related methods are widely used, already giving significant improvements in the field of medical image reconstruction [27 –29].

Given an image of size $\sqrt{N} \times \sqrt{N}$ , it can be decomposed into small patches of size $\sqrt{b} \times \sqrt{b}$ , b << N. Each patch can be expressed as a vector $x_{p} \in ℝ^{b \times 1}$ . All the patches are extracted from the image according to a designed partition. A dictionary $D \in ℝ^{b \times K}$ is a matrix consisting of k atoms (columns). An initial dictionary generated from sample images is usually redundant or over-complete. With this dictionary D, each vector x _p of an image can be sparsely represented as ${∥ x_{p} - D α ∥}_{2}^{2} < ε$ (5) where the error bound ε > 0. $α \in ℝ^{K \times 1}$ is a sparse representation vector which has few none-zero elements, i.e., ∥α ∥ ₀ << N. Finding a sparse representation of the vector x _p is to solve the following l ₀ norm minimization problem $min_{α} {∥ α ∥}_{0} s . t . {∥ x_{p} - D α ∥}_{2}^{2} < ε$ (6)

If an image contains S patches, DL is to find a dictionary that all the patches can be sparsely represented by $min_{D, α} \sum_{s = 1}^{S} ({∥ x_{p_s} - D α_{s} ∥}_{2}^{2}) s . t . {∥ α_{s} ∥}_{0} < T_{0}$ (7) where T ₀ is a required sparsity level.

Many algorithms were proposed to solve the dictionary learning problem Equation (7) [25 , 30]. As one of the popular algorithms, the k-means singular value decomposition (K-SVD) algorithm iteratively updates the dictionary atoms to better fit data. It does SVD on the errors, and refines the current dictionary atoms and coefficients simultaneously. As a widely-used method, K-SVD enjoys an excellent performance in convergence and sparsity [25]. The DL method was reported to perform well for MRI reconstruction from under-sampled k-space data.

However, image features may be in large disparities across multimodality images or even within one modality, for example, dual-energy CT. These different images cannot be directly integrated into one dictionary. Thus, two or more dictionaries are required to describe image features in different domains. A number of dual-dictionary learning (DDL) methods were proposed to solve this problem, such as MRI-CT reconstruction, multi-energy CT reconstruction, image reconstruction from sparse data [12 , 31–33]. DDL enables us to establish a connection between different types of images using the prior information of two modalities.

In multimodality image reconstruction, the existing DDL algorithms usually use the Euclidean distances among different patches to choose the dictionary atoms, which work well for the multimodality images in consistent contrasts, or from a high contrast image to a low contrast destination; for example, estimating or reconstructing a CT image of the human brain from an MRI image using DDL [12]. However, these algorithms would perform poorly in reconstructing a high contrast image from a low contrast counterpart; for example, reconstructing an MRI image of the human brain from a CT image. Therefore, we are motivated to investigate an edge-oriented dual-dictionary guided enrichment (EDGE) alternative to solve the MRI reconstruction problem from the training dataset consisting of well-registered MRI and CT images subject to highly under-sampled k-space data and a complete CT dataset. The key is to use the relative positions guided by the edges and contours in the images to establish the connections among the patches instead of the Euclidean distance that can be misled by inconsistent contrast and grey-scalebias.

2.3 EDGE for MRI image reconstruction

The EDGE algorithm has two steps. First, an initial MRI image is generated by an edge-oriented DDL algorithm from a training dataset consisting of well-registered MRI and CT images of selected previous patients, and the CT image of a current patient. The EDGE mechanism enables the two dictionaries well mapped and self-adaptive. Then, the estimated MRI image will be used as an initial value to facilitate the final MRI image reconstruction from highly under-sampled k-space data.

Step 1. Initial MRI image estimation using the EDGE strategy.

First, to establish two dictionaries D ^MR and D ^CT, we need to collect a series of MRI and CT images of patients. As a training dataset, these images may be gathered from different CT and MRI scanners. The dual-modality images of each patient require to be well-registered by structure similarity. For ahigh-resolution CT image $u^{CT} \in ℝ^{\sqrt{N} \times \sqrt{N}}$ , a patch vector $u_{i}^{CT} \in ℝ^{b \times 1}$ (b is the patch size) can be represented by a specific dictionary $D^{CT} = [d_{1}^{CT}, d_{2}^{CT}, . . ., d_{K}^{CT}] \in ℝ^{b \times K}$ according to DL theory as [27] ${∥ u_{i}^{CT} - D^{CT} α_{i}^{CT} ∥}_{2}^{2} < ε$ (8)

Using the orthogonal matching pursuit (OMP) algorithm, the patches in Equation (8) can be represented as [34], $u_{i}^{CT} = D^{CT} α_{i}^{CT} + η_{i}$ (9)

When MRI and CT data of an object are obtained simultaneously on a hybrid CT-MRI scanner or sequentially in a consistent status, u ^MR has a corresponding CT image u ^CT. We assume that the patches $u_{i}^{CT}$ and $u_{i}^{MR}$ of these two modal images in registration satisfy $u_{i}^{MR} = {Qu}_{i}^{CT} + ε_{i}^{CT}$ (10) where ε ^CT is the error and Q is the linear transform operator. Combining Equations (9) and (10), we have $u_{i}^{MR} = {QD}^{CT} α_{i}^{CT} + (Q η_{i} + ε_{i}^{CT})$ (11)

Equation (11) may be simplified to ${∥ u_{i}^{MR} - {QD}^{CT} α_{i}^{CT} ∥}_{2}^{2} < δ$ (12)

Therefore, through sparse representation and dictionary learning, a high-quality patch $u_{i}^{MR}$ can be sparsely coded by the same vector $α_{i}^{CT}$ in the relationship of D ^MR = Q · D ^CT. It other words, the patch of MRI image $u_{i}^{MR}$ can be approximately recovered by multiplying D ^MR by the sparse representation $α_{i}^{CT}$ obtained from the corresponding CT image and the dictionary D ^CT if the atom mapping from the dictionary D ^CT to D ^MR is provided. That is, $u_{i}^{MR} = D^{MR} α_{i}^{CT}$ (13)

Next, the key is to establish an atom-to-atom mapping between these two dictionaries of different imaging modalities. Here we propose an edged-orientated method to establish such a mapping. It was proved that the public edge information between the well-registered MRI and CT images was useful to improve the MRI image quality in MRI-CT hybrid imaging [35]. Different from that method, here the edge information in the low contrast CT images is used to choose the atoms of the CT dictionary to sparsely code a given CT image, instead of the traditional Euclidean distance measure.

For a given CT image u ^CT, some sample CT images are first chosen from the initial dataset whose structure information is close to the target image. Then, a specific dictionary D ^CT may be generated from the patches of these sample images. To represent each patch $u_{i}^{CT}$ of the given CT image, we perform an edge-based searching to select the closest atoms in D ^CT.

As shown in Fig. 1, (a) is the MRI image of a patient’s brain to be reconstructed, (b) is the corresponding CT image at the same slice, (d) and (e) are the MRI and CT sample images selected from the training dataset. Without loss of generality, let us consider a patch $u_{i}^{CT}$ of the CT image (b) whose center locates at N ₀ (x ₀, y ₀). To find the patches in Fig. 1(e) similar to $u_{i}^{CT}$ , we use their location from the boundaries instead of the Euclidean distance of the grayscale values because the contrast of this CT image of the human brain is rather low. Thus, the high contrast skull edges in Fig. 1 (b) and (e) are first segmented and extracted. Two contours can be respectively outlined by single-pixel-wide edges as shown in Fig. 1(c) and (f). The four purple points in Fig. 1(c) are the intersections of the contour and the horizontal and vertical lines through the point N ₀ (x ₀, y ₀). The distances between N ₀ and these four intersections can be calculated by ${\begin{matrix} d_{T} = y_{T} - y_{0} \\ d_{B} = y_{0} - y_{B} \\ d_{L} = x_{0} - x_{L} \\ d_{R} = x_{R} - x_{0} \end{matrix}$ (14) where x _R, x _L, x _T and y _B are the x- and y-coordinates of these four intersections, respectively.

We can compute two distance ratios of the above distances along the x- and y-axes respectively, ${\begin{matrix} k_{x_N_{0}} = \frac{d_{L}}{d_{L} + d_{R}} = \frac{x_{0} - x_{L}}{x_{R} - x_{L}} \\ k_{y_N_{0}} = \frac{d_{T}}{d_{T} + d_{B}} = \frac{y_{T} - y_{0}}{y_{T} - y_{B}} \end{matrix}$ (15)

Assume that in the sample image Fig. 1(e) N ₁ (x ₁, y ₁) is the closest patch or atom to N ₀ (x ₀, y ₀). As shown in Fig. 1(f), which is the contour image of (e), the distances between N ₁ and the four intersections on the edge can be calculated using the equation similar to Equation (14). Then, two distance ratios of these distances with respect to N ₁ may be given by ${\begin{matrix} k_{x_N_{1}} = \frac{d_{L}^{'}}{d_{L}^{'} + d_{R}^{'}} = \frac{x_{1} - x_{L}^{'}}{x_{R}^{'} - x_{L}^{'}} \\ k_{y_N_{1}} = \frac{d_{T}^{'}}{d_{T}^{'} + d_{B}^{'}} = \frac{y_{T}^{'} - y_{1}}{y_{T}^{'} - y_{B}^{'}} \end{matrix}$ (16)

Let k _{x_N
₁} = k _{x_N
₀} and k _{y_N
₁} = k _{y_N
₀}, we have ${\begin{matrix} x_{1} = k_{x_{N_{0}}} \cdot (x_{R}^{'} - x_{L}^{'}) + x_{L}^{'} \\ y_{1} = y_{T}^{'} - k_{y_{N_{0}}} \cdot (y_{T}^{'} - y_{B}^{'}) \end{matrix}$ (17)

For any given patch $u_{i}^{CT}$ of the CT image Fig. 1(b), a corresponding patch can be determined in the sample image Fig. 1(e) using Equation (17). That is, Equation (17) establishes an edge-oriented atom mapping from the patient’s CT image to the sample CT image, which selects the atoms having the same distance ratios defined by Equation (15).

However, because the values of $x_{R}^{'}, x_{L}^{'}, x_{T}^{'}, and x_{B}^{'}$ related to the location of N ₁, (x ₁, y ₁) cannot be directly determined by one forward calculation of Equation (17), generally we need to use an iterative procedure in Fig. 2 to solve this problem, which is a crucial DDL step in our EDGE algorithm.

In this paper, the central pixel of this atom was determined by being rounded to the nearest integer from the output (x ₁, y ₁). Furthermore, as shown in Fig. 1, once the patch N ₁ (x ₁, y ₁) is selected from the sample CT image Fig. 1(e), its corresponding patch M ₁ (x ₁, y ₁) is also determined at the same position in the sample MRI image Fig. 1(d) because these two sample images are presumably well-registered.

To each patch of the known corresponding CT image Fig. 1(b), $u_{i}^{CT}$ , its dictionary $D_{i}^{CT}$ consists of the atoms selected by the procedure in Fig. 2 from the sample CT images. To improve the accuracy, the atom in each dictionary $D_{i}^{CT}$ triples its length by adding the first-order gradient vector of each patch along x- and y- directions respectively [32]. Then, the sparse representation $α_{i}^{CT}$ of each $u_{i}^{CT}$ with respect to $D_{i}^{CT}$ can be calculated using the OMP algorithm. Given on the coherence between the CT and MRI images, a correlative dictionary $D_{i}^{MR}$ can be generated from the corresponding well-registered MRI image. Thus, each patch of the reconstructed MRI image $u_{i}^{MR}$ can be approximately recovered by Equation (13) from the corresponding spare representation $α_{i}^{CT}$ . Once all the patches are obtained, the final MRI image can be reconstructed by combining these patches. Figure 3 is a general workflow of the EDGE algorithm to reconstruct an initial MRI image u ^MR.

Note that the above EDGE algorithm requires that the single-pixel contour should be a convex and closed curve as shown in Fig. 1. Otherwise, there may have more than four intersections of the contour and those horizontal and vertical lines through the point N ₀ (x ₀, y ₀). Figure 4 shows the general process to get this convex contour. First, the high contrast bone edges, Fig. 4(b), can be extracted using the common threshold method from the original CT image. Then, the largest connected domain is selected with a certain area threshold as shown in Fig. 4(c). Figure 4(d) is the minimum convex set containing the connected domain of Fig. 4(c). Finally, we can extract the convex set and its single-pixel contour shown in Fig. 4(d) and (e) respectively.

Step 2. High-resolution MRI image reconstruction using DL from highly under-sampled k-space data and an initial image.

Once an MRI image is obtained by Step 1 from its counterpart CT image and two dictionaries D ^CTand D ^MR, it will be used as an initial input in the following high-resolution MRI image reconstruction from highly under-sampled k-space data. Based on the CSMRI model described in [27], the image reconstruction problem can be approximately described as $min_{x} \sum_{ij} {∥ R_{ij} x - D^{MR} α_{ij} ∥}_{2}^{2} + ν {∥ Fx - y^{MR} ∥}_{2}^{2}$ (18) where the matrix R _ij represents the operator extracting the patch x _ij from x, the weight ν is the Lagrange multiplier which depends on the standard deviation of the noise in the measured k-space data y ^MR of MRI, F is the Fourier transform matrix. Thus, Fx represents the k-space data calculated from x.

Equation (18) is a common l ₂ norm minimization problem which can be solved directly using a classical iterative optimization technique, such as Gauss-Seidel and conjugate gradient (CG). The detailed solution procedure is summarized in Fig. 5.

In Equation (19), y (k _x, k _y) denotes the updated value at the location (k _x, k _y) in the k-space, S ₀ represents the original k-space data which are highly under-sampled, Ω is a subset of k-space that has been sampled, and S = FFT (x ⁽ⁿ⁾) represents the k-space data generated from the current iterative image x ⁽ⁿ⁾.

3 Simulation results

In this section, the EDGE algorithm was evaluated by using the well-registered clinical MRI and CT images on human brain provided by the Visible Human Project of the National Library of Medicine (http://www.nlm.nih.gov). The MRI images have 256×256 pixels with the size of 1.01562×1.01562 mm², while the CT images have 512×512 pixels with the size of 0.527344×0.527344 mm². The highly under-sampled k-space data was simulated by subsampling the Fourier transform of original (test) images using a corresponding under-sampling mask.

In the simulation, 5 pairs of CT and MRI images were selected to construct two dictionaries. First, CT images were down-sampled into 256×256 pixels and re-registered with the same pixel size as MRI images. Thus, for each image patch to be represented, its corresponding dictionaries D ^CT and D ^MR have 135 atoms by selecting the nearest 9 patches to the each patch center. The patch size was 9×9 with the sliding distance of 2 pixels. After the initial MRI image was reconstructed by the Algorithm 1 described as Fig. 3 (Step 1 of EDGE), it is used as an initial value of the Algorithm 2 described as Fig. 5 (Step 2 of EDGE). Moreover, in the implementation of Fig. 5, the iteration number of K-SVD in step 1) was set 15. The block size is 8, and the sparseness level is 9. In step 2), all the atoms were used in OMP iteration. And, its convergence condition ε _OMP= 0.023.

The results of the EDGE algorithm were compared with the reconstruction of the traditional DL-MRI method [27]. In addition, we used the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) for quantitative image quality assessment [16]. The PSNR is computed as the ratio between the peak intensity value of the reference image and the root-mean-square of the reconstruction error. The SSIM between 0 and 1 compares local patterns of pixel intensities that have been normalized for luminance and contrast. All implementations were coded in Matlab 7.11.0 (R2010b).

3.1 Noiseless simulation

The sample CT and MRI images in Fig. 6 were selected to generate the dictionaries of these two modalities. The images in each column have the same size and are well-registered. The images in the 3rd row are the closed single-pixel edges of the skulls in the CT images of the 1st row. Figure 7 shows the test MRI and CT images of the same patient. (a) is the test CT image. (b) is the closed single-pixel edge of the skull extracted from (a). (c) is the target MRI image to be reconstructed. (d) is the initial MRI image reconstructed by the step 1 of EDGE (Algorithm 1) from the training data set consisting of the five pairs of well-registered MRI and CT images from other patients shown in Fig. 6, and the CT image of this same imaged object shown in Fig. 7(a). (e) is the reconstructed MRI image by the traditional DDL algorithm without edge-oriented mapping [12]. According to the true value of the target MRI image, calculate the MSE (mean squared error) and SSIM of these two reconstructed results. The MSE and SSIM of Fig. 6(d) are 8720.3 and 0.8003, respectively. However, the MSE and SSIM of Fig. 6(e) are 9909.5 and 0.7043, respectively. Enhanced by the adaptive edge information, EDGE algorithm can provide much better image quality than the traditional DDL algorithm.

Figure 8 shows the final MRI reconstruction results from the 20 folds under-sampling k-space measurement data. (a) is the test MRI image. (b) is the 20 folds under-sampling matrix in k-space. (c) is the reconstruction by the Fourier transform algorithm with zero-filling of measured data. (d) is the initial MRI image reconstructed by the Step 1 of EDGE. (e) is the reconstruction by the traditional DL-MRI algorithm with the initial image (c) [27]. (f) is the reconstruction by the EDGE algorithm with the initial image (d). The MSE of the reconstructed image (f) is 0.0065 which is much lower than the MSE of the reconstructed image (e), 0.0564. It can also be seen that the EDGE reconstruction has much cleaner background and less errors than the results of DL-MRI.

Figure 9 represents the PSNR as a function of the number of iterations for the EDGE and traditional DL-MRI algorithms respectively. After 15 iterations, for the DL-MRI reconstruction the PSNR and SSIM are 12.5128 dB and 0.6292, respectively. In contrast, the PSNR and SSIM of EDGE reconstruction are 27.7277 and 0.9322. Moreover, according to the same iterative convergence condition, our algorithm can stop when the iteration number equaled 6, while DL-MRI method stopped more than 15 iterations. Figure 10 shows the PSNR curves for different under-sampling folds in k-space measurements. We can find that the result of EDGE has a slow decline. However, the result of DL-MRI has a rapid fall after 14 folds.

Benefiting from the edge-oriented patch mapping between D ^CT and D ^MR, similar structural information are preserved within these two dictionaries. The initial MRI image reconstructed by EDGE can be regard as a highly under-sampling MRI image whose under-sampling factor is about 14. Therefore, as shown in Fig. 10, when the under-sampling factor is higher than 13, EDGE keep a much better PSNR than DL-MRI. The mean PSNR of the EDGE is about 18 dB higher than the DL-MRI result when the k-space is 25 under-sampled.

3.2 Noisy data simulation

In the noisy data simulation, the k-space data were added by the Gaussian noise with zero mean and standard deviation σ. And, the k-space data was 20 folds under-sampled. Figure 11 shows the results with the noise level σ = 10 reconstructed by DL-MRI and EDGE respectively. It may be found that the reconstruction of EDGE has a higher PSNR and cleaner background than that of DL-MRI. Table 1 shows the mean PSNR values with different noise levels for EDGE and DL-MRI reconstructions. It can be seen that the PSNR values for EDGE are much better than the results of DL-MRI from either noise-free or noisy data. In the case of high level noise, increasing the number of dictionary atoms, such as 10 or 15 (each image generates two or more atoms), may provide a better reconstruction result.

4 Discussions and conclusion

In conclusion, we proposed a new edge-oriented DDL MRI image reconstruction algorithm from highly under-sampled k-space data in order to decrease the MRI data acquisition time in the MRI-CT scanner. The key step is to establish a one-to-one mapping relationship between the training CT images and MRI images by using the edge-oriented mapping. Without any measured MRI data an initial MRI image can be generated within this new algorithm from the CT image of the target patient and the training CT and MRI data sets, which proved to be useful to improve the MRI image reconstruction from high under-sampled k-space data in this paper. This method is expected to be applied in fast MRI imaging where requires high temporal resolution such as dynamic cardiac imaging. Though the algorithm in this paper was derived and simulated in MRI-CT simultaneous imaging, we expect it can be used in other multi-modality imaging, e.g., the Omni-tomography including CT, MRI, PET, SPECT and optical image [10].

As mentioned above, the dictionary mapping in the EDGE algorithm decides the quality of the initial MRI image. Thus, the performance of EDGE is highly related to the mapping accuracy between the training CT and MRI images. The mismatch of the pairs of atoms in the two dictionaries D ^CT and D ^MR may damage their mapping relationship. Therefore, the MRI and CT images in the training data set should have the same pixel size and be well registered in data pre-processing stage. Moreover, because the edge information is extracted and matched from the training CT images and the patient’s CT image, we need to ensure that these two kinds of CT images have the similar and high image qualities, which means that the MRI-CT scanner should provide the similar CT image quality to the current clinical X-ray CT.

Footnotes

Acknowledgments

This work was partly supported by the grants from NSFC 61571256 and 81427803, Beijing Excellent Talents Training Foundation (2013D009004000004), and Beijing Municipal Science & Technology Commission (Z151100003915079).

References

Townsend

D.W.

, Multimodality imaging of structure and function, Physics in Medicine and Biology 53 (2008), R1–R39.

Townsend

D.W.

, Beyer

, Kinahan

, Meltzer

C.C.

, Brun

and Nutt

, The SMART scanner: A combined PET/CT tomograph for clinical oncology, Radiology 209P (1998), 169–170.

Beyer

, Townsend

D.W.

, Brun

, Kinahan

P.E.

, Charron

, Roddy

, et al., A combined PET/CT scanner for clinical oncology, Journal of Nuclear Medicine 41 (2000), 1369–1379.

Judenhofer

M.S.

, Wehrl

H.F.

, Newport

D.F.

, Catana

, Siegel

S.B.

, Becker

, et al., Simultaneous PET-MRI: A new approach for functional and morphological imaging, Nature Medicine 14 (2008), 459–465.

Cherry

S.R.

, Multimodality Imaging: Beyond PET/CT and SPECT/CT, Seminars In Nuclear Medicine 39 (2009), 348–353.

Bybel

, Brunken

R.C.

, DiFilippo

F.P.

, Neumann

D.R.

, Wu

G.Y.

and Cerqueira

M.D.

, SPECT/CT imaging: Clinical utility of an emerging technology, Radiographics 28 (2008), 1097–1113.

Ehrhardt

M.J.

, Thielemans

, Pizarro

, et al., Joint reconstruction of PET-MRI by exploiting structural similarity, Inverse Problems 31 (2015), 015001 (23pp).

, Chen

Z.Q.

, Jin

, Yu

H.Y.

and Wang

, Experimental measurement of human head motion for high-resolution computed tomography system design, Optical Engineering (2010) 49(6) (0150), 063201.

Wagner

, Schicho

, Kainberger

, Birkfellner

, Grampp

and Ewers

, Quantification and clinical relevance of head motion during computed tomography, Invest Radiol (2003), 733–741.

10.

Wang

, Zhang

, Gao

, Weir

, Yu

H.Y.

, Cong

W.X.

, Towards omni-tomography-grand fusion of multiple modalities for simultaneous interior tomography, PLoS ONE 7(6), e39700. doi:10.1371/journal.pone.0039700

11.

Yelleswarapu

V.R.

, Liu

, Cong

, et al., Top-level system designs for hybrid low-field MRI–CT with potential of pulmonary imaging, Sensing and Imaging 15(1) (2014), 1–9.

12.

, Zhao

, Wang

, et al., Unified dual-modality image reconstruction with dual dictionaries, Proc of SPIE, Vol. 8506, 85061V.

13.

Wang

, Li

and Wu

, CT-MRI image reconstruction with mask-enhanced dual-dictionary learning, Proc IEEE NSS/MIC, Seattle, 2014.

14.

Fahrig

, Butts

, Rowlands

J.A.

, et al., A truly hybrid interventional MR/X-ray system: Feasibility demonstration, Journal of Magnetic Resonance Imaging 13(2) (2001), 294–300.

15.

Fahrig

, Ganguly

, Lillaney

, et al., Design, performance, and applications of a hybrid X-Ray/MR system for interventional guidance, Proceedings of the IEEE 96(3) (2008), 468–480.

16.

Donoho

D.L.

, Compressed sensing, IEEE Transactions on Information Theory 52 (2006), 1289–1306.

17.

Link

T.M.

, Vieth

, Stehling

, Lotter

, Beer

, Newitt

, et al., High-resolution MRI vs multislice spiral CT: Which technique depicts the trabecular bone structure best?, European Radiology 13 (2003), 663–671.

18.

Schoettle

P.B.

, Zanetti

, Seifert

, Pfirrmann

C.W.A.

, Fucentese

S.F.

and Romero

, The tibial tuberosity-trochlear groove distance; a comparative study between CT and MRI scanning, Knee 13 (2006), 26–31.

19.

Candes

E.J.

, Romberg

and Tao

, Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information, IEEE Transaction on Information Theory 52 (2006), 489–509.

20.

Sidky

E.Y.

and Pan

X.C.

, Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization, Physics in Medicine and Biology 53 (2008), 4777–4807.

21.

H.Y.

and Wang

, Compressed sensing based interior tomography, Physics in Medicine and Biology 54 (2009), 2791–2805.

22.

Chen

G.H.

, Tang

and Leng

S.H.

, Prior image constrained compressed sensing (PICCS): A method to accurately reconstruct dynamic CT images from highly undersampled projection data sets, Medical Physics 35 (2008), 660–663.

23.

, Kang

K.J.

, Chen

Z.Q.

, et al., A general region-of-interest image reconstruction approach with truncated Hilbert transform, Journal of X-ray Science and Technology 17(2) (2009), 135–152.

24.

Chang

, Li

, Chen

Z.Q.

, Xiao

Y.S.

, Zhang

and Wang

, A few-view reweighted sparsity hunting (FRESH) method for CT image reconstruction, Journal of X-Ray Science And Technology 21 (2013), 161–176.

25.

Aharon

, Elad

and Bruckstein

, K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation, IEEE Trans Signal Process 54(11) (2006), 4311–4322.

26.

Yaghoobi

, Blumensath

and Davies

, Dictionary learning for sparse approximations with the majorization method, IEEE Trans Signal Process 57(6) (2009), 2178–2191.

27.

Ravishankar

and Bresler

, MR image reconstruction from highly undersampled k-space data by dictionary learning[J], Medical Imaging, IEEE Transactions on 30(5) (2011), 1028–1041.

28.

, Yu

H.Y.

, Mou

X.Q.

, et al., Low-dose X-ray CT reconstruction via dictionary learning, IEEE Trans Med Imag 31(9) (2012), 1682–1697.

29.

Tosic

, Jovanovic

, Frossard

, Vetterli

and Duric

, Ultrasound tomography with learned dictionaries, in Proc IEEE Int Conf Acoust, Speech Signal Process 2010, pp. 5502–5505.

30.

Rubinstein

, Bruckstein

A.M.

and Elad

, Dictionaries for sparse representation modeling, Proc IEEE 98(6) (2010), 1045–1057.

31.

Zhao

, Ding

H.J.

, Lu

, et al., Dual-dictionary learning-based iterative image reconstruction for spectral computed tomography application, Phys Med Biol 57 (2012), 8217–8229.

32.

, Zhao

and Wang

, Few-view image reconstruction with dual dictionaries, Phys Med Biol 57 (2012), 173–189.

33.

Song

, Zhu

, Lu

, et al., Reconstruction of magnetic resonance imaging by three-dimensional dual-dictionary learning, Magnetic Resonance in Medicine 71(3) (2014), 1285–1298.

34.

Tropp

J.A.

, Greed is good: Algorithmic results for sparse approximation, IEEE Trans Inf Theory 50(10) (2004), 2231–2242.

35.

, Zhao

and Wang

, Edge-guided dual-modality image reconstruction, IEEE Access 2 (2014), 1359–1363.