In recent years, many applications in the entertainment industry adopted the so-called super resolution approaches, which aim to reconstruct a high-resolution high quality image from input low-resolution low quality image. For instance, the low-resolution image may be quickly transferred through the network and, at the user side, reconstructed into its high-resolution version in real time. In the domain of medical images this area is not as adopted, but it bears potential for a wide set of applications. Namely, if we could reconstruct a high-resolution magnetic resonance (MR) image from low-resolution MR image acquisition, it would reduce the time and associated costs of MR imaging. Furthemore, the adoption of such high-resolution images could improve image-based diagnostics and the overall patient treatment. In applications of super resolution on medical images, compared to those used in the entertainment industry, the quality of the resulting image content needs to be reconstructed in a very precise and sensible manner for practical use.
Our approach to image super resolution was based on the use of convolutional neural networks. The networks were designed as generative adversarial networks (GANs), which represent a very efficacious training strategy. The basic principle is to simultaneously train two networks: a generator G and discriminator D. The first aims to generate as realistic images as possible, while the second aims to classify between the real and artificially generated images. This type of adversarial learning is mimicking a competition between the two networks and was shown to produce very realistic images. In this work, we applied several generator architectures, such as EDSR, mDCSRN, MRDG and RWMAN, where the EDSR was designed for natural images, and several discriminator architectures such as PPD, SRGAN and MedSRGAN. We have systematically evaluated combinations of generator and discriminator architectures, learning strategies and image preprocessing approaches, optimized critical components like the loss function of G and D networks, and tuned hyperparameters. Next, the networks were extended for use with 3D images and the practical value of super resolution for improving brain MR structure segmentation was verified.
In the experiments involving 2D images, obtained as slices of 3D MR images, we verified the impact of super resolution on the quality of reconstruction based on PSNR, SSIM and NRMSE quality metrics. The super resolution approaches generally improved all the metrics. We found that it is very important to train with real images, because we could not reproduce similar results with synthetically generated low- and high-resolution MR images. Next, image intensity normalization was crucial, with [0, 1] interval normalization giving the best results. Applying normalization in hidden layers of G was found to be unnecessary, whereas it was crucial in D. In GAN training the use of regularization with some sort of gradient penalty gave best results. Using pre-trained G networks as initial network weights in the entry point of GAN training showed very good quality super resolution results. Namely, avoiding the local optima shortened the training time. By far the most important component was the loss function, which critically determined the quality of our reconstructed high-resolution image. We also noticed the difference with the applied architectures of network G, where EDSR and RWMAN, both belonging to a family of residual networks, gave best results.
In the experiments on 3D MR images, we observed improved values of the PSNR and NRMSE quality metrics, but not SSIM. Next, we compared the automated brain segmentations on reconstructed high-resolution and real low resolution image. The comparison was based on the DSC and SurfDSC segmentation quality metrics, where the segmentation of real high-resolution image was used as a reference. We found that in general both metrics improved with the use of reconstructed high-resolution image by 2-4%. The Wilcoxon test showed that, compared to the segmentation of low-resolution image, the segmentation of the reconstructed high-resolution MR image was significantly better for nearly all brain structures.
|