NAS-PRNet: Neural Architecture Search generated Phase Retrieval
Net for Off-axis Quantitative Phase Imaging
Xin Shu1, Mengxuan Niu1, Yi Zhang1, and Renjie Zhou1, *
1Department of Biomedical Engineering, the Chinese University of Hong Kong, Shatin, New Territories,
Hong Kong, China
*Corresponding author: rjzhou@cuhk.edu.hk
Abstract: Single neural networks have achieved simultaneous phase retrieval with aberration
compensation and phase unwrapping in off-axis Quantitative Phase Imaging (QPI). However, when
designing the phase retrieval neural network architecture, the trade-off between computation latency and
accuracy has been largely neglected. Here, we propose Neural Architecture Search (NAS) generated
Phase Retrieval Net (NAS-PRNet), which is an encoder-decoder style neural network, automatically
found from a large neural network architecture search space. The NAS scheme in NAS-PRNet is
modified from SparseMask, in which the learning of skip connections between the encoder and the
decoder is formulated as a differentiable NAS problem, and the gradient decent is applied to efficiently
search the optimal skip connections. Using MobileNet-v2 as the encoder and a synthesized loss that
incorporates phase reconstruction and network sparsity losses, NAS-PRNet has realized fast and accurate
phase retrieval of biological cells. When tested on a cell dataset, NAS-PRNet has achieved a Peak Signal-
to-Noise Ratio (PSNR) of 36.1 dB, outperforming the widely used U-Net and original SparseMask-
generated neural network. Notably, the computation latency of NAS-PRNet is only 31 ms which is 12
times less than U-Net. Moreover, the connectivity scheme in NAS-PRNet, identified from one off-axis
QPI system, can be well fitted to another with different fringe patterns.
1. INTRODUCTION
Quantitative Phase Imaging (QPI) has been widely applied to biomedical imaging [1, 2] and material
metrology [3, 4]. In off-axis interferometry-based QPI [5] (off-axis QPI), the Optical Path Difference
(OPD) or phase distribution of an object is encoded in a fringe pattern or interferogram by interfering the
object field with a tilted reference field. To reconstruct the phase map from a recorded interferogram, the
conventional approach contains three key steps: (i) retrieving the wrapped phase (valued between 0 to 2𝜋)
from the complex object field (e.g., using the Fourier transform method [6]); (ii) unwrapping the phase
(e.g., using the Goldstein algorithm [7]); and (iii) calibrating the phase or compensating the phase
aberration by using an additional interferogram captured in a sample-free region [8]. Among the three
steps, phase unwrapping is the most time-consuming; moreover, one may fail to obtain the calibration
interferogram when imaging a dense sample. To expedite phase retrieval in off-axis QPI, parallel
computation using sophisticated Parallel Graphics Processing Units (GPUs) or Field Programmable Gate
Arrays (FPGAs) has been applied to accelerate phase unwrapping [9, 10], while the calibration
interferogram is still required. In recent years, neural networks have become attractive alternatives for
achieving phase retrieval with aberration compensation with/without phase unwrapping in off-axis QPI,
such as the widely used U-Net models [11, 12] and the Y-Net [13]. Despite realizing significant
simplification of the imaging system and reduced cost, furthering applying those methods for real-time
phase imaging in off-axis QPI is potentially limited by the relatively large computation latency and phase
retrieval accuracy. It is known that the network inference accuracy and efficiency heavily depend on its
architecture. Therefore, a strategy to identify an optimal network architecture is needed to minimize the
computation latency, while keeping the phase retrieval accuracy high.