
Input Reflectance Illumination Amplification
Figure 2. Image decomposition via RetinexNet. It can be observed
clearly that the illumination of a low-light image contains noise.
many visual applications for processing the low-light im-
ages with poor visibility, for instance object detection, im-
age recognition and segmentation in the dark [12]. Early
methods solve this task by designing minimal reconstruc-
tion models [11,21], which generally have limited capa-
bilities for restoring detailed information of the low-light
images. In recent years, deep neural networks have been
widely applied to LLIE tasks [15,17,28,33]. These deep
models are mostly trained on dark images with relatively
low noise, achieving good performance on recovering local
textures and global structures of the images [6,13,30,37].
Among previous deep LLIE methods, some tackle image
enhancement jointly with denoising, such as [17,27,28,35]
that are aimed at removing noise in some specific layers
while enhancing image in other components, or [6,19,32]
that are aimed at removing noise and enhance image jointly.
Although these methods achieve promising effectiveness,
they still tend to give inferior results with speckle noise
and blur when applied to real-world low-light sRGB im-
ages, as shown in Figure 1. We hold that such performance
gap mainly roots in two challenges of real-world LLIE. (1)
The noise issue in real-world LLIE is rather complex. Due
to insufficient lighting or hardware limitations, the images
captured in real-world dark environments inevitably contain
varying degrees of noise, which is often with unknown dis-
tribution. Moreover, some noise is somewhat covered up by
the low-lighting, which makes it hard to be separated com-
pletely with traditional methods. Besides, the unobserved
noise in the image would be amplified and lead to speckle
noise and blur during the process of enhancement. (2) The
existing LLIE methods trained on raw data are unsuitable
for sRGB images due to the difference in the domains of
raw and sRGB data. The low-light sRGB image contain
less information compared with a raw one, and the conver-
sion of raw data to sRGB may also produce more noise.
We conduct a pilot study to investigate the separation of
noise from the low-light image [27] and provide visualiza-
tions in Figure 2. It can be observed that both reflectance
and illumination contain noise, and the final enhanced re-
sult (Figure 1(b)) contains much speckle noise, which indi-
cates that the noise is hard to be separated completely by the
delicately designed layers, and that the hidden noise indeed
causes inaccurate representation.
We are then motivated to design a new Low-light En-
hancement & Denoising Network for enhancing real-world
low-light images in the sRGB color space, shorted as
RLED-Net. Instead of removing the noise in some spe-
cific layers, we propose to suppress the noise in the low-
rank/low-dimensional subspaces which can be used to char-
acterize the real-world images. In RLED-Net, a low-rank
subspace recovery block (LSRB) is used to directly embed
the noisy dark image into the low-rank subspaces to make
our network more suitable for real-world low-light image
denoising. Furthermore, to reduce the speckle noise and
blur caused by the hidden noise in the low-light environ-
ment, we propose to borrow some self-attentions to build
transformer layers for recovering the important features.
Hence we design a Crossed-channel & Shift-window Trans-
former (CST) layer with two branches. Specifically, CST
can simultaneously maintain more accurate local features
(e.g., edge/texture) and global features (e.g., color/shape),
by shift-window attention and crossed channels attention in
two parallel branches. Through the ablation studies, We find
this structure can preserve more useful features.
We conduct extensive experiments to prove the effec-
tiveness of the proposed RLED-Net. The obtained results
demonstrate that our RLED-Net can significantly improve
the performance when employed for real-world low-light
sRGB images enhancement and denoising. An example can
be found in Figure 1. To sum up, the main contributions of
this paper are described as follows:
• We propose a plug-and-play and differentiable low-
rank subspace recovery block (LSRB), which can sup-
press the noise in real-world low-light images effec-
tively by representing them in low-rank subspaces.
• To reduce the speckle noise and blur caused by hid-
den noise, we design a two-branch transformer (CST)
layer, which can simultaneously recover more accurate
local features and global features by self-attention.
• We develop a Low-light Enhancement & Denoising
Network (RLED-Net) for enhancing real-world low-
light images in the sRGB color space, which is a more
widely used color space in reality. SOTA performance
of RLED-Net on both well-designed and real low-light
images is verified through extensive experiments.
2. Related Work
2.1. Low-light image enhancement
Low-light image enhancement (LLIE) explores how to
enhance the illumination to improve the visibility of low-
light images [10,12,20,30,37]. Recently, some LLIE meth-
ods have been proposed to exploit the restoration of global
and local information for more natural restored results. For
example, DCC-Net [37] decomposes a color image into a
gray image and a color histogram, so as to retain the color
consistency between the enhanced image and ground truth;
2