2 M. Pistellato et al.
field, we can mention enhanced visualisation [6,21], documentation and preser-
vation [16,13,15] as well as surface analysis [3]. Moreover, RTI techniques can
be effectively paired with other tools as 3D reconstruction [36,23,24,25] or mul-
tispectral imaging [8] to further improve the results.
In the majority of the cases, the acquisition of RTI data is carried out with
specialised hardware involving a light dome and other custom devices that need
complex initial calibration. Since the amount of processed data is significant,
several compression methods have been proposed for RTI data representation
to obtain efficient storage and interactive rendering [27,9]. In addition to that,
part of the proposals focus on the need of low-cost portable solutions [12,38,28],
including mobile devices [31] to perform the computation on the field.
In this paper we first propose a low-cost acquisition pipeline that requires a
couple of ordinary smartphones and a simple marker printed on a flat surface.
During the process, both smartphones acquire two videos simultaneously: one
device acting as a static camera observing the object from a fixed viewpoint,
while the other provides a trackable moving light source. The two videos are
synchronised and then the marker is used to recover the light position with
respect to a common reference frame, originating a sequence of intensity images
paired with light directions. The second contribution of our work is an efficient
and accurate neural-network model to describe per-pixel reflectance based on
PCA-compressed intensity data. We tested the proposed relighting approach
both on a synthetic RTI dataset, involving different surfaces and materials, and
on several real-world objects acquired on the field.
2 Related Work
The literature counts a huge number of different methods for both acquisition
and processing of RTI data for relighting. In [22] the authors give a compre-
hensive survey on Multi-Light Image Collections (MLICs) for surface analysis.
Many approaches employ the classical polynomial texture maps [14] to (i) define
the per-pixel light function, (ii) store a representation of the acquire data, and
(iii) dynamically render the image under new lights. Similar techniques are the
so-called Hemispherical Harmonics coefficients [17] and Discrete Modal Decom-
position [26]. In [9] the authors propose a new method based on Radial Basis
Function (RBF) interpolation, while in [27] a compact representation for web
visualisation employing PCA is presented. The authors in [18] present the High-
light Reflectance Transformation Imaging (H-RTI) framework, where the light
direction is estimated by detecting its specular reflection on one or more spherical
objects captured in the scene. However, such setup involves several assumptions
such as constant light intensity and orthographic camera model, that in practice
make the model unstable. Other techniques that have been proposed to estimate
light directly from some scene features are [1,2], while in the authors [9] propose
a novel framework to expand the H-RTI technique.
Recently, neural networks have been employed successfully in several Com-
puter Vision tasks, including RTI. In particular, the encoder-decoder architec-