2 JOE KILEEL AND KATHL ´
EN KOHN
computational algebra geometry community. However, in the last decade, algebro-
geometric papers and workshops on 3D reconstruction have been appearing, leading
to novel results in multiple view geometry while motivating developments in applied
algebraic geometry.
The present article provides a survey of algebraic vision. No previous knowledge
of computer vision is assumed, and the prerequisites for computational algebraic
geometry are kept mostly to the level of undergraduate texts [51]. Due to space lim-
itations, the article makes no attempt to be comprehensive in any way, but instead
it focuses narrowly on the role of projective varieties and systems of polynomial
equations in 3D vision. An outline of the sections is as follows:
•In Section 1, we introduce the problem of 3D scene reconstruction from
unkown cameras and its algebro-geometric nature.
•In Section 2, we discuss a variety of usual models for cameras.
•In Section 3, we study multiview varieties which characterize feasible im-
ages of points under fixed cameras. Their defining equations play a key
role in 3D reconstruction algorithms, and their Euclidean distance degrees
measure the intrinsic complexity of noisy triangulation (i.e., the task of
recovering the 3D coordinates of a point observed by known cameras).
•In Section 4, we consider the space of all cameras. We explain how tuples
of cameras – up to changes of world coordinates – can be encoded via
multifocal tensors [94].
•In Section 5, we overview the most popular algorithmic pipeline to solve
3D scene reconstruction, highlighting minimal problems that are the algebro-
geometric heart of the pipeline.
•In Section 6, we describe polynomial solvers for minimal problems, focus-
ing on Gr¨obner basis methods using elimination templates and homotopy
continuation. Those method applies to zero-dimensional parameterized
polynomial systems in general.
•In Section 7, we discuss algebro-geometric approaches to understand de-
generate world scenes and image data, where uniqueness of reconstruction
breaks down and algorithms can encounter difficulty.
After reading Sections 1 and 2, the other sections are essentially independent;
only Section 6 builds on Section 5. We provide specific pointers to earlier sections
in case of partial dependencies.
Some important topics in algebraic vision that are omitted include group syn-
chronization (e.g., [161, 127]), uses of polynomial optimization (e.g., [104, 50, 6,
190, 44]), and approaches based on differential invariants (e.g., [42, 30]). Readers
may consult [147] for a survey that covers numerical and large-scale optimization
aspects in 3D reconstruction.
Acknowledgements. We thank Sameer Agarwal, Paul Breiding, Luca Car-
lone, Tim Duff, Hongyi Fan, Fredrik Kahl, Anton Leykin, Tomas Pajdla, Jean
Ponce, Kristian Ranestad, Felix Rydell, Elima Shehu, Rekha Thomas, Matthew
Trager and Uli Walther for their comments on earlier versions of the manuscript.
1. Computer vision through the algebraic lens
One of the main challenges in computer vision is the structure-from-motion
(SfM) problem: given many 2D images, the task is to reconstruct the 3D scene