
Leveraging Structure from Motion to Localize Inaccessible Bus Stops
Indu Panigrahi1, Tom Bu2, and Christoph Mertz2
Abstract— The detection of hazardous conditions near pub-
lic transit stations is necessary for ensuring the safety and
accessibility of public transit. Smart city infrastructures aim
to facilitate this task among many others through the use
of computer vision. However, most state-of-the-art computer
vision models require thousands of images in order to perform
accurate detection, and there exist few images of hazardous
conditions as they are generally rare.
In this paper, we examine the detection of snow-covered
sidewalks along bus routes. Previous work has focused on
detecting other vehicles in heavy snowfall or simply detect-
ing the presence of snow. However, our application has an
added complication of determining if the snow covers areas
of importance and can cause falls or other accidents (e.g.
snow covering a sidewalk) or simply covers some background
area (e.g. snow on a neighboring field). This problem involves
localizing the positions of the areas of importance when they
are not necessarily visible.
We introduce a method that utilizes Structure from Motion
(SfM) rather than additional annotated data to address this
issue. Specifically, our method learns the locations of sidewalks
in a given scene by applying a segmentation model and SfM
to images from bus cameras during clear weather. Then, we
use the learned locations to detect if and where the sidewalks
become obscured with snow. After evaluating across various
threshold parameters, we identify an optimal range at which
our method consistently classifies different categories of side-
walk images correctly. Although we demonstrate an application
for snow coverage along bus routes, this method can extend
to other hazardous conditions as well. Code for this project
is available at https://github.com/ind1010/SfM_for_
BusEdge.
Index Terms— Computer Vision for Transportation, Intelli-
gent Transportation Systems, Localization, Segmentation and
Categorization
I. INTRODUCTION
Smart city infrastructures aim to use fields like computer
vision to facilitate city management, part of which involves
overseeing transportation systems. As transportation systems
become more intelligent, an increasing amount of public
transit vehicles are equipped with cameras that capture thou-
sands of images of the city per day along with geographic
positioning information. City infrastructures can use this
immense amount of raw data to monitor the conditions of
public transit stations and the surrounding areas.
Our application focuses on detecting snow-covered side-
walks along bus routes; snow-covered sidewalks are one
1Indu Panigrahi is with Robotics Institute Summer Scholars Program at
Carnegie Mellon University, Pittsburgh, PA 15213, USA and also with the
Department of Computer Science at Princeton University, NJ 08544, USA
indup@princeton.edu
2Tom Bu and Christoph Mertz are with the Robotics Institute
at Carnegie Mellon University, Pittsburgh, PA 15213, USA tomb,
cmertz@andrew.cmu.edu
type of hazardous condition that can limit the safety and
accessibility of public buses as pedestrians can lose access
to bus stops and/or slip (Fig. 1). We use images that are
captured on-board a public bus as data. However, instead
of annotating this data, we leverage the fact that the bus
travels around a set route and apply Structure from Motion
and a segmentation model to learn the locations of the
sidewalks in clear weather. Then, in future rounds, when
the bus encounters snowfall, we compare the detected snow
coverage to the learned locations of the sidewalks. If the
coverage exceeds a set threshold, we generate an alert, and
the bus company can contact the city to clear the sidewalk.
Fig. 1: Snow-covered sidewalk leading to a bus stop.
When evaluating on a few categories of sidewalk images,
we identify a set of thresholds at which our method per-
forms well across all categories for this bus route. Though
we demonstrate an application for detecting snow-covered
sidewalks, our method can generalize to detecting other
conditions such as snow on roads or bike lanes.
Our contributions are as follows:
•We present a method that combines Structure from
Motion with a segmentation model to learn the expected
locations of sidewalks and detect whether or not the
learned sidewalk locations become covered by snow.
•Although we demonstrate by detecting snow-covered
sidewalks, our method can easily generalize to other
problems.
•We collect a small dataset of images depicting sidewalks
in clear and snowy weather that we use for evaluation.
Additionally, we compile other categories of images that
may be relevant for other works.
II. RELATED WORK
A. Existing Municipal Infrastructures
Many American cities use the telephone number 311 that
allows anyone to report issues for the city to fix, such
as snow-covered sidewalks. However, this process can be
inefficient as it is decentralized and relies on the motivation
of people.
arXiv:2210.03646v1 [cs.CV] 7 Oct 2022