
3
TABLE I: Statistics of existing logo detection datasets.
#Scale #Datasets #Logos #Brands #Images #Objects #Public #Year
Small
BelgaLogos [12] 37 37 10,000 2,695 Yes 2009
FlickrLogos-27 [19] 27 27 1,080 4,671 Yes 2011
FlickrLogos-32 [13] 32 32 2,240 5,644 Yes 2011
MICC-Logos [10] 13 720 - No 2013
Logo-18 [14] 18 10 8,460 16,043 No 2015
Logos-32plus [20] 32 32 7,830 12,302 No 2017
Top-Logo-10 [21] 10 10 700 - No 2017
Video SportsLogo [22] 20 20 2,000 - No 2017
VLD 1.0 [23] 66 66 25,189 - No 2019
SportLogo [24] 31 31 2,836 - Yes 2020
VLD-45 [25] 45 45 45,000 - No 2020
Medium
Logo-160 [14] 160 100 73,414 130,608 No 2015
Logos-in-the-Wild [26] 871 871 11,054 32,850 Yes 2017
QMUL-OpenLogo [15] 352 352 27,083 - Yes 2018
PL2K [27] 2,000 2,000 295,814 - No 2019
FoodLogoDet-1500 [16] 1,500 - 99,768 145,400 Yes 2021
Large
Open Brands [28] 1,216 559 1,437,812 3,113,828 No 2020
LogoDet-3K [17] 3,000 2,864 158,652 194,261 Yes 2020
PL8K [18] 7,888 7,888 3,017,146 - No 2022
varies greatly. For example, the Food, Clothes, and Necessities
contain more images and objects than other super-classes.
The imbalanced distribution across different logo classes of
LogoDet-3K poses a challenge to effectively detecting logos
with few samples. PL8K [18] is a large logo detection dataset
constructed semi-automatically. The dataset consists of 7,888
logo brands and 3,017,146 images, and at least 20 images per
class.
In addition, there are also some datasets built for logo
classification, e.g. WebLogo-2M [29] and Logo 2K+ [30].
Weblogo-2M [29] is obtained by automatic web data ac-
quisition and processing. The dataset excludes images with
small widths and/or heights and duplicate images. Compared
with other datasets, the Weblogo-2M presents three unique
properties inherent to large-scale data exploration for learning
scalable logo models: (1) Weak Annotation. (2) Noisy (False
Positives). (3) Class Imbalance. Logo-2K+ [30] is a large-scale
high-quality logo dataset. It covers a variety of logo classes
from the real world, and different types of logo images have
various logo appearances, scales, and backgrounds since they
are collected from different websites. The dataset has high
spatial coverage of categories including Food, Clothes, Insti-
tutions, Accessories, Transportation, Electronics, Necessities,
Cosmetics, Leisure, and Medical. The number of images is
imbalanced among different categories. For instance, Food has
769 logo classes, while Medical has only 48.
As with common object detection, mAP [31] is the most
commonly used evaluation metric to measure logo detectors.
III. LOGO DETECTION
In logo detection, logo classification is also an essential part.
Therefore, we briefly summarize logo classification before
introducing logo detection.
A. Logo Classification
As one of the most critical tasks in computer vision, logo
classification aims to recognize the logo name corresponding
to the input image. According to different feature extraction
strategies, existing classification methods are divided into two
categories: traditional machine learning-based methods and
deep learning-based methods. Some representative methods
will be described briefly in the following.
1) Traditional Machine Learning-based Methods: The tra-
ditional logo classification methods extract features through
manual features, such as SIFT and HOG, and then classify
them by a classifier. Support Vector Machine (SVM) [32]
is a classifier that performs binary data classification in a
supervised learning manner. In traditional logo classification,
SVM also shows excellent performance [33–36]. Carvalho
et al. [35] proposed a self-learning and automatic detection
method that performs detection without any prior data. The
scheme automatically identifies the candidate regions of the
localization logo, uses the HOG features extracted from the
localization to train the object detector and some sub-detectors
and uses the SVM to recognize the logo image.
K-Nearest Neighbor (KNN) is a supervised learning algo-
rithm commonly used in classification [37, 38]. Gopinathan et
al. [38] proposed a vehicle logo recognition system in 2018.
They used Euclidean distance on the initial training dataset
to summarize the pixel intensity distribution by applying each
channel’s mean and standard deviation and used the K-means
algorithm to cluster the color histogram features to group
different logos. Then HOG and KNN algorithms extract logo
features and classify logos.
2) Deep Learning-based Methods: In recent years, with
the continuous development of deep learning, deep learning
based solutions have also been successfully applied to logo
classification [39–46]. Karimi et al. [43] used the DCNN
logo recognition algorithm, which conducted a pre-trained
model for feature extraction and then used SVM for logo
classification. They also used transfer learning to improve
existing pre-trained models for logo recognition. Finally, the
fine-tuned deep model is applied to the parallel structure to
obtain a more efficient deep model for logo recognition.
The latest trend in logo classification is to design efficient
networks with limited resources [18, 30, 47, 48]. Wang et
al. [30] proposed a Discriminative Region Navigation and
Augmentation Network (DRNA-Net), which is capable of
discovering more informative regions and expanding these