NON-ITERATIVE OPTIMIZATION OF PSEUDO-LABELING THRESHOLDS FOR TRAINING OBJECT DETECTION MODELS FROM MULTIPLE DATASETS Yuki Tanaka Shuhei M. Yoshida Makoto Terao
2025-05-02
0
0
3.09MB
5 页
10玖币
侵权投诉
NON-ITERATIVE OPTIMIZATION OF PSEUDO-LABELING THRESHOLDS FOR
TRAINING OBJECT DETECTION MODELS FROM MULTIPLE DATASETS
Yuki Tanaka, Shuhei M. Yoshida, Makoto Terao
Visual Intelligence Research Laboratories
NEC Corporation
Kawasaki, Kanagawa, Japan
ABSTRACT
We propose a non-iterative method to optimize pseudo-
labeling thresholds for learning object detection from a col-
lection of low-cost datasets, each of which is annotated for
only a subset of all the object classes. A popular approach
to this problem is first to train teacher models and then to
use their confident predictions as pseudo ground-truth labels
when training a student model. To obtain the best result,
however, thresholds for prediction confidence must be ad-
justed. This process typically involves iterative search and
repeated training of student models and is time-consuming.
Therefore, we develop a method to optimize the thresholds
without iterative optimization by maximizing the Fβ-score
on a validation dataset, which measures the quality of pseudo
labels and can be measured without training a student model.
We experimentally demonstrate that our proposed method
achieves an mAP comparable to that of grid search on the
COCO and VOC datasets.
Index Terms—Non-iterative optimization, pseudo label-
ing, object detection, weakly supervised learning
1. INTRODUCTION
Object detection [1, 2, 3, 4] has achieved significant progress
in deep learning with a tremendous number of images and an-
notations, but it becomes quite expensive to collect them. This
creates a significant barrier when it comes to moving from the
research stage to practical application. Recently, research on
how to train a model with low-cost datasets has become more
active.
There are several paradigms to learn from low-cost
datasets. Examples include semi-supervised learning [5, 6, 7,
8] and weakly supervised learning [9, 10]. In semi-supervised
learning, models are trained from a limited amount of labeled
data and a lot of unlabeled data (Fig. 1(a)), while in weakly
© 2022 IEEE. Personal use of this material is permitted. Permission
from IEEE must be obtained for all other uses, in any current or future media,
including reprinting/republishing this material for advertising or promotional
purposes, creating new collective works, for resale or redistribution to servers
or lists, or reuse of any copyrighted component of this work in other works.
(a) Semi-supervised learning (b) Weakly supervised learning
(c) Learning from multiple datasets with different class sets
Fig. 1. Examples of learning paradigms for object detection
using low-cost datasets. In these examples, the goal is to
train a model that detects people and bicycles in images by
using training datasets with annotations as illustrated above.
The two images are contained in COCO [14] and VOC [15]
datasets, respectively.
supervised learning, models are trained from only image-
level annotations and no bounding boxes (Fig. 1(b)). By
contrast, we aim at training a single object detection model
for all classes from multiple datasets that have different class
sets without additional annotations [11, 12, 13]. This set-
ting (Fig. 1(c)) is important for practical applications, because
we can add object categories simply by combining datasets
that are made for different purposes.
Typically, pseudo labeling is used to train an object de-
tection model in the current problem setting (Fig. 2). Specif-
ically, we first train one teacher model from each dataset and
then use them to predict locations of unlabeled objects. A
prediction is used as a pseudo label if its confidence score
is higher than a predetermined threshold. Finally, we train a
single student model for all classes by using both the ground-
truth labels and the pseudo labels.
To achieve the best performance with pseudo labeling, it is
imperative to decide this threshold properly, but the optimiza-
© IEEE 2022
arXiv:2210.10221v1 [cs.CV] 19 Oct 2022
摘要:
展开>>
收起<<
NON-ITERATIVEOPTIMIZATIONOFPSEUDO-LABELINGTHRESHOLDSFORTRAININGOBJECTDETECTIONMODELSFROMMULTIPLEDATASETSYukiTanaka,ShuheiM.Yoshida,MakotoTeraoVisualIntelligenceResearchLaboratoriesNECCorporationKawasaki,Kanagawa,JapanABSTRACTWeproposeanon-iterativemethodtooptimizepseudo-labelingthresholdsforlearning...
声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
相关推荐
-
公司营销部领导述职述廉报告VIP免费
2024-12-03 4 -
100套述职述廉述法述学框架提纲VIP免费
2024-12-03 3 -
20220106政府党组班子党史学习教育专题民主生活会“五个带头”对照检查材料VIP免费
2024-12-03 3 -
20220106县纪委监委领导班子党史学习教育专题民主生活会对照检查材料VIP免费
2024-12-03 6 -
A文秘笔杆子工作资料汇编手册(近70000字)VIP免费
2024-12-03 3 -
20220106县领导班子党史学习教育专题民主生活会对照检查材料VIP免费
2024-12-03 4 -
经济开发区党工委书记管委会主任述学述职述廉述法报告VIP免费
2024-12-03 34 -
20220106政府领导专题民主生活会五个方面对照检查材料VIP免费
2024-12-03 11 -
派出所教导员述职述廉报告6篇VIP免费
2024-12-03 8 -
民主生活会对县委班子及其成员批评意见清单VIP免费
2024-12-03 50
分类:图书资源
价格:10玖币
属性:5 页
大小:3.09MB
格式:PDF
时间:2025-05-02


渝公网安备50010702506394