NON-ITERATIVE OPTIMIZATION OF PSEUDO-LABELING THRESHOLDS FOR TRAINING OBJECT DETECTION MODELS FROM MULTIPLE DATASETS Yuki Tanaka Shuhei M. Yoshida Makoto Terao

2025-05-02 0 0 3.09MB 5 页 10玖币
侵权投诉
NON-ITERATIVE OPTIMIZATION OF PSEUDO-LABELING THRESHOLDS FOR
TRAINING OBJECT DETECTION MODELS FROM MULTIPLE DATASETS
Yuki Tanaka, Shuhei M. Yoshida, Makoto Terao
Visual Intelligence Research Laboratories
NEC Corporation
Kawasaki, Kanagawa, Japan
ABSTRACT
We propose a non-iterative method to optimize pseudo-
labeling thresholds for learning object detection from a col-
lection of low-cost datasets, each of which is annotated for
only a subset of all the object classes. A popular approach
to this problem is first to train teacher models and then to
use their confident predictions as pseudo ground-truth labels
when training a student model. To obtain the best result,
however, thresholds for prediction confidence must be ad-
justed. This process typically involves iterative search and
repeated training of student models and is time-consuming.
Therefore, we develop a method to optimize the thresholds
without iterative optimization by maximizing the Fβ-score
on a validation dataset, which measures the quality of pseudo
labels and can be measured without training a student model.
We experimentally demonstrate that our proposed method
achieves an mAP comparable to that of grid search on the
COCO and VOC datasets.
Index TermsNon-iterative optimization, pseudo label-
ing, object detection, weakly supervised learning
1. INTRODUCTION
Object detection [1, 2, 3, 4] has achieved significant progress
in deep learning with a tremendous number of images and an-
notations, but it becomes quite expensive to collect them. This
creates a significant barrier when it comes to moving from the
research stage to practical application. Recently, research on
how to train a model with low-cost datasets has become more
active.
There are several paradigms to learn from low-cost
datasets. Examples include semi-supervised learning [5, 6, 7,
8] and weakly supervised learning [9, 10]. In semi-supervised
learning, models are trained from a limited amount of labeled
data and a lot of unlabeled data (Fig. 1(a)), while in weakly
© 2022 IEEE. Personal use of this material is permitted. Permission
from IEEE must be obtained for all other uses, in any current or future media,
including reprinting/republishing this material for advertising or promotional
purposes, creating new collective works, for resale or redistribution to servers
or lists, or reuse of any copyrighted component of this work in other works.
(a) Semi-supervised learning (b) Weakly supervised learning
(c) Learning from multiple datasets with different class sets
Fig. 1. Examples of learning paradigms for object detection
using low-cost datasets. In these examples, the goal is to
train a model that detects people and bicycles in images by
using training datasets with annotations as illustrated above.
The two images are contained in COCO [14] and VOC [15]
datasets, respectively.
supervised learning, models are trained from only image-
level annotations and no bounding boxes (Fig. 1(b)). By
contrast, we aim at training a single object detection model
for all classes from multiple datasets that have different class
sets without additional annotations [11, 12, 13]. This set-
ting (Fig. 1(c)) is important for practical applications, because
we can add object categories simply by combining datasets
that are made for different purposes.
Typically, pseudo labeling is used to train an object de-
tection model in the current problem setting (Fig. 2). Specif-
ically, we first train one teacher model from each dataset and
then use them to predict locations of unlabeled objects. A
prediction is used as a pseudo label if its confidence score
is higher than a predetermined threshold. Finally, we train a
single student model for all classes by using both the ground-
truth labels and the pseudo labels.
To achieve the best performance with pseudo labeling, it is
imperative to decide this threshold properly, but the optimiza-
© IEEE 2022
arXiv:2210.10221v1 [cs.CV] 19 Oct 2022
摘要:

NON-ITERATIVEOPTIMIZATIONOFPSEUDO-LABELINGTHRESHOLDSFORTRAININGOBJECTDETECTIONMODELSFROMMULTIPLEDATASETSYukiTanaka,ShuheiM.Yoshida,MakotoTeraoVisualIntelligenceResearchLaboratoriesNECCorporationKawasaki,Kanagawa,JapanABSTRACTWeproposeanon-iterativemethodtooptimizepseudo-labelingthresholdsforlearning...

展开>> 收起<<
NON-ITERATIVE OPTIMIZATION OF PSEUDO-LABELING THRESHOLDS FOR TRAINING OBJECT DETECTION MODELS FROM MULTIPLE DATASETS Yuki Tanaka Shuhei M. Yoshida Makoto Terao.pdf

共5页,预览1页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:5 页 大小:3.09MB 格式:PDF 时间:2025-05-02

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 5
客服
关注