
Cloth Funnels: Canonicalized-Alignment
for Multi-Purpose Garment Manipulation
Alper Canberk 1, Cheng Chi 1, Huy Ha 1,
Benjamin Burchfiel 2, Eric Cousineau 2, Siyuan Feng 2and Shuran Song 1
clothfunnels.cs.columbia.edu
Abstract— Automating garment manipulation is challenging
due to extremely high variability in object configurations.
To reduce this intrinsic variation, we introduce the task
of “canonicalized-alignment” that simplifies downstream
applications by reducing the possible garment configurations.
This task can be considered as “cloth state funnel” that
manipulates arbitrarily configured clothing items into a
predefined deformable configuration (i.e. canonicalization) at
an appropriate rigid pose (i.e. alignment). In the end, the cloth
items will result in a compact set of structured and highly
visible configurations – which are desirable for downstream
manipulation skills. To enable this task, we propose a novel
canonicalized-alignment objective that effectively guides
learning to avoid adverse local minima during learning. Using
this objective, we learn a multi-arm, multi-primitive policy that
strategically chooses between dynamic flings and quasi-static
pick and place actions to achieve efficient canonicalized-
alignment. We evaluate this approach on a real-world ironing
and folding system that relies on this learned policy as the
common first step. Empirically, we demonstrate that our
task-agnostic canonicalized-alignment can enable even simple
manually-designed policies to work well where they were pre-
viously inadequate, thus bridging the gap between automated
non-deformable manufacturing and deformable manipulation.
I. INTRODUCTION
Why has garment manipulation proved more difficult to
automate than more typical rigid and articulated objects? We
argue that two key factors are severe self occlusion, which is
present in the large set of possible crumpled states, and the
infinite degrees of freedom inherent to clothing. As a result,
it is impractical to manually define manipulation policies that
achieve reliable manipulation — a cornerstone of current auto-
mated non-deformable manufacturing pipelines. In this work,
we explore bridging the gap between existing approaches to
automation and the challenging domain of clothing. We show
that when a robot first manipulates arbitrarily configured
clothing items into a predefined configuration (i.e. canonical-
ization) at an appropriate pose (i.e. alignment), downstream
manipulation skills work significantly more reliably.
Recently, real-world cloth manipulation has received
significant attention. Some of the earliest cloth manipulation
work explored manually designed heuristics which worked
well for specific clothing types, configurations, and tasks,
such as cloth unfolding [
1
–
4
], smoothing [
5
,
6
], folding [
2
,
7
–
9
], but their strong assumptions initial states, fiducial markers,
specialized tools, or cloth type/shape do not generalize. More
recently, learning-based approaches have shown success in
more general cloth manipulation behavior. One line of work
1Columbia University
2Toyota Research Institute
Fig. 1. Canonicalized-Alignment funnels the large space of possible cloth
configurations into a much smaller and better structured set of highly-visible
states that greatly simplifies downstream tasks such as ironing or folding.
has explored supervised-learning from human demonstrations
for smoothing [
10
] and folding [
7
], but those methods required
costly human demonstrations/annotations. Another recent line
of work employs fully self-supervised learning and has shown
success in learning to unfold [
11
] (but doesn’t generalize
to other tasks) and in tackling visual goal-conditioned
manipulation of a single square cloth instance [12].
Instead of learning arbitrary monolithic cloth manipulation
tasks, we hypothesize that it is more efficient to learn a
robust task-agnostic canonicalization and alignment policy
from which other task-specific manipulation skills may be
chained. This is because such a policy funnels unstructured
and self-occluded cloth configurations into structured states
with clearly visible key points (Fig. 1, middle), reducing the
complexity of the task-specific downstream policy, and en-
abling even simple heuristics to work with a high success rate.
To this end, we define a new “canonicalized-alignment” task
for garment manipulation, where the goal is to transform a gar-
ment from its arbitrary initial state into a canonical shape (de-
fined by its category) and align it with a particular 2D transla-
tion and rotation. The end result is a decomposition of garment
manipulation into two factorized parts. The first part funnels
a diverse set of cloth configurations into SE-2 transforms of a
small set of states with high visibility. The second part consists
of downstream behaviors which relies on kinematically
feasible transforms of structured initial configurations and
full keypoint observability to achieve high task success rates.
Our primary contribution is the introduction of
arXiv:2210.09347v1 [cs.RO] 17 Oct 2022