Learning Action Duration and Synergy in Task Planning for
Human-Robot Collaboration
Samuele Sandrini1, Marco Faroni1, Nicola Pedrocchi1
Abstract—A good estimation of the actions’ cost is key in
task planning for human-robot collaboration. The duration of
an action depends on agents’ capabilities and the correlation
between actions performed simultaneously by the human and
the robot. This paper proposes an approach to learning actions’
costs and coupling between actions executed concurrently by
humans and robots. We leverage the information from past
executions to learn the average duration of each action and a
synergy coefficient representing the effect of an action performed
by the human on the duration of the action performed by the
robot (and vice versa). We implement the proposed method in
a simulated scenario where both agents can access the same
area simultaneously. Safety measures require the robot to slow
down when the human is close, denoting a bad synergy of tasks
operating in the same area. We show that our approach can
learn such bad couplings so that a task planner can leverage this
information to find better plans.
Index Terms—Human-Robot Interaction; Task And Motion
Planning; Task planning; Learning for task planning.
I. INTRODUCTION
Human-Robot Collaboration (HRC) often requires the system
to make decisions based on human users’ observed or predicted
behavior. In such a context, planning and allocating tasks to the
human and robot agents are two complex problems, even for
tasks composed of a few activities. Two modelling issues are
essential. First, human behavior is intrinsically unpredictable
and partially uncontrollable [
1
]. Second, the coupling between
the human and the robotic agents is characterized by a large
variability. For example, the feasibility and the duration of
an action may vary because of the interference between the
human and the robot (e.g., safety stops of the robot or even
due to path-switch in motion re-planning [2]).
In recent years, task planning and allocation problems for
HRC have been investigated. Existing works tried to model
human preferences explicitly and ergonomics into planning
[
3
]. For example, [
4
], [
5
] proposed a hierarchical agent-based
task planner, where complex tasks can be decomposed into
simpler actions. This approach can improve the collaborative
experience by considering human preferences as social costs [
6
],
[
7
]. Manufacturing-oriented works focus on process throughput
by minimizing the expected duration [
8
] or planning contingent
plans to reduce process errors [
9
], [
10
]. A promising approach
to deal with uncertainty on the task duration is timeline-based
planning [
11
]. Timeline-based planning explicitly models the
duration variability of tasks and finds plans that are robust with
This work was partially supported by ShareWork project (H2020, European
Commission – G.A. 820807).
1
STIIMA-CNR - Institute of Intelligent Industrial Tech-
nologies and Systems, National Research Council of Italy
{name.surname}@stiima.cnr.it
respect to it. Timeline-based planning was used in HRC with
pre-computed robot motions [
12
], and online motion planning
[
13
], demonstrating robust plans allow for less frequent re-
planning and shorter average task duration.
Most planning-based methods assume a guess of the action
cost (e.g., the duration) is available from a domain expert.
If this information is unreliable, the task planner reasons on
wrong assumptions so that the resulting plans become sub-
optimal or even infeasible (leading to frequent re-planning
when using a motion re-planner such as [
2
]). Moreover, the
duration guess does not consider possible couplings between
the tasks executed concurrently by the human and the robot.
Therefore, the task planner neglects each agent’s positive or
negative effects on the others. For example, if concurrent tasks
require the human and the robot to work in the same area, the
robot would probably slow down because of safety. If the task
planner knew this effect, it could favour pairs of tasks that
avoid robot safety slowdowns.
In this work, we propose a method to learn the expected
duration of a joint plan. We leverage a minimum-time formu-
lation of the task planning/allocation problem for HRC. Then,
we estimate the expected duration of each task from previous
executions and learn a synergy coefficient that represents the
effect of one task over the other. A synergy coefficient greater
than one means that the human task causes a slow down of the
robot task. We show that it is possible to cast the estimation
of the synergy coefficients into a set of linear regression
problems (one for each task). The resulting synergy coefficients
can be exploited in task planning and allocation method to
select advantageous task couplings. The proposed approach
can be used offline – to obtain a guess of the task duration
and synergies – or iteratively as the number of executions
grows to refine the initial guess over time. We demonstrate the
proposed approach in an HRC scenario where a human and a
collaborative robot shall perform a sequence of pick-and-place
operations. Experiments show that pairs of tasks that drive the
nearby agents lead to high synergy coefficients, recognizing
favorable and unfavourable pairs.
The paper is organized as follows. Section II defines the
task planning problem for HRC systems. Section III builds on
that model to formulate the task expected duration in terms of
average duration and synergy coefficients. Then, it casts the
estimation of such coefficients into a set of linear regression
problems. Section IV describes the software architecture to
collect and process the data from task executions. Section
V applies the proposed methodology to the manufacturing
example and shows that the resulting coefficients reflect the
good and bad coupling effects of robot safety stops. Finally,
arXiv:2210.11660v1 [cs.RO] 21 Oct 2022