2
real world [13], [14]. However, for quadrupeds, the previous
model-based methods which require accurate modeling of
the ball and the robot will be hard to utilize due to the
complexity of the dynamics models, while previous model-free
RL methods have not been applied to control such dynamic
legged robot for manipulation tasks.
2) Dynamic Locomotion Control for Quadrupeds: In re-
cent years, there have been considerable advances in legged
robot hardware and control algorithms that enable quadrupedal
robots to preform highly dynamic locomotion maneuvers, such
as jumping [1], [15]–[19] or running [2], [3], [20], in the real
world. One approach is to utilize an optimal control framework
with the robot’s dynamics models, which can be the robot’s
full-order models and optimized offline [15], [16], [18], or
simplified models and deployed online [17], [20]. Another
approach is to leverage model-free deep RL to train the
quadrupedal robots through trail-and-error in simulation first
and then transfer to the real robot [1]–[3], [19]. However, most
previous work only focuses on a specific dynamic locomotion
skill without attaining a more diverse repertoire of maneuvers
based on learned skills to achieve a longer horizon task, such
as jumping while tracking different swing leg trajectories to
intercept a ball.
3) Legged Robot Soccer: Developing robots that can one
day compete with humans in soccer games has been an
enduring goal in the robotics community, and a notable soccer
robot game is RoboCup [21]. Related to the goalkeeping
problem of this work, there are some efforts to develop an
intelligent goalkeeper using holonomic wheeled robots [22]–
[24]. However, most previous work only consider the robot
moving in 2D plane to intercept a ball rolling on the ground
at low speeds [22], [23]. Intercepting balls in a 3D and at
high speeds, like a flying ball with a speed up to 8 m/s, as
in this work, has not been studied in robot soccer. Legged
robots, such as humanoid robots and quadrupedal robots, are
also used in RoboCup, but most presented soccer skills by
legged robots, such as shooting [25], kicking [26], and goal-
keeping [27], are based on rule-based motion primitives due to
their challenging dynamics. Most recently, by leveraging deep
RL, a quadrupedal robot demonstrates the capacity to dribble
a soccer ball to a target at a low walking speed [28], and a
quadruped is also trained to precisely shoot a soccer ball to a
random given target while the robot is standing with a single
shooting skill [29]. However, enabling legged robots to play
soccer while performing multiple highly dynamic locomotion
skills, such as using jump and dive skills, and precise ball
manipulation has not yet been demonstrated.
B. Contributions
The core contribution of this work is the creation of an
agile and dynamic quadrupedal goalkeeper for robot soccer.
This work presents one of the first solutions that combines
both highly dynamic locomotion and precise object intercep-
tion (manipulation) on real quadrupedal robots by using a
hierarchical reinforcement learning framework. The proposed
method allows quadrupeds to track parametric trajectories
with its end-effector(s) while engaging in dynamic locomotion
maneuvers. The hierarchical framework is used to learn and
compose a diverse set of low-level locomotion skills, and to
select the most appropriate skill and motion for the robot to
intercept a flying ball. We show that our system can be used
to directly transfer dynamic maneuvers and goalkeeping skills
learned in simulation to a real quadrupedal robot, with an
87.5% successful interception rate of random shots in the real
world. We note that human soccer goalkeepers average around
a 69% save rate, [30]. Although, this is against professional
players shooting towards regulation sized goals, we hope this
paper takes us one step closer to enabling robotic soccer
players to compete with humans in the near future.
II. HIERARCHICAL RL FRAMEWORK FOR GOALKEEPING
TASK WITH MULTI-SKILLS
In this section, we introduce the Mini Cheetah robot which
is the experimental platform for this work. We also provide a
brief overview of the framework for developing goalkeeping
skills as illustrated in Fig. 2.
A. The Mini Cheetah Quadrupedal Robot
As shown in Fig. 1, Mini Cheetah [20] is a quadrupedal
robot having a weight of 9kg and height of 0.4m when it
is fully standing. It has 12 actuated motors qm∈R12 and
a6degree-of-freedoms (DoFs) floating base, representing its
translational qx,y,z (sagittal, lateral, and vertical) positions and
orientation qψ,θ,φ (roll, pitch, yaw), respectively.
B. Locomotion Skills for Goalkeeping
Inspired by human goalkeepers, we propose a collection
of skills for intercepting a ball flying to different regions of
the goal, as illustrated in Fig. 3. The main concern underlying
the design of goalkeeping locomotion skills is that the robot
needs to react very quickly, since the total timespan of a ball’s
ballistic trajectory is typically under 1sec. Therefore, from
an initial standing pose in the middle of the goal, the robot
needs to perform very dynamic maneuvers to intercept the ball.
To accomplish this, our system uses three locomotion skills:
sidestep,dive, and jump to cover different goal regions.
1) Sidestep: During a sidestep, the robot takes a quick step
in the lateral direction to intercept the ball when it is rolling
on the ground or flying toward the goal at a low attitude.
Depending on the size of the step, the robot may only need
to swing up one of its front leg while the rest can remain in
the stance phase. But for larger steps, the stance legs may also
need to leave the ground, resulting in a small sideways hop.
However, the sidestep skill may not be able to cover regions
that are farther away from the robot, such as the lower corners
of the goal or the upper regions.
2) Dive: The dive skill is based on quadrupedal jumping
behaviors [16], which allows the robot to cover a larger area
of the goal. Using the dive skill, the robot should first pitch its
body up onto the rear legs, then turn to the lateral side towards
the direction that the ball is traveling, extend its two swing legs
to reach the ball, and finally land back on its feet. This skills
enables the robot to quickly block the lower corners of the
goal. During the dive, the rear legs may or may not leave the
ground, depending on how far the robot needs to travel.