motor cortex despite the lack of motion production. After the onset of the trigger signal,
the motor cortex exhibits dynamics which produce downstream muscle activations leading
to the desired motion.
In order to combine multiple motor primitives into complex sequences, there must be
additional structure to support these longer sequences. Recent work studying the neural
dynamics of rhesus macaques performing skilled, practiced compound reaches has indi-
cated that conjunctive movement consists of separate independent chains of motor pro-
cesses sharing the same underlying neural substrate Zimnik and Churchland (2021). In
order to skillfully and smoothly execute a sequence of movements, the motor cortex needs
to simultaneously prepare for an upcoming movement while current motor commands are
being executed. This multi-tasking capability requires functional segregation preparatory
activity and execution within the underlying motor cortical circuitry. Existing research
suggests that this is implemented in biological systems by separating preparatory activity
and execution into orthogonal subspaces.
We demonstrate that a task-optimized neural network is able to implement arbitrary
motor subroutines in a flexible and reusable manner. Our model demonstrates remark-
able generalization capabilities due to its structure. Through learning, our model exhibits
emergent self-organization of its latent representation which facilitates robust production
of movement patterns. Furthermore, our model is able to continuously produce sequences
of motion interference, demonstrating properties of functional segregation exhibited in bio-
logical neural circuits.
2. Methods
Our model architecture resembles the sequence-to-sequence models used in natural language
processing Sutskever et al. (2014). The input data (from the observed “guide”) consists of
procedurally generated movements consisting of up to two straight segments, represented by
a sequence of x,y coordinates. The agent constructs an embedding by sequentially ingesting
the x,y coordinates of the guide sequence into an encoder implemented as a recurrent
neural network (RNN) consisting of continuous-time neurons. The encoder produces a single
(static) readout, or embedding, at the end of each segment. These embeddings are fed into
the motor RNN at pre-specified times by a scheduler, which modulates the embedding by
a ramping function such that motion onset is triggered by the falling edge of the ramping
signal Hennequin et al. (2014). The beginning of the ramp signal was timed such that the
embedding signal was provided 7 steps before the end of the previous segment such that
the go signal aligns with the end of the previous segment. This temporal overlap forced the
network to simultaneously process both the future motor command while completing the
current segment.
The motor RNN takes in the ramp-modulated static encoder embedding and produces
a sequence of muscle activation commands which are implemented by a realistic simula-
tion of an over-actuated two-link arm moving in two dimensional space Kalidindi et al.
(2021), Lillicrap and Scott (2013). The entire system is trained end-to-end on one or two
segment motions using stochastic gradient descent and supervised learning with additional
biologically relevant loss terms penalizing neural activity and simultaneous activation of
antagonistic muscle pairs.
2