
to train a network to learn the latent rules of the assembly
sequence and automatically generate a feasible assembly
sequence according to a design template.
The main contribution of this work can be summarized as
follows.
1) A self-collected ASP dataset using user-defined models
from LEGO Studio and feasible assembly sequences
by brute force method and manual adjustment as
ground truth;
2) A graph-transformer based framework for ASP prob-
lem, with a heterogeneous graph attention network
to encoder the models, which are decoded with the
attention mechanism to generate assembly sequences;
3) A series of experiment to evaluate the effectiveness of
the proposed framework and offer the benchmark for
this ASP problem.
II. RALATED WORK
Assembly sequence planning ASP problems are usu-
ally solved by assembly-by-disassembly planning and the
combinatorial optimization algorithm. The assembly-by-
disassembly method finds a feasible disassembly order via
motion planning [12]. Being NP-complete, it requires numer-
ous search operations even to find a feasible disassembly
path. Thus researchers compress the search space using a
tree structure, such as ”Rapidly-exploring random tree” [13]
and Expansive Vornoi tree [14]. Nevertheless, the combinato-
rial optimization algorithm always implements the heuristic
methods, such as Ant Colony optimization algorithm [15],
Gray Wolf optimization algorithm [16], and Artificial Neural
Networks [17].
ASP problems have not been extensively studied by deep
learning methods. To the best of our knowledge, the only
method related to deep learning is proposed in [18]. We
believe there is still much room for progress in this research.
LEGO model Since LEGO models are complex enough
to approach real-world design, it is common for researchers
and designers to regard the LEGO model as an abstraction
of a physical object. By studying the LEGO model, we can
increase our cognition of the real world and be inspired
to solve practical problems. For example, LEGO models
are used to approximate actual 3D shapes while ensuring
that the final products have some good properties such as
connectivity and stability [19] [20]. Furthermore, the LEGO
models abstracted from physical objects are applied to the
deep generative models for 3D shape generative tasks [10]
[11].
Besides, as an abstraction of real-world objects, the LEGO
model is a powerful tool for promoting and testing human
cognitive abilities[21]. Moreover, cognitive loads are usually
studied during LEGO assembly tasks [22].
In the above scenario, the LEGO model is abstracted
into LEGO Model Representation graph [23] [10], with a
single vertex in the graph representing a LEGO brick (regular
brick, plate, or tile) and an edge representing the linkage
between two LEGO bricks. This is consistent with real-
Fig. 2. Overview of the LEGO models.
world assembly problems. For example, furniture and cars
are abstracted to graph structures [7] [1].
Graph-transformer The transformer has became the stan-
dard architecture for a wild variety of fields. In particular,
its variants have been modified to explore the latent feature
of graph-structured data and fulfill different tasks for natural
language processing [24], biological molecular structure [25]
[26], social networks [27], and academic network [28].
In a heterogeneous graph, different types of objects are
mapped to different types of nodes, and the same is true
for edges [29] [30]. This abstraction is more appropriate for
containing more complete information. Accordingly, LEGO
models are similarly devised to heterogeneous graphs in
our data set. Different node types represent different LEGO
bricks and different edge types for different relations between
LEGO bricks.
Almost all of the tasks of the graph-transformer framework
are node classification, graph classification, link prediction,
and text generation, but barely see its application in sequence
planning. To the best of our knowledge, this is the first
attempt to utilize graph transformers in an assembly sequence
planning task.
III. LEGO-ASP DATASET
A. Data set summary
We collect 100 LEGO animal models created and up-
loaded by individual users in LEGO Studio, among which
the simplest one is composed of 3 bricks and the most
complex one is composed of 44 bricks. The median number
of brick numbers in a LEGO model is 19. As shown in Fig.
2, we present four models with different number of bricks
as examples. And the statistics of the number of bricks in a
individual LEGO model is demonstrated in the histogram.
The total number of bricks in all LEGO models is 2127,
of which 23 types of bricks are frequently used. The rest are
used only a few times or customized, but they take up a non-
negligible percentage as a whole. According to the role of
bricks, the frequently-used bricks can be roughly categorized
into the following three types: regular brick, plate, and tile.
In addition, they can also be categorized into four types
according to their shape: cubic, curve, slope, and round. At
the same time, the bricks are of different sizes. With different
combinations of these features, a large variety of bricks exist.
We present different types of LEGO bricks in Fig. 3. For
example, the first brick is a regular brick as well as round
type. The second is a customized brick, the seventh is a plate