
sign can be written as a sequence of symbols (box
markers, graphemes, and punctuation marks) and
their location on a 2-dimensional plane.
To our knowledge, this work is the first to create
automatic SLT systems that use SignWriting. Our
main contributions are as follows: (a) we propose
methods to parse (§3.3), factorize (§3.4), decode
(§4.3), and evaluate (§4.3) SignWriting sequences;
(b) we report experiments on multilingual machine
translation systems between SignWriting and spo-
ken language text (§4); (c) we demonstrate that
common techniques for low-resource MT are bene-
ficial for SignWriting translation systems (§5).
2 Background
2.1 Sign language processing (SLP)
SLP (Bragg et al.,2019;Yin et al.,2021;Moryossef
and Goldberg,2021) is an emerging subfield of
both NLP and CV, which focuses on automatic
processing and analysis of sign language content.
Prominent tasks include pose estimation from sign
language videos (Cao et al.,2017,2021;Güler
et al.,2018), gloss transcription (Mesch and Wallin,
2012;Johnston and Beuzeville,2016;Konrad et al.,
2018), sign language detection (Borg and Camilleri,
2019;Moryossef et al.,2020), sign language identi-
fication (Gebre et al.,2013;Monteiro et al.,2016),
and sign language segmentation (Bull et al.,2020;
Farag and Brock,2019;Santemiz et al.,2009).
Besides, tasks including sign language recogni-
tion (Adaloglou et al.,2021), translation, and pro-
duction involve transforming one sign language rep-
resentation to another or from/to spoken language
text, as shown in Figure 2
4
. We find that exist-
ing works cover gloss-to-text (Camgöz et al.,2018;
Yin and Read,2020) (where “text” denotes spo-
ken language text), text-to-gloss (Zhao et al.,2000;
Othman and Jemni,2012), video-to-text (Camgöz
et al.,2020b,a), pose-to-text (Ko et al.,2019), and
text-to-pose (Saunders et al.,2020a,b,c;Zelinka
and Kanis,2020;Xiao et al.,2020).
2.2 Motivation
Our work is the first to explore translation between
spoken language text and sign language content
represented in SignWriting
5
. We focus on a sign
language writing system for the following reasons:
4
In the paper, we distinguish between a phonetic “writing
system” (e.g., SignWriting) and “glosses” (lexical notation,
marking the semantics of each sign with a distinct category).
5
Related work based on HamNoSys: Morrissey (2011);
Sanaullah et al. (2021); Walsh et al. (2022)
https://raw.githubusercontent.com/sign-language-processing/sign-language-processing.github.io/eddb4ac50ffc7698d4b2b9c8c34d6397721…
https://raw.githubusercontent.com/sign-language-processing/sign-language-processing.github.io/eddb4ac50ffc7698d4b2b9c8c34d63977211602c/src/assets/tasks/…
Video Text
Pose Glosses
Writing System
Figure 2: SLP tasks. Every edge on the left side rep-
resents a task in CV (language-agnostic). Every edge
on the right side represents a task in NLP (language-
specific). Every edge crossing both sides represents a
task requiring a combination of CV and NLP. Figure
taken from Moryossef and Goldberg (2021).
(a) currently an end-to-end (video-to-text/text-to-
video) approach is not feasible. State-of-the-art
systems either have a BLEU score lower than 1
(Müller et al.,2022a) or work only on a very nar-
row linguistic domain, e.g., Camgöz et al. (2020b,a)
work on the RWTH-PHOENIX-Weather T data set
which covers only 1,231 unique signs from weather
reports (less than what we use in Table 2); (b) a
writing system is lower-dimensional than videos
(not all parts of a video are relevant in a linguistic
sense), while adequate to encode information of
signs; (c) written sign language is a closer fit to
current MT pipelines than videos or poses; (d) a
phonetic writing system is a more universal solu-
tion than glossing since glosses are semantic and
therefore language-specific, and are an inadequate
representation of meaning (Müller et al.,2022b).
2.3 SignWriting, FSW, and SWU
SignWriting (Sutton,1990) is a featural and vi-
sually iconic sign language writing system (intro-
duced extensively in Appendix A). Previous work
explored recognition (Stiehl et al.,2015) and ani-
mation (Bouzid and Jemni,2013) of SignWriting.
SignWriting has two computerized specifica-
tions, Formal SignWriting in ASCII (FSW) and
SignWriting in Unicode (SWU). SignWriting is
two-dimensional, but FSW and SWU are written
linearly, similar to spoken languages. Figure 3
gives an example of the relationship between Sign-
Writing, FSW, and SWU
6
. We use FSW in our
research instead of SWU to explore the potential
of factorizing SignWriting symbols and utilizing
numerical values of their position (§3.3, §3.4).
6
Online demonstration:
https://slevinski.github.
io/SuttonSignWriting/characters/index.html.