has also helped to substantially improve re-
sults (Kojima et al., 2022; Wang et al., 2023;
Gatto, Sharif, and Preum, 2023).
Despite their high capabilities, the appli-
cation of LLMs still faces several challenges,
such as dealing with cases of implicit stance
or avoiding hallucinations, even when em-
ploying advanced prompting strategies such
as CoT reasoning (Gatto, Sharif, and Preum,
2023). To address these limitations, Stance
Reasoner (Taranukhin, Shwartz, and Milios,
2024) improves the CoT method by includ-
ing examples and reasoning as background
knowledge to achieve generalizable predic-
tions across different targets. However, these
approaches are only focused on English.
Additionally, most stance detection re-
search and datasets released do not in-
clude interaction data, despite being col-
lected from social media sources such as
Twitter. K¨u¸c¨uk and Can (2020) lists stance-
annotated datasets for 11 languages, whereas
recent work on cross-domain and cross-
lingual stance provide experimentation for
16 datasets and 15 languages (Hardalov et
al., 2021; Hardalov et al., 2022). The fo-
cus, however, remains on the textual con-
tent of the tweets. This trend has recently
changed with the release of, to the best of our
knowledge, two datasets which, in addition to
the stance labeled tweets, include interaction
data such as retweets and friends: SardiS-
tance (Cignarella et al., 2020) and VaxxS-
tance (Agerri et al., 2021).
The winner (Espinosa et al., 2020) of
the SardiStance shared task (Cignarella et
al., 2020) used a weighted voting ensem-
ble that combined two inputs: (a) psycho-
logical, sentiment and friends distances as
features used to learn an XGBoost (Fried-
man, 2001) model, with (b) text classifiers
based on the Transformer architecture (De-
vlin et al., 2019). Other systems combined
textual data (emoticons, special characters,
and word embeddings) with 2 dimensions ex-
tracted from the interactions distance ma-
trix using Multidimensional Scaling (MDS)
(Ferraccioli et al., 2020), or friendship-based
graphs created with DeepWalk (Perozzi, Al-
Rfou, and Skiena, 2014) and various types of
textual embeddings (Alkhalifa and Zubiaga,
2020).
The VaxxStance shared task (Agerri et al.,
2021) provided textual and interaction data
(friends and retweets) to study stance detec-
tion on vaccines in Basque and Spanish. The
one system that systematically outperformed
the baselines (Lai et al., 2021) manually en-
gineered a large number of features based on
stylistic, tweet, and user data, lexicons, de-
pendency parsing, and network information,
which were specifically developed for these
datasets and languages.
The most recent approaches tackling un-
supervised stance detection using social me-
dia interactions as features use the force-
directed algorithm (Fruchterman and Rein-
gold, 1991) or UMAP (McInnes et al.,
2018). These algorithms transform inter-
action frequency vectors into features, re-
ducing huge interaction matrices into low-
dimensional features. Darwish et al. (2020)
use both the force-directed algorithm and
UMAP for unsupervised stance detection of
Twitter users. UMAP is also used to get
interaction-based features for automatically
tagging Twitter users’ stance (Stefanov et al.,
2020) and to explore political polarization in
Turkey (Rashed et al., 2021).
Other works are based on node2vec
(Grover and Leskovec, 2016) for user pro-
filing and extracting user features for abuse
detection (Mishra et al., 2018) and also for
sentiment, stance and hate speech detection
(Del Tredici et al., 2019). Commonly used al-
gorithms for building interaction-based mod-
els like DeepWalk (Perozzi, Al-Rfou, and
Skiena, 2014) and node2vec are based on gen-
erating Random Walks. However, those ran-
domly generated walks create artificial inter-
actions that may not occur in the gathered
interaction pairs. Furthermore, selecting the
structure of the random walks and deciding
the number of context users to be predicted
needs to be manually modeled and adapted.
In contrast to previous work based on
in-context learning with LLMs, supervised
text classification or interaction-based meth-
ods such as DeepWalk or node2vec, our Re-
lation Embeddings method provides dense
interaction-based representations of users, fo-
cusing on real interaction pairs. The training
process is designed to predict a target user
receiving a retweet or a follow from a source
user, each instance an item-to-item predic-
tion instead of context-to-item (CBOW) or
item-to-context (Skip-gram) prediction. Ad-
ditionally, we focus on all the interaction
pairs, without generating artificial random
interactions to train the model or manually