
ARTIFICIAL ASMR: A CYBER-PSYCHOLOGICAL APPROACH
Zexin Fang⋆Bin Han⋆C. Clark Cao†Hans D. Schotten⋆‡
⋆RPTU Kaiserslautern-Landau, †Lingnan University, ‡German Research Center of Artificial Intelligence
ABSTRACT
The popularity of Autonomous Sensory Meridian Response
(ASMR) has skyrockted over the past decade, but scientific
studies on what exactly triggered ASMR effect remain few
and immature, one most commonly acknowledged trigger is
that ASMR clips typically provide rich semantic information.
With our attention caught by the common acoustic patterns
in ASMR audios, we investigate the correlation between the
cyclic features of audio signals and their effectiveness in trig-
gering ASMR effects. A cyber-psychological approach that
combines signal processing, artificial intelligence, and exper-
imental psychology is taken, with which we are able to quan-
tize ASMR-related acoustic features, and therewith synthe-
size ASMR clips with random cyclic patterns but not deliver-
ing identifiably scenarios to the audience, which were proven
to be effective in triggering ASMR effects.
Index Terms—ASMR, auditory, cyclostationary, GAN
1. INTRODUCTION
Autonomous Sensory Meridian Response (ASMR), a term
coined in the 2010s, is widely used to describe an intrigu-
ing phenomenon in which specific visual and auditory stimuli
trigger tingling sensations accompanied by positive emotions
as well as a feeling of deep relaxation [1]. With its bloom-
ing cultural popularity and growing commercial market [2],
ASMR has attracted emerging research interest [3], and its
cognitive effect has been verified by significant behavioral
and neurological evidences [4,5]. However, to the best of our
knowledge, existing work has only identified some semantic
elements that trigger the ASMR effect [6], while the acous-
tic features of auditory triggers remain poorly understood.
Though there is a generic claim that low-frequency sounds are
widely observed from effectively triggering ASMR audios, it
does not capture the repetitive characteristic.
This work is supported in part by the German Federal Min-
istry of Education and Research within the project Open6GHub
(16KISK003K/16KISK004), in part by the European Commission within the
Horizon Europe project Hexa-X (101015956), in part by the Network for
the Promotion of Young Scientists at RPTU Kaiserslautern-Landau within
the project A-SIREN (Individual Research Funding 2022-2), and in part by
the Lam Woo Research Fund at Lingnan University (F871223). B. Han
(bin.han@rptu.de) is the corresponding author.
Inspired by a study that reveals the correlation between
the trypophobia-triggering effect and cyclostationary features
of images [7], we suspect that the time-frequency and cyclic
features of audios may also play an important role in the
triggering of ASMR experience. In this paper, we prove this
correlation in a cyber-psychological approach that combines
various techniques of signal processing, artificial intelligence,
and experimental psychology. More specifically, we apply
short-time Fourier transform (STFT) and cyclic spectral anal-
ysis on recorded ASMR audio clips to extract their acoustic
features. Combining generative advertisal networks (GANs)
and a post-processing module, we are able to synthesize
ASMR clips based on generated cyclic patterns. We also
design a psychological survey to evaluate the effectiveness of
both the recorded and synthesized ASMR audios in triggering
ASMR effect on humans, and verify the correlation between
this effect and the selected acoustic features.
2. AUDIO DATA PROVISIONING
As the object of our study, we obtained off-the-shelf ASMR
audios from non-commercial online open sources. We col-
lected four audios recorded from real sounds of different ori-
gins, including: i) breathing, ii) mixing soft cream, iii) puff-
ing a spray, and iv) clicking a keyboard. Every audio was
recorded 16 bit stereo at the sampling rate of 22.05 kHz, last-
ing about 1 h. We intentionally selected these four sound
types as our experiment objects because they are largely dis-
tinct in cyclic and spectral features, for instance, clicking a
keyboard clips generally have dense cyclic patterns and more
high frequency components, meanwhile breathing clips man-
ifests sparse cyclic patterns with low pass characteristic.
3. ANALYSIS AND FEATURE EXTRACTION
Spectrograms of the provisioned ASMR clips generally ex-
hibit a low-pass characteristic and cyclic patterns of borad-
band bursts, as exampled in Fig. 1. To further investigate the
periodic patterns within the PSD of an audio signal x, the
spectral correlation density (SCD) and cyclic coherence func-
tion (CCF) can be applied as suggested by [8]:
arXiv:2210.14321v3 [eess.AS] 5 Jul 2023