Image Segmentation Semantic Communication over
Internet of Vehicles
Qiang Pan, Haonan Tong, Jie Lv, Tao Luo, Zhilong Zhang,
Changchuan Yin, and Jianfeng Li
Beijing Laboratory of Advanced Information Network,
Beijing Key Laboratory of Network System Architecture and Convergence,
Beijing University of Posts and Telecommunications, Beijing, China 100876.
Emails: {pq569375378, hntong, lvj, tluo, ccyin, lijf}@bupt.edu.cn, zhilong.zhang@outlook.com
Abstract—In this paper, the problem of semantic-based efficient
image transmission is studied over the Internet of Vehicles (IoV).
In the considered model, a vehicle shares massive amount of
visual data perceived by its visual sensors to assist other vehicles
in making driving decisions. However, it is hard to maintain a
high reliable visual data transmission due to the limited spectrum
resources. To tackle this problem, a semantic communication
approach is introduced to reduce the transmission data amount
while ensuring the semantic-level accuracy. Particularly, an image
segmentation semantic communication (ISSC) system is proposed,
which can extract the semantic features from the perceived images
and transmit the features to the receiving vehicle that reconstructs
the image segmentations. The ISSC system consists of an encoder
and a decoder at the transmitter and the receiver, respectively.
To accurately extract the image semantic features, the ISSC
system encoder employs a Swin Transformer based multi-scale
semantic feature extractor. Then, to resist the wireless noise and
reconstruct the image segmentation, a semantic feature decoder
and a reconstructor are designed at the receiver. Simulation results
show that the proposed ISSC system can reconstruct the image
segmentation accurately with a high compression ratio, and can
achieve robust transmission performance against channel noise,
especially at the low signal-to-noise ratio (SNR). In terms of mean
Intersection over Union (mIoU), the ISSC system can achieve
an increase by 75%, compared to the baselines using traditional
coding methods.
Index Terms—Image segmentation, semantic communication,
Swin Transformer.
I. INTRODUCTION
In the emerging Internet of Vehicles (IoV), there is a massive
amount of visual information required to be transmitted among
the vehicles to assist driving decisions, which consumes a
certain amount of bandwidth resources [1], [2]. However,
due to the restricted spectrum resources and the complicated
communication conditions in the traffic environment, it is
difficult to maintain a reliable connection to transmit large
amounts of visual data. To this end, semantic communication
[3], [4], with transmitting the data at a semantic level, rather
than the symbol level, is becoming a viable solution to solve
the above challenges. In current IoV system, vehicles mainly
use visual data for locating and obstacle avoidance, it is
available for the receiving vehicle to only reconstruct the image
segmentation for making driving decisions. According to our
research, although semantic communication can reduce the data
transmission amount thus reducing the occupied spectrum, the
image segmentation semantic communication (ISSC) system
has not been well designed.
Existing works [5]–[9] have systematically studied semantic
communication. In [5], the authors introduced a preliminary
theory of semantic information based on logical probabilistic
ranking. The work in [6] presented a model theory based tech-
nique for semantic data compression and trustworthy seman-
tic communication with quantitative measurements. Besides,
the authors in [7] modeled the communication problem as a
Bayesian game to minimize the end-to-end average semantic
metric in a dynamic communication scenario. Furthermore,
the work in [8] developed a deep learning based semantic
communication system for text transmission, which recovered
the meaning of sentences, rather than the transmitted bits
or symbols. Recently, a semantic communication system for
transmitting audio in the Internet of Things (IoT) is proposed,
and federated learning is employed to improve the precision of
semantic data extraction [9]. However, these aforementioned
works did not explore image semantic communication, and
image semantic communication is challenging due to the com-
plexity of image structure.
Along with the development of deep learning, the prior arts
have investigated image semantic communication systems [10]–
[13]. In [10], the authors proposed an end-to-end joint source-
channel coding (JSCC) image system on the structure of au-
toencoder, which provided a smooth performance degradation
with the decrease of signal-to-noise ratio (SNR). Based on
JSCC, the work in [11] exploited the channel feedback for
image transmission, and provided considerable improvements
in terms of the end-to-end image reconstruction quality. Be-
sides, the work in [12] developed a framework for semantic
communication with artificial intelligence tasks and built a
system for inspecting surface defects of workpieces, which
realized a visual semantic communication. Moreover, The au-
thors in [13] extended the semantic communication for image
classification tasks to UAV aerial photography and achieved the
tradeoff between transmission delay and classification accuracy.
However, the works in [10]–[13] were mainly concerned with
how to reconstruct or classify the images accurately but did not
propose a system to reconstruct the image segmentation from
semantic features at the receiver directly.
In this paper, a novel ISSC system is proposed for the IoV.
arXiv:2210.05321v1 [cs.NI] 11 Oct 2022