2
NNs, which can be viewed as controlled weighted sums, the
equivalent weights in MIMO systems are determined by the
channel matrices, which are determined by the environment
and hence uncontrollable. To control the equivalent weight in
such systems, the typical precoding and combining operations
in a MIMO system can be properly exploited. Both procedures
are controllable linear transformations on the signals and are
inherent in MIMO systems. As a result, we can control the
parameters in the equivalent weighted sum of the overall
system by controlling the precoding and combining matrices.
However, implementing the MIMO OAC in split ML still
faces two fundamental issues: the forward and backward
channels are different and may not be accurately known, and
the analog transmission in OAC results in unavoidable noise.
To deal with the uncertainty issue of the MIMO channel, we
find that the forward-backward propagation of a NN and the
channel reciprocity of a wireless channel are mathematically
related (see Section II-B for details). This can be exploited to
deploy a NN through MIMO-based OAC, which can still lead
to correct gradients even without any prior knowledge about
the MIMO channel as long as the channel reciprocity and
quasi-stability are assumed. Moreover, most NNs can work
well after proper training, even when the intermediate results
cannot be fully and accurately interpreted. Such unexplainably
in NNs inspires us that casting a deterministic linear transfor-
mation, i.e., multiplying an implicit matrix determined by the
MIMO channel on the intermediate results of a NN through the
whole training and test process, may not intensively deteriorate
the performance.
The other issue about the noise in wireless communication
is not fatal in NNs. From the perspective of information theory,
both digital and analog communication can be optimal in wire-
less communications such as sensor networks [6]. However,
the digital communication system can reduce transmission
errors by employing error-detecting and correcting codes,
whereas transmission errors can only be restricted but never
eliminated in analog communication. Moreover, it is found that
noise is tolerable and sometimes even becomes a training trick
in NNs [7], which can also be viewed as implicit dropout [8].
Since the devices transmit unexplainable, intermediate results
of the NN in split ML systems, the advantage of progressive
error-free is insignificant in such applications. For instance,
Jankowski et al. [2] show that analog communication performs
better than digital communication in the deep learning-based
JSCC problem, which is a particular case of split ML.
B. Related Works
Split ML is a method where multiple computation nodes
cooperatively execute an ML application. In such a system,
the ML model is split into multiple parts allocated to different
computation nodes. Each node executes a part of the ML
model in order and transmits the intermediate results to the
next one. When training the NN, the nodes also execute
backward propagation in backward order. Most of the existing
works [9]–[11] on split ML focus on how to distribute the
model on the nodes in a way to minimize the total communi-
cation and computation delay. There are also some other works
bringing up specialized NN structures for split ML [12]–[14].
Recent work also tries to deploy a proper NN architecture
on a given communication network [15]. Their framework
utilizes the neural architecture search method to meet latency
and accurate requirements. However, the above works [9]–[15]
all consider ideal communication among the computing nodes,
which ignores the communication scheme design.
On the other hand, in communication systems, OAC is
usually deployed in multiple access systems to compute the
weighted sum or some easy mathematical operations such
as geometric mean, polynomial, and Euclidean norm [4].
Hence most OAC works are mainly used in FL, where the
weighted sum is widely deployed. For example, the authors
in [5] optimize the number of simultaneous accesses, which
may improve the efficiency of FL. Besides, Zhu et al. apply
broadband analog aggregation to improve OAC in FL with
multiple bands [16], while Shao et al. consider FL with
misaligned OAC [17].
OAC for multiple access systems still has significant draw-
backs. Firstly, OAC requires strict synchronization among all
transmitters, which is hard to realize. Moreover, OAC does not
support backward communication, which is rarely considered
in previous works as far as we know. Furthermore, the above
works [4], [5], [16], [17] do not consider the MIMO system,
which is widely used in practical scenarios. The multiple
antennas of the transmitter in a MIMO system can be viewed
as a group of transmitters in the multiple access scheme, which
overcomes the drawback of synchronization. The authors of
[18] apply MIMO OAC to multimodal sensing. However, in
[18], the scenario is still a multiple access system where
all channels are MIMO channels, and the task of OAC is
still the weighted sum, which can be regarded as a direct
expansion of previous designs in [4], [5], [16], [17]. Besides,
the implementation of [18] is also strictly limited to FL
applications, which is not suitable for split ML.
In other fields, OAC has also provided an alternative to
traditional NNs by realizing parts of NNs with acoustic [19],
optical [20], and radio frequency [21] signals. The authors in
[19]–[21] use the characteristics of the target systems similar
to NNs, and employ the target systems as part of a NN. OAC
systems calculate aggregated results of multiple inputs from
different transmitters or time slots by moderating the envi-
ronments or some parts of the system. Recently, in wireless
communications, Sanchez et al. also realize NNs with the
help of multiple paths, and reconfigurable intelligent surfaces
[22]. Their system transmits the intermediate output of NNs
via time-sequential signals and uses the delay of multiple
paths to realize one-dimensional convolution. However, in this
paper, we use MIMO channels to realize fully connected layers
through multiplexed signals.
C. Contributions
The main contributions of this paper are summarized as
follows:
•A split ML framework is proposed for wireless MIMO
networks by exploiting the MIMO’s OAC capability,
which not only enables high-throughput and efficient