Actor-Critic Network for O-RAN Resource
Allocation: xApp Design, Deployment, and Analysis
Mohammadreza Kouchaki and Vuk Marojevic
Electrical and Computer Engineering, Mississippi State University, USA
mk1682@msstate.edu, vm602@msstate.edu
Abstract—Open Radio Access Network (O-RAN) has intro-
duced an emerging RAN architecture that enables openness,
intelligence, and automated control. The RAN Intelligent Con-
troller (RIC) provides the platform to design and deploy RAN
controllers. xApps are the applications which will take this
responsibility by leveraging machine learning (ML) algorithms
and acting in near-real time. Despite the opportunities provided
by this new architecture, the progress of practical artificial intel-
ligence (AI)-based solutions for network control and automation
has been slow. This is mostly because of the lack of an end-
to-end solution for designing, deploying, and testing AI-based
xApps fully executable in real O-RAN network. In this paper we
introduce an end-to-end O-RAN design and evaluation procedure
and provide a detailed discussion of developing a Reinforcement
Learning (RL) based xApp by using two different RL approaches
and considering the latest released O-RAN architecture and
interfaces.
Index Terms—O-RAN, Near Real-Time RIC, xApp, Resource
Allocation, Reinforcement Learning, Actor-Critic, AI.
I. INTRODUCTION
Telecommunication networks will soon provide wireless
connectivity and facilitate tiny and huge data transactions
among 10s of billions of smart devices. Resource, data,
and network management is becoming more challenging and
artificial intelligence (AI) solutions are being researched for
facilitating future wireless network operations. The major ob-
stacles are the resource restrictions and the lack of a proficient
platform to handle AI solutions completely independently
from the network hardware, decreasing the cost of changes to
new third-party software solutions [1]. The hardware, software,
and interfaces of traditional radio access networks (RANs) are
tightly coupled. Recent advancements of RAN technology can
help breaking such closed designs and vendor monopoly [2].
A new architecture introduced by the Open-RAN (O-RAN)
Alliance can bring this idea into reality and change the future
of RAN deployment, operation, and maintenance [3].
The O-RAN Alliance defines specifications to facilitate AI
integration and allow machines and software to function intel-
ligently in a cellular network. The O-RAN architecture enables
intelligence and openness by providing an infrastructure for
integrating RANs on open hardware with embedded AI-based
software [4]. This architecture supports the Third Generation
Partnership Project (3GPP) and other industry standards. 3GPP
defines the radio and network protocols for the user equipment,
RAN, and core network. O-RAN leverages those and the Radio
Unit (RU), Distributed Unit (DU), and Centralized Unit (CU)
that 3GPP defines and specifies particular RAN functional
splits and open interfaces facilitating practical disaggregation
of functionalities and integration from different vendors. O-
RAN also introduces new architectural components: the Near-
Real Time (RT) RAN Intelligent Controller (RIC), the Non-RT
RIC, and additional interfaces which pave the way for insert-
ing intelligent network control and optimization applications
called xApps [5].
The challenges for developing xApps and deploying them
on real networks include: finding the most efficient AI models
suitable for very large real-world networks, adopting the most
proficient network parameters, and testing the AI models in an
environment that accurately represents the behavior of real-
world networks. I light of these concerns, this paper details
the development and testing flow of a reinforcement learning
(RL) based xApp based on O-RAN architecture. We discuss
a O-RAN architectural components, interfaces, and workflow
to design an xApp. We investigate and simulate different AI
solutions to analyze the performance of various RL methods
for designing the xApp.
The rest of this paper is organized as follow: Section II
introduces the O-RAN architecture and the main components
that are relevant to the AI controller design. Section III
presents the related work. Section IV discusses the design
procedure and the challenges for developing xApps. Section
V analyzes different AI models to select the most efficient
solution. Sections VI describes the xApp development and
Section VII the deployment and results. Section VIII draws
the conclusions.
II. O-RAN ARCHITECTURE AND KEY COMPONENTS FOR
XAPP DEVELOPMENT
The network architecture needs to provide a platform for
deploying AI/ML-based applications and provide the required
infrastructure for data transactions from the RAN nodes to the
AI model, data storage, transmission of the model decision and
control commands to the network, and the AI model training
process. The O-RAN architecture shown in Fig. 1 is based on
open interfaces to enable interactions between the RAN and
the RAN controller. The RAN is split into three logical units:
CU, DU, and RU. The CU is a centralized unit developed
to handle the higher layer RAN protocols, such as the radio
resource control (RRC), the service data adaptation protocol
(SDAP), and the packet data convergence protocol (PDCP).
It interfaces with the DUs through the mid-haul. The DU is
a logical node that handles the lower protocol layers, which
arXiv:2210.04604v1 [cs.NI] 26 Sep 2022