UTAR Institutional Repository

Development of deep reinforcement learning based resource allocation techniques in cloud radio access network

Amjad, Iqbal (2022) Development of deep reinforcement learning based resource allocation techniques in cloud radio access network. PhD thesis, UTAR.

[img]
Preview
PDF
Download (2752Kb) | Preview

    Abstract

    Nextgeneration networks are envisioned to support management to maximize the user s ’ dynamic and agile network quality of service (QoS). Cloud radio access network (CRAN) emerges as a promising candidate since the limited network resources can be virtualized and shared among distributed remote radio heads (RRHs). Conventional approaches formulate resource allocation as an optimization problem and solve it with instantaneous environment knowledge without considering the consequences of actions. A step towards long network performance optimization is theterm use of deep reinforcement learning (DRL), which can learn the best policy via interaction with the environment. This thesis proposes three DRLbased resource allocation algorithms that optimize the CRAN performance in terms of energy efficiency (EE), spec tral efficiency (SE), and total power consumption. The first proposed algorithm aims to optimize the EE by controlling the on/off status of RRH via a deep Q network (DQN) and subsequently solving a power optimization problem. To capture the spatiotemporal channel state information (CSI), the second proposed algorithm adopts machine learning techniques with anchor graph hashing to extract generalized features before feeding them into the DQN. The goal here is to optimize the longterm tradeoff between EE and SE. In the last proposed scheme, additional EE savings are facilitated by designing and integrating a convolution al neural network (CNN), which can better learn the feature of environment states. Simulation results show that all proposed DRL algorithms outperform 2025% compared to existing techniques while achieving faster convergence. All performance benchmarking was carried out based on 100 testing episodes after properly training the DRL agent with 1000 episodes.

    Item Type: Final Year Project / Dissertation / Thesis (PhD thesis)
    Subjects: T Technology > T Technology (General)
    Divisions: Institute of Postgraduate Studies & Research > Lee Kong Chian Faculty of Engineering and Science (LKCFES) - Sg. Long Campus > Doctor of Philosophy in Engineering
    Depositing User: Sg Long Library
    Date Deposited: 26 Aug 2022 02:31
    Last Modified: 29 Dec 2022 22:19
    URI: http://eprints.utar.edu.my/id/eprint/4606

    Actions (login required)

    View Item