Transfer restless multi-armed bandit policy for energy-efficient heterogeneous cellular network
Abstract This paper proposes a learning policy to improve the energy efficiency (EE) of heterogeneous cellular networks.The combination of active and inactive base stations (BS) that allows for maximizing EE is identified as a combinatorial learning problem and requires high computational complexity as well as a large signaling overhead.This paper