TY - GEN
T1 - Deep Reinforcement Learning for Time Scheduling in RF-Powered Backscatter Cognitive Radio Networks
AU - Anh, Tran The
AU - Luong, Nguyen Cong
AU - Niyato, Dusit
AU - Liang, Ying Chang
AU - Kim, Dong In
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/4
Y1 - 2019/4
N2 - In an RF-powered backscatter cognitive radio network, multiple secondary users communicate with a secondary gateway by backscattering or harvesting energy and actively transmitting their data depending on the primary channel state. To coordinate the transmission of multiple secondary transmitters, the secondary gateway needs to schedule the backscattering time, energy harvesting time, and transmission time among them. However, under the dynamics of the primary channel and the uncertainty of the energy state of the secondary transmitters, it is challenging for the gateway to find a time scheduling mechanism which maximizes the total throughput. In this paper, we propose to use the deep reinforcement learning algorithm to derive an optimal time scheduling policy for the gateway. Specifically, to deal with the problem with large state and action spaces, we adopt a Double Deep-Q Network (DDQN) that enables the gateway to learn the optimal policy. The simulation results clearly show that the proposed deep reinforcement learning algorithm outperforms non-learning schemes in terms of network throughput.
AB - In an RF-powered backscatter cognitive radio network, multiple secondary users communicate with a secondary gateway by backscattering or harvesting energy and actively transmitting their data depending on the primary channel state. To coordinate the transmission of multiple secondary transmitters, the secondary gateway needs to schedule the backscattering time, energy harvesting time, and transmission time among them. However, under the dynamics of the primary channel and the uncertainty of the energy state of the secondary transmitters, it is challenging for the gateway to find a time scheduling mechanism which maximizes the total throughput. In this paper, we propose to use the deep reinforcement learning algorithm to derive an optimal time scheduling policy for the gateway. Specifically, to deal with the problem with large state and action spaces, we adopt a Double Deep-Q Network (DDQN) that enables the gateway to learn the optimal policy. The simulation results clearly show that the proposed deep reinforcement learning algorithm outperforms non-learning schemes in terms of network throughput.
KW - ambient backscatter
KW - Cognitive radio networks
KW - deep reinforcement learning
KW - RF energy harvesting
KW - time scheduling
UR - https://www.scopus.com/pages/publications/85074770874
U2 - 10.1109/WCNC.2019.8885426
DO - 10.1109/WCNC.2019.8885426
M3 - Conference contribution
AN - SCOPUS:85074770874
T3 - IEEE Wireless Communications and Networking Conference, WCNC
BT - 2019 IEEE Wireless Communications and Networking Conference, WCNC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE Wireless Communications and Networking Conference, WCNC 2019
Y2 - 15 April 2019 through 19 April 2019
ER -