Abstract
In this letter, to improve data rate over wireless communication channels, we propose a deep Q network (DQN)-based adaptive modulation scheme by using Markov decision process (MDP) model. The proposed algorithm makes the reinforcement learning agent to select rate region boundaries as the states, which divide signal-to-noise ratio (SNR) range into rate regions. The simulation results show that spectral efficiency can be improved on the average by 0.5395 bps/Hz in wide SNR range.
| Original language | English |
|---|---|
| Article number | 9025237 |
| Pages (from-to) | 1289-1293 |
| Number of pages | 5 |
| Journal | IEEE Communications Letters |
| Volume | 24 |
| Issue number | 6 |
| DOIs | |
| State | Published - Jun 2020 |
| Externally published | Yes |
Keywords
- Adaptive modulation
- deep learning
- deep Q network
- reinforcement learning