TY - JOUR
T1 - Area-Selective Deep Reinforcement Learning Scheme for Wireless Localization
AU - Sun, Young Ghyu
AU - Kim, Soo Hyun
AU - Kim, Dong In
AU - Kim, Jin Young
N1 - Publisher Copyright:
Copyright © 2025 The Institute of Electronics, Information and Communication Engineers.
PY - 2025/6
Y1 - 2025/6
N2 - In this paper, an area-selective deep reinforcement learning scheme is proposed to achieve high-quality wireless localization. The conventional localization schemes based on deep learning face several challenges that are labor-intensive for data collection and labeling, lack of adaptability to large-scale, etc. To address these issues, the proposed scheme incorporates deep reinforcement learning (DRL) with a reward-setting mechanism The localization problem is modeled as a dynamic decision process to leverage the capabilities of DRL. The device location is determined by an iterative decision-making procedure, which is an area-selective process. The proposed scheme consists of two distinct modes, namely train and deployment modes. During the train mode, an agent learns the optimal actions for a given environment. The learned agent estimates the position of device in deployment mode. Simulations were conducted to demonstrate the advantages of the proposed scheme and the results showed that it offers better localization performance, adaptability, and time complexity than conventional schemes.
AB - In this paper, an area-selective deep reinforcement learning scheme is proposed to achieve high-quality wireless localization. The conventional localization schemes based on deep learning face several challenges that are labor-intensive for data collection and labeling, lack of adaptability to large-scale, etc. To address these issues, the proposed scheme incorporates deep reinforcement learning (DRL) with a reward-setting mechanism The localization problem is modeled as a dynamic decision process to leverage the capabilities of DRL. The device location is determined by an iterative decision-making procedure, which is an area-selective process. The proposed scheme consists of two distinct modes, namely train and deployment modes. During the train mode, an agent learns the optimal actions for a given environment. The learned agent estimates the position of device in deployment mode. Simulations were conducted to demonstrate the advantages of the proposed scheme and the results showed that it offers better localization performance, adaptability, and time complexity than conventional schemes.
KW - area-selective scheme
KW - deep learning
KW - reinforcement learning
KW - wireless localization
UR - https://www.scopus.com/pages/publications/105008407961
U2 - 10.1587/transfun.2024EAL2069
DO - 10.1587/transfun.2024EAL2069
M3 - Article
AN - SCOPUS:105008407961
SN - 0916-8508
VL - E108.A
SP - 883
EP - 887
JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
IS - 6
ER -