TY - GEN
T1 - DLPNet
T2 - 4th International Conference on Artificial Intelligence and Pattern Recognition, AIPR 2021
AU - Kang, Junhyung
AU - Woo, Simon S.
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/9/24
Y1 - 2021/9/24
N2 - Object Detection on aerial imagery is a challenging task due to the following unique characteristics of the aerial imagery data: the large image size and the massive volume of data. Since the original image size from remote sensing sources is generally more colossal than the typical natural images, training detection models with a small mini-batch size are expected for the many practical, real-world applications. Furthermore, using a small mini-batch size enable the model to have better generalization performance. However, it causes an unstable learning process due to high gradient noise. In this case, reducing the learning rate enable the model to train reliably. However, a small learning rate can cause slow learning and convergence to the local minimum point also. Therefore, in this work, we propose a novel method, DLPNet, to enable robust and stable training with a small mini-batch size and various learning rates. Our model composes of an object detection model and a reinforcement learning agent. Our reinforcement learning agent extracts features from training mini-batch data and determines optimal parameters to the loss function of the object detection model. This dynamic loss function with adaptive parameters can achieve a more robust and stable learning process than the original baseline model. We demonstrate the effectiveness of our approach with a challenging object detection dataset, DOTA-v2.0. In addition, we release our code for reproducibility and to promote further research in this area1.
AB - Object Detection on aerial imagery is a challenging task due to the following unique characteristics of the aerial imagery data: the large image size and the massive volume of data. Since the original image size from remote sensing sources is generally more colossal than the typical natural images, training detection models with a small mini-batch size are expected for the many practical, real-world applications. Furthermore, using a small mini-batch size enable the model to have better generalization performance. However, it causes an unstable learning process due to high gradient noise. In this case, reducing the learning rate enable the model to train reliably. However, a small learning rate can cause slow learning and convergence to the local minimum point also. Therefore, in this work, we propose a novel method, DLPNet, to enable robust and stable training with a small mini-batch size and various learning rates. Our model composes of an object detection model and a reinforcement learning agent. Our reinforcement learning agent extracts features from training mini-batch data and determines optimal parameters to the loss function of the object detection model. This dynamic loss function with adaptive parameters can achieve a more robust and stable learning process than the original baseline model. We demonstrate the effectiveness of our approach with a challenging object detection dataset, DOTA-v2.0. In addition, we release our code for reproducibility and to promote further research in this area1.
KW - Aerial imagery
KW - Object detection
KW - Parameter optimization
KW - Reinforcement learning
UR - https://www.scopus.com/pages/publications/85125857020
U2 - 10.1145/3488933.3489031
DO - 10.1145/3488933.3489031
M3 - Conference contribution
AN - SCOPUS:85125857020
T3 - ACM International Conference Proceeding Series
SP - 191
EP - 198
BT - AIPR 2021 - 2021 4th International Conference on Artificial Intelligence and Pattern Recognition
PB - Association for Computing Machinery
Y2 - 17 September 2021 through 19 September 2021
ER -