UAV first view landmark localization with active reinforcement learning

Wang, Xinran and Li, Chao and Yu, Leijian and Han, Lirong and Deng, Xiaogang and Yang, Erfu and Ren, Peng (2019) UAV first view landmark localization with active reinforcement learning. Pattern Recognition Letters, 125. pp. 549-555. ISSN 0167-8655 (https://doi.org/10.1016/j.patrec.2019.03.011)

[thumbnail of Wang-etal-PRL-2019-UAV-first-view-landmark-localization-with-active-reinforcement]
Preview
Text. Filename: Wang_etal_PRL_2019_UAV_first_view_landmark_localization_with_active_reinforcement.pdf
Accepted Author Manuscript
License: Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 logo

Download (6MB)| Preview

Abstract

We present an active reinforcement learning framework for unmanned aerial vehicle (UAV) first view landmark localization. We formulate the problem of landmark localization as that of a Markov decision process and introduce an active landmark-localization network (ALLNet) to address it. The aim of the ALLNet is to locate a bounding box that surrounds the landmark in a first view image sequence. To this end, it is trained in a reinforcement learning fashion. Specifically, it employs support vector machine (SVM) scores on the bounding box patches as rewards and learns the bounding box transformations as actions. Furthermore, each SVM score indicates whether or not the landmark is detected by the bounding box such that it enables the ALLNet to have the capability of judging whether the landmark leaves or re-enters a first view image. Therefore, the operation of the ALLNet is not only dominated by the reinforcement learning process but also supplemented by an active learning motivated manner. Once the landmark is considered to leave the first view image, the ALLNet stops operating until the SVM detects its re-entry to the view. The active reinforcement learning model enables training a robust ALLNet for landmark localization. The experimental results validate the effectiveness of the proposed model for UAV first view landmark localization.

ORCID iDs

Wang, Xinran, Li, Chao, Yu, Leijian, Han, Lirong, Deng, Xiaogang, Yang, Erfu ORCID logoORCID: https://orcid.org/0000-0003-1813-5950 and Ren, Peng;