Q -learning approach to automated unmanned air vehicle (UAV) demining

Silvia Ferrari, Greyson Daugherty, Grant R. Gerhart, Douglas W. Gage, Charles M. Shoemaker
2010 Unmanned Systems Technology XII  
This paper develops a Q-learning approach to Unmanned Air Vehicle (UAV) navigation, or path planning, for sensing applications in which an infrared (IR) sensor or camera is installed onboard the UAV for the purpose of detecting and classifying multiple, stationary ground targets. The problem can be considered as a geometric sensor-path planning problem, because the geometry and position of the sensor's field of view (FOV) determines what targets can be detected and classified at any given time.
more » ... The advantage of this approach over existing path planning techniques is that the optimal guidance policy is learned via the Q-function, without explicit knowledge of the system models and environmental conditions. The approach is demonstrated through a demining application in which a UAV-based IR sensor is capable of determining the optimal altitude for properly detecting and classifying targets buried in a complex region of interest.
doi:10.1117/12.850135 fatcat:exflfljljfg4risfrf3wrst76a