Species classification on thermal video using a convolutional recurrent neural network [article]

Christopher David Carr, University Of Canterbury
2021
This paper proposes a new approach to species surveying, utilising convolutional recurrent neural networks (CRNNs). By using breakthroughs in neural network architectures and designs, as well as modern hardware, new approaches are possible that have not yet been investigated. Analysing thousands of hours of footage allows for more accurate, timely, and interesting surveying footage, far surpassing current approaches used by conservation programs. Prior to this research, a reliable dataset of
more » ... rmal images did not exist, much less a dataset that records motion. Further, the data has been labelled, and categorised by location and time. While the creation of this dataset alone is a contribution, the CRNN has a high performance and reliable detection for all trained classes, which increases as more data is gathered. This puts this neural network approach ahead of any other extant method, as those that do exist either use static images, infrared illumination, or perform worse. The proposed approach is much better at detecting animals than current low tech trap or observation based approaches (by over 3 thousand times), such as trapping lines, transects, dog hunting, or observations. Further, it is more accurate than extant trail cameras for detecting small mammals - being about 10-50 times better in experimental trials. Furthermore the net itself performs well on trained classes, with the accuracy of the CRNN reaching up to 87 percent and the catchment includes all night hours (the definition of which can be increased or decreased based on latitude and time of year, or simply ambient light levels) - and the filming technique uses a thermographic passive infrared camera, and requires a cold background. Processing time (per occurrence) is unaffected by total footage (3ms processing time per animal-occurrence), though obviously the more footage captured, the more that needs to be processed, also in- creasing linearly. Finally, the approach described in this paper has the potential to be used internationally, on all cont [...]
doi:10.26021/12027 fatcat:ptd4v4hzkzbshiikrxmpqncbnq