MarsNet: Multi-label Classification Network for Images of Various Sizes

Ju-Youn Park, Yewon Hwang, DukYoung Lee, Jong-Hwan Kim
2020 IEEE Access  
Since the Convolutional Neural Network (CNN) has surfaced and fascinated the world, many researchers have exploited CNN for image classification, object detection, semantic segmentation, etc. However, the conventional CNNs have a pyramidal structure and were designed to process images which have the same size. Although some CNNs can accept images of various sizes, performance is degraded for images smaller than the size of images used for training. In this paper, we propose MarsNet, a CNN based
more » ... arsNet, a CNN based end-to-end network for multi-label classification with an ability to accept various size inputs. In order to allow the network to accept such images, dilated residual network (DRN) is modified to get higher resolution feature maps, and horizontal vertical pooling (HVP) is newly designed to efficiently aggregate positional information from the feature maps. Furthermore, multi-label scoring module and threshold estimation module are employed to serve the purpose of multi-label classification. We verify the effectiveness of the proposed network through two distinctive experiments. We first verify our model by inspecting and classifying multiple types of defects occurred in PCB screen printer using solder paste inspection (SPI) datasets. Secondly, we verify our network using VOC 2007 dataset. Our network is pioneering in that no research has attempted to accomplish multi-label classification for defects in addition to being able to take input images of various sizes in SPI field. INDEX TERMS Convolutional neural networks, images of various sizes, multi-label classification, printed circuit board, solder paste inspection. 21832 This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see
doi:10.1109/access.2020.2969217 fatcat:zo7j4zbqmzfchh4qusnezfagxi