PACL: Piecewise Arc Cotangent Decay Learning Rate For Deep Neural Network Training

Haixu Yang, Jihong Liu, Hongwei Sun, Henggui Zhang
2020 IEEE Access  
Deep neural networks (DNNs) are currently the best-performing method for many classification problems. For training DNNs, the learning rate is the most important hyper-parameter, choice of which affects the performance of the model greatly. In recent years, some learning rate schedulers, such as HTD, CLR, and SGDR, have been proposed. These methods, some of which make use of the cycling mechanism to improve the convergence speed and accuracy of DNN, but performance degradation occurs in the
more » ... ergence process. Others have good accuracy, but their convergence speed is too slow. This paper proposed a new learning rate schedule called piecewise arc cotangent decay learning rate (PACL), which can not only improve the convergence speed and accuracy of DNN but also significantly reduce performance degradation zone caused by the cycling mechanism. It is easy to implement, but almost at no extra computing expense. Finally, we demonstrate the effectiveness of PACL, on training CIFAR-10, CIFAR-100, and Tiny ImageNet with ResNet, DenseNet, WRN, SEResNet, and MobileNet. INDEX TERMS Deep neural networks, learning rate schedulers, arc cotangent, optimization.
doi:10.1109/access.2020.3002884 fatcat:ylf2yyp6cnfhbiuqyqstz7gxxq