Detecting Malicious URLs via a Keyword-based Convolutional Gated-recurrent-unit Neural Network

WenChuan Yang, Wen Zuo, BaoJiang Cui
2019 IEEE Access  
With the continuous development of Web attacks, many web applications have been suffering from various forms of security threats and network attacks. The security detection of URLs has always been the focus of Web security. Many web application resources can be accessed by simply entering an URL or clicking a link in the browser. An attacker can construct various web attacks such as SQL, XSS, and information disclosure by embedding executable code or injecting malicious code into the URL.
more » ... ore, it is necessary to improve the reliability and security of web applications by accurately detecting malicious URLs. This paper designs a convolutional gated-recurrent-unit (GRU) neural network for the detection of malicious URLs detection based on characters as text classification features. Considering that malicious keywords are unique to URLs, a feature representation method of URLs based on malicious keywords is proposed, and a GRU is used in place of the original pooling layer to perform feature acquisition on the time dimension, resulting in high-accuracy multicategory results. The experimental results show that our proposed neural network detection model is very suitable for high-precision classification tasks. Compared with other classification models, the model accuracy rate is above 99.6%. The use of deep learning to classify URLs to identify Web visitors' intentions has important theoretical and scientific values for Web security research, providing new ideas for intelligent security detection. INDEX TERMS Gated recurrent unit (GRU), malicious URL detection, network attack, character-level embedding, convolutional neural network, neural network model.
doi:10.1109/access.2019.2895751 fatcat:ifq3wfm46nd4pe6kqb3narltie