Website Forensic Investigation to Identify Evidence and Impact of Compromise [chapter]

Yuta Takata, Mitsuaki Akiyama, Takeshi Yagi, Takeshi Yada, Shigeki Goto
2017 Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering  
Compromised websites that redirect users to malicious websites are often used by attackers to distribute malware. These attackers compromise popular websites and integrate them into a drive-by download attack scheme to lure unsuspecting users to malicious websites. An incident response organization such as a CSIRT contributes to preventing the spread of malware infection by analyzing compromised websites reported by users and sending abuse reports with detected URLs to webmasters. However,
more » ... abuse reports with only URLs are not sufficient to clean up the websites; therefore, webmasters cannot respond appropriately to the report with just URLs. In addition, it is difficult to analyze malicious websites across different client environments, i.e., a CSIRT and a webmaster, because these websites change behavior depending on a client environment. To expedite compromised website clean-up, it is important to provide fine-grained information such as the precise position of compromised web content, malicious URL relations, and the target range of client environments. In this paper, we propose a method of constructing a redirection graph with context, such as which web content redirects to which malicious websites. Our system with the proposed method analyzes a website in a multi-client environment to identify which client environment is exposed to threats. We evaluated our system using crawling datasets of approximately 2,000 compromised websites. As a result, our system successfully identified compromised web content and malicious URL relations, and the amount of web content and the number of URLs to be analyzed were sufficient for incident responders by 0.8% and 15.0%, respectively. Furthermore, it also can identify the target range of client environments in 30.4% of websites and a vulnerability that has been used in malicious websites by leveraging the target information. This fine-grained information identified with our system would dramatically makes the daily work of incident responders more efficient.
doi:10.1007/978-3-319-59608-2_25 fatcat:g3birlugyneslaax6ba5z74lbm