A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Out of Sight, Out of Mind: Detecting Orphaned Web Pages at Internet-Scale
2021
Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security
Security misconfigurations and neglected updates commonly lead to systems being vulnerable. Especially in the context of websites, we often find pages that were forgotten, that is, they were left online after they served their purpose and never updated thereafter. In this paper, we introduce new methodology to detect such forgotten or orphaned web pages. We combine historic data from the Internet Archive with active measurements to identify pages no longer reachable via a path from the index
doi:10.1145/3460120.3485367
fatcat:b4gtskmlzrbidmyp4zrkkzrqde