A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
How to Search the Internet Archive Without Indexing It
[chapter]
2016
Lecture Notes in Computer Science
Significant parts of cultural heritage are produced on the web during the last decades. While easy accessibility to the current web is a good baseline, optimal access to the past web faces several challenges. This includes dealing with large-scale web archive collections and lacking of usage logs that contain implicit human feedback most relevant for today's web search. In this paper, we propose an entity-oriented search system to support retrieval and analytics on the Internet Archive. We use
doi:10.1007/978-3-319-43997-6_12
fatcat:ryv5ur3plvbdzoka5ufmwejhay