Using Events for Content Appraisal and Selection in Web Archives

Thomas Risse, Stefan Dietze, Diana Maynard, Nina Tahmasebi
2011 International Semantic Web Conference  
With the rapidly growing volume of resources on the Web, Web archiving becomes an important challenge. In addition, the notion of community memories extends traditional Web archives with related data from a variety of sources on the Social Web. Community memories take an entity-centric view to organise Web content according to the events and the entities related to them, such as persons, organisations and locations. To this end, the main challenge is to extract, detect and correlate events and
more » ... elated information from a vast number of heterogeneous Web resources where the nature and quality of the content may vary heavily. In this paper we present the approach of the ARCOMEM project which is based on an iterative cycle consisting of (1) targeted archiving/crawling of Web objects, (2) entity and event extraction and detection, and (3) refinement of crawling strategy.
dblp:conf/semweb/0001DMT11 fatcat:kwisosghsbgefkryesqx3shqc4