Filters








31,230 Hits in 4.4 sec

Evaluating Web Archive Search Systems [chapter]

Miguel Costa, Mário J. Silva
2012 Lecture Notes in Computer Science  
In this work, we propose an evaluation methodology for web archive search systems based on a list of requirements compiled from previous characterizations of web archives and their users.  ...  The test collection enabled, for the first time, to measure the effectiveness of state-of-the-art IR technology employed in web archives.  ...  Acknowledgments This work could not be done without the support of FCCN and its Portuguese Web Archive team. We thank FCT for its Multiannual Funding Programme.  ... 
doi:10.1007/978-3-642-35063-4_32 fatcat:ku3njm6cdrdgbfon4cmuux3yae

Access Patterns for Robots and Humans in Web Archives [article]

Yasmin AlNoamany, Michele C. Weigle, Michael L. Nelson
2013 arXiv   pre-print
In terms of unique archived web pages, there is no overall preference for a particular time, but the recent past (within the last year) shows significant repeat accesses.  ...  Although user access patterns on the live web are well-understood, there has been no corresponding study of how users, both humans and robots, access web archives.  ...  ACKNOWLEDGMENTS This work was supported in part by the NSF (IIS 1009392) and the Library of Congress.  ... 
arXiv:1309.4009v1 fatcat:faz7dh7gr5h4pf66ra3ohng2cy

Access patterns for robots and humans in web archives

Yasmin A. AlNoamany, Michele C. Weigle, Michael L. Nelson
2013 Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries - JCDL '13  
In terms of unique archived web pages, there is no overall preference for a particular time, but the recent past (within the last year) shows significant repeat accesses.  ...  Although user access patterns on the live web are wellunderstood, there has been no corresponding study of how users, both humans and robots, access web archives.  ...  The only web usage mining research that has been conducted on the usage of web archives is the study of search behavior characterization of web archives based on a quantitative analysis of the Portuguese  ... 
doi:10.1145/2467696.2467722 dblp:conf/jcdl/AlNoamanyWN13 fatcat:v45urmmsrzgx3nix523tyea7vi

Creating a billion-scale searchable web archive

Daniel Gomes, Miguel Costa, David Cruz, João Miranda, Simão Fontes
2013 Proceedings of the 22nd International Conference on World Wide Web - WWW '13 Companion  
The Portuguese Web Archive is the largest full-text searchable web archive publicly available. It supports search over 1.2 billion files archived from the web since 1996.  ...  However, users demand efficient and effective search mechanisms to access the already vast collections of historical information held by web archives.  ...  ACKNOWLEDGMENTS We thank the Internet Archive for supplying us web collections from the .PT domain and for its continuous efforts to preserve and grant access to human knowledge.  ... 
doi:10.1145/2487788.2488118 dblp:conf/www/GomesCCMF13 fatcat:bkl6pofb3bb43iyxir46pinfta

Who and what links to the Internet Archive

Yasmin AlNoamany, Ahmed AlSum, Michele C. Weigle, Michael L. Nelson
2014 International Journal on Digital Libraries  
Most of the links (86%) from websites are to individual archived pages at specific points in time, and of those 83% no longer exist on the live web.  ...  Most human users come to web archives because they do not find the requested pages on the live web. About 65% of the requested archived pages no longer exist on the live web.  ...  Acknowledgment This work was supported in part by the NSF (IIS 1009392) and the Library of Congress.  ... 
doi:10.1007/s00799-014-0111-5 fatcat:rn2ux7gyenaglk5kpqfj5sag6a

Who and What Links to the Internet Archive [chapter]

Yasmin Alnoamany, Ahmed Alsum, Michele C. Weigle, Michael L. Nelson
2013 Lecture Notes in Computer Science  
Most of the links (86%) from websites are to individual archived pages at specific points in time, and of those 83% no longer exist on the live web.  ...  Most human users come to web archives because they do not find the requested pages on the live web. About 65% of the requested archived pages no longer exist on the live web.  ...  Acknowledgment This work was supported in part by the NSF (IIS 1009392) and the Library of Congress.  ... 
doi:10.1007/978-3-642-40501-3_35 fatcat:76xjfgs7onglrbdfgkxfqhd7gq

Searching Digital Libraries [chapter]

Panagiotis G. Ipeirotis
2016 Encyclopedia of Database Systems  
The Open Archives Initiative focused on defining a protocol for exporting metadata about the objects in the collections hosted by each library.  ...  For example, consider the following URL-encoded query: • Use the database characterization to select the most promising databases for evaluating a given query [9, 15] . • Evaluate the queries in the selected  ... 
doi:10.1007/978-1-4899-7993-3_327-2 fatcat:gw5dlna43nhjlks6tfwbh2urmy

An Investigation of the Academic Information Finding and Re-finding Behavior on the Web

Hsiao-Tieh Pu, Xin-Yu Jiang
2014 Journal of Library and Information Studies  
Web information finding and re-finding behavior is an important and novel area of research.  ...  and re-finding behavior.  ...  Characterizations of user web revisit behavior. Paper presented at A comparison of nine search engine transaction logs.  ... 
doi:10.6182/jlis.2014.12(2).117 doaj:bd84086bf6f6474cb60f600458cb2b6e fatcat:4j5ridrwzrc5dpy5j7zrtmijj4

Tools for noise characterization in Virgo

T Accadia, F Acernese, F Antonucci, P Astone, G Ballardin, F Barone, M Barsuglia, Th S Bauer, M G Beker, A Belletoile, S Birindelli, M Bitossi (+160 others)
2010 Journal of Physics, Conference Series  
It is used in the on-line LineMonitor process for high resolution search for lines in the dark fringe spectrum.  ...  and the results are reported on web pages. The goal is to have user-friendly tools to let the commissioning and data analysts easily characterize the data.  ... 
doi:10.1088/1742-6596/243/1/012004 fatcat:kila4o6dffdmxb4mefo7po26va

Characterization of Search Engine Caches [article]

Frank McCown, Michael L. Nelson
2007 arXiv   pre-print
We also examined the overlap of the various caches with the holdings of the Internet Archive.  ...  Although search engine caches are provided only as an aid to the interactive user, we are interested in building reliable preservation services from the aggregate of these limited caching services.  ...  Acknowledgments We would like to thank Giridhar Nandigam (ODU) for writing the search engine querying software. This work is supported in part by NSF Grant IIS-0610841.  ... 
arXiv:cs/0703083v2 fatcat:chnsm2xm2bayrnjbddvr667tym

Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities

Andrew Jackson, Jimmy Lin, Ian Milligan, Nick Ruest
2016 Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries - JCDL '16  
In this paper, we describe initial experiences in providing an exploratory search interface to web archives for humanities scholars and social scientists.  ...  Web archiving initiatives around the world capture ephemeral web content to preserve our collective digital memory.  ...  While our findings are accurately characterized as preliminary, and we are by no means the first to examine the information seeking behavior of scholars (cf.  ... 
doi:10.1145/2910896.2910912 dblp:conf/jcdl/JacksonLMR16 fatcat:ldy6mqflmfcazjl5lkozlsfvqu

User Interactions with Electronic Finding Aids in a Controlled Setting

Christopher Prom
2004 The American Archivist  
Most users stated in their questionnaires that they preferred to scroll and click links when searching a Web site, but their actual behavior was different.  ...  more than a hundred times in the past year AND characterized self as a "highly experienced" computer user.  ... 
doi:10.17723/aarc.67.2.7317671548328620 fatcat:dq5qjefwbjdhfluilhxrplaoeq

System evaluation of archival description and access

Junte Zhang
2012 SIGIR Forum  
Deriving Web Search Behavior from Search Logs We can study Web search behavior by looking at search logs of Web sites.  ...  Search Behavior in Cultural Heritage Besides on the Web, there can be a breakdown of the search behaviors of users in a particular domain, such as in cultural heritage (CH).  ...  So here there is complete agreement among the users for a specific search behavior, i.e. only getting an overview and then stop. In the other EAD finding aids, there are more transition types.  ... 
doi:10.1145/2093346.2093367 fatcat:uoqzc5kavva2hn3j7kxzybhlwu

GeoStream: Spatial Information Indexing Within Textual Documents Supported by a Dynamically Parameterized Web Service [chapter]

Albert Royer, Christian Sallaberry, Pierre Loustau, Mauro Gaio, Thierry Joliveau, Pierre-André Le Ny
2011 Lecture Notes in Geoinformation and Cartography  
Cultural heritage content is everywhere on the web: digital libraries, archives, and portals of museums or galleries.  ...  In this context, the GeoTopia project is supported by the CNRS-TGE-Adonis and focuses on archive data sharing and interpretation.  ...  It is led in partnership with the EVS (umr5600.univ-lyon3.fr/) and LIUPPA (http://liuppa.univpau.fr) laboratories and the Makina Corpus (http://www.makina-corpus.com/) company. Glossary  ... 
doi:10.1007/978-3-642-10595-1_12 fatcat:kshavpfpgre4hcgyisaevi2ckm

Searching Digital Libraries [chapter]

Rodney Topor, Kenneth Salem, Amarnath Gupta, Kazuo Goda, Johannes Gehrke, Nathaniel Palmer, Mohamed Sharaf, Alexandros Labrinidis, John F. Roddick, Ariel Fuxman, Renée J. Miller, Wang-Chiew Tan (+205 others)
2009 Encyclopedia of Database Systems  
The Open Archives Initiative focused on defining a protocol for exporting metadata about the objects in the collections hosted by each library.  ...  For example, consider the following URL-encoded query: • Use the database characterization to select the most promising databases for evaluating a given query [9, 15] . • Evaluate the queries in the selected  ... 
doi:10.1007/978-0-387-39940-9_327 fatcat:ev7t3weinbgc5d3t3wb7hi6wdm
« Previous Showing results 1 — 15 out of 31,230 results