Filters








31 Hits in 1.0 sec

Contextual Suggestion from Wikitravel: Exploiting Community-Based Suggestions

Marijn Koolen, Jaap Kamps, Hugo C. Huurdeman
2012 Text Retrieval Conference  
This paper describes our participation in the TREC 2012 Contextual Suggestion Track. The goal of the track is to evaluate systems that provide suggestions for activities to users in a specific location, at a specific time, taking into account their personal preferences. As a source for travel suggestions we use Wikitravel, which is a community-based travel guide for destinations all over the world. From pages dedicated to cities in the US we extract suggestions for sightseeing, shopping, eating
more » ... and drinking. Descriptions from positive examples in the user profiles are used as queries to rank all suggestions in the US. Our baseline approach merges the per-query rankings of all positive examples of all users. Our userdependent approach merges the per-query rankings of the positive examples of a single user. The rankings suggestions are then filtered based on the location of the user. We ignore the temporal aspects of the context. The user-dependent rankings are more effective for contextual suggestion than user-independent rankings. The two systems show similar perform on the geographical dimension, but the user-dependent system provides more interesting suggestions. Our results show that information on user preferences is valuable for providing appropriate suggestions.
dblp:conf/trec/KoolenKH12 fatcat:oletsgoeqfdz3njobdvvuxe6rm

Comparing Topic Representations for Social Book Search

Marijn Koolen, Hugo C. Huurdeman, Jaap Kamps
2013 Conference and Labs of the Evaluation Forum  
In this paper we describe our participation in the INEX 2013 Social Book Search Track. We compare the impact of different query representations for book search topics derived from the LibraryThing discussion forums, including the title and full narrative provided by the topic creator, the name of the discussion group in which the topic was posted, and a mediated search query provided by a trained annotator. Our findings are that 1) the mediated queries are short and do not improve performance
more » ... er the titles, but combining titles and mediated queries does, 2) the discussion group name adds relevant new terms to the representation and further improves performance, but adding the narrative is not effective, and 3) for the majority of topics retrieval effectiveness is the same across all topic representations. Our findings suggest that writing a good search query for the complex information needs in social book search is far from trivial, even for trained annotators.
dblp:conf/clef/KoolenHK13 fatcat:2lftoyfeozau3anvtbyyptbdq4

Adaptive Search Support for Information Seeking Stages

Hugo C. Huurdeman
2015 Bulletin of IEEE Technical Committee on Digital Libraries  
We use the Web for work, leisure, and research, assisted by various search systems in the task of satisfying our information needs. We utilize these systems to perform our daily tasks, ranging from simple lookup tasks to complex, exploratory and analytical ventures. The more complex tasks may involve multiple information seeking stages, with evolving inherent needs for each stage. Most search systems, however, only support these complex tasks in an elementary manner, and offer a
more » ... ll' interface optimized for shallow lookup search. In addition to the wealth of information available on the live Web, historical Web content is currently available in Web archives, containing snapshots of the Web that once was. These Web archives can enable new opportunities for analytical tasks, serving as data sources for researchers in various fields. At the moment, however, few archives offer full-text search, and the search systems that are available fall short of the rich functionality needed for analytical tasks. This PhD research proposal takes Kuhlthau's ISP model as its framework, and addresses search support for different 'stages' of complex search tasks. It discusses the theoretical implications of multistage information seeking models for the design of search systems. The proposal examines the effects of information seeking stages on the flow of interaction with actual search systems. The understanding on both the theoretical and practical level are used to design and evaluate multistage search systems, firstly in a general Web search setting, and secondly in a Web archive search setting. Finally, this leads to design recommendations for supporting different stages of complex tasks in search systems. This proposal consists of five parts: first, we introduce the topic and research problem. Then, we state the research questions. Subsequently, background literature is briefly discussed, followed by the employed methodology. Next, current progress is discussed, before summarizing this proposal in the conclusion.
dblp:journals/tcdl/Huurdeman15 fatcat:hopnyxjnhjclrecgopmwdh6bsa

Using Collaborative Filtering in Social Book Search

Hugo C. Huurdeman, Jaap Kamps, Marijn Koolen, Justin van Wees
2012 Conference and Labs of the Evaluation Forum  
In this paper we describe our participation in INEX 2012 in the Social Book Search Track and the Linked Data Track. For the Social Book Search Track we compare the impact of query-and userindependent popularity measures and recommendations based on user profiles. Book suggestions are more than just topical relevance judgements and may include personal factors such as interestingness, fun and familiarity and book-related aspects such as quality and popularity. Our aim is to understand to what
more » ... ent book suggestions are related to userdependent and -independent aspects of relevance. Our findings are that evidence that is both query-and user-independent is not effective for improving a standard retrieval model using blind feedback. User-dependent evidence, on the contrary, is highly effective, leading to significant improvements. For the Linked Data Track we compare different methods of weighted result aggregation using the DBpedia ontology relations as facets and values. Facets and values are aggregated using either document counts or retrieval scores. The reason to use retrieval scores for facet ranking is that we want the top retrieved results to be summarised by the top ranked facets and values. In addition, we look at the impact of taking overlap in aggregation into account. Facet values that give access to many of the same documents have high overlap. Selecting facet values that have low overlap may avoid frustrating the user.
dblp:conf/clef/HuurdemanKKW12 fatcat:n3z6ccoktnarzckodvzr3a2awe

Finding pages on the unarchived Web

Hugo C. Huurdeman, Anat Ben-David, Jaap Kamps, Thaer Samar, Arjen P. de Vries
2014 IEEE/ACM Joint Conference on Digital Libraries  
Web archives preserve the fast changing Web, yet are highly incomplete due to crawling restrictions, crawling depth and frequency, or restrictive selection policies-most of the Web is unarchived and therefore lost to posterity. In this paper, we propose an approach to recover significant parts of the unarchived Web, by reconstructing descriptions of these pages based on links and anchors in the set of crawled pages, and experiment with this approach on the Dutch Web archive. Our main findings
more » ... e threefold. First, the crawled Web contains evidence of a remarkable number of unarchived pages and websites, potentially dramatically increasing the coverage of the Web archive. Second, the link and anchor descriptions have a highly skewed distribution: popular pages such as home pages have more terms, but the richness tapers off quickly. Third, the succinct representation is generally rich enough to uniquely identify pages on the unarchived Web: in a known-item search setting we can retrieve these pages within the first ranks on average.
doi:10.1109/jcdl.2014.6970188 dblp:conf/jcdl/HuurdemanBKSV14 fatcat:rya7otftlvdqhlqp7rtb7kp3u4

Information Security Issues in a Digital Library Environment: A Literature Review

Audrey Anday, Enrico Francese, Hugo C. Huurdeman, Muharrem Yılmaz, Dydimus Zengenene
2012 Bilgi Dünyasi  
C.  ...  Patel, A., Qassim, O. and Wills, C. (2010). A survey of intrusion detection and prevention systems. Information Management & Computer Security, 18(4), 277-290 Pearson, D. and Webb, C. (2008).  ... 
doi:10.15612/bd.2012.171 fatcat:fsdwku53dbbathoeln54o73xyi

The Value of Multistage Search Systems for Book Search

Hugo C. Huurdeman, Jaap Kamps, Marijn Koolen, Sanna Kumpulainen
2015 Conference and Labs of the Evaluation Forum  
Often, our exploratory quests for books are highly complex endeavors which feature activities such as exploration, searching, selecting and comparing various books. Current systems for book search may not provide optimal support for this wide range of activities. The interactive Social Book Search Track investigates how users utilize different access interfaces in the context of two types of tasks, and evaluates a streamlined baseline interface and a rich multistage interface, potentially
more » ... ting different stages of search. In this paper, we analyze how these two types of interfaces influence user behavior, in terms of task duration, book selection and interaction patterns. Furthermore, we characterize the use of the different panels of the experimental multistage interface, as well as user engagement. We find initial evidence for the additional value of providing stage-based search support in the context of open-ended and focused book search tasks.
dblp:conf/clef/HuurdemanKKK15 fatcat:tqg33r6qffdbpptfv2x4rykkwu

Dynamic Compositions: Recombining Search User Interface Features for Supporting Complex Work Tasks

Hugo C. Huurdeman
2017 Conference on Human Information Interaction and Retrieval  
Especially in initial stages, Facets and Filters (b) can be useful to delineate resultsets, and adapting Results Ordering (c) may initially help to nd the right items.  ...  Huurdeman et al. [8] used a feature categorization from Wilson [17] to more broadly group di erent types of SUI features, and assessed their value over time using a multistage task design.  ... 
dblp:conf/chiir/Huurdeman17 fatcat:z5i6cu33rzfjrbe6gkt6ewddta

University of Amsterdam at the TREC 2013 Contextual Suggestion Track: Learning User Preferences from Wikitravel Categories

Marijn Koolen, Hugo C. Huurdeman, Jaap Kamps
2013 Text Retrieval Conference  
The average rating ru of examples D E by a user u is given as: ru = 1 |D E | d∈D E r u (d) (1) The average rating of example websites D C in Wikitravel category C by user u is: ru (C) = 1 |D C | d∈D C  ...  i ) (3) The language model score P (d|q i ) is computed as: P (d|q i ) = P (d) • P (q i |d) (4) where P (d) is a document prior probability, which is P (d) = 1 in the baseline system and P (d) = ru (C)  ... 
dblp:conf/trec/KoolenHK13 fatcat:m45lneov25fszkb3p3ladv24ey

Effective Metadata for Social Book Search from a User Perspective

Hugo C. Huurdeman, Jaap Kamps, Marijn Koolen
2014 Conference and Labs of the Evaluation Forum  
In this extended abstract we describe our participation in the INEX 2014 Interactive Social Book Search Track. In previous work, we have looked at the impact of professional and user-generated metadata in the context of book search, and compared these different categories of metadata in terms of retrieval effectiveness. Here, we take a different approach and study the use of professional and user-generated metadata of books in an interactive setting, and the effectivity of this metadata from a
more » ... ser perspective. We compare the perceived usefulness of general descriptions, publication metadata, user reviews and tags in focused and open-ended search tasks, based on data gathered in the INEX Interactive Social Book Search Track. Furthermore, we take a tentative look at the actual use of different types of metadata over time in the aggregated search tasks. Our preliminary findings in the surveyed tasks indicate that user reviews are generally perceived to be more useful than other types of metadata, and they are frequently mentioned in users' rationales for selecting books. Furthermore, we observe a varying usage frequency of traditional and user-generated metadata across time in the aggregated search tasks, providing initial indications that these types of metadata might be useful at different stages of a search task.
dblp:conf/clef/HuurdemanKK14 fatcat:fkuur5oyojad3dmio3kn7457qm

From multistage information-seeking models to multistage search systems

Hugo C. Huurdeman, Jaap Kamps
2014 Proceedings of the 5th Information Interaction in Context Symposium on - IIiX '14  
The ever expanding digital information universe makes us rely on search systems to sift through immense amounts of data to satisfy our information needs. Our searches using these systems range from simple lookups to complex and multifaceted explorations. A multitude of models of the information seeking process, for example Kuhlthau's ISP model, divide the information seeking process for complex search tasks into multiple stages. Current search systems, in contrast, still predominantly use a
more » ... -size-fits-all" approach: one interface is used for all stages of a search, even for complex search endeavors. The main aim of this paper is to bridge the gap between multistage information seeking models, documenting the search process on a general level, and search systems and interfaces, serving as the concrete tools to perform searches. To find ways to reduce the gap, we look at existing models of the information seeking process, at search interfaces supporting complex search tasks, and at the use of interface features over time. Our main contribution is that we conceptually bring together macro level information seeking stages and micro level search system features. We highlight the impact of search stages on the flow of interaction with user interface features, providing new handles for the design of multistage search systems.
doi:10.1145/2637002.2637020 dblp:conf/iiix/HuurdemanK14 fatcat:clbyju2vx5g3fgiqn2y7fcnt4m

Lost but not forgotten: finding pages on the unarchived web

Hugo C. Huurdeman, Jaap Kamps, Thaer Samar, Arjen P. de Vries, Anat Ben-David, Richard A. Rogers
2015 International Journal on Digital Libraries  
70,371 1.28 jpg 30,205 0.58 Table 11 TLD 11 Table 13Top 10 of inner aura categories (rank in archive) distribution Inner aura Count % Outer aura Count % 1 n l 5 ,268,772 95.7 1 c  ... 
doi:10.1007/s00799-015-0153-3 fatcat:f5yhxhrdxjduznbnxamlcjvacm

Uncovering the unarchived web

Thaer Samar, Hugo C. Huurdeman, Anat Ben-David, Jaap Kamps, Arjen de Vries
2014 Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval - SIGIR '14  
Many national and international heritage institutes realize the importance of archiving the web for future culture heritage. Web archiving is currently performed either by harvesting a national domain, or by crawling a pre-defined list of websites selected by the archiving institution. In either method, crawling results in more information being harvested than just the websites intended for preservation; which could be used to reconstruct impressions of pages that existed on the live web of the
more » ... crawl date, but would have been lost forever. We present a method to create representations of what we will refer to as a web collection's aura: the web documents that were not included in the archived collection, but are known to have existed -due to their mentions on pages that were included in the archived web collection. To create representations of these unarchived pages, we exploit the information about the unarchived URLs that can be derived from the crawls by combining crawl date distribution, anchor text and link structure. We illustrate empirically that the size of the aura can be substantial: in 2012, the Dutch Web archive contained 12.3M unique pages, while we uncover references to 11.9M additional (unarchived) pages.
doi:10.1145/2600428.2609544 dblp:conf/sigir/SamarHBKV14 fatcat:2ug2mk4txjhcvfbanmdrrpgh7a

"More than Meets the Eye" - Analyzing the Success of User Queries in Oria

Hugo C. Huurdeman, Mikaela Aamodt, Dan Michael Heggø
2018 Nordic Journal of Information Literacy in Higher Education  
Discovery systems allow academic library users to locate a wider range of resources than previous OPACs. However, actual usage of these systems may still be challenging. The main aim of this research is to get a better understanding of the hurdles users face while searching contemporary library systems.This study utilizes a transaction log analysis approach, using popular and zero result queries datasets gathered from the statistics module of a library discovery system. It explores what types
more » ... queries users perform, how successful the queries are, and examines underlying reasons for unsuccessful queries. To our knowledge, this is the first academic paper to use data originating from built-in transaction logs of the Oria library discovery system.The analysis shows that queries are often curriculum-related: we could pinpoint a relation with curriculum for 58% of the popular queries, and 28% for the zero result searches. A vast majority of popular queries refer to books, databases and journals, and over half of the queries used the title to locate a resource. 20% of the popular queries turned out to be unsuccessful. Zero result queries typically involve long queries, and in many cases consist of pasted reference citations.Our conclusion is that the examined discovery system is rather sensitive. Whilst this suggests the importance of increasing users' information search skills, it also points to the need for enhancing discovery systems and their underlying metadata. Furthermore, due to the prominence of curriculum-related queries, a better integration of curriculum materials ought to be achieved.
doi:10.15845/noril.v10i1.270 fatcat:cxlxtvv3bjc77o4vgu7xwo4e34

Active and Passive Utility of Search Interface Features in Different Information Seeking Task Stages

Hugo C. Huurdeman, Max L. Wilson, Jaap Kamps
2016 Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval - CHIIR '16  
Finally, Huurdeman and Kamps [11] looked at conceptual ways to bridge macro and micro-level information seeking models, and based on changes in gaze behaviour of a small-scale user study involving book  ... 
doi:10.1145/2854946.2854957 dblp:conf/chiir/HuurdemanWK16 fatcat:xzbnhnfxknchfoeqlsudg4w7rm
« Previous Showing results 1 — 15 out of 31 results