Filters








60 Hits in 3.6 sec

The seventeenth australasian document computing symposium

Andrew Trotman, Sally Jo Cunningham, Laurianne Sitbon
2012 SIGIR Forum  
The Seventeenth Australian Document Computing Symposium was held in Dunedin, New Zealand on the 5 th and 6 th of December 2012. In total twenty four papers were submitted.  ...  A poster session was held jointly with the Australasian Language Technology Workshop.  ...  Introduction The Australasian Document Computing Symposium was first run in 1996 and has been the annual regional Information Retrieval and Document Computing conference in Australasia ever since.  ... 
doi:10.1145/2492189.2492193 fatcat:kb2jz2sopjcrflmeha5xxiovyq

Pairwise similarity of TopSig document signatures

Christopher M. De Vries, Shlomo Geva
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
This paper analyses the pairwise distances of signatures produced by the TopSig retrieval model on two document collections.  ...  The distribution of the distances are compared to purely random signatures. It explains why TopSig is only competitive with state of the art retrieval models at early precision.  ...  Furthermore, TopSig is likely to be useful for increased computational efficiency of document-to-document comparisons.  ... 
doi:10.1145/2407085.2407103 dblp:conf/adcs/VriesG12 fatcat:gclf73tx5fb6lckuzvcszzfidm

A study in language identification

Rachel Mary Milne, Richard A. O'Keefe, Andrew Trotman
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
Language identification is automatically determining the language that a previously unseen document was written in.  ...  We compared several prior methods on samples from the Wikipedia and the EuroParl collections. Most of these methods work well.  ...  The language of the document was chosen as the language of the largest set.  ... 
doi:10.1145/2407085.2407097 dblp:conf/adcs/MilneOT12 fatcat:imso4hxwfrd5dmeecaaxpl2ezy

Is the unigram relevance model term independent?

Mike Symonds, Peter Bruza, Guido Zuccon, Laurianne Sitbon, Ian Turner
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
The framework is used to classify and compare the query expansion terms produced by the unigram and positional relevance models.  ...  Paradoxically, it was the unigram model that exhibited this association more than the positional relevance model.  ...  is the set of documents pseudo-relevant or relevant to query Q, and D is a document in RQ.  ... 
doi:10.1145/2407085.2407102 dblp:conf/adcs/SymondsBZST12 fatcat:mqc22yodhvcwzegornidy4pn54

Finding additional semantic entity information for search engines

Jun Hou, Richi Nayak, Jinglan Zhang
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
It focuses on retrieving a list of entities or information about the specific entities instead of documents.  ...  free text documents.  ...  F1 is computed based on label precision and recall.  ... 
doi:10.1145/2407085.2407101 dblp:conf/adcs/HouNZ12 fatcat:wk2cxf3dlvcphgmvrnezwegmwe

Multi-aspect group formation using facility location analysis

Mahmood Neshati, Hamid Beigy, Djoerd Hiemstra
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
Our experiments on a real dataset show significant improvement in comparison with the state-of-the art approaches for the team formation problem.  ...  Each task needs a diverse set of skills and the group of assigned experts should be able to collectively cover all required aspects of the task.  ...  Let , … be the set of expertise documents (i.e. document is the expertise document of reviewer ), the log likelihood of the expertise document collection according to the PLSA is: log | , log | | (1) Input  ... 
doi:10.1145/2407085.2407094 dblp:conf/adcs/NeshatiBH12 fatcat:3i2wilrfoncufobm56klmicbk4

Efficient indexing algorithms for approximate pattern matching in text

Matthias Petri, J. Shane Culpepper
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
Approximate pattern matching is an important computational problem with a wide variety of applications in Information Retrieval.  ...  Our approach relies on a new variant of the Context Bound Burrows-Wheeler Transform (k-BWT), referred to as the Variable Depth Burrows-Wheeler Transform (v-BWT).  ...  This work was supported in part by the  ... 
doi:10.1145/2407085.2407087 dblp:conf/adcs/PetriC12 fatcat:n5wzkfhvxrhwdmaxr7brmc4xxi

Effects of spam removal on search engine efficiency and effectiveness

Matt Crane, Andrew Trotman
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
In this paper we investigate the effect that withholding documents identified as spam has on the resources required to process large collections.  ...  We find that by removing spam at indexing time we are able to decrease the index size without affecting the indexing throughput, and are able to improve search precision for some thresholds.  ...  document is negligible when compared to the time needed to index that document.  ... 
doi:10.1145/2407085.2407086 dblp:conf/adcs/CraneT12 fatcat:rpopemgedfd4xiv6tktp2j335e

An English-translated parallel corpus for the CJK Wikipedia collections

Ling-Xiang Tang, Shlomo Geva, Andrew Trotman
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
This document collection is named CJK2E Wikipedia XML corpus.  ...  Furthermore, the translated CJK articles could be used to further expand the current coverage of the English Wikipedia.  ...  For each orphaned topic article, all possible n-gram substrings from the document were first computed.  ... 
doi:10.1145/2407085.2407099 dblp:conf/adcs/TangGT12 fatcat:rchf2w7imjem3jyk7mkjufxrwa

Exploiting medical hierarchies for concept-based information retrieval

Guido Zuccon, Bevan Koopman, Anthony Nguyen, Deanne Vickers, Luke Butt
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
In Proceedings of the Seventeenth Australasian Document Computing Symposium,  ...  Because the absolute number of judged documents per topic is small, the computation of metrics such as MAP, nDCG, etc. is not meaningful.  ...  Documents are scored according to (1) the weight of query concepts in a document, and (2) the weight of concepts in a document that have been subsumed by a query concept.  ... 
doi:10.1145/2407085.2407100 dblp:conf/adcs/ZucconKNVB12 fatcat:stemnqrljjdjnki2h5sgieteqm

Graph-based concept weighting for medical information retrieval

Bevan Koopman, Guido Zuccon, Peter Bruza, Laurianne Sitbon, Michael Lawley
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
Medical concepts are extracted from free-text documents using a state-of-the-art technique that maps n-grams to concepts from the SNOMED CT medical ontology.  ...  This paper presents a graph-based method to weight medical concepts in documents for the purposes of information retrieval.  ...  The second component, w(t, d), is the weight of the term in the document.  ... 
doi:10.1145/2407085.2407096 dblp:conf/adcs/KoopmanZBSL12 fatcat:dt5nvii5g5aw7mxn2c3tvef224

Reordering an index to speed query processing without loss of effectiveness

David Hawking, Timothy Jones
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
Following Long and Suel, we empirically investigate the importance of document order in search engines which rank documents using a combination of dynamic (query-dependent) and static (queryindependent  ...  Since static scores may not be available until all documents have been gathered and indexed, we build a tool for reordering an existing index and show that it operates in less than 20% of the original  ...  The time ratio column shows that achieving the criterion perfermance level requires twice as much computation for the Reverse and Random orders.  ... 
doi:10.1145/2407085.2407088 dblp:conf/adcs/HawkingJ12 fatcat:2dructr3ivhlfhpsncyaworlee

Comparing scanning behaviour in web search on small and large screens

Jaewon Kim, Paul Thomas, Ramesh Sankaranarayana, Tom Gedeon
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
The reason for this tendency may be difficulties in reading and the relative location of page folds.  ...  The results showed that on a small screen, users spend relatively more time to conduct a search than they do on a large screen, despite tending to look less far ahead beyond the link that they eventually  ...  Even if we do not consider the proportion on the periphery, the proportion of AOIs 1 and 2 was about 4% higher on the small screen, whereas the proportions of all the other AOIs were higher on the large  ... 
doi:10.1145/2407085.2407089 dblp:conf/adcs/KimTSG12 fatcat:7tggkrsjibelxlbhvtpu7a4vli

An attempt to measure the quality of questions in question time of the Australian Federal Parliament

Andrew Turpin
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
A search engine is used to index all answers to questions, and then runs each question as a query, recording the rank of the actual answer in the returned list of documents.  ...  The corpus used for this study is available from the author's web page for further investigations.  ...  The score is the similarity score computed by the Zettair implementation of the Okapi BM25, and has no absolute meaning, but can be compared relatively within a corpus.  ... 
doi:10.1145/2407085.2407098 dblp:conf/adcs/Turpin12 fatcat:lzf3htyqgneppbcxgz3wkwkgzu

Relationship between the nature of the search task types and query reformulation behaviour

Khamsum Kinley, Dian Tjondronegoro, Helen Partridge, Sylvia Edwards
2012 Proceedings of the Seventeenth Australasian Document Computing Symposium on - ADCS '12  
One of the important factors that affect a user's query reformulation behaviour is that of the nature of the search tasks.  ...  In particular, the search task types influenced the manner in which users reformulated the New and Repeat queries.  ...  a participant Date: The date of the interaction The Time: The duration of the interaction The URL: The URL of the Web site visited Search Terms:The query terms as entered by the user This study implemented  ... 
doi:10.1145/2407085.2407091 dblp:conf/adcs/KinleyTPE12 fatcat:mcxp6lic5napxfx43sekngxh6u
« Previous Showing results 1 — 15 out of 60 results