3,425 Hits in 4.9 sec

Semiautomatic evaluation of retrieval systems using document similarities

Ben Carterette, James Allan
2007 Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM '07  
evaluation of retrieval systems.  ...  Taking advantage of the well-known cluster hypothesis that "closely associated documents tend to be relevant to the same request", we can use inter-document similarity to provide more accurate and robust  ...  Acknowledgments This work was supported in part by the Center for Intelligent Information Retrieval and in part by the Defense Ad-  ... 
doi:10.1145/1321440.1321564 dblp:conf/cikm/CarteretteA07 fatcat:dlyn6claynfk7kzebo44xyjwpq

Repeatable evaluation of search services in dynamic environments

Eric C. Jensen, Steven M. Beitzel, Abdur Chowdhury, Ophir Frieder
2007 ACM Transactions on Information Systems  
We propose a semiautomatic evaluation framework to reduce this effort.  ...  In practice it is common to perform shallow evaluations over small numbers of live engines (often pairwise, engine A vs. engine B) without system pooling.  ...  RELATED WORK First, we review evaluation of information retrieval systems on the Web.  ... 
doi:10.1145/1292591.1292592 fatcat:ddaqnhbjfvhlharlukrsi2hqju

Using text search for personal photo collections with the MediAssist system

Neil O'Hare, Cathal Gurrin, Gareth J. F. Jones, Hyowon Lee, Noel E. O'Connor, Alan F. Smeaton
2007 Proceedings of the 2007 ACM symposium on Applied computing - SAC '07  
One mode of user interaction uses automatically extracted features to create text surrogates for photos, which enables text search of photo collections without manual annotation.  ...  The MediAssist system enables organisation and searching of personal digital photo collections based on contextual information, content-based analysis and semi-automatic annotation.  ...  For our future work we plan to conduct user evaluations of this system to test real users ability to formulate text queries and retrieve photos.  ... 
doi:10.1145/1244002.1244195 dblp:conf/sac/OHareGJLOS07 fatcat:zrqu3fev6raxbftgwpsxu3kvha

Ontology based Automatic Module Generation from E-book

Keerthana. R, Jayashree.N.R Jayashree.N.R
2015 International Journal of Computer Applications  
A novel DOM-Sortze is a system that uses natural language processing techniques, heuristic reasoning, and ontology for the semiautomatic construction of the Domain Module from electronic textbooks.  ...  The LDO ontology supports the multilingual representation of the domain topics, and machine translation might be used to get approximate translations of the gathered LOs, used for searching and retrieving  ...  It has been tested with several textbooks written in the Basque language in order to evaluate the automatic construction of Learning Objects [2] .Retrieving and reusing Learning Objects (LOs) can lighten  ... 
doi:10.5120/21271-4044 fatcat:cod5hgbqyvezrko7vliafpuq2a

Cluster Hypothesis in Low-Cost IR Evaluation with Different Document Representations

Kai Hui, Klaus Berberich
2016 Proceedings of the 25th International Conference Companion on World Wide Web - WWW '16 Companion  
Offline evaluation for information retrieval aims to compare the performance of retrieval systems based on relevance judgments for a set of test queries.  ...  Since manual judgments are expensive, selective labeling has been developed to semiautomatically label documents, in the wake of the similarity relationship among retrieved documents.  ...  INTRODUCTION Offline evaluation in information retrieval aims to establish the relative performance of several information retrieval systems based on a set of test queries.  ... 
doi:10.1145/2872518.2889370 dblp:conf/www/HuiB16 fatcat:ruhysprcwnfjplc7h6o3bqr5gu

Recovering traceability links between code and documentation

G. Antoniol, G. Canfora, G. Casazza, A. De Lucia, E. Merlo
2002 IEEE Transactions on Software Engineering  
Software system documentation is almost always expressed informally in natural language and free text.  ...  A premise of our work is that programmers use meaningful names for program items, such as functions, variables, types, classes, and methods.  ...  of the system and the requirement to the code traceability matrix.  ... 
doi:10.1109/tse.2002.1041053 fatcat:44hdgnmz5veqlpiour46e4zsda

A Semantic Model of Selective Dissemination of Information for Digital Libraries

J. M. Morales-del-Castillo, R. Pedraza-Jiménez, E. Peis, E. Herrera-Viedma
2009 Information Technology and Libraries  
Other tools used are fuzzy linguistic modelling techniques (which make possible easing the interaction between users and system) and natural language processing (NLP) techniques for semiautomatic thesaurus  ...  <span>n this paper we present the theoretical and methodological foundations for the development of a multi-agent Selective Dissemination of Information (SDI) service model that applies Semantic Web technologies  ...  22 This tool can also be used with clustering techniques-for example, to group documents of a collection in a set of nodes or clusters, depending on their similarity.  ... 
doi:10.6017/ital.v28i1.3169 fatcat:7iko7ulapnap7owl5pltawpicq

Extraction and visualization of numerical and named entity information from a large number of documents

Masaki Murata, Qing Ma, Kentaro Torisawa, Masakazu Iwatate, Tamotsu Shirado, Koji Ichii, Toshiyuki Kanamaru
2008 2008 International Conference on Natural Language Processing and Knowledge Engineering  
From this perspective, we concluded that our system is useful and convenient for extracting information from a large number of documents. We have constructed a demonstration system.  ...  We have developed a system that can semiautomatically extract numerical and named entity sets from a large number of Japanese documents and can create various kinds of tables and graphs.  ...  about weather or politics from the retrieved documents by using our system.  ... 
doi:10.1109/nlpke.2008.4906795 dblp:conf/nlpke/MurataMTISIK08 fatcat:mcevktdgsreulaugbnouj57b6e

Exploring semi-automatic nugget extraction for Japanese one click access evaluation

Matthew Ekstrand-Abueg, Virgil Pavlu, Makoto Kato, Tetsuya Sakai, Takehiro Yamamoto, Mayu Iwata
2013 Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '13  
Building test collections based on nuggets is useful evaluating systems that return documents, answers, or summaries.  ...  We compare manually-extracted and semiautomatically-extracted Japanese nuggets to demonstrate the coverage and efficiency of the semi-automatic nugget extraction.  ...  The automated system, however, is designed to branch out based on similar information and similar contexts, so it finds a much larger space of information.  ... 
doi:10.1145/2484028.2484153 dblp:conf/sigir/Ekstrand-AbuegPKSYI13 fatcat:u2platilsbfqpk2mvu2uozcgr4

Personalized web search for improving retrieval effectiveness

Fang Liu, C. Yu, Weiyi Meng
2004 IEEE Transactions on Knowledge and Data Engineering  
The user profiles are then used to improve retrieval effectiveness in Web search.  ...  Web search is conducted based on both the user query and the set of categories. Several profile learning and category mapping algorithms and a fusion algorithm are provided and evaluated.  ...  ACKNOWLEDGMENTS A preliminary version of this paper (but not containing the part on how the categories can be used to improve retrieval effectiveness) has been published in the Proceedings of the ACM Conference  ... 
doi:10.1109/tkde.2004.1264820 fatcat:c4ebqptkibhexlv5n3fki4nksm

Capturing and using design rationale

Paul W.H. Chung, René Bañares-Alcántara
1997 Artificial intelligence for engineering design, analysis and manufacturing  
ACKNOWLEDGMENTS The authors thank the authors and reviewers of the papers included in this issue.  ...  types of document as annotations to design objects (similar to the work described in ), • indexing of design and rationale objects with a userdefined set of keywords; this indexing is done semiautomatically  ...  King and Bafiares-Alcantara (1997) propose the (semiautomatic) indexing of design objects for improved retrieval and consistency checking of design rationale structures.  ... 
doi:10.1017/s0890060400001888 fatcat:dl45jllt5va3pgdyaqzooprvra

Ontology based Similarity Measure in Document Ranking

U.K Sridevi., N Nagaveni.
2010 International Journal of Computer Applications  
The retrieval model is based on the importance factors of the structural elements, which are used to re-rank the documents retrieval by the ontology based distance measure.  ...  The relevance concept similarity are combined with the annotation-weighting scheme to improve the relevance measures. The proposed method has been evaluated on USGS Science directory collection.  ...  evaluate the performance of document retrieval.  ... 
doi:10.5120/469-774 fatcat:fvggphmzlrcwpehjttce3opzhy

Composing web services for large-scale tasks

In-Young Ko, R. Neches
2003 IEEE Internet Computing  
Use of document collections.  ...  Using the service broker, the composer suggests appropriate transformations for a document collection set and semiautomatically inserts the next set of services in an application.  ... 
doi:10.1109/mic.2003.1232518 fatcat:4mdoui45lbhcfd7nfphe6rpunq

Nuclear Exports Control System Using Semi-Automatic Keyword Extraction

Uihyun Kim
2014 International Journal of Information and Electronics Engineering  
reasoning system proposed for the retrieval of documents only in the classes where a new export request case is related.  ...  Because the domain of nuclear power is highly specialized and complex, human experts have been utilized to manually evaluate all the documents submitted for export permission, causing the evaluation process  ...  ACKNOWLEDGMENT Research reported in this paper was supported by Korea Institute of Nuclear Nonproliferation and Control (KINAC) and financially supported by Nuclear Safety and Security Commission (NSSC  ... 
doi:10.7763/ijiee.2014.v4.451 fatcat:b6mcmdci5zghzces52zmufo3em

A taxonomy generation tool for semantic visual analysis of large corpus of documents

Belen Carrion, Teresa Onorati, Paloma Díaz, Vasiliki Triga
2019 Multimedia tools and applications  
In this paper, we introduce a semiautomatic taxonomy generation tool for supporting domain experts in building taxonomies that are then used to automatically create semantic visualizations of data.  ...  Lessons learned from this experience will guide the design of a utility evaluation involving domain experts interested in data analysis and knowledge modeling.  ...  I would imagine that most people would learn to use this system very quickly. 7. I found the system very cumbersome to use. 8. I felt very confident using the system. 9.  ... 
doi:10.1007/s11042-019-07880-y fatcat:kimithemjfd5jfsnyae5yzhwny
« Previous Showing results 1 — 15 out of 3,425 results