Filters








66 Hits in 7.7 sec

Integrating clustering and multi-document summarization to improve document understanding

Dingding Wang, Shenghuo Zhu, Tao Li, Yun Chi, Yihong Gong
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Acknowledgements: The work of T. Li is partially supported by National Science Foundation under IIS-0546280, HRD-0317692, and IIP-0450552.  ...  Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10.  ...  Figure 1 : 1 Overview of our proposed framework Table 2 : 2 One-sentence summaries formed by our method for the top 4 largest topics in TDT2 corpus.  ... 
doi:10.1145/1458082.1458319 dblp:conf/cikm/WangZLCG08 fatcat:sdbphoxq4rbhvckze5igmzzuam

Decomposition of terminology graphs for domain knowledge acquisition

Fidelia Ibekwe-SanJuan, Eric SanJuan, Michael S.E. Vogeley
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
After multiword term extraction, we apply techniques from text mining and visual analytics in a novel way by integrating symbolic and numeric information to build clusters of domain topics.  ...  The graph is then decomposed based on atom graph structure into central (non-decomposable) atom and peripheral atoms.  ...  CIKM'08 October 26-30, 2008 Napa Valley, California USA Copyright 2008 ACM 978-1-59593-991-3/08/10 ...$5.00. a graph-based approach using a hierarchical clustering algorithm named CPCL (Classification  ... 
doi:10.1145/1458082.1458334 dblp:conf/cikm/Ibekwe-SanjuanSV08 fatcat:zrwjvyusqncopkakncvj5skwy4

Natural language retrieval of grocery products

Petteri Nurmi, Eemil Lagerspetz, Wray Buntine, Patrik Floréen, Joonas Kukkonen, Peter Peltonen
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
We also compare our system against an off-the-shelf retrieval tool, and show that our system is significantly better for top-ranked retrieval results.  ...  This work was also supported in part by the IST Programme of the European Community, under the PASCAL network of excellence, IST-2002-506778. The publication only reflects the authors' views.  ...  Acknowledgments This work was supported in part by the Finnish Funding Agency for Technology and Innovation TEKES, under the project Personalised Ubiservices in Public Spaces.  ... 
doi:10.1145/1458082.1458308 dblp:conf/cikm/NurmiLBFKP08 fatcat:ulvrk2biwndjnd3mu7iqwexstm

An extension of PLSA for document clustering

Young-Min Kim, Jean-François Pessiot, Massih Reza Amini, Patrick Gallinari
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to cocluster documents and terms simultaneously.  ...  We show on three datasets that our extended model produces statistically significant improvements with respect to two clustering measures over the original PLSA and the multinomial mixture MM models.  ...  Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10.  ... 
doi:10.1145/1458082.1458271 dblp:conf/cikm/KimPAG08 fatcat:yumxs52whjanlc7a34wf46qp3y

A survey of pre-retrieval query performance predictors

Claudia Hauff, Djoerd Hiemstra, Franciska de Jong
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
The focus of research on query performance prediction is to predict the effectiveness of a query given a search system and a collection of documents.  ...  In particular, pre-retrieval predictors predict the query performance before the retrieval step and are thus independent of the ranked list of results; such predictors base their predictions solely on  ...  CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. on the terms' semantic relationships.  ... 
doi:10.1145/1458082.1458311 dblp:conf/cikm/HauffHJ08 fatcat:pevhs47wg5ay3ocyqdf2kiu65q

Handling implicit geographic evidence for geographic ir

Nuno Cardoso, Mário J. Silva, Diana Santos
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
In this paper, we analyze the amount of implicit and explicit geographic evidence in newspaper documents, and measure its impact on geographic information retrieval by evaluating the performance of a retrieval  ...  Most geographic information retrieval systems depend on the detection and disambiguation of place names in documents, assuming that the documents with a specific geographic scope contain explicit place  ...  This work was jointly funded by the Portuguese government and the European Union  ... 
doi:10.1145/1458082.1458291 dblp:conf/cikm/CardosoSS08 fatcat:gououkakfjdu7bx26cimocg2bu

Deriving non-redundant approximate association rules from hierarchical datasets

Gavin Shaw, Yue Xu, Shlomo Geva
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
ABSTRACT Association rule mining plays an important job in knowledge and information discovery.  ...  However, there are still shortcomings with the quality of the discovered rules and often the number of discovered rules is huge and contain redundancies, especially in the case of multi-level datasets.  ...  CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. reliability, it is important to ensure those rules with a high confidence are kept.  ... 
doi:10.1145/1458082.1458328 dblp:conf/cikm/ShawXG08 fatcat:njxs5mme5bcn5ouzyowwfv6tjq

Yizkor books

Jason J. Soo, Rebecca J. Cathey, Ophir Frieder, Michlean J. Amir, Gideon Frieder
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Prior to our effort, information regarding the content and location of each Yizkor Book volume was limited.  ...  Yizkor Book collections contain firsthand commemorative accounts of events from the era surrounding the rise and fall of Nazi Germany, including documents from before, during, and after the Holocaust.  ...  CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. technology.  ... 
doi:10.1145/1458082.1458266 dblp:conf/cikm/SooCFAF08 fatcat:yaufuib6svatdjgr5dytv42zai

In the development of a spanish metamap

Francisco M. Carrero, José Carlos Cortizo, José María Gómez, Manuel de Buenaga
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Our ongoing research is mainly focused on using biomedical concepts for cross-lingual text classification and retrieval.  ...  In this context the use of concepts instead of bag of words representation allows us to face text classification tasks abstracting from the language [4].  ...  CIKM' 08 , 08 October 26-30, 2008, Napa Valley California, USA. Copyright 2008 ACM 978-1-59593-991-3/08/10...$5.00.  ... 
doi:10.1145/1458082.1458335 dblp:conf/cikm/CarreroCGR08 fatcat:6doerbjz7rd5zgihgryfidkh3e

Mining named entity transliteration equivalents from comparable corpora

Raghavendra Udupa, K. Saravanan, A. Kumaran, Jagadeesh Jagarlamudi
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Two Mining Stages of MINT Method Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10.  ...  The lack of timestamp information prevented us from running the CoRanking algorithm on the En-Hi language pair.  ... 
doi:10.1145/1458082.1458313 dblp:conf/cikm/UdupaSKJ08 fatcat:kzxksuoccnc6poaukpyfdcdkvm

Measuring user preference changes in digital libraries

Yang Sun, Huajing Li, Isaac G. Councill, Wang-Chien Lee, C. Lee Giles
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
We present a study that measures the changes of user preferences based on an analysis of access logs of a large scale digital library over one year.  ...  A metric based on the accuracy of predicting future user actions is proposed.  ...  CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. users based on one year's worth of web access logs from a large scale academic digital library.  ... 
doi:10.1145/1458082.1458353 dblp:conf/cikm/SunLCLG08 fatcat:cufp7xgtsfhdjpzuiplvtt43di

Efficient frequent pattern mining over data streams

Syed Khairuzzaman Tanbeer, Chowdhury Farhan Ahmed, Byeong-Soo Jeong, Young-Koo Lee
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
The CPS-tree introduces the concept of dynamic tree restructuring technique in handling stream data that allows it to achieve highly compact frequency-descending tree structure at runtime and facilitates  ...  This paper proposes a prefix-tree structure, called CPS-tree (Compact Pattern Stream tree) that efficiently discovers the exact set of recent frequent patterns from high-speed data stream.  ...  ., and Yin Y. 2000 Tail-node (f). The DSTree for windows 1 and 2 requires Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA.  ... 
doi:10.1145/1458082.1458326 dblp:conf/cikm/TanbeerAJL08 fatcat:fwinusdhgvcfxpxva7izmtfcf4

Exploiting context to detect sensitive information in call center conversations

Tanveer A. Faruquie, Sumit Negi, Anup Chalamalla, L. Venkata Subramaniam
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Protecting sensitive information while preserving the shareability and usability of data is becoming increasingly important.  ...  In this work, we address the problem of protecting sensitive information in audio recordings and speech transcripts.  ...  In order to comply with Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA.  ... 
doi:10.1145/1458082.1458362 dblp:conf/cikm/FaruquieNCS08 fatcat:ve2oeni52zd45bjrkikpibkj6u

SHOPSMART

Alexander Yates, James Joseph, Ana-Maria Popescu, Alexander D. Cohn, Nick Sillick
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
On a data set collected from Amazon reviews and online technical specifications, rankings produced by this model rank the best product for a user in the 87th percentile of products in its category, on  ...  We first learn a model that can predict the price of a product given automatically-determined features describing technical specifications and users' opinions.  ...  CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. TV? shorter focal length or faster shutter speed?)  ... 
doi:10.1145/1458082.1458355 dblp:conf/cikm/YatesJPCS08 fatcat:5mahkwnitnf75cimtskg5fbqxi

Representative entry selection for profiling blogs

Jinfeng Zhuang, Steven C.H. Hoi, Aixin Sun, Rong Jin
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Many applications on blog search and mining often meet the challenge of handling huge volume of blog data, in which one single blog could contain hundreds or even thousands of entries.  ...  We suggest blog classification for judging the performance of the proposed entry selection techniques and evaluate their performance on a real blog dataset, in which encouraging results were obtained.  ...  We formulated it into a combinational optimization problem based on two principles and proposed an algorithm to solve it by exploiting the theory of submodular functions.  ... 
doi:10.1145/1458082.1458293 dblp:conf/cikm/ZhuangHSJ08 fatcat:gzlwrytuunblfepev3leq4rxca
« Previous Showing results 1 — 15 out of 66 results