A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
Integrating clustering and multi-document summarization to improve document understanding
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Acknowledgements: The work of T. Li is partially supported by National Science Foundation under IIS-0546280, HRD-0317692, and IIP-0450552. ...
Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. ...
Figure 1 : 1 Overview of our proposed framework
Table 2 : 2 One-sentence summaries formed by our method for the top 4 largest topics in TDT2 corpus. ...
doi:10.1145/1458082.1458319
dblp:conf/cikm/WangZLCG08
fatcat:sdbphoxq4rbhvckze5igmzzuam
Decomposition of terminology graphs for domain knowledge acquisition
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
After multiword term extraction, we apply techniques from text mining and visual analytics in a novel way by integrating symbolic and numeric information to build clusters of domain topics. ...
The graph is then decomposed based on atom graph structure into central (non-decomposable) atom and peripheral atoms. ...
CIKM'08 October 26-30, 2008 Napa Valley, California USA Copyright 2008 ACM 978-1-59593-991-3/08/10 ...$5.00. a graph-based approach using a hierarchical clustering algorithm named CPCL (Classification ...
doi:10.1145/1458082.1458334
dblp:conf/cikm/Ibekwe-SanjuanSV08
fatcat:zrwjvyusqncopkakncvj5skwy4
Natural language retrieval of grocery products
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
We also compare our system against an off-the-shelf retrieval tool, and show that our system is significantly better for top-ranked retrieval results. ...
This work was also supported in part by the IST Programme of the European Community, under the PASCAL network of excellence, IST-2002-506778. The publication only reflects the authors' views. ...
Acknowledgments This work was supported in part by the Finnish Funding Agency for Technology and Innovation TEKES, under the project Personalised Ubiservices in Public Spaces. ...
doi:10.1145/1458082.1458308
dblp:conf/cikm/NurmiLBFKP08
fatcat:ulvrk2biwndjnd3mu7iqwexstm
An extension of PLSA for document clustering
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to cocluster documents and terms simultaneously. ...
We show on three datasets that our extended model produces statistically significant improvements with respect to two clustering measures over the original PLSA and the multinomial mixture MM models. ...
Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. ...
doi:10.1145/1458082.1458271
dblp:conf/cikm/KimPAG08
fatcat:yumxs52whjanlc7a34wf46qp3y
A survey of pre-retrieval query performance predictors
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
The focus of research on query performance prediction is to predict the effectiveness of a query given a search system and a collection of documents. ...
In particular, pre-retrieval predictors predict the query performance before the retrieval step and are thus independent of the ranked list of results; such predictors base their predictions solely on ...
CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. on the terms' semantic relationships. ...
doi:10.1145/1458082.1458311
dblp:conf/cikm/HauffHJ08
fatcat:pevhs47wg5ay3ocyqdf2kiu65q
Handling implicit geographic evidence for geographic ir
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
In this paper, we analyze the amount of implicit and explicit geographic evidence in newspaper documents, and measure its impact on geographic information retrieval by evaluating the performance of a retrieval ...
Most geographic information retrieval systems depend on the detection and disambiguation of place names in documents, assuming that the documents with a specific geographic scope contain explicit place ...
This work was jointly funded by the Portuguese government and the European Union ...
doi:10.1145/1458082.1458291
dblp:conf/cikm/CardosoSS08
fatcat:gououkakfjdu7bx26cimocg2bu
Deriving non-redundant approximate association rules from hierarchical datasets
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
ABSTRACT Association rule mining plays an important job in knowledge and information discovery. ...
However, there are still shortcomings with the quality of the discovered rules and often the number of discovered rules is huge and contain redundancies, especially in the case of multi-level datasets. ...
CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. reliability, it is important to ensure those rules with a high confidence are kept. ...
doi:10.1145/1458082.1458328
dblp:conf/cikm/ShawXG08
fatcat:njxs5mme5bcn5ouzyowwfv6tjq
Yizkor books
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Prior to our effort, information regarding the content and location of each Yizkor Book volume was limited. ...
Yizkor Book collections contain firsthand commemorative accounts of events from the era surrounding the rise and fall of Nazi Germany, including documents from before, during, and after the Holocaust. ...
CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. technology. ...
doi:10.1145/1458082.1458266
dblp:conf/cikm/SooCFAF08
fatcat:yaufuib6svatdjgr5dytv42zai
In the development of a spanish metamap
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Our ongoing research is mainly focused on using biomedical concepts for cross-lingual text classification and retrieval. ...
In this context the use of concepts instead of bag of words representation allows us to face text classification tasks abstracting from the language [4]. ...
CIKM' 08 , 08 October 26-30, 2008, Napa Valley California, USA. Copyright 2008 ACM 978-1-59593-991-3/08/10...$5.00. ...
doi:10.1145/1458082.1458335
dblp:conf/cikm/CarreroCGR08
fatcat:6doerbjz7rd5zgihgryfidkh3e
Mining named entity transliteration equivalents from comparable corpora
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Two Mining Stages of MINT Method Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. ...
The lack of timestamp information prevented us from running the CoRanking algorithm on the En-Hi language pair. ...
doi:10.1145/1458082.1458313
dblp:conf/cikm/UdupaSKJ08
fatcat:kzxksuoccnc6poaukpyfdcdkvm
Measuring user preference changes in digital libraries
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
We present a study that measures the changes of user preferences based on an analysis of access logs of a large scale digital library over one year. ...
A metric based on the accuracy of predicting future user actions is proposed. ...
CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. users based on one year's worth of web access logs from a large scale academic digital library. ...
doi:10.1145/1458082.1458353
dblp:conf/cikm/SunLCLG08
fatcat:cufp7xgtsfhdjpzuiplvtt43di
Efficient frequent pattern mining over data streams
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
The CPS-tree introduces the concept of dynamic tree restructuring technique in handling stream data that allows it to achieve highly compact frequency-descending tree structure at runtime and facilitates ...
This paper proposes a prefix-tree structure, called CPS-tree (Compact Pattern Stream tree) that efficiently discovers the exact set of recent frequent patterns from high-speed data stream. ...
., and Yin Y. 2000 Tail-node (f). The DSTree for windows 1 and 2 requires Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ...
doi:10.1145/1458082.1458326
dblp:conf/cikm/TanbeerAJL08
fatcat:fwinusdhgvcfxpxva7izmtfcf4
Exploiting context to detect sensitive information in call center conversations
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Protecting sensitive information while preserving the shareability and usability of data is becoming increasingly important. ...
In this work, we address the problem of protecting sensitive information in audio recordings and speech transcripts. ...
In order to comply with Copyright is held by the author/owner(s). CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ...
doi:10.1145/1458082.1458362
dblp:conf/cikm/FaruquieNCS08
fatcat:ve2oeni52zd45bjrkikpibkj6u
On a data set collected from Amazon reviews and online technical specifications, rankings produced by this model rank the best product for a user in the 87th percentile of products in its category, on ...
We first learn a model that can predict the price of a product given automatically-determined features describing technical specifications and users' opinions. ...
CIKM'08, October 26-30, 2008, Napa Valley, California, USA. ACM 978-1-59593-991-3/08/10. TV? shorter focal length or faster shutter speed?) ...
doi:10.1145/1458082.1458355
dblp:conf/cikm/YatesJPCS08
fatcat:5mahkwnitnf75cimtskg5fbqxi
Representative entry selection for profiling blogs
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Many applications on blog search and mining often meet the challenge of handling huge volume of blog data, in which one single blog could contain hundreds or even thousands of entries. ...
We suggest blog classification for judging the performance of the proposed entry selection techniques and evaluate their performance on a real blog dataset, in which encouraging results were obtained. ...
We formulated it into a combinational optimization problem based on two principles and proposed an algorithm to solve it by exploiting the theory of submodular functions. ...
doi:10.1145/1458082.1458293
dblp:conf/cikm/ZhuangHSJ08
fatcat:gzlwrytuunblfepev3leq4rxca
« Previous
Showing results 1 — 15 out of 66 results