Filters








17,680 Hits in 2.9 sec

Exploiting clustering and phrases for context-based information retrieval

Peter G. Anick, Shivakumar Vaithyanathan
1997 Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '97  
This paper explores exploiting the synergy between document clustering and phrasal analysis for the purpose of automatically constructing a corrrex~-busedretrieval system.  ...  The I%zraphrase interface, running over a database of business-related news articles, is used to illustrate the advantages of such a context-based retrieval paradigm.  ...  Acknowledgments The authors wish to thank Bob Travis for his help and encouragement, Gregg Cooke for the use of his tagger, Chris Hill for his support in testing the clustering algorithm on computer troubleshooting  ... 
doi:10.1145/258525.258601 dblp:conf/sigir/AnickV97 fatcat:by74bolnardwrknwdmqeg6s6h4

Similar Document Search and Recommendation

Vidhya Govindaraju, Krishnan Ramanathan
2012 Journal of Emerging Technologies in Web Intelligence  
Our method is based on identifying key phrases from the input document. The key phrases are used to query a search engine and the results are evaluated for similarity to the original document.  ...  Our method is based on identifying key phrases from the input document. The key phrases are used to query a search engine and the results are evaluated for similarity to the original document.  ...  This method is close to ours except that they exploit the Wikipedia information to filter redundant phrases. In [14] , clustering based unsupervised key phrase extraction algorithm is presented.  ... 
doi:10.4304/jetwi.4.1.84-93 fatcat:yom5lbmh25c4bau7d7ux436ky4

Query expansion by mining user logs

Hang Cui, Ji-Rong Wen, Jian-Yun Nie, Wei-Ying Ma
2003 IEEE Transactions on Knowledge and Data Engineering  
Index Terms-Query expansion, user log, probabilistic model, information retrieval, search engine.  ...  In this study, we propose a new method for query expansion based on user interactions recorded in user logs.  ...  Beeferman and Berger [2] exploited "clickthrough data" in clustering URLs and queries using graph-based iterative clustering technique. Wen et al.  ... 
doi:10.1109/tkde.2003.1209002 fatcat:7t4bi3lljjbczaqbyy3pgsixqi

Toward a higher-level visual representation for object-based image retrieval

Yan-Tao Zheng, Shi-Yong Neo, Tat-Seng Chua, Qi Tian
2008 The Visual Computer  
We propose a higher-level visual representation, visual synset, for object-based image retrieval beyond visual appearances.  ...  Second, to bridge the visual appearance difference or to achieve better intra-class invariance power, the approach clusters visual words and phrases into visual synset, based on their class probability  ...  The visual synset is therefore a probabilistic relevance-consistent cluster of visual phrases, which is learned by Information Bottleneck based distributional clustering.  ... 
doi:10.1007/s00371-008-0294-0 fatcat:mtmoj5gysrb43e32jffbcvdsfy

Effective logo retrieval with adaptive local feature selection

Jianlong Fu, Jinqiao Wang, Hanqing Lu
2010 Proceedings of the international conference on Multimedia - MM '10  
Towards building a practical large-scale logo retrieval system, we propose a novel approach to extract and combine local features for effective logo retrieval.  ...  Then we divide logos into several groups according to local feature type based on which feature can model the logo best and naming as "Point-type", "Shape-type" and "Patch-type".  ...  To exploit the shape information, sketches between spatial related strokes [5] and shape context method [6] are employed.  ... 
doi:10.1145/1873951.1874126 dblp:conf/mm/FuWL10 fatcat:7scqcbtptjh4zaeccwxhqtcnpi

Exploiting Neural Embeddings for Social Media Data Analysis

Sadid A. Hasan, Yuan Ling, Joey Liu, Oladimeji Farri
2015 Text Retrieval Conference  
We submitted six runs for two tasks related to real-time filtering by using various Information Retrieval (IR), and Machine Learning (ML) techniques to analyze the Twitter sample live stream and match  ...  In this paper, we describe our microblog realtime filtering system developed and submitted for the Text Retrieval Conference (TREC 2015) microblog track.  ...  We exploited the strength of neural word and phrase embeddings in extending the context of the underlying user interest profiles for our microblog real-time filtering system.  ... 
dblp:conf/trec/HasanLLF15a fatcat:eq2fhpolpzdnxeyl2jyzcsnera

Image Description Mining and Hierarchical Clustering on Data Records Using HR-Tree [chapter]

Cong-Le Zhang, Sheng Huang, Gui-Rong Xue, Yong Yu
2006 Lecture Notes in Computer Science  
Finally, new hierarchical clustering algorithm based on HR-Tree is proposed for users' browsing conveniently. We demonstrate some HR-Trees and clustering results in experimental section..  ...  In this paper, Hierarchical Representation (HR) and HR-Tree are proposed for image description.  ...  Several approaches [1, 2] have been reported for image retrieval and clustering.  ... 
doi:10.1007/11610113_34 fatcat:6omrps7xp5bqbjkolnfk4pj5p4

Exploiting internal and external semantics for the clustering of short texts using world knowledge

Xia Hu, Nan Sun, Chao Zhang, Tat-Seng Chua
2009 Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09  
semantic knowledge bases -Wikipedia and WordNet.  ...  In this paper, we propose a novel framework to improve the performance of short text clustering by exploiting the internal semantics from the original text and external concepts from world knowledge.  ...  They do not provide sufficient word cooccurrence or context shared information for effective similarity measure [23] , which is the basis of clustering methods [15] .  ... 
doi:10.1145/1645953.1646071 dblp:conf/cikm/HuSZC09 fatcat:je6zmajetbbg7or72m2poajlcu

Using big data to support automatic Word Sense Disambiguation

Giovanni Simonini, Francesco Guerra
2014 2014 International Conference on High Performance Computing & Simulation (HPCS)  
The sense inventory is built extracting insight from Big Data exploiting a community detection algorithm.  ...  Since generate taking into account large corpora of data, the iSC is independent of the domain of application and of predefined target words.  ...  If we were able to build a generic Index of Sense Clusters, where for each word we can retrieve all possible sense clusters from which it belongs and independently of a corpus (e.g. the retrieved document  ... 
doi:10.1109/hpcsim.2014.6903701 dblp:conf/hpcs/SimoniniG14 fatcat:z64cfhmdzbcyheqrjyeioxuhei

CroVeWA: Crosslingual Vector-Based Writing Assistance

Hubert Soyer, Goran Topić, Pontus Stenetorp, Akiko Aizawa
2015 Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations  
provide novel functionality like the visualization of semantic relationships between phrases interlingually and intralingually.  ...  By employing crosslingually constrained vector space models to represent phrases, our system naturally sidesteps several difficulties that would arise from direct word-to-text matching, and is able to  ...  Acknowledgements This work was supported by the Data Centric Science Research Commons Project at the Research Organization of Information and Systems and by the Japan Society for the Promotion of Science  ... 
doi:10.3115/v1/n15-3019 dblp:conf/naacl/SoyerTSA15 fatcat:gnceffmyu5ehbonxnbvdzp5vla

An Efficient Approach For Semantically-Enhanced Document Clustering By Using Wikipedia Link Structure

Iyad AlAgha, Rami Nafee
2014 International Journal of Artificial Intelligence & Applications  
Second, it is more time efficient as it applies an algorithm for phrase extraction from documents prior to matching terms with Wikipedia.  ...  This paper presents a new approach to enhance document clustering by exploiting the semantic knowledge contained in Wikipedia.  ...  F-score combines the information of precision and recall which is extensively applied in information retrieval.  ... 
doi:10.5121/ijaia.2014.5605 fatcat:adhak2f6gbgcnf22wfkflijaom

Impact analysis of keyword extraction using contextual word embedding

Muhammad Qasim Khan, Abdul Shahid, M. Irfan Uddin, Muhammad Roman, Abdullah Alharbi, Wael Alosaimi, Jameel Almalki, Saeed M. Alshahrani
2022 PeerJ Computer Science  
These descriptive phrases make it easier for algorithms to find relevant information quickly and efficiently.  ...  For example, simply indicating the previous or next word of the phrase of interest might be used to describe the context of a phrase.  ...  EmbedRank (Bennani-Smires et al., 2018) retrieves candidate phrases based on POS.  ... 
doi:10.7717/peerj-cs.967 pmid:35721401 pmcid:PMC9202614 fatcat:4jjq3jqtm5hvfpw6wl7eaj66pe

Aligning codebooks for near duplicate image detection

Sebastiano Battiato, Giovanni Maria Farinella, Giovanni Puglisi, Daniele Ravì
2013 Multimedia tools and applications  
., Bags of Visual Phrases) in order to exploit the coherence between different feature spaces.  ...  Also we introduce a novel image database specifically designed for the development and benchmarking of near duplicate image retrieval techniques.  ...  Acknowledgments The authors would like to thank Giuseppe Claudio Guarnera, Tony Meccio and Rosetta Rizzo for their invaluable help.  ... 
doi:10.1007/s11042-013-1470-4 fatcat:pirmvrf4hjbbrjegbcic2c463m

Integrating visual and semantic contexts for topic network generation and word sense disambiguation

Jianping Fan, Hangzai Luo, Yi Shen, Chunlei Yang
2009 Proceeding of the ACM International Conference on Image and Video Retrieval - CIVR '09  
for addressing the issues of polysemes and synonyms more effectively, thus it can significantly improve the precision and recall rates for image retrieval.  ...  similarity contexts between their tags for topic network generation and word sense disambiguation.  ...  By exploiting multiple cross-modal information sources for cross-modal image clustering, our algorithm can address the issue of polysemes more effectively and result in a higher precision rate for image  ... 
doi:10.1145/1646396.1646440 dblp:conf/civr/FanLSY09 fatcat:dvczr67ddnenjhp3w3reurc25u

Text Analytics in Social Media [chapter]

Xia Hu, Huan Liu
2012 Mining Text Data  
The rapid growth of online social media in the form of collaborativelycreated content presents new opportunities and challenges to both producers and consumers of information.  ...  With the large amount of data produced by various social media services, text analytics provides an effective way to meet usres' diverse information needs.  ...  Acknowledgments This work is, in part, supported by the grants NSF (#0812551), ONR (N000141010091) and AFOSR (FA95500810132).  ... 
doi:10.1007/978-1-4614-3223-4_12 fatcat:ynmfabrhpjf6vils663o3rs2za
« Previous Showing results 1 — 15 out of 17,680 results