Filters








247,661 Hits in 3.4 sec

Block-based web search

Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma
2004 Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04  
Experiments on block-level query expansion and retrieval are performed. Among the four approaches, the combined method achieves the best performance for web search.  ...  up the performance of current web search engines.  ...  We believe such a block-level analysis of web pages will have the opportunity to significantly enhance the performance of existing commercial search engines.  ... 
doi:10.1145/1008992.1009070 dblp:conf/sigir/CaiYWM04 fatcat:kegscgg5lnfptgjnsfpepjubki

Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model

Xing Xie, Gengxin Miao, Ruihua Song, Ji-Rong Wen, Wei-Ying Ma
Third IEEE International Conference on Pervasive Computing and Communications  
In this paper, a block importance model is employed to assign importance values to different segments of a web page, in order to extract and present more condensed search results to mobile users.  ...  Based on the block importance model, three presentations for displaying the result pages in different levels of detail have been designed to reduce both the number of user interactions and the overall  ...  Search interface other than a web based interface is also an interesting topic to explore. We will continue to investigate these directions in the future.  ... 
doi:10.1109/percom.2005.16 dblp:conf/percom/XieMSWM05 fatcat:2qnhrcy7rzc6zllorsyrv65of4

ISOLATING INFORMATIVE BLOCKS FROM LARGE WEB PAGES USING HTML TAG PRIORITY ASSIGNMENT BASED APPROACH

Rasel Kabir, Shaily Kabir, Shamiul Amin
2015 Zenodo  
Searching useful information from the web, a popular activity, often involves huge irrelevant contents or noises leading to difficulties in extracting useful information.  ...  This assignment process gives a priority value to each block which helps rank the overall search results in online searching.  ...  This calculation assign a priority value to each block which helps rank the overall search results in online searching.  ... 
doi:10.5281/zenodo.3592312 fatcat:gl3hrmb7fbg2lmfhdaic6xmhzq

Search Optimization using Context based Search

Nidhi Jain, Paramjeet Rawat
2014 International Journal of Computer Applications  
After that will accept the keywords to be searched and make search more concise by pruning the unwanted data and display the results based upon that along with its related context with the help of tree  ...  The future of web is a structured semantic web in place of unstructured information present in the web nowadays. On semantic web, ontology is used to assign meaning to the content of the web.  ...  It is a client-server based architecture that allows a user to initiate search by providing keywords to a search engine, which in turn collects and returns the desired web pages from the web.  ... 
doi:10.5120/15317-3616 fatcat:ucg4mbkcynfl3gv52rkzhczjle

Hierarchical clustering of WWW image search results using visual, textual and link information

Deng Cai, Xiaofei He, Zhiwei Li, Wei-Ying Ma, Ji-Rong Wen
2004 Proceedings of the 12th annual ACM international conference on Multimedia - MULTIMEDIA '04  
By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image  ...  We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics.  ...  Page Layout Analysis for Web Search Most of previous web-based applications [4] [17] [18] regard web pages as information units.  ... 
doi:10.1145/1027527.1027747 dblp:conf/mm/CaiHLMW04 fatcat:kj5ssskrgbdqtaltxt4gby6uzu

Semantic Based Text Block Segmentation Using WordNet

Nyein Myint Myint Aung, Su Su Maung
2013 International Journal of Computer and Communication Engineering  
Web pages are segmented into blocks for getting text around the image, which will later be used in retrieval process based on user query keywords.  ...  Text block segmentation plays an important role in image search systems.  ...  TEXT BLOCK SEGMENTATION Text block segmentation is the basic process of web browsing and image search system.  ... 
doi:10.7763/ijcce.2013.v2.257 fatcat:ukx7xonicnesdgoclcybtozvca

Web Phishing Detection Based on Page Spatial Layout Similarity

Weifeng Zhang, Hua Lu, Baowen Xu, Hongji Yang
2013 Informatica (Ljubljana, Tiskana izd.)  
In particular, we develop two different options to extract the spatial layout features as rectangle blocks from a given web page.  ...  One of the keys of phishing detection is to efficiently search the legitimate web page library and to find those page that are the most similar to a suspicious phishing page.  ...  In particular, analyzing search logs discloses the search terms commonly used by attackers, which indicate vulnerable web sites. There also exists similarity based anti-phishing.  ... 
dblp:journals/informaticaSI/ZhangLXY13 fatcat:df56kgrapzauvi2okqfbsnc6ne

From relevance to intelligence

Wei-Ying Ma
2005 Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval - MIR '05  
Has Many Other Rich Structures Wei-Ying Ma, Microsoft Research Asia Using Block-level PageRank to Improve Search Vision-based Approach for Web Object Extraction • The Problem • Our Solution based on Extended  ...  PageRank (SIGIR'04) PageRank Block-level PageRank Search = α * IR_Score + (1-α) * PageRank α Web Search 2.0 Web Search 3.0 Current Web Search Page-Level General Search Vertical  ... 
doi:10.1145/1101826.1101827 dblp:conf/mir/Ma05 fatcat:ezgdvrytlzg2zmrjx2opvy5t6m

Clustering and searching WWW images using link and page layout analysis

Xiaofei He, Deng Cai, Ji-Rong Wen, Wei-Ying Ma, Hong-Jiang Zhang
2007 ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)  
By using a vision-based page segmentation algorithm, a Web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image  ...  This article describes iFind, a system for clustering and searching WWW images.  ...  PicASHOW is based on a Web page search engine.  ... 
doi:10.1145/1230812.1230816 fatcat:t4e3mzpgsndulgumqn66pfoxum

A Meta-Heuristic Approach for Dynamic Data Allocation on a Multiple Web Server System

Masaki KOHANA, Shusuke OKAMOTO, Atsuko IKEGAMI
2013 IEICE transactions on information and systems  
An evaluation for a web-based MORPG system with our tabu search shows that it could achieve 420 users capacity while 320 for our previous system. key words: web-based application, dynamic data allocation  ...  It uses multiple web servers and divides the entire game world into small blocks. Each ownership of block is allocated to a web server.  ...  Real System We confirmed the advantage of our tabu search algorithm from the simulation based experiment. Thus, we build this algorithm into our web-based MORPG system.  ... 
doi:10.1587/transinf.e96.d.2645 fatcat:nxmluzkf2zcdxhokz7p2iqoyla

Changing Vision for Access to Web Archives

Zeynep Pehlivan, Anne Doucet, Stéphane Gançarski
2011 The Web Conference  
These queries can not be performed by currently proposed access methods: wayback machine, full-text search and navigation.  ...  A web page, identified with an url, is a set of concrete blocks and a web site is a set of pages. Pages and sites are generated dynamically by manipulating concrete blocks when needed.  ...  Block-Based Search: Today, web pages contain various topics.  ... 
dblp:conf/www/PehlivanDG11 fatcat:xrokgfshevdlrhtyqzp6nuh7x4

Knowledge base Construction using Hidden Web Retrieval Technique

Shrina Patel, Amit Ganatar
2015 International Journal of Computer Applications  
and precisely based on their visual features Which hidden web source do we intend at the information indispensable to access the data at the back web form and the type of interface.  ...  We proposed algorithm narrative vision based page segmentation (NVIPS) and also comparison DOM tree, VIPS.  ...  After we have every separators, we allocate the weights base on visual dissimilarity of neighboring blocks.  ... 
doi:10.5120/20025-2078 fatcat:lyvyso2fnnd7nbne2adelnyvkq

Identifying Informative Web Content Blocks using Web Page Segmentation

Stevina Dias, Jayant Gadge
2014 International Journal of Applied Information Systems  
In the proposed technique, the extraction of informative content blocks and elimination of non informative blocks is based on the idea of Web page Segmentation.  ...  The proposed approach saves significant space and time Keywords Search engine, information extraction, web content mining, web segmentation, repetition detection, Informative blocks, non-informative blocks  ...  As only unique articles are returned this will improve search results. FeatureExtractor uses heuristics based upon the occurrence of certain features to identify content blocks.  ... 
doi:10.5120/ijais14-451129 fatcat:g3cjy4v6lzh7zm6ugmo6nfupfq

Adapted Web Crawler for Mining Offline Web Data using AFHC

S. Amudha
2013 International Journal of Computer Applications  
Search engine using extended cocitaion algorithm to retrieve accurate content in the local disk and search based on any word, all word and phrase matching in the local disk.  ...  The download web page information searched in offline browsing and avoids the repeated searches in the web server to give a solution to problem.  ...  based on URL.  ... 
doi:10.5120/14030-1398 fatcat:7ohcy6ff4vczdeatkof7jv6v5m

Discovering informative content blocks from Web documents

Shian-Hua Lin, Jan-Ming Ho
2002 Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02  
Based on the answer set generated from 13 manually tagged news Web sites with a total of 26,518 Web pages, experiments show that both recall and precision rates are greater than 0.956.  ...  Our system, InfoDiscoverer, first partitions a page into several content blocks according to HTML tag in a Web page.  ...  Extracting Content Blocks from a Page Based on DOM, a Web page can be parsed and represented with a tree structure, in which leaf nodes contains content or anchor texts.  ... 
doi:10.1145/775107.775134 fatcat:rzsfpxpujzhlrknnh2u6h7eyja
« Previous Showing results 1 — 15 out of 247,661 results