Filters








3,127 Hits in 6.3 sec

Enhanced Crawler with Multiple Search Techniques using Adaptive Link-Ranking and Pre-Query Processing

Suchetadevi M. Gaikwad, Sanjay B. Thakare
<span title="2016-08-24">2016</span> <i title="Computer Science Laboratory Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/gjt25qn2areqrcmd5d7espafgy" style="color: black;">Circulation in Computer Science</a> </i> &nbsp;
In second stage, enhanced crawler achieves quick in site browsing by fetching most relevant links with associate degree of reconciling link ranking.  ...  However, because of huge volume and varying nature of deep-web, achieving wide coverage and high efficiency is difficult issue.  ...  ACKNOWLEDGMENTS The authors would like to thank the researchers as well as publishers for making their resources available and teachers of RSCOE, Computer Engineering for their guidance.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.22632/ccs-2016-251-24">doi:10.22632/ccs-2016-251-24</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/nxtgm7lvlfcwdnqkcfkamctiju">fatcat:nxtgm7lvlfcwdnqkcfkamctiju</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180721155126/http://www.ccsarchive.org/articles/volume1/number1/ccs-2016-251-24.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a3/29/a329729122f63d792ae7b35c953dabd457ade24e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.22632/ccs-2016-251-24"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Blackhat Search Engine Optimization Techniques (SEO) and Counter Measures

R. D. Gaharwar, D. B. Shah
<span title="2018-11-19">2018</span> <i title="Technoscience Academy"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zondmso2kjfrzbdtowoynb6yoe" style="color: black;">International Journal of Scientific Research in Science and Technology</a> </i> &nbsp;
Hence web space users or website developers should be well aware of SEO techniques and how to use them in optimal way.  ...  As a service provider who uses internet for digital marketing it becomes mandatory to get high ranks from search engines. Search engines optimization (SEO) techniques are used for this purpose.  ...  URL redirection Some sites contain the web pages with higher ranks just to redirect the user to some other web pages with less rank.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.32628/ijsrst1840117">doi:10.32628/ijsrst1840117</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/z7kg4vk4x5eghbo2gaga3t6zem">fatcat:z7kg4vk4x5eghbo2gaga3t6zem</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200213162439/http://ijsrst.com/paper/4907.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/85/f2/85f25e3f005ab456903df73b290b77c5ff642dc4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.32628/ijsrst1840117"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Scalable Framework for Locating Deep Web Entry Points

John Onihunwa, Olufade Onifade, Isaac Ariyo, Stephen Omotugba, Deji Joshua
<span title="">2017</span> <i title="IOSR Journals"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vabuspdninc75epczdurccts4u" style="color: black;">IOSR Journal of Computer Engineering</a> </i> &nbsp;
find it, Frontier:it consist of URL and Crawling modules used to crawl the web pages to extract the meta data,surface web: information that common search engines like Google, Yahoo, AltaVista, Ask, and  ...  Crawler: a program that systematically browse the world wide web in order to create an index of data, deep web: information buried far down on dynamically generated sites, and standard search engines never  ...  hidden web.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.9790/0661-1902034555">doi:10.9790/0661-1902034555</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/pil7lsutm5fp3dmn4otqewfgbq">fatcat:pil7lsutm5fp3dmn4otqewfgbq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180602080401/http://www.iosrjournals.org/iosr-jce/papers/Vol19-issue2/Version-3/H1902034555.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/be/6a/be6a6b50f4b2f891b0722c387c729cd7ea9a5811.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.9790/0661-1902034555"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Towards Continuous Web Archiving

Julien Masanès
<span title="">2002</span> <i title="CNRI Acct"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ugbiirfvufgcjkx33r3cmemcuu" style="color: black;">D-Lib Magazine</a> </i> &nbsp;
Consequently, there is a growing awareness of the need to track and archive Web content.  ...  Identification of such sites could be used to lower crawling or archiving priority for certain types of sites, even though they are very well ranked.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1045/december2002-masanes">doi:10.1045/december2002-masanes</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ugn77my5vzhvlalip4ot7z6v4m">fatcat:ugn77my5vzhvlalip4ot7z6v4m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200215165443/http://www.dlib.org/dlib/december02/masanes/12masanes.html" title="fulltext access" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [HTML] </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1045/december2002-masanes"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> dlib.org </button> </a>

Creating and exploring web form repositories

Luciano Barbosa, Hoa Nguyen, Thanh Nguyen, Ramesh Pinnamaneni, Juliana Freire
<span title="">2010</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vxrc3vebzzachiwy3nopwi3h5u" style="color: black;">Proceedings of the 2010 international conference on Management of data - SIGMOD &#39;10</a> </i> &nbsp;
DeepPeep allows users to explore the entry points to hidden-Web sites whose contents are out of reach for traditional search engines.  ...  We present DeepPeep (http://www.deeppeep.org), a new system for discovering, organizing and analyzing Web forms.  ...  We also thank Sumit Tandon for his contributions to an early prototype of DeepPeep.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1807167.1807311">doi:10.1145/1807167.1807311</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigmod/BarbosaNNPF10.html">dblp:conf/sigmod/BarbosaNNPF10</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/wl45ukw4hzf5lh2memiu75lbvy">fatcat:wl45ukw4hzf5lh2memiu75lbvy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20110401184200/http://www2.research.att.com/~lbarbosa/publications/sigmod_demo_2010.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/6f/1d/6f1d614e7d65ef6c9ed64b6dcce9fe283f29a45c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1807167.1807311"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Downloading textual hidden web content through keyword queries

Alexandros Ntoulas, Petros Zerfos, Junghoo Cho
<span title="">2005</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/kw2apmx5ynfyjf6jhs5gzrrx6e" style="color: black;">Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries - JCDL &#39;05</a> </i> &nbsp;
We experimentally evaluate the effectiveness of these policies on 4 real Hidden Web sites and our results are very promising.  ...  For instance, in one experiment, one of our policies downloaded more than 90% of a Hidden Web site (that contains 14 million documents) after issuing fewer than 100 queries.  ...  Unless users go directly to Hidden-Web sites and issue queries there, they cannot access the pages at the sites. • Improving user experience: Even if a user is aware of a number of Hidden-Web sites, the  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1065385.1065407">doi:10.1145/1065385.1065407</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/jcdl/NtoulasZC05.html">dblp:conf/jcdl/NtoulasZC05</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/tqkr45nb5vbo3nsbowr53qzmgq">fatcat:tqkr45nb5vbo3nsbowr53qzmgq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20070302113733/http://oak.cs.ucla.edu:80/~ntoulas/pubs/ntoulas_hidden_web.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d3/77/d37773304a3b70a49771bee1430093ff50997935.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1065385.1065407"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Creating a billion-scale searchable web archive

Daniel Gomes, Miguel Costa, David Cruz, João Miranda, Simão Fontes
<span title="">2013</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/s4hirppq3jalbopssw22crbwwa" style="color: black;">Proceedings of the 22nd International Conference on World Wide Web - WWW &#39;13 Companion</a> </i> &nbsp;
This study contributes with an overview of the lessons learned while developing the Portuguese Web Archive, focusing on web data acquisition, ranking search results and user interface design.  ...  However, users demand efficient and effective search mechanisms to access the already vast collections of historical information held by web archives.  ...  ACKNOWLEDGMENTS We thank the Internet Archive for supplying us web collections from the .PT domain and for its continuous efforts to preserve and grant access to human knowledge.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2487788.2488118">doi:10.1145/2487788.2488118</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/www/GomesCCMF13.html">dblp:conf/www/GomesCCMF13</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/bkl6pofb3bb43iyxir46pinfta">fatcat:bkl6pofb3bb43iyxir46pinfta</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20131107132237/http://xldb.fc.ul.pt/xldb/publications/Gomes.etal:CreatingABillion-Scale:2013_document.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a7/8a/a78a9c4efb7b9041257c2b71e2e00e6c92033a93.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2487788.2488118"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

How to Build Google2Google – An (Incomplete) Recipe – [chapter]

Wolfgang Nejdl
<span title="">2004</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
The reader has to be aware, though, that many of these ingredients are research questions rather than solutions, and that it needs quite a few more research papers on these aspects before we can really  ...  This talk explores aspects relevant for peer-to-peer search infrastructures, which we think are better suited to semantic web search than centralized approaches.  ...  directly at the site which provides it, without necessarily crawling all of its content again.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-540-30475-3_1">doi:10.1007/978-3-540-30475-3_1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zmvumxe7szgodh63jeh7qfcf44">fatcat:zmvumxe7szgodh63jeh7qfcf44</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20080424114919/http://www.kbs.uni-hannover.de/Arbeiten/Publikationen/2004/google2google.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f8/29/f829413e4633e246f32f258a6bc6c8c994494521.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-540-30475-3_1"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Conversion of Website Users to Customers-The Black Hat SEO Technique

Rotimi-Williams Bello, Firstman Noah Otobo
<span title="2018-06-29">2018</span> <i title="Advance Academic Publisher"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/morgzaixj5hsjghb6it2id5lgy" style="color: black;">International Journal of Advanced Research in Computer Science and Software Engineering</a> </i> &nbsp;
Having studied and understood white hat SEO, black hat SEO, gray hat SEO, crawling, indexing, processing and retrieving methods used by search engines as a web software program or web based script to search  ...  SEO helps build brand awareness through high rankings, SEO helps circumvent competition, and SEO gives room for high increased return on investment.  ...  First, search engines crawl the web to see what is there. This task is performed by a piece of software, called a crawler or a spider.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.23956/ijarcsse.v8i6.714">doi:10.23956/ijarcsse.v8i6.714</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/w64tepxltja7fdbkcjxtb2c3re">fatcat:w64tepxltja7fdbkcjxtb2c3re</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190430182953/http://ijarcsse.com/index.php/ijarcsse/article/download/714/376" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/20/c3/20c3909e18aa0ece1b6b15e679a3392a3634f204.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.23956/ijarcsse.v8i6.714"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Influence of Mobile-friendly Design to Search Results on Google Search

David Schubert
<span title="">2016</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/trfcnrckdvgftcxiz3i6lginty" style="color: black;">Procedia - Social and Behavioral Sciences</a> </i> &nbsp;
This article discusses the impact of mobile-friendliness of the web on mobile search results in Google search engine.  ...  There are also discussed common mistakes which have a direct impact on the low user-friendliness in terms of mobile experience.  ...  Hidden content can be discounted in ranking, but if the content is visible on the desktop version of your site, we can crawl it and use the information for ranking your mobile site as well since we can  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.sbspro.2016.05.517">doi:10.1016/j.sbspro.2016.05.517</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/gm5wigx4cjbpzczw7efyojzaiq">fatcat:gm5wigx4cjbpzczw7efyojzaiq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170926155820/http://publisher-connector.core.ac.uk/resourcesync/data/elsevier/pdf/410/aHR0cDovL2FwaS5lbHNldmllci5jb20vY29udGVudC9hcnRpY2xlL3BpaS9zMTg3NzA0MjgxNjMwNjE4OA%3D%3D.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0f/e3/0fe32fdd902f960a61c819963eec67e192e69b54.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.sbspro.2016.05.517"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> elsevier.com </button> </a>

A focused crawler for Dark Web forums

Tianjun Fu, Ahmed Abbasi, Hsinchun Chen
<span title="">2010</span> <i title="Wiley"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/irrfofzyh5dn5jykwm6u5zvb5e" style="color: black;">Journal of the American Society for Information Science and Technology</a> </i> &nbsp;
Despite the need for tools to collect and analyze Dark Web forums, the covert nature of this part of the Internet makes traditional web crawling techniques insufficient for capturing such content.  ...  The unprecedented growth of the Internet has propagated the escalation of the Dark Web, the problematic facet of the web associated with cybercrime, hate, and extremism.  ...  Focused Crawling of the Hidden Web There has been limited focused crawling work on the hidden web.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1002/asi.21323">doi:10.1002/asi.21323</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vqdvbnapgvhwbkjaucj3yr7mve">fatcat:vqdvbnapgvhwbkjaucj3yr7mve</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20121202125949/http://w.icadl.org:80/intranet/papers/Fu-Abbasi_FocusedCrawler_JASIST_preprint.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ba/90/ba908cd8ab7c6ffb2025bc791798d4b9885252f4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1002/asi.21323"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> wiley.com </button> </a>

Web Privacy Census

Chris Jay Hoofnagle, Nathan Good
<span title="">2012</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tol7woxlqjeg5bmzadeg6qrg3e" style="color: black;">Social Science Research Network</a> </i> &nbsp;
Highlights • We repeated a 2012 survey of tracking mechanisms such as HTTP cookies, Flash cookies, and HTML5 storage, used by top 25,000 most popular websites • We found that the top 100 most popular sites  ...  would collect over 6,000 HTTP cookies with 83% being third-party cookies Overall summary of results for shallow and deep crawls for the top 100, 1,000 and 25,000 websites Abstract Most people may believe  ...  Methods To answer the questions above, we use a web crawler, a computer program that systematically browses the Internet, to run a crawl on the top 100, 1,000, and 25,000 sites ranked by Quantcast.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.2139/ssrn.2460547">doi:10.2139/ssrn.2460547</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/kaxxdrl73nbvncdqipea37h2ze">fatcat:kaxxdrl73nbvncdqipea37h2ze</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170222204008/https://www.ftc.gov/system/files/documents/public_events/776191/ialtaweelwebpriv_0.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/95/f4/95f40d91c44d7e8eecb4b4542bb74902d75ca7a0.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.2139/ssrn.2460547"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ssrn.com </button> </a>

Efficient Deep Web Crawling Using Reinforcement Learning [chapter]

Lu Jiang, Zhaohui Wu, Qian Feng, Jun Liu, Qinghua Zheng
<span title="">2010</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
Deep web refers to the hidden part of the Web that remains unavailable for standard Web crawlers.  ...  Experimental results show that the method outperforms the state of art methods in terms of crawling capability and breaks through the assumption of full-text search implied by existing methods.  ...  Introduction Deep web or hidden web refers to World Wide Web content that is not part of the surface Web, which is directly indexed by search engines.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-13657-3_46">doi:10.1007/978-3-642-13657-3_46</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3tzz7rwb5na7be62hhpxkaa2ae">fatcat:3tzz7rwb5na7be62hhpxkaa2ae</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829174817/http://www.cs.cmu.edu/~lujiang/camera_ready_papers/PAKDD_2010.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/92/db/92dbaf21b19ce14dd236993455ddc9a387b66a52.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-13657-3_46"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

A Novel Design of Hidden Web Crawler using Ontology
English

Ma nvi, Komal Kumar Bhatia, Ashutosh Dixit
<span title="2015-08-25">2015</span> <i title="Seventh Sense Research Group Journals"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/wczsapkvorbcbfmnmu4k6wonju" style="color: black;">International Journal of Engineering Trends and Technoloy</a> </i> &nbsp;
Deep Web is content hidden behind HTML forms.  ...  Since it represents a large portion of the structured, unstructured and dynamic data on the Web, accessing Deep-Web content has been a long challenge for the database community.  ...  if a user is aware of a number of Hidden-Web sites, the user still has to waste a significant amount of time and effort, visiting all of the potentially relevant sites, querying each of them and exploring  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14445/22315381/ijett-v26p204">doi:10.14445/22315381/ijett-v26p204</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/au6vrmggojczlmnnd3tvoa3yde">fatcat:au6vrmggojczlmnnd3tvoa3yde</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170810073513/http://www.ijettjournal.org/2015/volume-26/number-1/IJETT-V26P204.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/57/3a/573a66d192e6fe24a4eb3c9c28573523923f48d4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14445/22315381/ijett-v26p204"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

AJAXSearch

Cristian Duda, Gianni Frey, Donald Kossmann, Chong Zhou
<span title="2008-08-01">2008</span> <i title="VLDB Endowment"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a> </i> &nbsp;
very numerous events, scalability in the number of events, duplicate elimination of states, result presentation and aggregation, ranking.  ...  They are increasingly frequent on the Web (in YouTube, Amazon, GMail, Yahoo!  ...  Transitions and states are a part of movie retrieval [6] . Searching and Ranking based on structural properties are addressed by XRank [10] or [5] in the context of XML.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/1454159.1454195">doi:10.14778/1454159.1454195</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dot5gv4xtzayvgorh6qj2fh5rq">fatcat:dot5gv4xtzayvgorh6qj2fh5rq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190309093314/http://pdfs.semanticscholar.org/f71d/f0613ff10d684cbb1b5291a40a1a707df1fa.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f7/1d/f71df0613ff10d684cbb1b5291a40a1a707df1fa.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/1454159.1454195"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 3,127 results