Filters








5,063 Hits in 9.3 sec

Mirror, mirror on the Web: a study of host pairs with replicated content

Krishna Bharat, Andrei Broder
1999 Computer Networks  
The main aim of this paper is to present a clearer picture of mirroring on the Web. As input we used a set of 179 million URLs found during a Web crawl done in the summer of 1998.  ...  Our technique for detecting mirrored hosts from large sets of collected URLs depends mostly on the syntactic analysis of URL strings, and requires retrieval and content analysis only for a small number  ...  Acknowledgments We would like to thank Jeff Dean, Steve Glassman, Monika Henzinger, Allan Heydon, Puneet Kumar, Mark Manasse, Hannes Marais, Mark Najork, and Jim Pitkow for help in conducting the study  ... 
doi:10.1016/s1389-1286(99)00021-3 fatcat:2zvuewzrqfhcbewri4y5t7pbxe

Improving Internet archive service through proxy cache

Hsiang‐Fu Yu, Yi‐Ming Chen, Shih‐Yong Wang, Li‐Ming Tseng
2003 Internet Research  
Accordingly, users can find archives on WWW and FTP servers through the Archie, and they can directly download archives from the proxy server. Thus, the reuse of cached archives is improved.  ...  A system was implemented and operated on a real environment to evaluate the approach. 2 Empirical results indicate that the reuse rate of cached objects increased by 18% to 37%.  ...  Acknowledgement The authors would like to thank the National Science Council of the Republic of 20 China for financially supporting this research under Contract No. NSC 91-2213-E-008-016.  ... 
doi:10.1108/10662240310458387 fatcat:zx57enzoofao7dprpe344ack64

Ejournals in Education: Just Generating Excitement or Living up to the Promise?

Tirupalavanam G. Ganesh
2017 Education Libraries  
Are these ejournals merely poor electronic imitations of print journals? Granted, the use of the Internet to publish peer-reviewed scholarship has the potential of democratizing access.  ...  The American Educational Research Association Special Interest Group, Communications among Researchers (AERA SIG CR) lists over one hundred electronic journals i n the field of education that are scholarly  ...  Once mirror hosts are found, mirroring can be achieved using software.  ... 
doi:10.26443/el.v26i1.181 fatcat:pq4j7mcmp5hhtlo35ouwkmf7sy

Tracking Web spam with HTML style similarities

Tanguy Urvoy, Emmanuel Chauveau, Pascal Filoche, Thomas Lavergne
2008 ACM Transactions on the Web  
We also propose a flexible algorithm to cluster a large collection of documents according to these measures.  ...  We present an evaluation of our algorithm on the WEBSPAM-UK2006 dataset.  ...  Some spam techniques like honey pots are based on text plagiarism: the honey pot technique consists of mirroring a reputed Web site to introduce sneaky links in its HTML code.  ... 
doi:10.1145/1326561.1326564 fatcat:kxdqptkd7fexjjtxmk2bxxv46y

PRINTING IN HETEROGENEOUS COMPUTER ENVIRONMENT AT DESY

Z. JAKUBOWSKI
1996 Computing in High Energy Physics '95  
The number of registered hosts at DESY reaches 3500 while the number of print queues approaches 150.  ...  The number of registered hosts at DESY reaches 3500 while the number of print queues approaches 150.  ...  It is impossible here to give here more technical details in the frame of this article. Larger article on printing is in preparation. A lot of details can be found on the WWW citehepix95.  ... 
doi:10.1142/9789814447188_0130 fatcat:duudtsrubncbdov23aue6rsdva

DNP3 network scanning and reconnaissance for critical infrastructure

Nicholas R. Rodofile, Kenneth Radke, Ernest Foo
2016 Proceedings of the Australasian Computer Science Week Multiconference on - ACSW '16  
The Distributed Network Protocol v3.0 (DNP3) is one of the most widely used protocols to control national infrastructure.  ...  In this paper we present a series of intrusive techniques used for reconnaissance on DNP3 critical infrastructure.  ...  Port Mirror A technique that can be used to analyse network traffic, is through the utility of an port mirror 1 .  ... 
doi:10.1145/2843043.2843350 dblp:conf/acsc/RodofileRF16 fatcat:qcvra33qbbb65m37fwc46oinzi

Sampling the National Deep Web [chapter]

Denis Shestakov
2011 Lecture Notes in Computer Science  
We propose the Host-IP clustering sampling method to address the drawbacks of existing approaches for deep Web characterization and report our findings based on the survey of Russian Web.  ...  In this paper, we revisit a problem of deep Web characterization: how to estimate the total number of online databases on the Web?  ...  The idea of grouping hosts based on their IP addresses was used by Bharat et al. [8] to identify host aliases (or mirrored hosts according to Bharat's terminology).  ... 
doi:10.1007/978-3-642-23088-2_24 fatcat:dgb4cmjx4vgcndap4gfyxd55pq

Diabetes Information Technology & WebWatch

Eldon D. Lehmann
1999 Diabetes Technology & Therapeutics  
AIDA is a diabetes-computing program freely available from www.2aida.org on the Web.  ...  One aspect of learning as much as possible about diabetes Website visitors and users may be to apply techniques that do not necessitate any visitor or user interaction.  ...  Depending on site traffic, and server status, visitors to the main http://www.2aida.org site may be re-routed to the U.S. mirror site (or vice versa).  ... 
doi:10.1089/152091599317297 pmid:11475281 fatcat:x4ulqbpivffcfop2rfbsaowctu

The development of digital libraries in Taiwan

Hao‐Ren Ke, Ming‐Jiu Hwang
2000 Electronic library  
This article first quotes a definition of a digital library, and based on this definition, an overview of some of the digital library programs in Taiwan is presented.  ...  Digital libraries are organizations that provide the resources, including the specialized staff, to select, structure, offer intellectual access to, interpret, distribute, preserve the integrity of, and  ...  It is also possible for a particular member willing to act as a consortium host to take charge of the installation and maintenance of databases.  ... 
doi:10.1108/02640470010354590 fatcat:gtehdkmoo5eq3o4xzuswpp2fxi

Analyzing stability in wide-area network performance

Hari Balakrishnan, Mark Stemm, Srinivasan Seshan, Randy H. Katz
1997 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems - SIGMETRICS '97  
Despite this heterogeneity, we find (using best-fit linear regression techniques) that we can express the throughput for Web transfers to most hosts as a random variable with a log-normal distribution.  ...  We find that Internet hosts that are close to each other often have almost identically distributed probability distributions of throughput.  ...  We would also like to thank the anonymous SIG-METRICS reviewers for their detailed comments, suggestions, and criticisms, that led to significant improvements in the quality of this paper.  ... 
doi:10.1145/258612.258631 dblp:conf/sigmetrics/BalakrishnanSSK97 fatcat:35h7z4d5pbg63mfuyw6epb2lvi

Analyzing stability in wide-area network performance

Hari Balakrishnan, Mark Stemm, Srinivasan Seshan, Randy H. Katz
1997 Performance Evaluation Review  
Despite this heterogeneity, we find (using best-fit linear regression techniques) that we can express the throughput for Web transfers to most hosts as a random variable with a log-normal distribution.  ...  We find that Internet hosts that are close to each other often have almost identically distributed probability distributions of throughput.  ...  We would also like to thank the anonymous SIG-METRICS reviewers for their detailed comments, suggestions, and criticisms, that led to significant improvements in the quality of this paper.  ... 
doi:10.1145/258623.258631 fatcat:6mjcoqjkobgqxlh2vpz2aicka4

PhysioBank, PhysioToolkit, and PhysioNet : Components of a New Research Resource for Complex Physiologic Signals

A. L. Goldberger, L. A. N. Amaral, L. Glass, J. M. Hausdorff, P. Ch. Ivanov, R. G. Mark, J. E. Mietus, G. B. Moody, C.-K. Peng, H. E. Stanley
2000 Circulation  
evaluation and comparison of analysis methods, and the analysis of nonstationary processes.  ...  PhysioToolkit is a library of open-source software for physiological signal processing and analysis, the detection of physiologically significant events using both classic techniques and novel methods  ...  This work was supported by a grant from the National Center for Research Resources of the National Institutes of Health (P41 RR13622).  ... 
doi:10.1161/01.cir.101.23.e215 pmid:10851218 fatcat:owg2tdtdczdybefjyaoqlpvjpi

Government mandated blocking of foreign Web content [article]

Maximillian Dornseif
2004 arXiv   pre-print
It will also give some empirical data on the effects of the blocking orders to help in the legal assessment of the orders.  ...  Since fall 2001 the state of North-Rhine-Westphalia very actively tries to mandate such blocking.  ...  Also the actual effects of the blocking orders should be monitored -not only from a technical point of view but also from a criminological perspective.  ... 
arXiv:cs/0404005v1 fatcat:jbajwhnegrh6pdjd3lkhuons3e

Designing Interfaces for Distributed Electronic Collections: The Lessons of Traditional Librarianship

Nicholas Joint
2001 Libri  
Nevertheless, this is what today's academic libraries increasingly have to do -they have to put a single local library WWW interface over a host of disparate leased commercial interfaces.  ...  One type of information retrieval technique familiar to the traditional library user is the technique of browsing (O'Connor 1988) .  ... 
doi:10.1515/libr.2001.148 fatcat:so4djedgxjfp5lbp6as6y3q3ye

LIStEN: L′ band Imaging Survey for Exoplanets in the North

Arianna Musso Barcucci, Ralf Launhardt, André Müller, Grant M. Kennedy, Roy van Boekel, Thomas Henning, Henrik L. Ruh, Sebastian Marino, Tim D. Pearce, Stefan S. Brems, Steve Ertel, Eckhart A. Spalding
2021 Astronomy and Astrophysics  
We combined the derived mass detection limits with information on the disc, and on the proper motion of the host star, to constrain the presence of unseen planetary and low-mass stellar companion around  ...  The direct imaging technique allows simultaneous imaging of both a companion and the circumstellar disc it resides in, and is thus a valuable tool to study companion-disc interactions.  ...  This research made use of Astropy (http://www. astropy.org), a community-developed core Python package for Astronomy (Astropy Collaboration 2013.  ... 
doi:10.1051/0004-6361/202039541 fatcat:isk3mzuidzel7mf63adt4e3qum
« Previous Showing results 1 — 15 out of 5,063 results