Removal of Image Advertisement from Web Page

Hetal R. Parmar, Jayant Gadge
2011 International Journal of Computer Applications  
With the phenomenal growth of the web, there is an ever increasing volume of data and information published in numerous web-pages. It is said that web is noisy. A web page typically contains a mixture of many kind of information e.g. main contains, advertisements, navigational panels, copy right blocks etc... for a particular application only part of information is useful and the rest are noise. These all seriously harm web mining. Advertisements and Sponsor images are not much important in
more » ... ing. As there is a need of technique that keep common navigation structure as it is but removes image advertisement and improve surfing efficiency. In this paper a small application HTML Tag Differentiator is created which removes image advertisement using rule based classifier.
doi:10.5120/3316-4555 fatcat:wpckzghnizhfzinjwntrcssq2i