Filters








1,376 Hits in 4.1 sec

A web page usage prediction scheme using sequence indexing and clustering techniques

Costantinos Dimopoulos, Christos Makris, Yannis Panagis, Evangelos Theodoridis, Athanasios Tsakalidis
2010 Data & Knowledge Engineering  
In this paper we consider the problem of web page usage prediction in a web site by modeling users' navigation history and web page content with weighted suffix trees.  ...  This user's navigation prediction can be exploited either in an on-line recommendation system in a web site or in a web page cache system.  ...  The task of modeling and predicting a user's navigational behavior on a web site or on a web domain can be useful in quite many web applications such as web caching [33, 43] , web page recommendation  ... 
doi:10.1016/j.datak.2009.04.010 fatcat:7oksy5ugxvhfrmd4k4zsn7kbha

A text mining approach to Internet abuse detection

Chen-Huei Chou, Atish P. Sinha, Huimin Zhao
2008 Information Systems and E-Business Management  
We have empirically compared a variety of term weighting, feature selection, and classification techniques for Internet abuse detection in the workplace of software programmers.  ...  As the use of the Internet in organizations continues to grow, so does Internet abuse in the workplace.  ...  Term frequency weights a term for a Web page by the number of times the term appears in the page.  ... 
doi:10.1007/s10257-007-0070-0 fatcat:4ynqj54bxvgezovleqkek56akq

Bidirectional Growth Based Mining and Cyclic Behaviour Analysis of Web Sequential Patterns

Srikantaiah K C, Krishna Kumar N, Venugopal K R, Patnaik L M
2013 International Journal of Data Mining & Knowledge Management Process  
The more accurate the prediction and more satisfying the results of prefetching if we use a highly efficient and scalable mining technique such as the Bidirectional Growth based Directed Acyclic Graph.  ...  Our experimental results show that prefetching rules generated using BGCAP is 5-10 percent faster for different data sizes and 10-15% faster for a fixed data size than TD-Mine.  ...  of the DOM tree structure of the pages to describe HTML or XML tag usage.  ... 
doi:10.5121/ijdkp.2013.3204 fatcat:4l2ofdk4rjdptlx4s6uicnob5y

Commercial Internet filters: Perils and opportunities

Chen-Huei Chou, Atish P. Sinha, Huimin Zhao
2010 Decision Support Systems  
These products mainly rely on black lists, white lists, and keyword/profile matching to filter out undesired web pages.  ...  decision tree, k-nearest neighbor, and neural network.  ...  Thus, we selected TF as the term weighting scheme for indexing web pages and developed a document indexing program based on the WekaIndex tool (http:// www.ainetsolutions.com/eng/soluciones/aplicaciones  ... 
doi:10.1016/j.dss.2009.11.002 fatcat:jacfmkqvrnbcpdbl7466ib4vvm

A Hybrid Attribute Selection Approach for Text Classification

Chen-Huei Chou, Atish Sinha, Huimin Zhao
2010 Journal of the AIS  
The empirical evaluations we conducted using a variety of classification algorithms, indexing schemes, and attribute selection methods demonstrate the utility of the proposed approach.  ...  representative of them), instead of the individual terms, may be used as dimensions of the vector space.  ...  Their input and feedback helped us to improve the quality of the paper significantly.  ... 
doi:10.17705/1jais.00236 fatcat:m5uz6zcztnbb7iqbiruqri6fhm

DSM-PLW: Single-pass mining of path traversal patterns over streaming Web click-sequences

Hua-Fu Li, Suh-Yin Lee, Man-Kwan Shan
2006 Computer Networks  
According to the algorithm, each maximal forward reference of the stream is projected into a set of reference-suffix maximal forward references, and these reference-suffix maximal forward references are  ...  Mining Web click streams is an important data mining problem with broad applications.  ...  Web prefetching and prediction of HTTP requests are important applications of Web usage mining [13, 41] .  ... 
doi:10.1016/j.comnet.2005.10.018 fatcat:cn2aghozazfwnibdvhveigc5yy

Learning implicit user interest hierarchy for context in personalization

Hyoung R. Kim, Philip K. Chan
2003 Proceedings of the 8th international conference on Intelligent user interfaces - IUI '03  
A UIH can represent a user's interests at different abstraction levels and can be learned from the contents (words/phrases) in a set of web pages bookmarked by a user.  ...  To enrich features used in the UIH, we used phrases in addition to words.  ...  The advantage of this approach is that it can predict visited web pages well, but is not good for predicting unvisited web pages.  ... 
doi:10.1145/604050.604064 fatcat:o5ilqbxp7nfnjphtm722ubzkam

Learning implicit user interest hierarchy for context in personalization

Hyoung R. Kim, Philip K. Chan
2003 Proceedings of the 8th international conference on Intelligent user interfaces - IUI '03  
A UIH can represent a user's interests at different abstraction levels and can be learned from the contents (words/phrases) in a set of web pages bookmarked by a user.  ...  To enrich features used in the UIH, we used phrases in addition to words.  ...  The advantage of this approach is that it can predict visited web pages well, but is not good for predicting unvisited web pages.  ... 
doi:10.1145/604045.604064 dblp:conf/iui/KimC03 fatcat:rwwjwh7l3vghtjx3slwz2wkjze

Web mining: Machine learning for web applications

Hsinchun Chen, Michael Chau
2005 Annual Review of Information Science and Technology  
Anchor text shows how other Web page authors annotate a page and can be useful in predicting the content of the target page. Several algorithms have been developed to address this issue.  ...  Based on training examples, learning algorithms can be used to adjust the connection weights in the network so that it can predict or classify unknown examples correctly.  ... 
doi:10.1002/aris.1440380107 fatcat:wdqwbszj7valbnyjfysbb4ap4y

Stemming Text-based Web Page Classification using Machine Learning Algorithms: A Comparison

Ansari Razali, Salwani Mohd, Nor Azan, Faezehsadat Shahidi
2020 International Journal of Advanced Computer Science and Applications  
The research aim is to determine the effect of word-stemming in web pages classification using different machine learning classifiers, namely Naïve Bayes (NB), k-Nearest Neighbour (k-NN), Support Vector  ...  This research uses BBC dataset that has five predefined categories.  ...  WEB PAGE CLASSIFICATION Web page classification, or also called web page categorization, is defined as a task to determine the category of a web page.  ... 
doi:10.14569/ijacsa.2020.0110171 fatcat:j3auxq73pvf3lkenmq2ucqgkvq

Web site personalization based on link analysis and navigational patterns

Magdalini Eirinaki, Michalis Vazirgiannis
2007 ACM Transactions on Internet Technology  
In this work we present UPR, a PageRank-style algorithm which combines usage data and link analysis techniques for assigning probabilities to the web pages based on their importance in the web site's navigational  ...  In the vast majority of related algorithms, however, only the usage data are used to produce recommendations, disregarding the structural properties of the web graph.  ...  WEB PATH PREDICTION USING HYBRID PROBABILISTIC PREDICTIVE MODELS One of the most popular web usage mining methods is the use of probabilistic models.  ... 
doi:10.1145/1278366.1278370 fatcat:fx3tqvdj7zg53nevs3dnrkv34i

Web Site Audience Segmentation Using Hybrid Alignment Techniques [chapter]

Vinh-Trung Luu, Germain Forestier, Frédéric Fondement, Pierre-Alain Muller
2015 Lecture Notes in Computer Science  
In this paper, we present a hybrid approach for clustering visitor sessions, based on a combination of global and local sequence alignments, such as Needleman-Wunsch and Smith-Waterman.  ...  goal is to define very simple approaches able to address about 80% of visitor sessions to be segmented, and which can be easily turned into small pieces of program, to be run in parallel in thousands of web  ...  [7] modeled users navigation history and web page content using weighted suffix trees. Their system was then used for the prediction of web page usage.  ... 
doi:10.1007/978-3-319-25660-3_3 fatcat:hx5i5lrw5jhcpgwiodjuocu5ci

Web Mining Research Issues and Future Directions – A Survey

D. Jayalatchumy
2013 IOSR Journal of Computer Engineering  
This paper is a work on survey on the existing techniques of web mining and the issues related to it. The World Wide Web acts as an interactive and popular way to transfer information.  ...  Due to the enormous and diverse information on the web, the users cannot make use of the information very effectively and easily.  ...  Association rules Weighted Temporal Tree structure. More time than WTARM.  ... 
doi:10.9790/0661-1432027 fatcat:sst2h6njivgizlma5ugwg35lnm

A survey on detection of Phishing Websites using Machine Learning

Anjali P, Revati P, Manorama J, Shubhangi S, Deepali U
2021 International Journal of Engineering in Computer Science  
Hence, we require to utilize a web page feature set to preserve any phishing assault. A Machine Learning approach is implemented to resist these attacks.  ...  The proposed a brilliant version for detecting phishing internet pages primarily predicated on Extreme Learning Machine. Types of internet pages are one of a kind in phrases in their features.  ...  In RF, prediction is achieved the usage of decision trees. during the training phase, a few decision trees are built (defined via the programmer) which are then used for class prediction; this is performed  ... 
doi:10.33545/26633582.2021.v3.i1a.47 fatcat:rtcv3ygbl5futdtx6bh2ez6rai

Innovations in Web Personalization [chapter]

Giovanna Castellano, Anna Maria Fanelli, Maria Alessandra Torsello, Lakhmi C. Jain
2009 Studies in Computational Intelligence  
This chapter presents an overview of the Web personalization in the endeavor of Intelligent systems.  ...  Web personalization offers this invaluable opportunity, representing one of the most important technologies required by an ever increasing number of real-world applications.  ...  An alternative algorithm based on the use of a tree structure has been presented in Pei et al. [2000] . Tree structures have also been used in Menasalvas et al. [2002] .  ... 
doi:10.1007/978-3-642-02794-9_1 fatcat:h7iwkag6zzfn7mbz55gvfigs5m
« Previous Showing results 1 — 15 out of 1,376 results