Filters








21,312 Hits in 4.0 sec

Event Log Preprocessing for Process Mining: A Review

Heidy M. Marin-Castro, Edgar Tello-Leal
2021 Applied Sciences  
In this paper, we conduct a systematic literature review and provide, for the first time, a survey of relevant approaches of event data preprocessing for business process mining tasks.  ...  An essential element in the three tasks of process mining (discovery, conformance, and enhancement) is data cleaning, used to reduce the complexity inherent to real-world event data, to be easily interpreted  ...  Data Availability Statement: Not applicable. Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/app112210556 fatcat:lls2qf6llnddxbdnego2okk2ya

Preprocessing Big Data for Efficient Storage and Research

2019 International journal of recent technology and engineering  
Thus this paper presents an efficient method for preprocessing data and also partitioning big dataset based on sensitivity parameters.  ...  Data preprocessing techniques, when applied prior to analytics, can substantially improve the overall quality of the patterns mined and/or the time required for the actual mining.  ...  Thus it is necessary to preprocess data for efficient storage and mining.  ... 
doi:10.35940/ijrte.b1003.0782s319 fatcat:molmrnisn5hs5a5icifx7bxcne

Fast and Efficient Cloud Data Utilization with Deduplication

Sunil S, A Ananda Shankar
2018 International Journal of Emerging Research in Management and Technology  
Cloud storage system is to provides facilitative file storage and sharing services for distributed clients.The cloud storage preserve the privacy of data holders by proposing a scheme to manage encrypted  ...  It is an effective approach to verify data ownership and check duplicate storage with secure challenge and big data support.  ...  For applying data mining techniques our data must be preprocessed, after the can be use data mining techniques like classification, clustering and association rule mining etc.  ... 
doi:10.23956/ijermt.v6i6.247 fatcat:zlf7jc5ssjedjbak3lhjueqm44

Adaptive Data Mining Approach for PCB Defect Detection and Classification

P. K. Srimani, Vaddin Prathiba
2016 Indian Journal of Science and Technology  
A genetic algorithm is used for data preprocessing to achieve the feature reduction and confidence measurement. Findings: The system is implemented using MatLab 2013b.  ...  The proposed approach is divided into three main stages: (i) data pre-processing (ii) feature selection and reduction and (iii) Classification.  ...  In order to make the appropriate selection of the data, preprocessing is required.  ... 
doi:10.17485/ijst/2016/v9i44/98964 fatcat:pibrt574dfatnmc5igalsg32se

A Preprocessing Design Scheme for Sequential Pattern Analysis of a Student Database

R. Campagni, D. Merlini, M. C. Verri
2016 Proceedings of the 8th International Conference on Computer Supported Education  
In a data mining project evolved on a relational database often a significant effort needs to be done to construct the data set for the analysis.  ...  In fact, usually the database contains a series of normalized tables that need to be joined, aggregated and processed in an appropriate way to build the data set.  ...  and Ventura, 2013) for recent surveys on the state of the art of educational data mining and on preprocessing educational data).  ... 
doi:10.5220/0005789600990106 dblp:conf/csedu/CampagniMV16 fatcat:orfcynozlnce7ptnycrxvnyewm

Web Recommendation Framework based on Association Rules Coverage to be Applied for Site Modification

M. MagedM.Deghaidy, Khaled Mahmoud Badran, Gouda Ismail Mohamed
2014 International Journal of Computer Applications  
This paper introduces a Web Recommendation Framework based on the usage history to be applied for Site Modification as one of the applications of Web Usage Mining (WUM) that is applicable for online business  ...  Web mining can be defined as the discovery and analysis of useful information from the Web data so it is considered to be an application of data mining to large web data repositories that can be divided  ...  With the huge amount of information available online, the Web is a fertile area for data mining research.  ... 
doi:10.5120/15854-4754 fatcat:wl3n5sixhbcwjjsdaxxnhsvaqi

Improving Customer Relationship Management through Integrated Mining of Heterogeneous Data

I. T. Fatudimu, C. O. Uwadia, C. K. Ayo
2012 Journal of clean energy technologies  
-Association rule mining, customer relationship management, integrated mining, structured data, unstructured data.  ...  For this to be effective, there is need to discover knowledge from the seamless integration of structured and unstructured data for completeness and comprehensiveness which is the main focus of this paper  ...  PREPROCESSING PHASE Filtration Index documents by using the weighting scheme TF-IDF for all keywords in all documents Incoming Textual Documents Stemming Semantic Clustering of XML data  ... 
doi:10.7763/ijcte.2012.v4.523 fatcat:m3pis7d2uzflxi6c2smbfkpeme

The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing

Sven F. Crone, Stefan Lessmann, Robert Stahlbock
2006 European Journal of Operational Research  
in predictive data mining.  ...  While research in operations research, direct marketing and machine learning focuses on the analysis and design of data mining algorithms, the interaction of data mining with the preceding phase of data  ...  Data preprocessing for predictive classification Current research in data preprocessing The application of each data mining algorithm requires the presence of data in a mathematically feasible format  ... 
doi:10.1016/j.ejor.2005.07.023 fatcat:g7hy2izytja43b25wv4chvbqt4

Heavy Rainfall Prediction using Gini Index in Decision Tree

2019 International journal of recent technology and engineering  
To avoid this, we use data mining algorithms for early warning of climatic conditions such as like maximum temperature, minimum temperature wind speed, rainfall, humidity, pressure, dew point, cloud, sunshine  ...  Hence, we apply the Decision tree algorithm using Gini Index in order to predict the precipitation with accuracy and it is completely based on the historical data.  ...  The present research is focused on using the gini index as an attribute selection measure in an elegant decision tree to predict precipitation for datasets, making data preprocessing and data transformation  ... 
doi:10.35940/ijrte.d8503.118419 fatcat:p4jskeftfrf57llbq67b34pfdu

Analysis of Users' Web Navigation Behavior using GRPA with Variable Length Markov Chains

Ch Bindu Madhuri, J Anand Chandulal, K Ramya, M Phanidra
2011 International Journal of Data Mining & Knowledge Management Process  
prediction model for better Web Usage mining Applications.  ...  With the never-ending growth of Web services and Web-based information systems, the volumes of click stream and user data collected by Web-based organizations in their daily operations has reached enormous  ...  Thus the log file contains an entry for each click and can be preprocessed into time-ordered sessions of sequential clicks.  ... 
doi:10.5121/ijdkp.2011.1201 fatcat:umeb62n2kjdmjeqeq5onpy62ma

Data mining and preprocessing application on component reports of an airline company in Turkey

Feyza Gürbüz, Lale Özbakir, Hüseyin Yapici
2011 Expert systems with applications  
This paper focuses on different preprocessing and feature selection techniques applied on the 15 component reports of an airline company in Turkey to understand and clean the data set.  ...  Also the classification techniques of data mining are used to predict the warning level of the component as the class attribute.  ...  Data preprocessing: This is required to improve the quality of the actual data for mining. This also increases the mining efficiency by reducing the time required for mining the preprocessed data.  ... 
doi:10.1016/j.eswa.2010.11.076 fatcat:gycwt23j5fepngax3nanq7dxhi

YALE

Ingo Mierswa, Michael Wurst, Ralf Klinkenberg, Martin Scholz, Timm Euler
2006 Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '06  
These case studies cover tasks like feature engineering, text mining, data stream mining and tracking drifting  ...  Rapid prototyping is an approach which allows crucial design decisions as early as possible.  ...  In order to guide transformations of the feature space or the automatic search for the best preprocessing, the user can define additional meta data.  ... 
doi:10.1145/1150402.1150531 dblp:conf/kdd/MierswaWKSE06 fatcat:6oz263ifqjdrdbse6dtixb7kza

Eureka!: A Tool for Interactive Knowledge Discovery [chapter]

Giuseppe Manco, Clara Pizzuti, Domenico Talia
2002 Lecture Notes in Computer Science  
In this paper we describe an interactive, visual knowledge discovery tool for analyzing numerical data sets.  ...  The accuracy of clustering results can be validated by using a decision tree classifier, included in the mining tool.  ...  Weka is a Java library defining standard interfaces for data sets loading and preprocessing (e.g., filter definition), mining algorithms and results representation.  ... 
doi:10.1007/3-540-46146-9_38 fatcat:ubsuhrn7qfghvga4kwbscygzvy

Data Mining Application using Association Rule Mining ECLAT Algorithm Based on SPMF

Jason Reynaldo, David Boy Tonara, R.H. Setyobudi, E. Alasaarela, F. Pasila, G. Chan, S.-G. Lee
2018 MATEC Web of Conferences  
Data mining is an important research domain that currently focused on knowledge discovery database.  ...  Association Rule Mining (ARM) has become the core of data mining.  ...  Introduction Data mining is an important research domain is currently focused on knowledge discovery in databases.  ... 
doi:10.1051/matecconf/201816401019 fatcat:erijv2ipfree7ekgvaptic3hfi

The WEKA data mining software

Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, Ian H. Witten
2009 SIGKDD Explorations  
In that time, the software has been rewritten entirely from scratch, evolved substantially and now accompanies a text on data mining [35] .  ...  These days, WEKA enjoys widespread acceptance in both academia and business, has an active community, and has been downloaded more than 1.4 million times since being placed on Source-Forge in April 2000  ...  Preprocessing Filters Just as the list of learning schemes in WEKA has grown, so has the number of preprocessing tools.  ... 
doi:10.1145/1656274.1656278 fatcat:fwag7fbh7febpnrjlcrth2rpcq
« Previous Showing results 1 — 15 out of 21,312 results