Big Data and Causality

Hossein Hassani, Xu Huang, Mansi Ghodsi
2017 Annals of Data Science  
Causality analysis continues to remain one of the fundamental research questions and the ultimate objective for a tremendous amount of scientific studies. In line with the rapid progress of science and technology, the age of Big Data has significantly influenced the causality analysis on various disciplines especially for the last decade due to the fact that the complexity and difficulty on identifying causality among Big Data has dramatically increased. Data Mining, the process of uncovering
more » ... dden information from Big Data is now an important tool for causality analysis, and has been extensively exploited by scholars around the world. The primary aim of this paper is to provide a concise review of the causality analysis in Big Data. To this end the paper reviews recent significant applications of Data Mining techniques in causality analysis covering a substantial quantity of research to date, presented in chronological order with an overview table of Data Mining applications in causality analysis domain as a reference directory. Specific Tested Subjects&Regions Purpose and Function Entity Extraction [22-25, 27, 29, 31-42, 48-54, 56-59, 61] lexico-syntactic patterns discovery, ambiguous patterns ranking by semantic constraints, Cause Effect Association (CEA)-based feature, distributional similarity methods, discourse relation prediction by Ruby-based discourse extraction system [62], event causality test (ECT), Penn Discourse Treebank (PDTB [55],
doi:10.1007/s40745-017-0122-3 fatcat:y5fixid4ujgsjlgrcfceq5nosu