Query Optimization on Large Scale Nested Data with Service Tree and Frequent Trajectory

Li Wang, Guodong Wang
2021 Journal of Information Processing Systems  
Query applications based on nested data, the most commonly used form of data representation on the web, especially precise query, is becoming more extensively used. MapReduce, a distributed architecture with parallel computing power, provides a good solution for big data processing. However, in practical application, query requests are usually concurrent, which causes bottlenecks in server processing. To solve this problem, this paper first combines a column storage structure and an inverted
more » ... ex to build index for nested data on MapReduce. On this basis, this paper puts forward an optimization strategy which combines query execution service tree and frequent sub-query trajectory to reduce the response time of frequent queries and further improve the efficiency of multi-user concurrent queries on large scale nested data. Experiments show that this method greatly improves the efficiency of nested data query.
doi:10.3745/jips.04.0205 dblp:journals/jips/WangW21 fatcat:jebkod7zjnfuvmx4uyw2td6cou