Analysis of Plant Breeding on Hadoop and Spark

Shuangxi Chen, Chunming Wu, Yongmao Yu
2016 Advances in Agriculture  
Analysis of crop breeding technology is one of the important means of computer-assisted breeding techniques which have huge data, high dimensions, and a lot of unstructured data. We propose a crop breeding data analysis platform on Spark. The platform consists of Hadoop distributed file system (HDFS) and cluster based on memory iterative components. With this cluster, we achieve crop breeding large data analysis tasks in parallel through API provided by Spark. By experiments and tests of Indica
more » ... and Japonica rice traits, plant breeding analysis platform can significantly improve the breeding of big data analysis speed, reducing the workload of concurrent programming.
doi:10.1155/2016/7081491 fatcat:m67r6na6lfc4bloxaf6ywvvvqq