The Exploration of the Approach to Data Preparation for Chinese Text Analysis Based on R Language

Jiang Li
2021 OALib  
This paper explores how to prepare data for analyzing the Chinese texts with R language based on the theory of Welbers, particularly comparing the R package Rwordseg with jiebaR to see the results of Chinese text segmentation at the step of preprocessing.
doi:10.4236/oalib.1107821 fatcat:yvgs3gd3jja5rc3mngove336fq