SF-CNN: Deep Text Classification and Retrieval for Text Documents

R. Sarasu, K. K. Thyagharajan, N. R. Shanker
2023 Intelligent Automation and Soft Computing  
Researchers and scientists need rapid access to text documents such as research papers, source code and dissertations. Many research documents are available on the Internet and need more time to retrieve exact documents based on keywords. An efficient classification algorithm for retrieving documents based on keyword words is required. The traditional algorithm performs less because it never considers words' polysemy and the relationship between bag-of-words in keywords. To solve the above
more » ... em, Semantic Featured Convolution Neural Networks (SF-CNN) is proposed to obtain the key relationships among the searching keywords and build a structure for matching the words for retrieving correct text documents. The proposed SF-CNN is based on deep semantic-based bag-of-word representation for document retrieval. Traditional deep learning methods such as Convolutional Neural Network and Recurrent Neural Network never use semantic representation for bag-of-words. The experiment is performed with different document datasets for evaluating the performance of the proposed SF-CNN method. SF-CNN classifies the documents with an accuracy of 94% than the traditional algorithms.
doi:10.32604/iasc.2023.027429 fatcat:r2czwj5p6jdntkr3lgkp23erma