26,482 Hits in 4.2 sec

Analysis of book documents' table of content based on clustering

Liangcai Gao, Zhi Tang, Xiaofan Lin, Xin Tao, Yimin Chu
2009 2009 10th International Conference on Document Analysis and Recognition  
Table of contents (TOC) recognition has attracted a great deal of attention in recent years.  ...  Based on this finding we introduce an automatic TOC analysis method through clustering. This method first detects the decorative elements in TOC pages.  ...  Introduction Most multi-page documents have a built-in table of contents (TOC), which is a collection of references to the different components of the document and naturally reflects the logical structure  ... 
doi:10.1109/icdar.2009.143 dblp:conf/icdar/GaoTLTC09 fatcat:s36r7ryr7fcf5dtwsvrjexcunq

A Hybrid Method for Chinese Entity Relation Extraction [chapter]

Hao Wang, Zhenyu Qi, Hongwei Hao, Bo Xu
2014 Communications in Computer and Information Science  
And we submitted 364944 triples with the precision rate of 46.3% for the competition of Sougou Chinese entity relation extraction and rank the 4th place in the platform.  ...  This kind of method needs a large, high-quality training dataset.  ...  This work was supported by the National Natural Science Foundation of China (NSFC) Grants No.61203281 and No.61303172.  ... 
doi:10.1007/978-3-662-45924-9_32 fatcat:tlq2entwzzc43kia4sfmslqlyu

Marketing Improvement of Chinese Original Picture Books From Dissatisfaction Evaluation - Text Mining Based on LDA Model

Ziyan Tang, Wen Yuan, S. Zhao
2021 SHS Web of Conferences  
Chinese original picture books play an important role in inheriting traditional culture and forming cultural identity, which is very important for children.  ...  We analyzes the dissatisfaction evaluation of Chinese original picture books by using the topic model of Latent Dirichlet Allocation (LDA).  ...  For example, Wang uses content analysis and text analysis method to analysis the content of the online comment of popular picture book Peppa Pig, the result show that the affective factors and derivative  ... 
doi:10.1051/shsconf/202112301005 fatcat:ryvw26qwabab3ecznhl2phry44

Language and Literature Layout Integrating Text Image Preprocessing Algorithm

Lirong Wang, Cui Fu, Gaixia Fu, Hye-jin Kim
2022 Mobile Information Systems  
This paper proposes a method of document image tilt correction based on the content of the document.  ...  In addition, a large number of digital resources exist in the form of images instead of text encoding.  ...  For this reason, this paper proposes a Chinese layout analysis method based on hierarchical extraction.  ... 
doi:10.1155/2022/1429635 fatcat:hjsbusovlzadfahg2xchh734i4

Research on the Relationship Between Chinese Nicknames and Accounts in Social Networks [chapter]

Yi Han, Xiangyu Liu, Yanhui Du, Tianliang Lu
2022 Communications in Computer and Information Science  
Therefore, there are many virtual identities belonging to one person on the Internet, and the similarity analysis of cross-platform network identities is of great significance in the field of network security  ...  This paper studied the Chinese user nicknames and virtual identity recognition in domestic social networking.  ...  application, the content extraction module realizes the automatic collection of nickname information of user accounts in the address book.  ... 
doi:10.1007/978-981-16-9229-1_9 fatcat:rdraaajxxndctjygnwgd2rrv4y

A Study on the Translation of Cultural Classics Based on Deep Learning Methods

Yanqing Zhang, Jianying Lou, Zhiqi Cheng, Muhammad Zakarya
2022 Scientific Programming  
In fact, other countries need to translate cultural classics out of their love for China's cultural classics or academic research. However, there are a large number of cultural classics in China.  ...  China's cultural classics have high artistic and ideological values in the world, which implies China's historical heritage and the inheritance of the cultural situation of the Chinese nation for thousands  ...  techniques such as machine learning, arti cial intelligence, and deep learning can be used for similar data analysis and manipulations. is paper uses a deep learning method to analyze the text data in  ... 
doi:10.1155/2022/1026926 fatcat:emlejzqymjgsdminrsyrcor4c4

CharCNN-SVM for Chinese Text Datasets Sentiment Classification with Data Augmentation

Xingkai Wang, Yiqiang Sheng, Haojiang Deng, Zhenyu Zhao
2019 International Journal of Innovative Computing, Information and Control  
In terms of data augmentation, we construct a synonym replacement thesaurus for text classification.  ...  Through experiments, results demonstrate that our method outperforms several traditional methods, such as Naive Bayes, maximum entropy, support vector machine and bag-of-words, in terms of Chinese sentiment  ...  This work is supported by the Special Fund for Strategic Pilot Technology of Chinese Academy of Sciences under Grant No. XDA06040602.  ... 
doi:10.24507/ijicic.15.01.227 fatcat:usvaobfz45chvciodl3nktdsu4

Design and Implementation of Mongolian Wordnet Management Platform

Hasi, En Bo Tang
2013 Advanced Materials Research  
Aiming at automatic processing of words in machine translation and automatic proofreading, Wordnet mainly provides semantic information in the form of a semantic knowledge database.  ...  With the development of natural language processing technology, a powerful tool containing semantic information is in great need in lexical semantic processing.  ...  The main approach is to search for the corresponding Chinese word for each word of Mongolian Grammatical Information Dictionary in Chinese Wordnet, and mark with the corresponding Chinese synset ID.  ... 
doi:10.4028/ fatcat:67qssdpudnfjbig3jyhs4lyg5e

Page 896 of American Ceramic Society Bulletin Vol. 69, Issue 5 [page]

1990 American Ceramic Society Bulletin  
It is not a book for the casual reader. For those with a serious interest in Chinese ce- ramics, the effort will be worthwhile. Standard or Custom... ...  ...  The wares discussed mostly fall into the Yue family which has a high feld- spar content but is low in quartz and mica, and in the Lung- ch’uan family which has a high quartz and micaceous content but little  ... 

Proficient Readers' Reading Behavior in Taiwan: The Study of Young Chinese Readers

Li-Chun Chang
2015 Universal Journal of Educational Research  
Especially, the roles of phonetic skill and Chinese Character recognition in reading comprehension were explored. 10 kindergartens were recruited to participate in the study.  ...  The results found: (1) Word recognition was a better predictor in reading fluency when comparing with Chinese phonic spelling ability. (2) Reading fluency was positively correlated with reading comprehension  ...  Acknowledgements This project was sponsored by Taiwan's Ministry of Science and Technology. Project number: NSC 99-2410-H-024-016  ... 
doi:10.13189/ujer.2015.030405 fatcat:hbuq5h7plvgyxl5dbcifssa5f4

Ontology Construction Based on Latent Topic Extraction in a Digital Library [chapter]

Jian-hua Yeh, Naomi Yang
2008 Lecture Notes in Computer Science  
The method proposed in this paper combines the statistical correction and latent topic extraction of textual data in a digital library, which produces a semantic-oriented and OWL-based ontology.  ...  This paper discusses the automatic ontology construction process in a digital library.  ...  Figure 5 : 5 A partial page example of Chinese Recorder Index Book Figure 6 : 6 Partial latent topics generated by this experiment Table 1 : 1 Related data statistics in our experimentIn latent topic  ... 
doi:10.1007/978-3-540-89533-6_10 fatcat:7v3zpzhsqnestkczoz7obdfrpe

Chinese Painting Classification Method Based on PCA

Meng-yu WANG
2018 DEStech Transactions on Computer Science and Engineering  
In this paper, a kind of Chinese painting classification method based on PCA is proposed. The image was pretreated using this method, then reduced by Gabor transform and classified based of PCA.  ...  Experiments in the self-built image library show that this method has high classification accuracy.  ...  Conclusion This paper presents a new automatic classification method of Chinese painting.  ... 
doi:10.12783/dtcse/cnai2018/24198 fatcat:b4vch57gbzcphfvrfv4vqxkvle

Impacts towards a comprehensive assessment of the book impact by integrating multiple evaluation sources

Qingqing Zhou, Chengzhi Zhang
2021 Journal of Informetrics  
The surge in the number of books published makes the manual evaluation methods difficult to efficiently evaluate books.  ...  Meanwhile, relying on a single resource for book assessment may lead to the risk that the evaluation results cannot be obtained due to the lack of the evaluation data, especially for newly published books  ...  Users can make a preliminary judgment on the contents of books by browsing the tables of contents (TOCs for short). Therefore, books' TOCs can be used to reflect impacts of books in contents.  ... 
doi:10.1016/j.joi.2021.101195 fatcat:j45b6lbo35buxgvpcnsejegore

Document Analysis Systems for Digital Libraries: Challenges and Opportunities [chapter]

Henry S. Baird, Venugopal Govindaraju, Daniel P. Lopresti
2004 Lecture Notes in Computer Science  
We attempt to specify, in considerable detail, the essential features of document analysis systems that can assist in: (a) the creation of DL's; (b) automatic indexing and retrieval of doc-images within  ...  The state-of-the-art is summarized, including a digest of themes that emerged during the recent International Workshop on Document Image Analysis for Libraries.  ...  Section 6 points out implications for DIA systems of the lack of fully automatic, high-accuracy methods for analyzing doc-image content.  ... 
doi:10.1007/978-3-540-28640-0_1 fatcat:3szb2elcm5amvlhvma3kbwmzza

Public Perception on Healthcare Services: Evidence from Social Media Platforms in China

Guangyu Hu, Xueyan Han, Huixuan Zhou, Yuanli Liu
2019 International Journal of Environmental Research and Public Health  
Neutral disposition was found to be the highest (30.4%) in the contents on appointment-booking services.  ...  Social media has been used as data resource in a growing number of health-related research.  ...  Acknowledgments: We would like to acknowledge Dalu Wang and Hongda Wu from Tencent for their technical support in data analysis. Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/ijerph16071273 pmid:30974729 pmcid:PMC6479867 fatcat:6fupoxpob5a65eakz7lhrucaiq
« Previous Showing results 1 — 15 out of 26,482 results