32,750 Hits in 3.6 sec

Contributing Evidence to Data-driven Ontology Evaluation - Workflow Ontologies Perspective

Hlomani Hlomani, Deborah A. Stacey
2013 Proceedings of the International Conference on Knowledge Engineering and Ontology Development  
This evaluation methodology is then evaluated through statistical means particularly the Kruskal-Wallis test and further post hoc testing using the Mann-Whiteny U test.  ...  The key phrases that were used to search for content are: Workflow modelling, Business Process modelling, Workflow modelling languages, Business process modelling languages.  ...  Corpus Definition and Distance Measure The ontologies considered in this paper pertain to the concept of workflow.  ... 
doi:10.5220/0004543602070213 dblp:conf/ic3k/HlomaniS13 fatcat:5xzjbruc5bcotmc4dwgnjjrpsa

Selecting level-specific specialized vocabulary using statistical measures

Kiyomi Chujo, Masao Utiyama
2006 System (Linköping)  
We conclude that these statistical measures are effective tools for identifying multi-level specialized vocabulary for pedagogical purposes.  ...  school textbook vocabulary coverage, and that these measures produced level-specific words; i.e., beginning-level basic business words were identified using Cosine and the complimentary similarity measure  ...  Acknowledgements This study was funded by a Grant-in-aid for Scientific Research (No. 17520401) from the Japan Society for the Promotion of Science and Ministry of Education, Science, Sports and Culture  ... 
doi:10.1016/j.system.2005.12.003 fatcat:v4qf2ckf5rcvhjx7kjufgoby3i

A Corpus Based Approach to Build Arabic Sentiment Lexicon

Afnan Atiah Alsolamy, Department of Information Systems, King Abdulaziz University, Saudi Arabia, Muazzam Ahmed Siddiqui, Imtiaz Hussain Khan
2019 International Journal of Information Engineering and Electronic Business  
The use of similarity measures depends on the fact that the words that are appearing in the same context will have similar polarity.  ...  To achieve this, we proposed a graph propagation algorithm and compared different similarity measures. The lexicon was evaluated using a manually annotated list of terms.  ...  Corpus Statistic ((The documents in our corpus are drawn from Copyright © 2019 MECS I.J.  ... 
doi:10.5815/ijieeb.2019.06.03 fatcat:pvao5fxt7bd5jn7revybhzwhva

Arabic Documents Classification by a Radial Basis Hybridization

Taher Zaki, Driss Mammass, Abdellatif Ennaji, Stéphane Nicolas
2021 International Journal of Mathematical Models and Methods in Applied Sciences  
We proceed in fact by the calculation of similarity between words using an hybridization of NGRAMs-OKAPI statistical measures and a kernel function in order to identify relevant descriptors.  ...  Terminological resources such as graphs and semantic dictionaries are integrated into the system to improve the indexing and the classification processes.  ...  as a measure of similarity.  ... 
doi:10.46300/9101.2021.15.18 fatcat:6t6y6tlkafc7zihhu3ob2ase6y

Finding model through latent semantic approach to reveal the topic of discussion in discussion forum

Reina Setiawan, Widodo Budiharto, Iman Herwidiana Kartowisastro, Harjanto Prabowo
2019 Education and Information Technologies : Official Journal of the IFIP technical committee on Education  
The model proposes a complete step to reveal the topic of discussion from a thread in a discussion forum, consisting of the pre-processing text document, corpus classification and finding a topic.  ...  The reason for using several course subjects is to observe consistency of the model.  ...  The corpus classification groups documents based on similarity words. This output is then used in third process, i.e. finding topic.  ... 
doi:10.1007/s10639-019-09901-7 fatcat:chvsgpt2yzej3p4x7vb36eg3um

Enhancing Focus Topic Findings of Discussion Forum through Corpus Classifier Algorithm

2019 International journal of recent technology and engineering  
In preparing the paper, the methods used were PLSA and the classifying process, which classifies the documents to become a corpus based on the similarity word approach.  ...  The performance of the result was measured by the f-measure, which was calculated for each thread subject.  ...  For the most part, the documents were written in the Indonesian language, but some words were English, and some words were abbreviations, such as IT for Information Technology or BP for Business Process  ... 
doi:10.35940/ijrte.b2166.078219 fatcat:a3fnk7jstvbgfgpanc7s3gqdki

Comparison Latent Semantic and WordNet Approach for Semantic Similarity Calculation [article]

I Wayan Simri Wicaksana Gunadarma University)
2011 arXiv   pre-print
In this aper, we will evaluate latent semantic and WordNet approach to calculate semantic similarity.  ...  For example, word 'bank' has meaning as economic institution for economy domain, but for ecology domain it will be defined as slope of river or lake.  ...  There are some approach for semantic similarity calculation, for example manual, statistic, latent semantic and WordNet. Semantic similarity is a study about semantic relationship.  ... 
arXiv:1105.1406v1 fatcat:w3dao2dmirfy5by5vxjav2sfoy


2017 Journal of Engineering Science and Technology  
The f-measure was 92.9% in sport category and 89.1% in business category.  ...  In order to evaluate the performance of the classifier, this study used a corpus that consists of 5070 documents independently classified into six categories: sport, entertainment, business, Middle East  ...  Table 6 shows the recall measure for the sport, entertainment, business, Middle-East, switch, and world categories.  ... 
doaj:619ebce8382a41c2bd6efbc0e551bb0f fatcat:jbor6fcknzavzexxessitttnki


Christina Yip Chung, Bin Chen
2002 Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02  
statistical significance.  ...  The results demonstrate that corpus based smoothing can be used for query expansion by term clustering.  ...  The authors would like to thank Jianchang Mao for his support of this study and for providing valuable feedback on an early draft of the manuscript.  ... 
doi:10.1145/775047.775115 dblp:conf/kdd/ChungC02 fatcat:47x2yk2iqzfgpkbiitb73bcony

A Survey on Similarity Measures in Text Mining

Vijaymeena M.K, Kavitha K
2016 Machine Learning and Applications An International Journal  
The similarity measure process in text mining can be used to identify the suitable clustering algorithm for a specific problem.  ...  This difference is often measured by similarity measure such as Euclidean distance, Cosine similarity etc.  ...  Figure 2 shows the Corpus-Based similarity measures. A Corpus is a large collection of texts and it is used for language research.  ... 
doi:10.5121/mlaij.2016.3103 fatcat:ggqyuuxdqfdkjn3zjw2h46kiuq


Christina Yip Chung, Bin Chen
2002 Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02  
statistical significance.  ...  The results demonstrate that corpus based smoothing can be used for query expansion by term clustering.  ...  The authors would like to thank Jianchang Mao for his support of this study and for providing valuable feedback on an early draft of the manuscript.  ... 
doi:10.1145/775107.775115 fatcat:2ak44ejh6bfobhwgcqvidpdx7y

Page 2662 of Linguistics and Language Behavior Abstracts: LLBA Vol. 29, Issue 5 [page]

1995 Linguistics and Language Behavior Abstracts: LLBA  
construction experiences, 2- stage tagging project; 9511605 phrasal verbs, natural language processing treatment, computational lexicon construction; 9511614 quantitation/measurement fundamental problems  ...  integration software design, terminology issues; 1181 interactive language pedagogy, e-mail use, student collaborative proj- ects; observations; second-year university French learners; 9510127 language for  ... 

Innovation in the electricity sector in the age of Disruptive Technologies and renewable Energy Sources: A Bibliometric study from 1991 to 2019

G. S. Marques, M. A. P. Dias, J. N. S. Vianna
2020 International Journal of Advanced Engineering Research and Science  
interface, for qualitative, quantitative and statistical data processing.  ...  In tightly regulated markets, regulation has yet to find the point and balance that can foster an enabling environment for sectoral technological innovation.  ...  words in the textual corpus, making the similarity analysis.  ... 
doi:10.22161/ijaers.72.35 fatcat:ij53c4kv25hg3cdxzgccyngn5q

Discovering contextual tags from product review using semantic relatedness

Soon Chong Johnson Lim, Shilong Wang, Ying Liu
2014 Journal of Industrial and Production Engineering  
received his BS, MS, and PhD degrees in Mechanical Engineering from Chongqing University, China, in 1988China, in , 1991China, in , and 1995 ics, intelligent manufacturing, design methodology and process  ...  For this purpose, a few input terms that exist in document text are selected randomly. Prior to the evaluation process, pre-15 processing tasks for MCV1 corpus are performed.  ...  to medium business user… Contextually similar terms screen, adapt, wide aspect, display, size business, performance, travel, notebook, quality, design, price Input query (hits) screen (49) business  ... 
doi:10.1080/21681015.2014.895966 fatcat:53vma3dkorehhozap2gyiq6vyu

A Novel Summarization-based Approach for Feature Reduction Enhancing Text Classification Accuracy

S. Rahamat Basha, J. Keziya Rani, J. J. C. Prasad Yadav
2019 Zenodo  
The summary of every document in the corpus is taken into a new document used for the summarization evaluation process.  ...  Our approach for single document summarization uses two measures for sentence similarity: the frequency of the terms in one sentence and the similarity of that sentence to other sentences.  ...  PROPOSED SUMMARIZATION METHOD FOR EACH DOUMENT OF EVERY CATEGORY Our summarization method uses the concept of neighbor with the measures of statistical evaluation.  ... 
doi:10.5281/zenodo.3566534 fatcat:yalawpiz4feutn53ypzdo46mze
« Previous Showing results 1 — 15 out of 32,750 results