1,325 Hits in 4.1 sec

Evaluation of statistical part of speech tagging of persian text

Samira Tasharofi, Fahimeh Raja, Farhad Oroumchian, Masoud Rahgozar
2007 2007 9th International Symposium on Signal Processing and Its Applications  
Part of Speech (POS) tagging is an essential part of text processing applications. A POS tagger assigns a tag to each word of its input text specifying its grammatical properties.  ...  In case of statistical methods such as TnT, this will have an added practical advantages also. This paper presents creation of a POS tagged corpus and evaluation of TnT tagger on Persian text.  ...  Faili for his helps in gathering and preparing the tagged corpus and Dr. BijanKhan for his valuable work in tagging the Persian texts and providing us with his tagged corpus.  ... 
doi:10.1109/isspa.2007.4555312 dblp:conf/isspa/TasharofiROR07 fatcat:fxup43sqprd2pfluzjjtaj65iu

A hidden Markov model for Persian part-of-speech tagging

Morteza Okhovvat, Behrouz Minaei Bidgoli
2011 Procedia Computer Science  
One of the important actions in the processing of languages is part-of-speech tagging.  ...  In this paper, a part-of-speech tagging system on Persian corpus by using hidden Markov model is proposed. Achieving to this goal, the main aspects of Persian morphology is introduced and developed.  ...  Introduction Part-Of-Speech (POS) tagging is known as a necessary work in many areas Natural Language Processing (NLP) systems like translation machines and text-to-speech (TTS) applications and prosody  ... 
doi:10.1016/j.procs.2010.12.160 fatcat:dd76xwgcyncm3gxofbroywobey

A Comparative Study of Persian Sentiment Analysis Based on Different Feature Combinations [chapter]

Kia Dashtipour, Mandar Gogate, Ahsan Adeel, Amir Hussain, Abdulrahman Alqarafi, Tariq Durrani
2018 Lecture Notes in Electrical Engineering  
The performance of the proposed framework has been evaluated taking into account different features combinations.  ...  In this paper, a novel sentiment analysis framework for Persian language has been proposed.  ...  Acknowledgements The authors are grateful to the anonymous reviewers for their insightful comments and suggestions which helped improved the quality of the paper.  ... 
doi:10.1007/978-981-10-6571-2_279 fatcat:loljfmevmjfybj3i4soawi3j5m

Towards grammar checker development for Persian language

Nava Ehsan, Heshaam Faili
2010 Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010)  
This paper briefly describes the concepts and definition of grammar checkers in general followed by developing the first Persian (Farsi) grammar checker leading to an overview of the error types of Persian  ...  Therefore, existence of automatic systems such as spell and grammar checker/corrector can help in reducing costs and increasing the electronic texts and it will improve the quality of electronic texts.  ...  The system is a modification of a system, developed at Lancaster University, for tagging words in a text with their part-of-speech tag.  ... 
doi:10.1109/nlpke.2010.5587839 dblp:conf/nlpke/EhsanF10 fatcat:mao4xdqlvvcftmc2kvnzjdsftq

A Farsi part-of-speech tagger based on Markov model

Mahdi Mohseni, Hasan Motalebi, Behrouz Minaei-bidgoli, Mahmoud Shokrollahi-far
2008 Proceedings of the 2008 ACM symposium on Applied computing - SAC '08  
This paper describes a method based on morphological analysis of words for a Persian Part-Of-Speech (POS) tagging system.  ...  Using a Markov tagger the method is evaluated on the corpus. The experiments show the efficiency of the method in Persian POS tagging.  ...  ., 2007) presents evaluation of some tagging methods on texts in old version of Peykare (Textual Corpus of the Persian Language).  ... 
doi:10.1145/1363686.1364059 dblp:conf/sac/MohseniMMS08 fatcat:pfvqsxve25fcbnpi62trelh4m4

An Accurate Persian Part-of-Speech Tagger

Morteza Okhovvat, Mohsen Sharifi, Behrouz Minaei Bidgoli
2020 Computer systems science and engineering  
The processing of any natural language requires that the grammatical properties of every word in that language are tagged by a part of speech (POS) tagger.  ...  To present a more accurate POS tagger for the Persian language, we propose an improved and accurate tagger called IAoM that supports properties of text to speech systems such as Lexical Stress Search,  ...  In this paper, we propose a part-of-speech tagging system for the Persian corpus using the Hidden Markov Model (HMM) that is applied to both homogeneous and heterogeneous Persian corpuses.  ... 
doi:10.32604/csse.2020.35.423 fatcat:xj74i77e6zgsrilhfu344hdwpa

Improving Persian Information Retrieval Systems Using Stemming and Part of Speech Tagging [chapter]

Reza Karimpour, Amineh Ghorbani, Azadeh Pishdad, Mitra Mohtarami, Abolfazl AleAhmad, Hadi Amiri, Farhad Oroumchian
2009 Lecture Notes in Computer Science  
In this study, we have used part of speech properties of terms as extra source of information about document and query terms and have evaluated the impact of such data on the performance of the Persian  ...  Our findings indicate that part of speech tags may have small influence on effectiveness of the retrieved results.  ...  Part of Speech Tagging Part of speech tagging selects the most likely sequence of syntactic categories for the words in a sentence.  ... 
doi:10.1007/978-3-642-04447-2_10 fatcat:fqg2pq7ghnfwnpufqf2ut4mlsi

LSCP: Enhanced Large Scale Colloquial Persian Language Understanding [article]

Hadi Abdi Khojasteh, Ebrahim Ansari, Mahdi Bohlouli
2020 arXiv   pre-print
The proposed corpus consists of 120M sentences resulted from 27M tweets annotated with parsing tree, part-of-speech tags, sentiment polarity and translation in five different languages.  ...  This consists of a significant gap in describing the colloquial language especially for low-resourced ones such as Persian.  ...  The research was supported by the grants 19-26934X (NEUREM3) of the Czech Science Foundation. The au-thors would like to thank Kinal Mehta, for his invaluable help and cooperation in this project.  ... 
arXiv:2003.06499v1 fatcat:yfqax7kb7nf2rj2oy56ttkoa4e

A large vocabulary continuous speech recognition system for Persian language

Hossein Sameti, Hadi Veisi, Mohammad Bahrani, Bagher Babaali, Khosro Hosseinzadeh
2011 EURASIP Journal on Audio, Speech, and Music Processing  
For this purpose, we had to identify the computational challenges of the Persian language, especially for text processing and extract statistical and grammatical language models for the Persian language  ...  To achieve this, we had to either generate the necessary speech and text corpora or modify the available primitive corpora available for the Persian language.  ...  ); bigram statistics of words; trigram statistics of words; unigram statistics of POS tags (for 166 tags); bigram statistics of POS tags; trigram statistics of POS tags; number of assigning one POS tag  ... 
doi:10.1186/1687-4722-2011-426795 fatcat:wm2tzdvtpfealbl2vvxtecueqe

ViraPart: A Text Refinement Framework for Automatic Speech Recognition and Natural Language Processing Tasks in Persian [article]

Narges Farokhshad, Milad Molazadeh, Saman Jamalabbasi, Hamed Babaei Giglou, Saeed Bibak
2021 arXiv   pre-print
In most of the works in Persian, these techniques are addressed individually. Despite that, we believe that for text refinement in Persian, all of these tasks are necessary.  ...  Experimental results show that our proposed approach is very effective in text refinement for the Persian language.  ...  ACKNOWLEDGMENT The authors would like to give big thanks to Part AI Research Center (the biggest AI company in Iran) for supporting and funding this research.  ... 
arXiv:2110.09086v3 fatcat:c5njbcwuurfxrh37oxkrd3kdc4

A Probabilistic Approach to Persian Ezafe Recognition

Habibollah Asghari, Jalal Maleki, Heshaam Faili
2014 Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers  
In this paper, we investigate the problem of Ezafe recognition in Persian language. Ezafe is an unstressed vowel that is usually not written, but is intelligently recognized and pronounced by human.  ...  In this paper, POS tags augmented by Ezafe tags (POSE) have been used to train a probabilistic model for Ezafe recognition.  ...  Ezafe recognition as a POS tagging problem Part Of Speech tagging is an effective way for automatically assigning grammatical tags to words in a text.  ... 
doi:10.3115/v1/e14-4027 dblp:conf/eacl/AsghariMF14 fatcat:k23zunpmc5grpfgtpvgy2ghcpm

A Speech Act Classifier for Persian Texts and its Application in Identifying Rumors [article]

Zoleikha Jahanbakhsh-Nagadeh, Mohammad-Reza Feizi-Derakhshi, Arash Sharifi
2020 arXiv   pre-print
This study presents a dictionary-based statistical technique for Persian SA recognition.  ...  Knowledge of the SA of a text can be helpful in analyzing that text in natural language processing applications.  ...  and negative polarity in the text T. 3) Syntactic Features • Part-of-Speech (POS) tags.  ... 
arXiv:1901.03904v4 fatcat:cshowghcafh47gez4erglchwrm

Lessons from building a Persian written corpus: Peykare

Mahmood Bijankhan, Javad Sheykhzadegan, Mohammad Bahrani, Masood Ghayoomi
2010 Language Resources and Evaluation  
To annotate Peykare, we use EAGLES guidelines which result to have a hierarchy in the part-of-speech tags. To this aim, we apply a semi-automatic approach for the annotation methodology.  ...  For tokenization of Persian, we propose a descriptive generalization to normalize orthographic variations existing in texts.  ...  Acknowledgments This project was funded by the Higher Council for Informatics of Iran and the University of Tehran under the contract number 190/3554.  ... 
doi:10.1007/s10579-010-9132-x fatcat:jq3yh76navb5dkzko4kdhledju

Developing a persian chunker using a hybrid approach

Soheila Kiani, Tara Akhavan, Mehrnoush Shamsfard
2009 2009 International Multiconference on Computer Science and Information Technology  
Our system exploits a hybrid method for automatic chunking of Persian texts.  ...  Text segmentation is the process of recognizing boundaries of text constituents, such as sentences, phrases and words. This paper focuses on phrase segmentation also known as chunking.  ...  They presented a Support Vector Machine (SVM) based approach to automatically do tokenization, part-of-speech (POS) tagging and annotating base phrases (BPs) in Arabic texts.  ... 
doi:10.1109/imcsit.2009.5352723 dblp:conf/imcsit/KianiAS09 fatcat:ju5zjmrkinbude3pcvvcpmz5oy

PGST: a Polyglot Gender Style Transfer method [article]

Reza Khanmohammadi, Seyed Abolghasem Mirroshandel
2021 arXiv   pre-print
Since different approaches are introduced in our research, we determine a trade-off value for evaluating different models' success in faking our gender identification model with transferred text.  ...  In this research, we introduce PGST, a novel polyglot text style transfer approach in the gender domain, composed of different constitutive elements.  ...  Meaning sentences are still expressive even if specific part-of-speech tags are relocated.  ... 
arXiv:2009.01040v2 fatcat:wnwwsafmkrbbzhhhyjbi3gbhdu
« Previous Showing results 1 — 15 out of 1,325 results