125 Hits in 9.6 sec

Towards Accurate Handwritten Word Recognition for Hindi and Bangla [chapter]

Kartik Dutta, Praveen Krishnan, Minesh Mathew, C. V. Jawahar
2018 Communications in Computer and Information Science  
We outperform the previous lexicon based, state of the art methods on the test set of Devanagari and Bangla tracks of RoyDB by a significant margin.  ...  In this work we focus building state of the art handwritten word recognizers for two popular Indic scripts -Devanagari and Bangla.  ...  Acknowledgement This work was partly supported by IMPRINT scheme, Govt. of India. The authors would also like to thank Oishika, Sounak and Sreya for their help in verifying the results for Bangla.  ... 
doi:10.1007/978-981-13-0020-2_41 fatcat:oaqatsafsjchvkljpzm2kas4vm

Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods [article]

Ovishake Sen, Mohtasim Fuad, MD. Nazrul Islam, Jakaria Rabbi, MD. Kamrul Hasan, Mohammed Baz, Mehedi Masud, Md. Abdul Awal, Awal Ahmed Fime, Md. Tahmid Hasan Fuad, Delowar Sikder, MD. Akil Raihan Iftee
2021 arXiv   pre-print
The studies are mainly concentrated on the specific domains of BNLP, such as sentiment analysis, speech recognition, optical character recognition, and text summarization.  ...  Therefore, in this paper, we present a thorough review of 71 BNLP research papers and categorize them into 11 categories, namely Information Extraction, Machine Translation, Named Entity Recognition, Parsing  ...  [44] suggested a text summarization technique on Bangla text document. Their model is extraction-based and summarizes a single document at a time.  ... 
arXiv:2105.14875v2 fatcat:kvqmgxpthvh2fj7jza64n6kaiq

Study of Different Features and Classification Techniques for Recognition of Handwritten Devanagari Text

Vijay Vijay, M U Kharat, S V Gumaste
2018 International Journal of Engineering & Technology  
Recognition of handwritten Devanagari word is one of the popular area of research from decades because of its wide scope of applications.  ...  Different features and techniques of classification are the most important steps in the process of recognizing Devanagari handwritten word, are described in this paper.  ...  Dataset is created by collection of selected a paragraph of printed text from the various documents of History, Medical, Arts, Science and Religious.  ... 
doi:10.14419/ijet.v7i4.19.28285 fatcat:5vziba6hgjhchetwyzsrvqclya


2021 Zenodo  
Social media platforms hold a vast volume of raw data that has been posted by people in the forms of texts, images, audio and video. People use this medium to express their thoughts and opinions.  ...  The purpose of this paper is to explain the knowledge gap and the proposed model by using Bangla language sentiment analysis.  ...  Dey et al, 2019 [24] Sentiment analysis on Bengali text using lexicon based approach Depression Detection Text Bangla Linguistic Dictionary in Bangla Language C.  ... 
doi:10.5281/zenodo.5392869 fatcat:226k3nzxcrf6rkzvkkw2fwobya

Recent Progress, Emerging Techniques, and Future Research Prospects of Bangla Machine Translation: A Systematic Review

M. A. H. Akhand, Arna Roy, Argha Chandra Dhar, Md Abdus Samad Kamal
2021 International Journal of Advanced Computer Science and Applications  
Machine Translation (MT), the way of translating texts or documents from a source language to a target language automatically without human intervention, has gained popularity in the growing information  ...  The following subsections briefly discuss the fundamental points of the five basic MT methods to understand different Bangla MT studies easily. 1) Rule-Based MT (RBMT): Based on linguistic information,  ...  Then the translated texts are found by MOSES decoder. On the other hand, Rabbani et al. [75] proposed a hybrid phrase-based E2B MT using the concept of RBMT and SMT.  ... 
doi:10.14569/ijacsa.2021.0120933 fatcat:5v2spta7vzgglix6xq5wfet7wa

Handwritten Optical Character Recognition (OCR): A Comprehensive Systematic Literature Review (SLR)

Jamshed Memon, Maira Sami, Rizwan Ahmed Khan, Mueen Uddin
2020 IEEE Access  
The objective of this review paper is to summarize research that has been conducted on character recognition of handwritten documents and to provide research directions.  ...  Given the ubiquity of handwritten documents in human transactions, Optical Character Recognition (OCR) of documents have invaluable practical worth.  ...  An OCR system depends mainly, on the extraction of features and discrimination/classification of these features (based on patterns).  ... 
doi:10.1109/access.2020.3012542 fatcat:f5bfni5kbfhf3i63lvv3t6pena

Automatic Monitoring Social Dynamics During Big Incidences: A Case Study of COVID-19 in Bangladesh [article]

Fahim Shahriar, Md Abul Bashar
2021 arXiv   pre-print
On the other hand, social media often spread rumors and misleading news to get more traffic and attention.  ...  Bangladesh over a period of time.  ...  One of the most popular topics modeling technique Latent Dirichlet Allocation (LDA) (Blei et al., 2003; Bashar et al., 2020a) discovers topics based on word recurrence in a set of documents.  ... 
arXiv:2101.09667v2 fatcat:obaiifbixbcsrmdn377rzyubcq

Machine Learning Techniques for Sentiment Analysis of Indian Languages

2019 International journal of recent technology and engineering  
A number of machine learning techniques have been applied on this textual data set. Basic concepts of Sentiment analysis shall be discussed with focus on Indian language text in this paper.  ...  With the increase in Indian language text, researchers find it quite fascinating to infer valuable information from this unstructured text data.  ...  Authors in [16] presented an approach for Classification of Bangla text documents based on inverse class frequency.  ... 
doi:10.35940/ijrte.b1456.0982s1119 fatcat:k4uqob44xffjzlw2zw46yfx56q

BANNER: A Cost-Sensitive Contextualized Model For Bangla Named Entity Recognition

Imranul Ashrafi, Muntasir Mohammad, Arani Shawkat Mauree, Galib Md. Azraf Nijhum, Redwanul Karim, Nabeel Mohammed, Sifat Momen
2020 IEEE Access  
an improvement of over 8% F1 MUC score on a recently introduced Bangla NER dataset when compared to previously published work.  ...  In this paper, we perform the NER task on Bangla Language using Word2Vec and contextual Bidirectional Encoder Representations from Transformers (BERT) embeddings.  ...  They also compared feature-based and fine-tuning based strategies. Xue et al. in [49] fine-tuned the BERT model to focus on the NER and Relation Extraction task words in medical texts.  ... 
doi:10.1109/access.2020.2982427 fatcat:ujdbt3urh5gzrkmo4yc66oputu

L-Boost: Identifying Offensive Texts from Social Media Post in Bengali

M. F. Mridha, Md. Anwar Hussen Wadud, Md. Abdul Hamid, Muhammad Mostafa Monowar, M. Abdullah-Al-Wadud, Atif Alamri
2021 IEEE Access  
A survey on different text categorization techniques for text filtration.  ...  w2 , ..., w768 ) of each text document.  ... 
doi:10.1109/access.2021.3134154 fatcat:jaaavefprne2xlukdtywtxzd6a

State of the Art in Authorship Attribution With Impact Analysis of Stylometric Features on Style Breach Prediction

Rajesh Shardanand Prasad, Midhun Chakkaravarthy
2022 Journal of Cases on Information Technology  
The reference material contributes robust classifiers with reasonable array of feature extraction techniques, such as Dirichlet–multinomial change point regression to extract the progress of inscription  ...  This paper presents quantifiable evaluation of the research in terms of year-wise research output, diversity of applications, nature of collaboration, characteristics of highly productive techniques and  ...  Furthermore, such hybridized feature space provides improved efficiency than the feature space used by the non-hybridized feature space.  ... 
doi:10.4018/jcit.296716 fatcat:5i6sb6od5bafvdrv4ly5vpz46u

Formal Modeling and Verification of Trusted OLSR Protocol Using I-SPIN Model Checker

Harpreet Kaur
2012 IOSR Journal of Computer Engineering  
To validate the improved version of the protocol a technique of formal modeling and verification is used by the utilization of established Model Checker I-SPIN and PROMELA language for validation of Trusted  ...  In order to enhance the security of conventional OLSR Protocol trust is incorporated as additional security measure in the functioning of the protocol.  ...  Shilong Ma in Beijing University of Aeronautics and Astronautics (BUAA), China. This work was supported in part by Prof. Dr. Mohammed Sakre, AL-Shorouk Academy, Egypt.  ... 
doi:10.9790/0661-0410105 fatcat:2ye2jbyhyzgclnjrmctzwkivwa

Regional Language Support for Patient-inclusive Decision Making in Breast Cancer Pathology Domain

2019 International journal of recent technology and engineering  
Medical documents generated in English by Medical practitioners may be understood only by patients with adequate medical knowledge and proficiency in English language.  ...  In India, Breast cancer is the number one killer disease among women. The fast-growing breast cancer patient population demands development of a CDSS for the domain with patient-inclusive features.  ...  ACKNOWLEDGMENT The authors thank the Department of Pathology, Christian Medical College and Hospital, Vellore for providing them with the sample data for their study. Our special thanks to Dr.  ... 
doi:10.35940/ijrte.c6518.098319 fatcat:d22azrqtk5c43pbs3y4xlny4ou

Computational intelligence in processing of speech acoustics: a survey

Amitoj Singh, Navkiran Kaur, Vinay Kukreja, Virender Kadyan, Munish Kumar
2022 Complex & Intelligent Systems  
When compared with non-Indian languages, the research on speech recognition of Indian languages (except Hindi) has not achieved the expected milestone yet.  ...  This paper presents a comprehensive survey on the speech recognition techniques for non-Indian and Indian languages, and compiled some of the computational models used for processing speech acoustics.  ...  The authors proposed a lightly supervised training scheme based on statistical language model transformation that fills the gap between faithful transcripts of spoken utterances and final texts for documentation  ... 
doi:10.1007/s40747-022-00665-1 fatcat:6pu2xccbq5as7bn2y2tav2fdwa


2021 2021 International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT)  
Cyberbullying in Bangla and Romanized Bangla text: A Comparative Study Md.  ...  Protocol in Wireless Sensor Network Sanjay Kumar Mirania and Kanika Sharma 295663 CHEC1087 Analysis of ECG Signal based on Feature Fusion and Two-Fold Classification Approach Nabanita Sinha and  ... 
doi:10.1109/icaect49130.2021.9392460 fatcat:a4xrica7hjegvfvcqypdzn2mfq
« Previous Showing results 1 — 15 out of 125 results