Filters








186 Hits in 5.9 sec

Machine Translation System in Indian Perspectives

Dwivedi
2010 Journal of Computer Science  
Approach: Most of the state government works in there provincial languages, whereas the central government's official documents and reports are in English and Hindi.  ...  Conclusion/Recommendations: There are certain machine translation systems that have been developed in India for translation from English to Indian languages by using different approaches.  ...  English to (Hindi, Kannada, Tamil) and Kannada to Tamil language-pair example based machine translation system: English to {Hindi, Kannada and Tamil} and Kannada to Tamil language-pair example based machine  ... 
doi:10.3844/jcssp.2010.1111.1116 fatcat:sn67snf37vcw3k3b2oajnw5roq

Machine Translation Approaches and Survey for Indian Languages [article]

Nadeem Jadoon Khan, Waqas Anwar, Nadir Durrani
2017 arXiv   pre-print
In this study, we present an analysis regarding the performance of the state-of-art Phrase-based Statistical Machine Translation (SMT) on multiple Indian languages.  ...  We report the performance of baseline systems translating from Indian languages (Bengali, Guajarati, Hindi, Malayalam, Punjabi, Tamil, Telugu and Urdu) into English with average 10% accurate results for  ...  In their work they discussed the simple implementation of default phrase-based model for SMT for English to Hindi and also give an overview of different Machine translation applications that are in use  ... 
arXiv:1701.04290v1 fatcat:awsv2wioljduzofvkinejcl3wu

Machine Translation Systems for Indian Languages

Latha R.Nair, David Peter S.
2012 International Journal of Computer Applications  
Syntactic structure transfer for verbs from Tamil to Hindi is discussed in [5] .  ...  An English to Hindi MT system which combines RBMT and phrase-based SMT approach was developed at IIIT Hyderabad in 2010.  ... 
doi:10.5120/4785-7014 fatcat:cnsprg7cyzaodn3wq5z6vhoolu

Neural Machine Translation for Low-Resourced Indian Languages [article]

Himanshu Choudhary, Shivansh Rao, Rajesh Rohilla
2020 arXiv   pre-print
In this paper, we have applied NMT on two of the most morphological rich Indian languages, i.e. English-Tamil and English-Malayalam.  ...  A large number of significant assets are available online in English, which is frequently translated into native languages to ease the information sharing among local people who are not much familiar with  ...  For instance, English has Subject-Verb-Object (SVO) whereas Tamil and Malayalam have Subject-Object-Verb (SOV).  ... 
arXiv:2004.13819v1 fatcat:qcdmyjouuve3xkrfivdo7ursve

Utilizing Language Relatedness to improve Machine Translation: A Case Study on Languages of the Indian Subcontinent [article]

Anoop Kunchukuttan, Pushpak Bhattacharyya
2020 arXiv   pre-print
In this work, we present an extensive study of statistical machine translation involving languages of the Indian subcontinent. These languages are related by genetic and contact relationships.  ...  Our observations span 90 language pairs from 9 Indic languages and English.  ...  Translation from Indic Languages to English We trained phrase-based SMT systems for translation from Indic languages to English using the Moses (Koehn et al., 2007) , with the grow-diag-final-and heuristic  ... 
arXiv:2003.08925v1 fatcat:f3vqq36alfewfeilgenw3t7nyy

INTEGRATING MACHINE TRANSLATION AND SPEECH SYNTHESIS COMPONENT FOR ENGLISH TO DRAVIDIAN LANGUAGE SPEECH TO SPEECH TRANSLATION SYSTEM

J. SANGEETHA, S. JOTHILAKSHMI
2015 Journal of Engineering Science and Technology  
This paper provides an interface between the machine translation and speech synthesis system for converting English speech to Tamil text in English to Tamil speech to speech translation system.  ...  In this paper, we focus on integration of machine translation and speech synthesis, and report a subjective evaluation to investigate the impact of speech synthesis, machine translation and the integration  ...  develop ment of English to Tamil hybrid machine translation system (co mb ination of rule based and statistical mach ine translation system) and syllab le based concaten ative speech synthesis system.  ... 
doaj:06bae093162d4267bff0ab88b507baa6 fatcat:idfuwpmxjbepxi33robdntaq6q

English to Malayalam translation

Mary Priya Sebastian, Sheena Kurian K, G. Santhosh Kumar
2010 Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India - A2CWiC '10  
By using a monolingual Malayalam corpus and a bilingual English/Malayalam corpus in the training phase, the machine automatically generates Malayalam translations of English sentences.  ...  Pre-processing techniques like suffix separation from the Malayalam corpus and stop word elimination from the bilingual corpus also proved to be effective in training.  ...  Another work on English to Tamil statistical machine translation is also reported in [2] .  ... 
doi:10.1145/1858378.1858442 dblp:conf/a2cwic/SebastianKK10 fatcat:vsrk5irkknbclb6pvojatmoihy

XLIFF: Multilingual Translation Memory Management among Divergent Language Families

Guo Jiangzhen, Henan Polytechnic University, Priyanka Pawar, Pratik Ardhapurkar, Priyanka Jain, Anuradha Lele, Ajai Kumar, Hermant Darbari, Centre for Development of Advanced Computing, Centre for Development of Advanced Computing, Centre for Development of Advanced Computing, Centre for Development of Advanced Computing (+2 others)
2015 International Journal of Smart Business and Technology  
This process uses bilingual data set along with other language assets to frame language and phrase models which are used while translating text in machine translation.  ...  In this paper, we are introducing format retention utility using "XLIFF" as an important tool to English to Indian Language Machine Translation.  ...  Fig. 10 shows Format Retention of English webpage to Bengali, Hindi and Tamil Languages for HTML format.  ... 
doi:10.21742/ijsbt.2015.3.1.03 fatcat:jgobklexrbfmna2qd7kkhp7i3e

XLIFF : Multilingual Translation Memory Management among divergent language families

Priyanka Pawar, Pratik Ardhapurkar, Priyanka Jain, Anuradha Lele, Ajai Kumar, Hemant Darbari
2015 International Journal of Smart Business and Technology  
This process uses bilingual data set along with other language assets to frame language and phrase models which are used while translating text in machine translation.  ...  Indian languages belong majorly to Aryan and Dravidian languages which are flexible free order in Subject-Object-Verb [SOV] syntactic order where as English Language has syntactic order Subject-Verb-Object  ...  English Hindi and Hemant Darbari Bengali Tamil Figure 10.  ... 
doi:10.21742/ijsbt.2015.3.2.01 fatcat:ragrczxgzjawvbf2ch7uzrlciu

A Statistical Study of Telugu Treebanks

Praveen Gatla
2021 Translation Today  
Later it discusses the comparison of both the treebanks and tries to provide the reasons for the highest and lowest frequencies in both the treebanks. k1 and k2 have 60% of the coverage in karaka labels  ...  The most frequent non-k-labels are verb modifier (vmod) (949 times) and noun modifier (nmod) (1033 times) in both the treebanks.  ...  They are Bhojpuri, Hindi, Hindi-English, Kangri, Magahi, Marathi, Sanskrit, Tamil, and Urdu. There are four upcoming Indian languages under UD treebank.  ... 
doi:10.46623/tt/2021.15.1.ar6 fatcat:ie5wu6eeyrhezfokh4xtqo5c7m

On the Difficulty of Translating Free-Order Case-Marking Languages [article]

Arianna Bisazza, Ahmet Üstün, Stephan Sportel
2021 arXiv   pre-print
In this work, we investigate whether this class of languages is also more difficult to translate by state-of-the-art Neural Machine Translation models (NMT).  ...  Using a variety of synthetic languages and a newly introduced translation challenge set, we find that word order flexibility in the source language only leads to a very small loss of NMT quality, even  ...  We would like to thank the Center for Information Technology of the University of Groningen for providing access to the Peregrine HPC cluster, and the anonymous reviewers for their helpful comments.  ... 
arXiv:2107.06055v1 fatcat:sjpvlv5arjb6lb7hsc6hdr7ffe

A Study of Neural Machine Translation from Chinese to Urdu

. Zeeshan
2020 Journal of Autonomous Intelligence  
In this paper, the Chinese language is translated to the Urdu language with the help of Open Neural Machine Translation (OpenNMT) in Deep Learning.  ...  Neural Machine Translation (NMT) is one of the most standard machine translation methods, which has made great progress in the recent years especially in non-universal languages.  ...  According to modern exploration, the performance report of baseline systems in translating Indian languages-based text (Bengali, Hindi, Malayalam, Punjabi, Tamil, Telugu, Gujarati, and Urdu) into English  ... 
doi:10.32629/jai.v2i4.82 fatcat:d7qi6jadlvcozmkvyx4t6bevhy

Survey on Publicly Available Sinhala Natural Language Processing Tools and Research [article]

Nisansa de Silva
2022 arXiv   pre-print
However, due to poverty in both linguistic and economic capital, Sinhala, in the perspective of Natural Language Processing tools and research, remains a resource-poor language which has neither the economic  ...  As such, we shall be uploading this paper to arXiv and perpetually update it periodically to reflect the advances made in the field.  ...  Similarly, the authors would also like to thank Shravan Kale for checking the examples we have provided in Hindi for their accuracy.  ... 
arXiv:1906.02358v12 fatcat:3bvyvces4zhzzhnvjgaqqacbzy

Page 354 of Linguistics and Language Behavior Abstracts: LLBA Vol. 29, Issue 1 [page]

1995 Linguistics and Language Behavior Abstracts: LLBA  
English/indigenous languages translation development, South Africa, political /social variables; 9500762 English/Spanish determiner phrases model, sentence structure similar- ity, N.  ...  /lexical items; grammaticality judgments; educated speakers; 9500991 instructional language, English vs Kiswahili, Tanzania; 9501634 interlingua machine translation system design, English-Chinese appli  ... 

Multilingual Multiword Expressions [article]

Lahari Poddar
2016 arXiv   pre-print
The project aims to provide a semi-supervised approach to identify Multiword Expressions in a multilingual context consisting of English and most of the major Indian languages.  ...  The word sequences matching the required pattern are subjected to a series of linguistic tests which include verb filtering, named entity filtering and hyphenation filtering test to exclude false positives  ...  a machine translation system from English to Hindi is erroneous due to the ignorance of multiword expressions in the sentence.  ... 
arXiv:1612.00246v1 fatcat:4rs33ntycnadvnn6e3atd3ixra
« Previous Showing results 1 — 15 out of 186 results