Filters








5,820 Hits in 4.7 sec

Lexical-Based Alignment for Reconstruction of Structure in Parallel Texts [chapter]

Alexander Gelbukh, Grigori Sidorov, Liliana Chanona-Hernandez
Lecture Notes in Computer Science  
Also, we present a new method for evaluation of the algorithms of parallel texts alignment, which consists in restoration of the structure of the text in one of the languages using the units of the lower  ...  In this paper, we present an optimization algorithm for finding the best text alignment based on the lexical similarity and the results of its evaluation as compared with baseline methods (Gale and Church  ...  , and to propose an alternative method of evaluation of alignment algorithms based on reconstruction of the global text structure in one of the languages.  ... 
doi:10.1007/978-3-540-73351-5_37 fatcat:q2yyuqt475bztabu7qf6pasqde

Constructing a Family Tree of Ten Indo-European Languages with Delexicalized Cross-linguistic Transfer Patterns [article]

Yuanyuan Zhao, Weiwei Sun, Xiaojun Wan
2020 arXiv   pre-print
This allows us to quantitatively probe cross-linguistic transfer and extend inquiries of SLA.  ...  In this paper, we validate this hypothesis on ten Indo-European languages.  ...  Based on reliable syntactic analysis for aligned parallel data 3 , we can generate such patterns with grammar induction technologies.  ... 
arXiv:2007.09076v1 fatcat:bpzg5ww4bzgfdjpq3rmjqfueyi

Cross-Level Sentence Alignment [chapter]

Anna Ho, Francisco Oliveira, Fai Wong
2006 Computational Methods in Engineering & Science  
This paper describes a new model for sentence alignment system of structurally different languages such as Chinese and Portuguese.  ...  We first complete the word level alignment by making use of the Chinese-Portuguese dictionary to get the basic translation rate between the two texts.  ...  [8] combines both statistical (length-based) and lexical methods in sentence alignment. Some lexical cues or parameters are included in order to maximum the result of a probability function.  ... 
doi:10.1007/978-3-540-48260-4_158 fatcat:yn7onem5z5arvgiwygbn6sudxy

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen [article]

Yixin Cao, Ruihao Shui, Liangming Pan, Min-Yen Kan, Zhiyuan Liu, Tat-Seng Chua
2020 arXiv   pre-print
This is a challenging task, unaddressed in previous work, as it requires the models to have expert intelligence in order to modify text with a deep understanding of domain knowledge and structures.  ...  We establish the benchmark performance of five state-of-the-art models for style transfer and text simplification. The results demonstrate a significant gap between machine and human performance.  ...  Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore.  ... 
arXiv:2005.00701v1 fatcat:w5qfj34wiffifaq6xh2ohanxuu

Adapting Language Models for Non-Parallel Author-Stylized Rewriting

Bakhtiyar Syed, Gaurav Verma, Balaji Vasan Srinivasan, Anandhavelu Natarajan, Vasudeva Varma
2020 PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE  
To evaluate the efficacy of our approach, we propose a linguistically-motivated framework to quantify stylistic alignment of the generated text to the target author at lexical, syntactic and surface levels  ...  of a language model to rewrite an input text in a target author's style.  ...  We propose an evaluation framework to assess the efficacy of stylized text generation that accounts for alignment of lexical and syntactic aspects of style.  ... 
doi:10.1609/aaai.v34i05.6433 fatcat:qssbgjdvc5dhxkyyqmx767jlqm

Adapting Language Models for Non-Parallel Author-Stylized Rewriting [article]

Bakhtiyar Syed, Gaurav Verma, Balaji Vasan Srinivasan, Anandhavelu Natarajan, Vasudeva Varma
2020 arXiv   pre-print
To evaluate the efficacy of our approach, we propose a linguistically-motivated framework to quantify stylistic alignment of the generated text to the target author at lexical, syntactic and surface levels  ...  of a language model to rewrite an input text in a target author's style.  ...  We propose an evaluation framework to assess the efficacy of stylized text generation that accounts for alignment of lexical and syntactic aspects of style.  ... 
arXiv:1909.09962v3 fatcat:5of3gl35pvfkzd3nhugy7rgrjq

A Template-based Method for Constrained Neural Machine Translation [article]

Shuo Wang, Peng Li, Zhixing Tan, Zhaopeng Tu, Maosong Sun, Yang Liu
2022 arXiv   pre-print
Experimental results show that the proposed template-based methods can outperform several representative baselines in lexically and structurally constrained translation tasks.  ...  Machine translation systems are expected to cope with various types of constraints in many practical scenarios.  ...  Hashimoto et al. (2019) collect a parallel dataset consisting of structural text translated by human experts.  ... 
arXiv:2205.11255v1 fatcat:omqf6pfczraihklv35ghzu42pq

Neural Versus Non-Neural Text Simplification: A Case Study

Islam Nassar, Michelle Ananda-Rajah, Gholamreza Haffari
2019 Australasian Language Technology Association Workshop  
We propose a modular rule-based system for Text Simplification and show that it outperforms the state-of-the-art neural-based simplification system in terms of simplicity.  ...  Further, we present an adaptation of our system to handle domainspecific tasks, where we employ a hybrid approach of our rule-based system and phrase-based machine translation to simplify medical discharge  ...  Acknowledgements The authors are grateful to the reviewers for their insightful comments and feedback. This work is partly supported by the ARC Future Fellowship FT190100039 to G.  ... 
dblp:conf/acl-alta/NassarAH19 fatcat:hsjvqhpmdbhxfe6dsoayazph6a

Automatic Generation of Exercises for Second Language Learning from Parallel Corpus Data

2021 International Journal of TESOL Studies  
Comparing the text in one language with its translation into another (known) language makes the structure accessible to the learner.  ...  Authentic samples can be obtained from corpora, but it is necessary to identify material that is suitable for language learners. Parallel corpora of written text consist of translated material.  ...  large text collections and crowdsourcing techniques for innovative autonomous language learning applications".  ... 
doi:10.46451/ijts.2021.06.05 fatcat:z5i4rgd46zg4xljgja6g6hzhve

Grounded and Controllable Image Completion by Incorporating Lexical Semantics [article]

Shengyu Zhang, Tan Jiang, Qinghao Huang, Ziqi Tan, Zhou Zhao, Siliang Tang, Jin Yu, Hongxia Yang, Yi Yang, Fei Wu
2020 arXiv   pre-print
One major challenge for LSIC comes from modeling and aligning the structure of visual-semantic context and translating across different modalities.  ...  This can be true since the annotated captions for an image are often semantically equivalent in existing datasets, and thus there is only one paired text for a masked image in training.  ...  We propose to explicitly model the lexical semantic structure for Structure Completion.  ... 
arXiv:2003.00303v1 fatcat:shz72z6arrccxgs7lbwz5d6ftq

Potentials of aligned corpora in second language acquisition: Rethinking language pedagogy from a corpus perspective

Zoran Ristovic
2016 Nasle?e  
The paper encompasses mapping the theoretical background for the corpora exploitation in language pedagogy, pre-processing and the alignment process, exploitation of corpora with offering the repertoire  ...  The approach is based on the production of bilingual electronic corpus of English-Serbian texts, exploitation of the corpus, and tracing the cumulative effects on teaching, learning and acquisition of  ...  Acknowledgments We owe a huge debt of gratitude to Professor Vladislava Gordić-Petković, Faculty of Philosophy, Novi Sad, for the permission to incorporate her translations of Hemingway's stories into  ... 
doi:10.5937/naslkg1633095r fatcat:qly6xme3c5cunlnt644bdesbga

Page 2778 of Linguistics and Language Behavior Abstracts: LLBA Vol. 29, Issue 5 [page]

1995 Linguistics and Language Behavior Abstracts: LLBA  
right-handed male aged 47; 9509851 bilingual text word-for-word alignment achievement difficulty, inter- nal-evidence-based algorithm use; 9511594 ro aise sentences, miminalist/movement analysis; cleftability  ...  , sentence meaning, interpreta- tion instructions/point of view; 9511027 medieval Scandanavian provincial laws’ sentence/text structure, medi- eval German/English/Roman law comparisons; 9511224 parallel  ... 

Unsupervised Neural Text Simplification

Sai Surya, Abhijit Mishra, Anirban Laha, Parag Jain, Karthik Sankaranarayanan
2019 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics  
The core framework is composed of a shared encoder and a pair of attentional-decoders, crucially assisted by discrimination-based losses and denoising.  ...  Our analysis (both quantitative and qualitative involving human evaluators) on public test data shows that the proposed model can perform text-simplification at both lexical and syntactic levels, competitive  ...  Sudeshna Sarkar for helpful discussions in this project. References Mikel Artetxe, Gorka Labaka, and Eneko Agirre. 2018a. Unsupervised statistical machine translation.  ... 
doi:10.18653/v1/p19-1198 dblp:conf/acl/SuryaMLJS19 fatcat:mvbgh5vzzndenbx3zx6e63mwpu

DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting [article]

Hrituraj Singh, Gaurav Verma, Aparna Garimella, Balaji Vasan Srinivasan
2021 arXiv   pre-print
Recent works in this area have leveraged Transformer-based language models in a denoising autoencoder setup to generate author stylized text without relying on a parallel corpus of data.  ...  Author stylized rewriting is the task of rewriting an input text in a particular author's style.  ...  For lexical style, the mean squared error is calculated between the 6-dimensional lexical alignment vector of the directives/generator outputs (calculated as the averaged sum of alignments of words in  ... 
arXiv:2101.11836v1 fatcat:c2h3g32jnrfn7pcpu5dw3lhel4

Unsupervised Neural Text Simplification [article]

Sai Surya, Abhijit Mishra, Anirban Laha, Parag Jain, Karthik Sankaranarayanan
2019 arXiv   pre-print
The core framework is composed of a shared encoder and a pair of attentional-decoders and gains knowledge of simplification through discrimination based-losses and denoising.  ...  Our analysis (both quantitative and qualitative involving human evaluators) on a public test data shows that the proposed model can perform text-simplification at both lexical and syntactic levels, competitive  ...  Sudeshna Sarkar for helpful discussions in this project.  ... 
arXiv:1810.07931v6 fatcat:zwls6h3ovrc7xad443wluazoni
« Previous Showing results 1 — 15 out of 5,820 results