2,479 Hits in 4.1 sec

Text chunking using regularized Winnow

Tong Zhang, Fred Damerau, David Johnson
2001 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics - ACL '01  
Many machine learning methods have recently been applied to natural language processing tasks.  ...  Among them, the Winnow algorithm has been argued to be particularly suitable for NLP problems, due to its robustness to irrelevant features.  ...  Each kernel support vector machine is computationally significantly more expensive than a corresponding Winnow classifier, and they use an order of magnitude more classifiers.  ... 
doi:10.3115/1073012.1073081 dblp:conf/acl/ZhangDJ01 fatcat:gw2m5lirnzhh7chomyycnjgzqi

Robust and efficient multiclass SVM models for phrase pattern recognition

Yu-Chieh Wu, Yue-Shi Lee, Jie-Chi Yang
2008 Pattern Recognition  
Support vector machines (SVMs)-based methods had shown excellent performance in many sequential text pattern recognition tasks such as protein name finding, and noun phrase (NP)-chunking.  ...  Experiments on CoNLL-2000 chunking and Chinese base-chunking tasks showed that our method can achieve very competitive accuracy and at least 100 times faster than the state-of-the-art SVM-based phrase  ...  SVM-based phrase chunking models Support vector machines (SVM) [21] was originally designed for binary classification problems.  ... 
doi:10.1016/j.patcog.2008.02.010 fatcat:olvoty5gc5etnht55ngipbdrym

Domain based Chunking

Nilamadhaba Mohapatra, Namrata Sarraf, Swapna sarit Sahu
2021 International Journal on Natural Language Computing  
These annotated queries were used for training the machine learning model using an ensemble transformer-based deep neural network model [24.]  ...  Chunking means splitting the sentences into tokens and then grouping them in a meaningful way.  ...  Chunking has been done using machine learning-based models such as HMM(Hidden Markov Model) [7] [17] and Maximum Entropy model and has gradually seen a shift to Statistical models such as Support Vector  ... 
doi:10.5121/ijnlc.2021.10401 fatcat:ak7zrtzvfnei3gzxi4qz2r67ne

Integrating Text Chunking with Mixture Hidden Markov Models for Effective Biomedical Information Extraction [chapter]

Min Song, Il-Yeol Song, Xiaohua Hu, Robert B. Allen
2005 Lecture Notes in Computer Science  
We compared KXtractor with three IE techniques: 1) RAPIER, an inductive learning-based machine learning system, 2) a Dictionary-based extraction system, and 3) single POS HMM.  ...  This paper presents a new information extraction (IE) technique, KXtractor, which integrates a text chunking technique with Mixture Hidden Markov Models (MiHMM).  ...  Text chunking based on Support Vector Machines (SVMs) was reported to produce the highest accuracy in the text chunking task [5] .  ... 
doi:10.1007/11428848_124 fatcat:mbbaeslr3bhm5mgnmhq7qvhcnm

USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics

Liling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith
2015 Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)  
We extend the work on using machine translation evaluation metrics in the STS task.  ...  In addition, we introduce a novel deep regressor architecture and evaluated its efficiency in the STS task.  ...  and base phrase chunks level.  ... 
doi:10.18653/v1/s15-2015 dblp:conf/semeval/TanSSG15 fatcat:wmjvuvvd4fc3blyy7cprb4y2qq

A Generic Online Parallel Learning Framework for Large Margin Models [article]

Shuming Ma, Xu Sun
2017 arXiv   pre-print
To speed up the training process, many existing systems use parallel technology for online learning algorithms.  ...  Our framework is lock-free and easy to implement on existing systems.  ...  Acknowledgements This work was supported in part by National Natural Science Foundation of China (No. 61673028), and National High Technology Research and Development Program of China (863 Program, No.  ... 
arXiv:1703.00786v1 fatcat:xzl7x5ecrvetxpiifwbtxiu3ae

Amharic Phrase Chunking with Conditional Random Fields

Alemebante Mulu, Vishal Goyal
2020 Journal of scientific research  
Totally, we are using 400,000 tagged words and have evaluated the result by the combination boundary identification and labeling.  ...  In this research work, our goal is to develop chunking for Amharic language by using different tagging schemes for identifying the chunk boundaries and incorporate the tangs in form of contextual information  ...  The task of chunking is ideally suited for machine learning and deep learning because of robustness and relatively easy training. C.  ... 
doi:10.37398/jsr.2020.640135 fatcat:cwefy2ypf5dg3g3iabpftzfabm

Semantic role labeling for protein transport predicates

Steven Bethard, Zhiyong Lu, James H Martin, Lawrence Hunter
2008 BMC Bioinformatics  
a word-chunking paradigm and trained support vector machine classifiers to classify words as being at the beginning, inside or outside of a protein transport role.  ...  We trained these models with the features of previous wordchunking models, features adapted from phrase-chunking models, and features derived from an analysis of our data.  ...  YamCha is based on Support Vector Machine models and has performed well on a variety of similar tasks [39, 40] .  ... 
doi:10.1186/1471-2105-9-277 pmid:18547432 pmcid:PMC2474622 fatcat:oewg2g4z6rctdkugaexhnxoyie

Keyphrases Extraction from Scientific Documents: Improving Machine Learning Approaches with Natural Language Processing [chapter]

Mikalai Krapivin, Aliaksandr Autayeu, Maurizio Marchese, Enrico Blanzieri, Nicola Segata
2010 Lecture Notes in Computer Science  
In this paper we use Natural Language Processing techniques to improve different machine learning approaches (Support Vector Machines (SVM), Local SVM, Random Forests) to the problem of automatic keyphrases  ...  Finally, we report a detailed analysis of the effect of the individual NLP features and data set size on the overall quality of extracted keyphrases.  ...  Support Vector Machines (SVMs) [17] are classifiers with sound foundations in statistical learning theory [18] considered, with RF, the state-of-the-art classification method.  ... 
doi:10.1007/978-3-642-13654-2_12 fatcat:kndnxam4wnf2hoqf4bz3wjlvem

Analysis of Intelligent English Chunk Recognition based on Knowledge Corpus

Mei Zhang
2022 Annals of Emerging Technologies in Computing  
To strengthen the RNN, it was improved by Long Short Term Memory (LSTM) for recognising English chunk. The LSTM-RNN was compared with support vector machine (SVM) and RNN in simulation experiments.  ...  Although corpora have already been widely used in the areas mentioned above, annotation and recognition of chunks are mainly done manually.  ...  The experiment based on three databases found that this method improved recognition accuracy, robustness, and recognition efficiency. Kai et al.  ... 
doi:10.33166/aetic.2022.03.002 fatcat:l6h6rcu6jrf5xex34zlcukkece

Very Short-Term Conflict Intensity Estimation Using Fisher Vectors

Gábor Gosztolya
2020 Interspeech 2020  
In this study we show that Support Vector Regression machine learning models using Fisher vectors as features, even when trained on longer utterances, allow us to efficiently and accurately detect conflict  ...  We also verify the validity of this approach by comparing the SVM predictions of the chunks with a manual annotation for the full and the 5-second-long cases.  ...  Acknowledgements This research was partially supported by the Ministry for Innovation and Technology, Hungary (grants TUDFO/47138-1/2019-ITM and the New National Excellence ProgramÚNKP-20-5), and by the  ... 
doi:10.21437/interspeech.2020-2349 dblp:conf/interspeech/Gosztolya20 fatcat:kvnmpux3pjazpnzaev6ylkvkmm

Filtering-Ranking Perceptron Learning for Partial Parsing

A. Xavier Carreras, B. Lluís Màrquez, C. Jorge Castro
2005 Machine Learning  
This work introduces a general phrase recognition system based on perceptrons, and a global online learning algorithm to train them together.  ...  the optimal phrase structure by discriminating among competing phrases.  ...  This work was supported by: the IST Programme of the European Community, under the PASCAL Network of Excellence (IST-2002-506778) and the Meaning project (IST-2001-34460); and the Spanish Research Department  ... 
doi:10.1007/s10994-005-0917-x fatcat:xk5fpnilhbg4loaiow4hvgdz6m

Chinese text chunking using lexicalized HMMs

Guo-Hong Fu, Rui-Feng Xu, Kang-Kwong Luke, Qin Lu
2005 2005 International Conference on Machine Learning and Cybernetics  
Acknowledgements We would like to thank the Institute of Computational Linguistics of the Peking University for their part-of-speech tagset, lexicon and corpus.  ...  (ME) [5] , memory-based learning (MBL) [4] [6] , and support vector machines (SVMs) [7] .  ...  In comparison with rule-based methods, machine-learning approaches are more adaptive and robust.  ... 
doi:10.1109/icmlc.2005.1526911 fatcat:55h62vhm25grthbuxgrt4hmbly

Chunk and Clause Identification for Basque by Filtering and Ranking with Perceptrons

Iñaki Alegria, Bertol Arrieta, Xavier Careras, Arantza Díaz de Ilarraza, Larraitz Uria
2008 Revista de Procesamiento de Lenguaje Natural (SEPLN)  
Resumen: Este artículo presenta sistemas de identificación de chunks y cláusulas para el euskera, combinando gramáticas basadas en reglas con técnicas de aprendizaje automático.  ...  Así, los resultados del identificador de chunks han mejorado considerablemente y se ha compensado la influencia del relativamente pequeño corpus de entrenamiento que disponemos para el euskera.  ...  Xavier Carreras was supported by the Catalan Ministry of Innovation, Universities and Enterprise.  ... 
dblp:journals/pdln/AlegriaACIU08 fatcat:2ctxw327v5ar5ang2g7ymsiyp4

A Hybrid Model For Phrase Chunking Employing Artificial Immunity System And Rule Based Methods

Bindu, Sumam Mary Idicula
2011 International Journal of Artificial Intelligence & Applications  
This ambiguity may be partially resolved by using chunking as an intermediate step. To the best of our knowledge no known work or tag set is available for phrase chunking in Malayalam.  ...  Identification of phrases or phrase chunking is an important step in natural language understanding (NLU). Chunker identifies and divides sentences into syntactically correlated word groups.  ...  In Tamil few taggars are developed employing support vector machines (SVM) and they obtained a tagging accuracy of 95.82%.  ... 
doi:10.5121/ijaia.2011.2408 fatcat:65f2ohdj6vfxhnch775ad32hpe
« Previous Showing results 1 — 15 out of 2,479 results