1,143 Hits in 4.0 sec

Extraction of Keyphrases from Text: Evaluation of Four Algorithms [article]

Peter D. Turney
2002 arXiv   pre-print
This report presents an empirical evaluation of four algorithms for automatically extracting keywords and keyphrases from documents.  ...  The target keyphrases were generated for human readers; they were not tailored for any of the four keyphrase extraction algorithms.  ...  Introduction This report evaluates four different methods for automatically extracting keywords and keyphrases from documents.  ... 
arXiv:cs/0212014v1 fatcat:zvwb4m3vr5d7bkmwkjqpjspzhm

Extended list of stop words: Does it work for keyphrase extraction from short texts?

Svetlana Popova, Gabriella Skitalinskaya
2017 2017 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT)  
Extracted stop words allow to improve the quality of the key phrase extraction algorithm.  ...  In this paper we study the problem of key phrase extraction from short texts written in Russian. As texts we consider messages posted on Internet car forums related to the purchase or repair of cars.  ...  list of stop words was extracted from the collection, and the quality evaluation of the algorithm was performed on the y collection.  ... 
doi:10.1109/stc-csit.2017.8098815 fatcat:5oqyskhaz5h35abc42zleanmsa

Query-Based Keyphrase Extraction from Long Documents

Martin Dočekal, Pavel Smrž
2022 Proceedings of the ... International Florida Artificial Intelligence Research Society Conference  
This paper overcomes this issue for keyphrase extraction by chunking the long documents while keeping a global context as a query defining the topic for which relevant keyphrases should be extracted.  ...  The developed system employs a pre-trained BERT model and adapts it to estimate the probability that a given text span forms a keyphrase.  ...  The computation used the infrastructure supported by the Ministry of Education, Youth and Sports of the Czech Republic through the e-INFRA CZ (ID:90140).  ... 
doi:10.32473/flairs.v35i.130737 fatcat:azo7psgumveqdlojeaicmgdgsy

KERT: Automatic Extraction and Ranking of Topical Keyphrases from Content-Representative Document Titles [article]

Marina Danilevsky, Chi Wang, Nihit Desai, Jingyi Guo, Jiawei Han
2013 arXiv   pre-print
By shifting from the unigram-centric traditional methods of unsupervised keyphrase extraction to a phrase-centric approach, we are able to directly compare and rank phrases of different lengths.  ...  We introduce KERT (Keyphrase Extraction and Ranking by Topic), a framework for topical keyphrase generation and ranking.  ...  Unlike most of these methods which extract keyphrases from documents, we aim to extract keyphrases from a corpus of short texts.  ... 
arXiv:1306.0271v1 fatcat:w6iib3arkjbvjpbbnama73ovxy

An Efficient Approach for Keyphrase Extraction from English Document

Imtiaz Hossain Emu, Asraf Uddin Ahmed, Manowarul Islam, Selim Al Mamun, Ashraf Uddin
2017 International Journal of Intelligent Systems and Applications  
Keyphrases are set of  ...  Finally, we evaluate some keyphrases with highest weight measure. The number of keyphrases to be extracted is maintained by a predefined threshold.  ...  PERFORMANCE EVALUATION We evaluate the proposed prototype keyphrase extraction system using the two well-known matrices: Precision and Recall.  ... 
doi:10.5815/ijisa.2017.12.06 fatcat:ydzo2dzoprgpphb2fnjgoe4dhq

Topical Keyphrase Extraction from Twitter

Wayne Xin Zhao, Jing Jiang, Jing He, Yang Song, Palakorn Achananuparp, Ee-Peng Lim, Xiaoming Li
2011 Annual Meeting of the Association for Computational Linguistics  
We evaluate our proposed methods on a large Twitter data set. Experiments show that these methods are very effective for topical keyphrase extraction.  ...  In this paper, we propose to extract topical keyphrases as one way to summarize Twitter.  ...  Evaluation Metrics Traditionally keyphrase extraction is evaluated using precision and recall on all the extracted keyphrases.  ... 
dblp:conf/acl/ZhaoJHSALL11 fatcat:hu2i5hwe4rbvti6wkoztxz2yyq

Automatic Keyphrase Extraction from Scientific Chinese Medical Abstracts Based on Character-Level Sequence Labeling

Liangping Ding, Zhixiong Zhang, Huan Liu, Jie Li, Gaihong Yu
2021 Journal of Data and Information Science  
AbstractPurposeAutomatic keyphrase extraction (AKE) is an important task for grasping the main points of the text.  ...  And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.  ...  G190091) from the National Science Library, Chinese Academy of Sciences and the project "Design and Research on a Next Generation of Open Knowledge Services System and Key Technologies" (2019XM55).  ... 
doi:10.2478/jdis-2021-0013 fatcat:7ugts2hqsjhknghscpuy4gzq3y

Evaluating the Practical Applicability of Thesaurus-Based Keyphrase Extraction in the Agricultural Domain: Insights from the VOA3R Project

David Martín-Moncunill, Elena García-Barriocanal, Miguel-Angel Sicilia, and Salvador Sánchez-Alonso
2015 Knowledge organization  
This paper presents an evaluation of keyphrase extraction using the KEA software and the AGROVOC vocabulary on a sample of a large collection of metadata in the field of agriculture from the AGRIS database  ...  This effort includes a double evaluation, the classical automatic evaluation based on precision and recall measures, plus a blind evaluation aimed to contrast the quality of the keyphrases extracted against  ...  A fragment of a resource description using AGRIS AP. The code shows four keyphrases from AGROVOC (in boldface). Figure 4 . 4 Figure 4.  ... 
doi:10.5771/0943-7444-2015-2-76 fatcat:wbqfcrsb4rdsnki6czsvy77vfq

Keyphrases Extraction from User-Generated Contents in Healthcare Domain Using Long Short-Term Memory Networks

Ilham Fathy Saputra, Rahmad Mahendra, Alfan Farizki Wicaksono
2018 Proceedings of the BioNLP 2018 workshop  
RAKE and CRF, on the task of extracting keyphrases from Indonesian health forum posts.  ...  We propose keyphrases extraction technique to extract important terms from the healthcare user-generated contents. We employ deep learning architecture, i.e.  ...  We extract four types of medical entity from the text, i.e. drug, treatment, symptom, and disease. The medical entity often become part of a keyphrase of the sentence or document.  ... 
doi:10.18653/v1/w18-2304 dblp:conf/bionlp/SaputraMW18 fatcat:wdawzrjukzfape4vn2rpyj4zwe

Citation-Enhanced Keyphrase Extraction from Research Papers: A Supervised Approach

Cornelia Caragea, Florin Adrian Bulgarov, Andreea Godea, Sujatha Das Gollapalli
2014 Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)  
In this paper, we propose a supervised model for keyphrase extraction from research papers, which are embedded in citation networks.  ...  Given the large amounts of online textual documents available these days, e.g., news articles, weblogs, and scientific papers, effective methods for extracting keyphrases, which provide a high-level topic  ...  Any opinions, findings, and conclusions expressed here are those of the authors and do not necessarily reflect the views of NSF.  ... 
doi:10.3115/v1/d14-1150 dblp:conf/emnlp/CarageaBGG14 fatcat:hinzsef3lzfd3irmhgkdm6t34e

A Distributed Framework for NLP-Based Keyword and Keyphrase Extraction From Web Pages and Documents

Paolo Nesi, Gianni Pantaleo, Gianmarco Sanesi
2015 Proceedings of the 21st International Conference on Distributed Multimedia Systems  
The proposed framework has been evaluated against a real corpus of web pages and documents.  ...  In order to automatically ingest and process such huge amounts of data, single-machine, non-distributed architectures are proving to be inefficient for tasks like Big Data mining and intensive text processing  ...  CONCLUSIONS AND FUTURE WORK In this paper, a distributed system for keywords and keyphrases extraction from text content of web pages and documents has been presented.  ... 
doi:10.18293/dms2015-024 dblp:conf/dms/NesiPS15 fatcat:ieenhxagojenfdivup23wt42h4

Extracting Significant Phrases from Text

Yuan J. Lui, Richard Brent, Ani Calinescu
2007 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07)  
The algorithm approaches the problem of keyphrase extraction as a classification task, and uses a combination of statistical and computational linguistics techniques, a new set of attributes, and a new  ...  Therefore, there is a need for automatic keyphrase extraction. This paper introduces a new domain independent keyphrase extraction algorithm.  ...  The data are used to train the algorithm to distinguish keyphrases from non-keyphrases. The resulting algorithm can then be applied to new documents for keyphrase extraction.  ... 
doi:10.1109/ainaw.2007.180 dblp:conf/aina/LuiBC07 fatcat:b52p4ypt3razpk2k2dhc5774vu

Extraction Of Significant Phrases From Text

Yuan J. Lui
2007 Zenodo  
The algorithm approaches the problem of keyphrase extraction as a classification task, and uses a combination of statistical and computational linguistics techniques, a new set of attributes, and a new  ...  Therefore, there is a need for automatic keyphrase extraction. This paper introduces a new domain independent keyphrase extraction algorithm.  ...  of a phrase and the number of occurrences of a phrase in the narrative text, anchor text and special text to extract keyphrases. 6) Use the extracted keywords and keyphrases to extract key sentences. 7  ... 
doi:10.5281/zenodo.1072618 fatcat:hsldpgufvfem7kpgztjh4p26qi

Extracting Information-rich Part of Texts using Text Denoising [article]

Rushdi Shams
2013 arXiv   pre-print
When applied on tasks like biomedical relation bearing text extraction, keyphrase indexing and extracting sentences describing protein interactions, it is evident that the reduced set of text produced  ...  The aim of this paper is to report on a novel text reduction technique, called Text Denoising, that highlights information-rich content when processing a large volume of text data, especially from the  ...  One is technical-processing large data, like that from biomedical texts, slows down many algorithms; another is even more important-algorithms can exhibit a decreased accuracy because of the noise, which  ... 
arXiv:1307.8060v1 fatcat:piays5giavewrkwfvqubfsnd3i

Text Preprocessing using Annotated Suffix Tree with Matching Keyphrase

Ionia Veritawati, Ito Wasito, T Basaruddin
2015 International Journal of Electrical and Computer Engineering (IJECE)  
Content of text is represented by keyphrases, which consist of one or more meaningful words. Keyphrases can be extracted from text through several steps of processing, including text preprocessing.  ...  Combination of four variations of preprocessing is used.  ...  Boris Mirkin for his contributions to this research.This research was supported partially by Grant from Directorate of Higher Education of Indonesia.  ... 
doi:10.11591/ijece.v5i3.pp409-420 fatcat:xm42wcqaufckfbvacogct2x7cm
« Previous Showing results 1 — 15 out of 1,143 results