A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2004; you can also visit the original URL.
The file type is application/pdf
.
Automated labeling of bibliographic data extracted from biomedical online journals
2003
Document Recognition and Retrieval X
A prototype system has been designed to automate the extraction of bibliographic data (e.g., article title, authors, abstract, affiliation and others) from online biomedical journals to populate the National Library of Medicine's MEDLINE database. This paper describes a key module in this system: the labeling module that employs statistics and fuzzy rule-based algorithms to identify segmented zones in an article's HTML pages as specific bibliographic data. Results from experiments conducted
doi:10.1117/12.476047
dblp:conf/drr/KimLT03
fatcat:d63h5lijjfhrrp6nm2o2q2htja