6,755 Hits in 6.5 sec

Article-level matching of Web of Science to a local database in a comparative context [article]

Linda Sīle, Raf Guns
We provide a step-by-step description for matching metadata for journal articles from a national database with metadata from the Web of Science using both algorithmic as well semi-manual methods.  ...  This paper presents an approach for matching article-level bibliographic metadata developed at the University of Anwerp branch of ECOOM.  ...  For example, in WoS bibliographic metadata for articles published in a non-English language are recorded in English. Hence, the matching of records on the basis of the title is not possible.  ... 
doi:10.6084/m9.figshare.13013816.v1 fatcat:3pizqm27ezdf5j7rygoavhgedq

Discovering Metadata Inconsistencies

Bruno Angeles, Cory McKay, Ichiro Fujinaga
2010 Zenodo  
The second statistic that is not higher in Table 1 than in Table 2 is -identical metadata‖ (artist, album, and title) for Classical music.  ...  This was done in order to find the percentage of metadata that was identical between the manuallymaintained metadata and that found on the MusicBrainz server of metadata.  ... 
doi:10.5281/zenodo.1415549 fatcat:jyfuoigggrde5dsyf6rd4r2heu

Characterizing Same Work Relationships in Large-Scale Digital Libraries [chapter]

Peter Organisciak, Summer Shetenhelm, Danielle Francisco Albuquerque Vasques, Krystyna Matusiak
2019 Lecture Notes in Computer Science  
Addressing this problem is complicated by metadata inconsistencies as well as structural/content differences.  ...  Following from work in algorithmically identifying duplicate works in the HathiTrust Digital Library, we present some cases that complicate our existing language for work entity relationships.  ...  Identical Manifestations with differing catalogue metadata, control numbers. Why not match same-work relationships by metadata, particularly by title and author?  ... 
doi:10.1007/978-3-030-15742-5_40 fatcat:vjpg2x4iwza4pa33mmgsyjjn5i

Misinformation and Bias in Metadata Processing: Matching in Large Databases

Gail Thornburg, W. Michael Oskins
2007 Information Technology and Libraries  
This article discusses structural, systems, and other types of bias that arise in matching new records to large databases.  ...  Problems of satisfying a "match" with sufficient flexibility and rigor in an environment of imperfect data are presented, and sources of unintentional variance are discussed.  ...  , Rogers, and Oskins article. 6 The authors discuss their analysis of differences in records that are similar but not identical, and which elements caused failure to match two records for the same item  ... 
doi:10.6017/ital.v26i2.3278 fatcat:ug7ym2q7ird45fw3hdtbreux44

Metadata for Name Disambiguation and Collocation

Jeffrey Beall
2010 Future Internet  
organization's name may appear in different forms in various online documents.  ...  Databases and search engines can use metadata as a tool to solve the problem of name ambiguity and name variation in online databases.  ...  For example, a natural-language search on "White, Edmund" doesn't match metadata recorded as "White.  ... 
doi:10.3390/fi2010001 fatcat:54anocj4bjanpjpqg3e76gv7mq

Side by Side: The Use of Multiple Subject Languages in Capturing Shifting Contexts around Historical Collections

Brian Dobreski, Jian Qin, Melissa Resnick
2019 NASKO  
Furthermore, neither of these perspectives may match with self-identifying language used in contemporary society.  ...  At the same time, original language used in these images is now often seen as demeaning, exploitative, and offensive.  ...  In the first phase, descriptive analysis was conducted on the title and subject metadata for the total set of 1,411 records.  ... 
doi:10.7152/nasko.v7i1.15615 fatcat:kipetj3nw5g3lphvy4p6jjxfaq

Author name disambiguation for collaboration network analysis and visualization

Andreas Strotmann, Dangzhi Zhao, Tania Bubela
2009 Proceedings of the American Society for Information Science and Technology  
A new algorithm for the cross-language disambiguation of author names is presented. The algorithm uses the matching of Russian and English papers and journal titles.  ...  For example, the paper titled as SCHEMATOLOGY IN A MULTI-LANGUAGE OPTIMIZER in has the title Schematology in a MJ I/T I-language OPT imizer in SpringerLink.  ...  An algorithm for the cross-language identity resolution using the SBRAS Open Archive is presented in [2] .  ... 
doi:10.1002/meet.2009.1450460218 fatcat:t5dolzdijnbbtl7znmni3forui


Zinaida Apanovich, A.P. Ershov Institute of Informatics Systems SB RAS
2019 Bulletin of the Novosibirsk Computing Center Series Computer Science  
Of special interest is the problem of crosslingual entity resolution for multilingual data integration since local language data sources are often more complete and accurate than global data sources.  ...  Nowadays, entity resolution is being intensively investigated in the context of the integration of heterogeneous data sets.  ...  An algorithm for the cross-language identity resolution using the SBRAS Open Archive is presented in [3] .  ... 
doi:10.31144/bncc.cs.2542-1972.2019.n43.p1-14 fatcat:piebj7ypfzfurkptwdjm5b4wv4

Developing practical automatic metadata assignment and evaluation tools for internet resources

Gordon W. Paynter
2005 Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries - JCDL '05  
This paper describes the development of practical automatic metadata assignment tools to support automatic record creation for virtual libraries, metadata repositories and digital libraries, with particular  ...  In this paper, we describe the form and function of common metadata fields, and identify appropriate performance measures for these fields.  ...  Language Assignment INFOMINE is a purely English-language resource, and any automatically-discovered records must also be predominantly in English. iVia assigns Language metadata based on two sources.  ... 
doi:10.1145/1065385.1065454 dblp:conf/jcdl/Paynter05 fatcat:rctmi2twfnfe3b2bqkisc7rlje

GIRT and the Use of Subject Metadata for Retrieval [chapter]

Vivien Petras
2005 Lecture Notes in Computer Science  
As in previous years, we compare our techniques of thesaurus matching and Entry Vocabulary Modules to simple machine translation techniques in bilingual retrieval.  ...  Retrieval results for the German and English GIRT collection for monolingual as well as bilingual retrieval (with English and German as query languages) will be represented.  ...  INTRODUCTION For several years now, the Berkeley group has been interested in how the use of subject metadata (additional to the full text of title and abstract of documents) can improve information retrieval  ... 
doi:10.1007/11519645_31 fatcat:niqguum4qjabba7s4mdt7sbb5a

Search across Different Media: Numeric Data Sets and Text Files

Michael Buckland, Aitao Chen, Fredric C. Gey, Ray R. Larson
2006 Information Technology and Libraries  
Topic searches are described in two different media: text files and socioeconomic numeric databases and also for transverse searching, whereby retrieved text is used to find topically related numeric data  ...  Statistical association techniques and natural-language processing can help. Searches in socioeconomic numeric databases ordinarily require that place and time be specified.  ...  Training records containing two different metadata vocabularies can be used to create direct mappings between the values of the two metadata vocabularies.  ... 
doi:10.6017/ital.v25i4.3352 fatcat:aqcmzlsjljajve2z2k2wcdzj7q

Only the first results count: user-feedback-modified relevance ranking in E-LIB Bremen

Martin Blenkle, Rachel Ellis, Elmar Haake
2015 Insights: The UKSG Journal  
Users of these new, continually expanding catalogue systems need to be able to find titles in a convenient way.  ...  However, in 2011, the State and University Library Bremen (SuUB) developed modified ranking methods for its own discovery system, Electronic Library Bremen (E-LIB Bremen).  ...  Another problem is caused by the relatively small amount of text in library metadata records.  ... 
doi:10.1629/uksg.235 fatcat:mvs3dbgu3veznnj3i365oxyina

Refcat: The Internet Archive Scholar Citation Graph [article]

Martin Czygan, Helge Holzmann, Bryan Newbold
2021 arXiv   pre-print
The source code used for the derivation process, including exact and fuzzy citation matching, is released under an MIT license.  ...  It is composed of data gathered by the fatcat cataloging project (the catalog that underpins IA Scholar), related web-scale crawls targeting primary and secondary scholarly outputs, as well as metadata  ...  Each combination of fields may require a slightly different processing path. For example, references with an Arxiv identifier can be processed differently from references with only a title.  ... 
arXiv:2110.06595v2 fatcat:dt2btzfllbbmhfpb3xpx3pz6o4

ISNI and traditional authority work

Amy Armitage, Mary Jane Cuneo, Isabel Quintana, Karen Carlson Young
This article looks not only at the differences in workflows, but also explains how these theoretical differences can affect the way librarians identify and collocate named entities.  ...  The focus of the work becomes identity management rather than the establishment of authorized name forms.  ...  There are also key differences in how metadata is recorded between ISNI and NACO. This is partly due to input workflows.  ... 
doi:10.4403/ doaj:8e5039e1d4034d99a6859c013e3d7653 fatcat:rihr5553drbbjop3fgtj4fayri

The coverage of Microsoft Academic: analyzing the publication output of a university

Sven E. Hug, Martin P. Brändle
2017 Scientometrics  
A Perl script was written to retrieve metadata from MA based on publication titles. The script is freely available on GitHub.  ...  We find that the publication year is correct for 89.5% of all publications and the number of authors is correct for 95.1% of the journal articles.  ...  Acknowledgments The authors thank the ZORA editorial team for their advice, Robin Haunschild for comments, and Mirjam Aeschbach for proofreading.  ... 
doi:10.1007/s11192-017-2535-3 fatcat:34pmwmjytvfixkrw3nktyepzl4
« Previous Showing results 1 — 15 out of 6,755 results