133 Hits in 8.7 sec

A heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments

Ciriaco Andrea D'Angelo, Cristiano Giuffrida, Giovanni Abramo
2010 Journal of the American Society for Information Science and Technology  
We will address this problem by presenting a heuristic approach to author name disambiguation in bibliometric datasets for large-scale research assessments.  ...  Bibliometrics has not been able to offer a valid large-scale alternative because of almost overwhelming difficulties in identifying the true author of each publication.  ...  The development and application of the algorithm described has allowed the authors to set up a unique national bibliometric database, apt to support large scale research evaluations at the level of the  ... 
doi:10.1002/asi.21460 fatcat:lluouhebcvhhtn7twnyedjmzya

Collecting large-scale publication data at the level of individual researchers: a practical proposal for author name disambiguation

Ciriaco Andrea D'Angelo, Nees Jan van Eck
2020 Scientometrics  
The proposed approach can serve as a starting point for large-scale census of publication portfolios for bibliometric analyses at the level of individual researchers.  ...  The disambiguation of author names is an important and challenging task in bibliometrics.  ...  performed in an approach proposed for "large scale" author name disambiguation.  ... 
doi:10.1007/s11192-020-03410-y fatcat:avvb2ptxgfh4jhg3aktknjdp6u

Why name ambiguity resolution matters for scholarly big data research

Jinseok Kim, Jana Diesner, Heejun Kim, Amirhossein Aleyasen, Hwan-Min Kim
2014 2014 IEEE International Conference on Big Data (Big Data)  
This calls for special attention to data pre-processing choices in scholarly big data research.  ...  to 285.28% for degree centrality for initial based methods compared to algorithmic disambiguation.  ...  .), and Mark Newman (Univ. of Michigan) for helping us to disambiguate author names in the APS dataset.  ... 
doi:10.1109/bigdata.2014.7004345 dblp:conf/bigdataconf/KimDKAK14 fatcat:peovaf6g2fgwllkukghmziueqa

Comparing Institutional-Level Bibliometric Research Performance Indicator Values Based on Different Affiliation Disambiguation Systems

Paul Donner, Christine Rimmert, Nees Jan van Eck
2019 Quantitative Science Studies  
We study the coverage of the disambiguation systems and, in particular, the differences in a number of commonly used bibliometric indicators.  ...  The Web of Science normalized institution names and Organization Enhanced system, and the Scopus Affiliation ID system are tested against a complete, independent institution disambiguation system for a  ...  We are therefore in a position to study the degree to which the use of a sophisticated disambiguation system with near-complete national-scale coverage leads to different bibliometric indicator values  ... 
doi:10.1162/qss_a_00013 fatcat:xrxqsehzkfcp5kfo5ejssh2kr4

AuthCrowd: Author Name Disambiguation and Entity Matching using Crowdsourcing

Antonio Correia, Diogo Guimaraes, Dennis Paulino, Shoaib Jameel, Daniel Schneider, Benjamim Fonseca, Hugo Paredes
2021 2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD)  
Despite decades of research and development in named entity resolution, dealing with name ambiguity is still a challenging issue for many bibliometric-enhanced information retrieval (IR) tasks.  ...  Experimental results on a real-world dataset of publicly available papers published in peer-reviewed venues demonstrate the potential of our proposed approach for improving author name disambiguation.  ...  In a more recent study designed in part to create a large-scale and manually-labeled dataset of author names, Xiao et al. [6] framed their findings under a crowdsourcing annotation framework.  ... 
doi:10.1109/cscwd49262.2021.9437769 fatcat:wuc2n5a2bjenpkyzeasmcqdsqi

Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies

Jeroen Baas, Michiel Schotten, Andrew Plume, Grégoire Côté, Reza Karimi
2020 Quantitative Science Studies  
The trustworthiness of Scopus has led to its use as bibliometric data source for large-scale analyses in research assessments, research landscape studies, science policy evaluations, and university rankings  ...  a virtual laboratory where researchers will be able to utilize Scopus data.  ...  ACKNOWLEDGMENTS The authors wish to express their gratitude to Elsevier colleague Roy Boverhof, who provided the design of the charts in Figure 1 .  ... 
doi:10.1162/qss_a_00019 fatcat:3sfrla2epbch7hew6u3bcjta5a

Disambiguation and co-authorship networks of the U.S. patent inventor database (1975–2010)

Guan-Cheng Li, Ronald Lai, Alexander D'Amour, David M. Doolin, Ye Sun, Vetle I. Torvik, Amy Z. Yu, Lee Fleming
2014 Research Policy  
The authors address this issue contributing (1) an application of the Author-ity disambiguation approach (Torvik et al., 2005; Torvik and Smalheiser, 2009 ) to the US utility patent database, (2) a new  ...  in the form of a database of inventors and their associated patents.  ...  Software and computation We wrote a generic disambiguation engine in C/C++, in order to provide developers with a modular and computationally efficient way to specify any disambiguation strategy on any  ... 
doi:10.1016/j.respol.2014.01.012 fatcat:esv57l4b2bdolo2iuz3brcp2mi

Evaluating author name disambiguation for digital libraries: a case of DBLP

Jinseok Kim
2018 Scientometrics  
A discussion follows on strengths and weaknesses of labeled datasets used in this study for future efforts to evaluate author name disambiguation on a digital library scale.  ...  In doing so, this study takes a triangulation approach that author name disambiguation for a digital library can be better evaluated when its performance is assessed on multiple labeled datasets with comparison  ...  Acknowledgements I would like to thank Florian Reitz (Leibniz Center for Informatics, Schloss Dagstuhl, Germany) for providing the list of synonyms in DBLP and Alan Filipe Santana (Departamento de Ciência  ... 
doi:10.1007/s11192-018-2824-5 fatcat:kvhu24js2rglrpozn4n4ckksuq

An empirical and theoretical critique of the Euclidean index

Jens Peter Andersen
2017 Journal of Informetrics  
The recently proposed Euclidean index offers a novel approach to measure the citation impact of academic authors, in particular as an alternative to the h-index.  ...  We test if the index provides new, robust information, not covered by existing bibliometric indicators, discuss the measurement scale and the degree of distinction between analytical units the index offers  ...  Acknowledgments The author wishes to thank Jesper W. Schneider and Carter W. Bloch for helpful comments and discussion.  ... 
doi:10.1016/j.joi.2017.02.009 fatcat:v4cqe54e65hxlc2snnc7yhgjsy

Effect of Forename String on Author Name Disambiguation

Jinseok Kim, Jenna Kim
2019 Journal of the Association for Information Science and Technology  
In author name disambiguation, author forenames are used to decide which name instances are disambiguated together and how much they are likely to refer to the same author.  ...  This study assesses the contributions of forenames in author name disambiguation using multiple labeled datasets under varying ratios and lengths of full forenames, reflecting real-world scenarios in which  ...  to efforts trying to disambiguate author names at scale.  ... 
doi:10.1002/asi.24298 fatcat:cipy3637nfhmdpmxuymo7aep6e

Generating automatically labeled data for author name disambiguation: an iterative clustering method

Jinseok Kim, Jinmo Kim, Jason Owen-Smith
2018 Scientometrics  
Several challenges are discussed for applying this method to resolving author name ambiguity in large-scale scholarly data.  ...  To train algorithms for supervised author name disambiguation, many studies have relied on hand-labeled truth data that are very laborious to generate.  ...  Thus, Author C is depicted to cite Author A and Author B, and Author D to cite Author A and Author B. If Author C is the same as Author A, they are in self-citation relation.  ... 
doi:10.1007/s11192-018-2968-3 fatcat:tjeflvfjpva5rn5fp2vislzpci

Cloud-Based Big Data Management and Analytics for Scholarly Resources: Current Trends, Challenges and Scope for Future Research [article]

Samiya Khan, Kashish A. Shakil, Mansaf Alam
2016 arXiv   pre-print
In addition to this, the evolving nature of research problems has made them essentially interdisciplinary.  ...  As a result, there is a growing need for scholarly applications like collaborator discovery, expert finding and research recommendation systems.  ...  Fundamentally, two issues exist in author disambiguation. Two authors may have the same name while one author may use different names.  ... 
arXiv:1606.01808v1 fatcat:pl6eoais75dxpckxw7xiz5cdri

Paths Study on Knowledge Convergence and Development in Computational Social Science: Data Metric Analysis Based on Web of Science

Yuxi Liu, Xin Feng, Yue Zhang, Ying Kong, Rongyao Yang, Fujun Lai
2022 Complexity  
This study is to conduct a systematic quantitative analysis from a bibliometric perspective, aiming to provide a reference for scholars to explore the paths and changing rules in the field.  ...  evolution, so as to summarize and sort out the paths of computational social science research.  ...  proposing a heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments.  ... 
doi:10.1155/2022/3200371 fatcat:lpmafcaifvhrrck63eo7rcsff4

A survey on scholarly data: From big data perspective

Samiya Khan, Xiufeng Liu, Kashish A. Shakil, Mansaf Alam
2017 Information Processing & Management  
Fundamentally, two issues exist in author disambiguation. Two authors may have the same name while one author may use different names.  ...  Parallel research in the development of cost-effective devices for large-scale visualization is also underway [21] .  ... 
doi:10.1016/j.ipm.2017.03.006 fatcat:3asm74kqwrg4bdqe6l7u2wgseq

Publications on COVID-19 in High Impact Factor Journals: A Bibliometric Analysis

Wilson López-López, Gonzalo Salas, Maribel Vega-Arce, Claudia A. Cornejo-Araya, Miguel Barboza-Palomino, Yun-Shan Ho
2020 Universitas Psychologica  
This article aims to provide a bibliometric analysis of the publications on COVID-19 in five high-impact journals indexed to the Web of Science Core Collection's Science Citation Index Expanded (SCI-EXPANDED  ...  Future bibliometric and scientometric studies on COVID-19 should provide updated information to analyse other relevant indicators in this field.  ...  A., Giuffrida, C., & Abramo, G. (2011). A heuristic approach to author name disambiguation in bibliometric databases for large-scale research assessments.  ... 
doi:10.11144/javeriana.upsy19.pchi fatcat:qb2z4vxo25bxhkxoqkbd5ifpzu
« Previous Showing results 1 — 15 out of 133 results