A structural topic model approach to scientific reorientation of economics and chemistry after German reunification

Andreas Rehs, Universität Kassel
The detection of differences or similarities in large numbers of scientific publications is an open problem in scientometric research. In this paper we therefore develop and apply a machine learning approach based on structural topic modelling in combination with cosine similarity and a linear regression framework in order to identify differences in dissertation titles written at East and West German universities before and after German reunification. German reunification and its surrounding
more » ... its surrounding time period is used because it provides a structure with both minor and major differences in research topics that could be detected by our approach. Our dataset is based on dissertation titles in economics and business administration and chemistry from 1980 to 2010. We use university affiliation and year of the dissertation to train a structural topic model and then test the model on a set of unseen dissertation titles. Subsequently, we compare the resulting topic distribution of each title to every other title with cosine similarity. The cosine similarities and the regional and temporal origin of the dissertation titles they come from are then used in a linear regression approach. Our results on research topics in economics and business administration suggest substantial differences between East and West Germany before the reunification and a rapid conformation thereafter. In chemistry we observe minor differences between East and West before the reunification and a slightly increased similarity thereafter. Our main effort was in applying a probabilistic text model ("structural topic model") to these dissertation titles, aggregating the outcomes and then incorporating them into a linear regression framework, which allows us to calculate the level of difference between
doi:10.17170/kobra-202012082413 fatcat:dna5wsmrgranjimasqsc7lx6gi