Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference

Alexandre Hannud Abdo, Jean‐Philippe Cointet, Pascale Bourret, Alberto Cambrosio
2021 Journal of the Association for Information Science and Technology  
This paper presents a contribution to the study of bibliographic corpora through science mapping. From a graph representation of documents and their textual dimension, stochastic block models can provide a simultaneous clustering of documents and words that we call a domain-topic model. Previous work investigated the resulting topics, or word clusters, while ours focuses on the study of the document clusters we call domains. To enable the description and interactive navigation of domains, we
more » ... roduce measures and interfaces that consider the structure of the model to relate both types of clusters. We then present a procedure that extends the block model to cluster metadata attributes of documents, which we call a domain-chained model, noting that our measures and interfaces transpose to metadata clusters. We provide an example application to a corpus relevant to current science, technology and society (STS) research and an interesting case for our approach: the abstracts presented between 1995 and 2017 at the American Society of Clinical Oncology Annual Meeting, the major oncology research conference. Through a sequence of domain-topic and domain-chained models, we identify and describe a group of domains that have notably grown through the last decades and which we relate to the establishment of "oncopolicy" as a major concern in oncology.
doi:10.1002/asi.24606 pmid:35873705 pmcid:PMC9299004 fatcat:c4m7mbhs7zgbldtdtwz3dbxq4a