Inferring Social Network Structure from Bacterial Sequence Data

Mateusz M. Pluciński, Richard Starfield, Rodrigo P. P. Almeida, Petter Holme
2011 PLoS ONE  
Using DNA sequence data from pathogens to infer transmission networks has traditionally been done in the context of epidemics and outbreaks. Sequence data could analogously be applied to cases of ubiquitous commensal bacteria; however, instead of inferring chains of transmission to track the spread of a pathogen, sequence data for bacteria circulating in an endemic equilibrium could be used to infer information about host contact networks. Here, we showusing simulated data-that multilocus DNA
more » ... quence data, based on multilocus sequence typing schemes (MLST), from isolates of commensal bacteria can be used to infer both local and global properties of the contact networks of the populations being sampled. Specifically, for MLST data simulated from small-world networks, the small world parameter controlling the degree of structure in the contact network can robustly be estimated. Moreover, we show that pairwise distances in the network-degrees of separation-correlate with genetic distances between isolates, so that how far apart two individuals in the network are can be inferred from MLST analysis of their commensal bacteria. This result has important consequences, and we show an example from epidemiology: how this result could be used to test for infectious origins of diseases of unknown etiology.
doi:10.1371/journal.pone.0022685 pmid:21829645 pmcid:PMC3148245 fatcat:zcznsqc7m5f23jckowdfvh4bvm