Variant calling for cpn60 barcode sequence-based microbiome profiling
release_37aaqrztgfaydhjpwfxm66f6nm
by
Sarah J Vancuren,
Janet E. Hill
2019
Abstract
Amplification and sequencing of conserved genetic barcodes such as the cpn60 gene is a common approach to determining the taxonomic composition of microbiomes. Exact sequence variant calling has been proposed as an alternative to previously established methods for aggregation of sequence reads into operational taxonomic units (OTU). We investigated the utility of variant calling for cpn60 barcode sequences and determined the minimum sequence length required to provide species level resolution. Sequence data from the 5' region of the cpn60 barcode amplified from the human vaginal microbiome (n=25), infant stool (n=47), and a mock community were used to compare variant calling to de novo assembly of reads, and mapping to a reference sequence database. Results were compared in terms of number of OTU formed, resolution of OTU sequences, and overall community composition. Variant calling resulted in high resolution profiles that were consistent in composition to those generated with previous methods but with significant logistical advantages. Variant calling is rapid, achieves high resolution of taxa, and does not require reference sequence data. Our results further demonstrate that 150 bp from the 5' end of the cpn60 barcode sequence is sufficient to provide species-level resolution of microbiota.
In application/xml+jats
format
Archived Files and Locations
application/pdf
2.4 MB
file_dp5y2g7oz5grvj2siftmars3sq
|
www.biorxiv.org (web) web.archive.org (webarchive) |
application/pdf
1.5 MB
file_fqzrv3c76zcgxd6diln2lz7tzu
|
www.biorxiv.org (repository) web.archive.org (webarchive) |
post
Stage
unknown
Date 2019-08-29
10.1101/749267
access all versions, variants, and formats of this works (eg, pre-prints)
Crossref Metadata (via API)
Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar