Knowledge-based antibody repertoire simulation, a novel allele detection tool evaluation and application [article]

Xiujia Yang, Yan Zhu, Huikun Zeng, Sen Chen, Junjie Guan, Qi-Long Wang, Chunhong Lan, Deqiang Sun, Xueqing Yu, Zhenhai Zhang
2021 bioRxiv   pre-print
Detailed knowledge of the diverse immunoglobulin germline genes is critical for the study of humoral immunity. Hundreds of alleles have been discovered by analyzing antibody repertoire sequencing (Rep-seq or Ig-seq) data via multiple novel allele detection tools (NADTs). However, the performance of these NADTs through antibody sequences with intrinsic somatic hypermutations (SHMs) is unclear. Here, we developed a tool to simulate repertoires by integrating the full spectrum features of an
more » ... dy repertoire such as germline gene usage, junctional modification, position-specific SHM and clonal expansion based on 2152 high-quality datasets. We then systematically evaluated these NADTs using both simulated and genuine Ig-seq datasets. Finally, we applied these NADTs to 687 Ig-seq datasets and identified 43 novel alleles using defined criteria. Twenty-five alleles were validated through findings of other sources. In addition to the novel alleles detected, our simulation tool, the results of our comparison, and the streamline of this process may benefit further humoral immunity studies via Ig-seq.
doi:10.1101/2021.07.01.450681 fatcat:mityo3d7g5g25izodenmfi3wiq