Genetic Variants Detection Based on Weighted Sparse Group Lasso

Kai Che, Xi Chen, Maozu Guo, Chunyu Wang, Xiaoyan Liu
2020 Frontiers in Genetics  
Identification of genetic variants associated with complex traits is a critical step for improving plant resistance and breeding. Although the majority of existing methods for variants detection have good predictive performance in the average case, they can not precisely identify the variants present in a small number of target genes. In this paper, we propose a weighted sparse group lasso (WSGL) method to select both common and low-frequency variants in groups. Under the biologically realistic
more » ... assumption that complex traits are influenced by a few single loci in a small number of genes, our method involves a sparse group lasso approach to simultaneously select associated groups along with the loci within each group. To increase the probability of selecting out low-frequency variants, biological prior information is introduced in the model by re-weighting lasso regularization based on weights calculated from input data. Experimental results from both simulation and real data of single nucleotide polymorphisms (SNPs) associated with Arabidopsis flowering traits demonstrate the superiority of WSGL over other competitive approaches for genetic variants detection.
doi:10.3389/fgene.2020.00155 pmid:32194631 pmcid:PMC7063084 fatcat:lvt6ezzcoja6rptavxuwdwxrji