CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies

Zhong, Kaiyin; Karssen, Lennart; Kayser, Manfred; Liu, Fan

doi:10.1186/s12859-016-1006-9

K. Zhong (Kaiyin), L.C. Karssen (Lennart), M.H. Kayser (Manfred) and F. Liu (Fan)

2016-04-08

CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies

B M C Bioinformatics , Volume 17 - Issue 1

Background: Compound Heterozygosity (CH) in classical genetics is the presence of two different recessive mutations at a particular gene locus. A relaxed form of CH alleles may account for an essential proportion of the missing heritability, i.e. heritability of phenotypes so far not accounted for by single genetic variants. Methods to detect CH-like effects in genome-wide association studies (GWAS) may facilitate explaining the missing heritability, but to our knowledge no viable software tools for this purpose are currently available. Results: In this work we present the Generalized Compound Double Heterozygosity (GCDH) test and its implementation in the R package CollapsABEL. Time-consuming procedures are optimized for computational efficiency using Java or C++. Intermediate results are stored either in an SQL database or in a so-called big.matrix file to achieve reasonable memory footprint. Our large scale simulation studies show that GCDH is capable of discovering genetic associations due to CH-like interactions with much higher power than a conventional single-SNP approach under various settings, whether the causal genetic variations are available or not. CollapsABEL provides a user-friendly pipeline for genotype collapsing, statistical testing, power estimation, type I error control and graphics generation in the R language. Conclusions: CollapsABEL provides a computationally efficient solution for screening general forms of CH alleles in densely imputed microarray or whole genome sequencing datasets. The GCDH test provides an improved power over single-SNP based methods in detecting the prevalence of CH in human complex phenotypes, offering an opportunity for tackling the missing heritability problem. Binary and source packages of CollapsABEL are available on CRAN (https://cran.r-project.org/web/packages/CollapsABEL) and the website of the GenABEL project (http://www.genabel.org/packages).

Additional Metadata
Keywords	Compound heterozygosity, Genome wide association study, Missing heritability, Next generation sequencing
Persistent URL	doi.org/10.1186/s12859-016-1006-9, hdl.handle.net/1765/88483
Journal	B M C Bioinformatics
Grant	This work was funded by the European Commission 7th Framework Programme; grant id fp7/305280 - Methods for Integrated analysis of Multiple Omics datasets (MIMOMICS), This work was funded by the European Commission 7th Framework Programme; grant id fp7/602736 - Multi-dimensional omics approach to stratification of patients with low back pain (PAIN-OMICS)
Organisation	Department of Genetic Identification
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Zhong, K., Karssen, L., Kayser, M., & Liu, F. (2016). CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies. B M C Bioinformatics, 17(1). doi:10.1186/s12859-016-1006-9

Free Full Text ( Final Version , 1mb )

Additional Files
pubmedcentral Author Manuscript
pubmedcentral Author Manuscript

CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies

Publication

Publication

About

CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies

Publication

Publication

Workflow

Workflow

Add Content