High-throughput experimental methods such as medical sequencing and genome-wide association studies (GWAS) identify increasingly large numbers of potential relations between genetic variants and diseases. Both biological complexity (millions of potential gene-disease associations) and the accelerating rate of data production necessitate computational approaches to prioritize and rationalize potential gene-disease relations. Here, we use concept profile technology to expose from the biomedical literature both explicitly stated gene-disease relations (the explicitome) and a much larger set of implied gene-disease associations (the implicitome). Implicit relations are largely unknown to, or are even unintended by the original authors, but they vastly extend the reach of existing biomedical knowledge for identification and interpretation of gene-disease associations. The implicitome can be used in conjunction with experimental data resources to rationalize both known and novel associations. We demonstrate the usefulness of the implicitome by rationalizing known and novel gene-disease associations, including those from GWAS. To facilitate the re-use of implicit gene-disease associations, we publish our data in compliance with FAIR Data Publishing recommendations [https://www.force11.org/group/fairgroup] using nanopublications. An online tool (http://knowledge.bio) is available to explore established and potential gene-disease associations in the context of other biomedical relations.

Additional Metadata
Persistent URL dx.doi.org/10.1371/journal.pone.0149621, hdl.handle.net/1765/81862
Journal PLoS ONE
Grant This work was funded by the European Commission 7th Framework Programme; grant id fp7/305444 - RD-CONNECT: An integrated platform connecting registries, biobanks and clinical bioinformatics for rare disease research (RD-CONNECT), This work was funded by the European Commission 7th Framework Programme; grant id imi/115191 - The Open Pharmacological Concepts Triple Store (OPEN PHACTS)
Citation
Hettne, K.M, Thompson, M, van Haagen, H.H.H.B.M, Van Der Horst, E, Kaliyaperumal, R, Mina, E, … Schultes, E. (2016). The implicitome: A resource for rationalizing gene-disease associations. PLoS ONE, 11(2). doi:10.1371/journal.pone.0149621