TY - JOUR
T1 - Quantification of Phenotype Information Aids the Identification of Novel Disease Genes
AU - Vulto-van Silfhout, Anneke T.
AU - Gilissen, Christian
AU - Goeman, Jelle J.
AU - Jansen, Sandra
AU - van Amen-Hellebrekers, Claudia J.M.
AU - van Bon, Bregje W.M.
AU - Koolen, David A.
AU - Sistermans, Erik A.
AU - Brunner, Han G.
AU - de Brouwer, Arjan P.M.
AU - de Vries, Bert B.A.
PY - 2017/5/1
Y1 - 2017/5/1
N2 - Next-generation sequencing led to the identification of many potential novel disease genes. The presence of mutations in the same gene in multiple unrelated patients is, however, a priori insufficient to establish that these genes are truly involved in the respective disease. Here, we show how phenotype information can be incorporated within statistical approaches to provide additional evidence for the causality of mutations. We developed a broadly applicable statistical model that integrates gene-specific mutation rates, cohort size, mutation type, and phenotype frequency information to assess the chance of identifying de novo mutations affecting the same gene in multiple patients with shared phenotype features. We demonstrate our approach based on the frequency of phenotype features present in a unique cohort of 6,149 patients with intellectual disability. We show that our combined approach can decrease the number of patients required to identify novel disease genes, especially for patients with combinations of rare phenotypes. In conclusion, we show how integrating genotype–phenotype information can aid significantly in the interpretation of de novo mutations in potential novel disease genes.
AB - Next-generation sequencing led to the identification of many potential novel disease genes. The presence of mutations in the same gene in multiple unrelated patients is, however, a priori insufficient to establish that these genes are truly involved in the respective disease. Here, we show how phenotype information can be incorporated within statistical approaches to provide additional evidence for the causality of mutations. We developed a broadly applicable statistical model that integrates gene-specific mutation rates, cohort size, mutation type, and phenotype frequency information to assess the chance of identifying de novo mutations affecting the same gene in multiple patients with shared phenotype features. We demonstrate our approach based on the frequency of phenotype features present in a unique cohort of 6,149 patients with intellectual disability. We show that our combined approach can decrease the number of patients required to identify novel disease genes, especially for patients with combinations of rare phenotypes. In conclusion, we show how integrating genotype–phenotype information can aid significantly in the interpretation of de novo mutations in potential novel disease genes.
KW - de novo mutations
KW - intellectual disability
KW - patient cohorts
KW - phenotype features
KW - statistical approach
KW - systematic phenotyping
UR - http://www.scopus.com/inward/record.url?scp=85011707913&partnerID=8YFLogxK
U2 - https://doi.org/10.1002/humu.23176
DO - https://doi.org/10.1002/humu.23176
M3 - Article
C2 - 28074630
SN - 1059-7794
VL - 38
SP - 594
EP - 599
JO - Human Mutation
JF - Human Mutation
IS - 5
ER -