Abstract
Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30-500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.
Original language | English |
---|---|
Pages (from-to) | 818-825 |
Number of pages | 8 |
Journal | Nature Genetics |
Volume | 46 |
Issue number | 8 |
DOIs | |
Publication status | Published - Aug 2014 |
Cohort Studies
- Netherlands Twin Register (NTR)
Access to Document
Other files and links
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver
}
In: Nature Genetics, Vol. 46, No. 8, 08.2014, p. 818-825.
Research output: Contribution to journal › Article › Academic › peer-review
TY - JOUR
T1 - Whole-genome sequence variation, population structure and demographic history of the Dutch population
AU - The Genome of the Netherlands consortium
AU - Francioli, Laurent C.
AU - Menelaou, Androniki
AU - Pulit, Sara L.
AU - Van Dijk, Freerk
AU - Palamara, Pier Francesco
AU - Elbers, Clara C.
AU - Neerincx, Pieter B.T.
AU - Ye, Kai
AU - Guryev, Victor
AU - Kloosterman, Wigard P.
AU - Deelen, Patrick
AU - Abdellaoui, Abdel
AU - Van Leeuwen, Elisabeth M.
AU - Van Oven, Mannis
AU - Vermaat, Martijn
AU - Li, Mingkun
AU - Laros, Jeroen F.J.
AU - Karssen, Lennart C.
AU - Kanterakis, Alexandros
AU - Amin, Najaf
AU - Hottenga, Jouke Jan
AU - Lameijer, Eric Wubbo
AU - Kattenberg, Mathijs
AU - Dijkstra, Martijn
AU - Byelas, Heorhiy
AU - Van Setten, Jessica
AU - Van Schaik, Barbera D.C.
AU - Bot, Jan
AU - Nijman, Isaäc J.
AU - Renkens, Ivo
AU - Marschall, Tobias
AU - Schönhuth, Alexander
AU - Hehir-Kwa, Jayne Y.
AU - Handsaker, Robert E.
AU - Polak, Paz
AU - Sohail, Mashaal
AU - Vuzman, Dana
AU - Hormozdiari, Fereydoun
AU - Van Enckevort, David
AU - Mei, Hailiang
AU - Koval, Vyacheslav
AU - Moed, Matthijs H.
AU - Van Der Velde, KJoeri
AU - Rivadeneira, Fernando
AU - Estrada, Karol
AU - Medina-Gomez, Carolina
AU - Isaacs, Aaron
AU - McCarroll, Steven A.
AU - Beekman, Marian
AU - Mde Craen, Anton J.
AU - de Craen, A.J.
AU - Suchiman, H.E.D.
AU - Hofman, A.
AU - Oostra, B.A.
AU - Uitterlinden, A.G.
AU - Willemsen, G.
AU - Platteel, M.
AU - Veldink, J.H.
AU - Van den Berg, L.H.
AU - Pitts, S.J.
AU - Potluri, S.
AU - Sundar, P.
AU - Cox, D.R.
AU - Sunyaev, S.R.
AU - den Dunnen, J.T.
AU - Stoneking, M.
AU - de Knijff, P.
AU - Kayser, M.
AU - Li, Q.
AU - Li, Y.
AU - Du, Y.
AU - Chen, R.
AU - Cao, H.
AU - Cao, S.
AU - Wang, J.
AU - Bovenberg, J.A.
AU - Pe'er, I.
AU - Slagboom, P.E.
AU - van Duijn, C.M.
AU - Boomsma, D.I.
AU - van Ommen, G.J.B
AU - Bakker, P.I.W.
AU - Swertz, M.
AU - Wijmenga, C.
N1 - Funding Information: We wish to dedicate this work to the memory of David R. Cox, an enthusiastic supporter of human genetic research in the Netherlands for many years. The GoNL Project is funded by the BBMRI-NL, a research infrastructure financed by the Netherlands Organization for Scientific Research (NWO project 184.021.007). We acknowledge additional financial support from eBioGrid, CTMM/TraIT, the Ubbo Emmius Fund, the Netherlands Bioinformatics Center (NBIC) and EU-BioSHARE. We thank the individual participants of the biobanks; M. Depristo, E. Banks, R. Poplin and G. del Angel from the Broad Institute for expert advice on setting up our alignment and calling pipeline; K. Garimella for the initial implementation of PhaseByTransmission; G. Strikwerda, W. Albers, R. Teeninga, H. Gankema and H. Wind of the Groningen Center for Information Technology (see URLs) for support of the compute cluster and Target storage; E. Valentyn and R. Williams of Target (see URLs) for hosting project data on IBM GPFS storage; T. Visser and I. Nooren of BiG Grid (see URLs) and SURFsara for providing backup storage, additional computing capacity and expert advice; the team from MOLGENIS (see URLs) for software development support; H. Lauvenberg for handling data access requests; K. Zych for design of the GoNL logo; L. Franke, H.-J. Westra and J. Gutierrez-Achury for useful discussions; and S. Raychaudhuri and B. Neale for their critical reading of the manuscript. Target is supported by Samenwerkingsverband Noord Nederland, the European Fund for Regional Development, the Dutch Ministry of Economic Affairs, Pieken in de Delta and the provinces of Groningen and Drenthe. Target operates under the auspices of Sensor Universe. BiG Grid and the Life Science Grid are financially supported by the Netherlands Organization for Scientific Research (NWO). A.A. is funded by the Center for Medical Systems Biology-2, and D.I.B. is funded by the European Research Council (ERC 230374). A.S. and P.I.W.d.B. are recipients of VIDI awards (NWO projects 016.138.318 and 016.126.354, respectively).
PY - 2014/8
Y1 - 2014/8
N2 - Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30-500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.
AB - Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30-500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.
UR - http://www.scopus.com/inward/record.url?scp=84905579746&partnerID=8YFLogxK
U2 - https://doi.org/10.1038/ng.3021
DO - https://doi.org/10.1038/ng.3021
M3 - Article
C2 - 24974849
SN - 1061-4036
VL - 46
SP - 818
EP - 825
JO - Nature Genetics
JF - Nature Genetics
IS - 8
ER -