TY - JOUR
T1 - A saturated map of common genetic variants associated with human height
AU - 23andMe Research Team
AU - Yengo, Loïc
AU - Vedantam, Sailaja
AU - Marouli, Eirini
AU - Sidorenko, Julia
AU - Bartell, Eric
AU - Sakaue, Saori
AU - Graff, Marielisa
AU - Eliasen, Anders U
AU - Jiang, Yunxuan
AU - Raghavan, Sridharan
AU - Miao, Jenkai
AU - Arias, Joshua D
AU - Graham, Sarah E
AU - Mukamel, Ronen E
AU - Spracklen, Cassandra N
AU - Yin, Xianyong
AU - Chen, Shyh-Huei
AU - Ferreira, Teresa
AU - Highland, Heather H
AU - Ji, Yingjie
AU - Karaderi, Tugce
AU - Lin, Kuang
AU - Lüll, Kreete
AU - Malden, Deborah E
AU - Medina-Gomez, Carolina
AU - Machado, Moara
AU - Moore, Amy
AU - Rüeger, Sina
AU - Sim, Xueling
AU - Vrieze, Scott
AU - Ahluwalia, Tarunveer S
AU - Akiyama, Masato
AU - Allison, Matthew A
AU - Alvarez, Marcus
AU - Andersen, Mette K
AU - Ani, Alireza
AU - Appadurai, Vivek
AU - Arbeeva, Liubov
AU - Chen, Chien-Hsiun
AU - Hottenga, Jouke-Jan
AU - Lee, Jong-Young
AU - Milaneschi, Yuri
AU - Wu, Yang
AU - Boomsma, Dorret I
AU - Heckbert, Susan R
AU - Huang, Wei
AU - Penninx, Brenda W J H
AU - Posthuma, Danielle
AU - van Dam, Rob M
AU - Willemsen, Gonneke
AU - LifeLines Cohort Study
AU - Understanding Society Scientific Group
AU - VA Million Veteran Program
AU - DiscovEHR (DiscovEHR and MyCode Community Health Initiative)
AU - Bhaskar, Seema
AU - Bielak, Lawrence F.
AU - Bollepalli, Sailalitha
AU - Bonnycastle, Lori L.
AU - Bork-Jensen, Jette
AU - Bradfield, Jonathan P.
AU - Bradford, Yuki
AU - Braund, Peter S.
AU - eMERGE (Electronic Medical Records and Genomics Network)
AU - The PRACTICAL consortium
AU - van Klinken, Jan B.
AU - Asselbergs, Folkert W.
AU - Rosendaal, Frits R.
AU - van der Velde, Nathalie
AU - Jansen, Iris E.
AU - Slieker, Roderick C.
AU - Adams, Hieab H. H.
AU - Chaturvedi, Nish
AU - Elders, Petra J. M.
AU - Rutters, Femke
AU - van Schoor, Natasja M.
AU - Yuan, Jian-Min
N1 - © 2022. The Author(s).
PY - 2022/10
Y1 - 2022/10
N2 - Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
AB - Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
KW - Body Height/genetics
KW - Chromosome Mapping
KW - Europe/ethnology
KW - Gene Frequency/genetics
KW - Genome, Human/genetics
KW - Genome-Wide Association Study
KW - Haplotypes/genetics
KW - Humans
KW - Linkage Disequilibrium/genetics
KW - Phenotype
KW - Polymorphism, Single Nucleotide/genetics
KW - Sample Size
UR - http://www.scopus.com/inward/record.url?scp=85139748621&partnerID=8YFLogxK
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85139748621&origin=inward
UR - https://www.ncbi.nlm.nih.gov/pubmed/36224396
U2 - https://doi.org/10.1038/s41586-022-05275-y
DO - https://doi.org/10.1038/s41586-022-05275-y
M3 - Article
C2 - 36224396
SN - 1476-4687
VL - 610
SP - 704
EP - 712
JO - Nature
JF - Nature
IS - 7933
ER -