TY - JOUR
T1 - Identifying cardiovascular risk factor-related dietary patterns with reduced rank regression and random forest in the EPIC-NL cohort
AU - Biesbroek, Sander
AU - Van Der A., Daphne L.
AU - Brosens, Marinka C.C.
AU - Beulens, Joline W.J.
AU - Verschuren, W. M.Monique
AU - Van Der Schouw, Yvonne T.
AU - Boer, Jolanda M.A.
PY - 2015/7/1
Y1 - 2015/7/1
N2 - Background: Several methods are used to determine dietary patterns. Hybrid methods incorporate information on nutrient intake or biological factors to extract patterns relevant to disease etiology. Objective: We explore differences between patterns derived with 2 hybrid methods with those obtained by a posteriori methods and compare associations of these patterns with coronary artery disease (CAD) and stroke risk. Design: Food-frequency questionnaires were used to estimate dietary intake in 34,644 participants of European Prospective Investigation into Cancer-Netherlands at baseline (1993-1997). Follow- up was complete until 31 December 2007. Hybrid methods to determine dietary patterns were reduced rank regression (RRR) and random forest with classification tree analysis (RF-CTA). Included risk factors were body mass index, total:high-density lipoprotein cholesterol ratio, and systolic blood pressure. Results were compared with those from principal component analysis (PCA) and k-means cluster analysis (KCA), respectively. Results: Both RRR and PCA derived a "Western," "prudent," and "traditional pattern." All RRR patterns were significantly associated with CAD risk [highest vs. lowest quartile factor score; HR: 1.45 (95% CI: 1.25, 1.69), 0.86 (0.74, 0.99), and 1.25 (1.07, 1.47), respectively]. Only the prudent RRR factor was statistically significant associated with stroke (HR: 0.76; 95% CI: 0.59, 0.97). From the PCA patterns, only the traditional pattern was associated with CAD (HR: 1.29; 95% CI: 1.11, 1.50). RF-CTA derived 7 dietary patterns that could be categorized as "Western-like," "prudent-like," and "traditional-like." KCA established a prudent and Western cluster. Compared with the RF-CTA "prudent-like 1" pattern, only the "traditional-like 1" pattern was associated with CAD (HR: 1.36; 955 CI: 1.12, 1.65). None of the RF-CTA groups were associated with stroke. Compared with the Western KCA cluster, the prudent cluster was not associated with CAD or stroke. Conclusion: Including risk factors in RRR and RF-CTA resulted in small differences in food groups, contributing to similar patterns that showed in general stronger associations with CAD than PCA and KCA, respectively.
AB - Background: Several methods are used to determine dietary patterns. Hybrid methods incorporate information on nutrient intake or biological factors to extract patterns relevant to disease etiology. Objective: We explore differences between patterns derived with 2 hybrid methods with those obtained by a posteriori methods and compare associations of these patterns with coronary artery disease (CAD) and stroke risk. Design: Food-frequency questionnaires were used to estimate dietary intake in 34,644 participants of European Prospective Investigation into Cancer-Netherlands at baseline (1993-1997). Follow- up was complete until 31 December 2007. Hybrid methods to determine dietary patterns were reduced rank regression (RRR) and random forest with classification tree analysis (RF-CTA). Included risk factors were body mass index, total:high-density lipoprotein cholesterol ratio, and systolic blood pressure. Results were compared with those from principal component analysis (PCA) and k-means cluster analysis (KCA), respectively. Results: Both RRR and PCA derived a "Western," "prudent," and "traditional pattern." All RRR patterns were significantly associated with CAD risk [highest vs. lowest quartile factor score; HR: 1.45 (95% CI: 1.25, 1.69), 0.86 (0.74, 0.99), and 1.25 (1.07, 1.47), respectively]. Only the prudent RRR factor was statistically significant associated with stroke (HR: 0.76; 95% CI: 0.59, 0.97). From the PCA patterns, only the traditional pattern was associated with CAD (HR: 1.29; 95% CI: 1.11, 1.50). RF-CTA derived 7 dietary patterns that could be categorized as "Western-like," "prudent-like," and "traditional-like." KCA established a prudent and Western cluster. Compared with the RF-CTA "prudent-like 1" pattern, only the "traditional-like 1" pattern was associated with CAD (HR: 1.36; 955 CI: 1.12, 1.65). None of the RF-CTA groups were associated with stroke. Compared with the Western KCA cluster, the prudent cluster was not associated with CAD or stroke. Conclusion: Including risk factors in RRR and RF-CTA resulted in small differences in food groups, contributing to similar patterns that showed in general stronger associations with CAD than PCA and KCA, respectively.
KW - Cardiovascular diseases
KW - Dietary patterns
KW - Principal component analysis
KW - Random forest
KW - Reduced rank regression
UR - http://www.scopus.com/inward/record.url?scp=84937397177&partnerID=8YFLogxK
U2 - https://doi.org/10.3945/ajcn.114.092288
DO - https://doi.org/10.3945/ajcn.114.092288
M3 - Article
C2 - 25971717
SN - 0002-9165
VL - 102
SP - 146
EP - 154
JO - American Journal of Clinical Nutrition
JF - American Journal of Clinical Nutrition
IS - 1
ER -