TY - JOUR
T1 - Multivariate Adaptive Shrinkage Improves Cross-Population Transcriptome Prediction and Association Studies in Underrepresented Populations
AU - Araujo, Daniel
AU - Nguyen, Chris
AU - Hu, Xiaowei
AU - Mikhaylova, Anna V.
AU - Gignoux, Christopher R.
AU - Ardlie, Kristin
AU - Taylor, Kent D.
AU - Durda, Peter
AU - Liu, Yongmei
AU - Papanicolaou, George
AU - Cho, Michael H.
AU - Rich, Stephen S.
AU - Rotter, Jerome I.
AU - Im, Hae Kyung
AU - Manichaikul, Ani
AU - Wheeler, Heather
PY - 2023/10/12
Y1 - 2023/10/12
N2 - Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWASs for multiethnic or underrepresented populations.
AB - Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWASs for multiethnic or underrepresented populations.
KW - genetics
KW - genomics
KW - human genetics
KW - transcriptome-wide association studies
KW - transcriptome prediction
KW - multivarite adaptive shrinkage
KW - multi-ancestry GWAS
KW - PrediXcan
UR - https://ecommons.luc.edu/biology_facpubs/192
U2 - 10.1016/j.xhgg.2023.100216
DO - 10.1016/j.xhgg.2023.100216
M3 - Article
C2 - 37869564
VL - 4
JO - History: Faculty Publications and Other Works
JF - History: Faculty Publications and Other Works
IS - 4
ER -