Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10(-7)). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r (2) with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11-1.24, P = 1.1×10(-7)). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10(-8)). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r (2) = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10(-8)). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene.
To identify credible causal risk variants (CCVs) associated with different histotypes of epithelial ovarian cancer (EOC), we performed genome-wide association analysis for 470,825 genotyped and 10,163,797 imputed SNPs in 25,981 EOC cases and 105,724 controls of European origin. We identified five histotype-specific EOC risk regions (p value <5 × 10-8) and confirmed previously reported associations for 27 risk regions. Conditional analyses identified an additional 11 signals independent of the primary signal at six risk regions (p value <10-5). Fine mapping identified 4,008 CCVs in these regions, of which 1,452 CCVs were located in ovarian cancer-related chromatin marks with significant enrichment in active enhancers, active promoters, and active regions for CCVs from each EOC histotype. Transcriptome-wide association and colocalization analyses across histotypes using tissue-specific and cross-tissue datasets identified 86 candidate susceptibility genes in known EOC risk regions and 32 genes in 23 additional genomic regions that may represent novel EOC risk loci (false discovery rate <0.05). Finally, by integrating genome-wide HiChIP interactome analysis with transcriptome-wide association study (TWAS), variant effect predictor, transcription factor ChIP-seq, and motifbreakR data, we identified candidate gene-CCV interactions at each locus. This included risk loci where TWAS identified one or more candidate susceptibility genes (e.g., HOXD-AS2, HOXD8, and HOXD3 at 2q31) and other loci where no candidate gene was identified (e.g., MYC and PVT1 at 8q24) by TWAS. In summary, this study describes a functional framework and provides a greater understanding of the biological significance of risk alleles and candidate gene targets at EOC susceptibility loci identified by a genome-wide association study.
Polygenic risk scores (PRS) for epithelial ovarian cancer (EOC) have the potential to improve risk stratification. Joint estimation of Single Nucleotide Polymorphism (SNP) effects in models could improve predictive performance over standard approaches of PRS construction. Here, we implemented computationally efficient, penalized, logistic regression models (lasso, elastic net, stepwise) to individual level genotype data and a Bayesian framework with continuous shrinkage, "select and shrink for summary statistics" (S4), to summary level data for epithelial non-mucinous ovarian cancer risk prediction. We developed the models in a dataset consisting of 23,564 non-mucinous EOC cases and 40,138 controls participating in the Ovarian Cancer Association Consortium (OCAC) and validated the best models in three populations of different ancestries: prospective data from 198,101 women of European ancestries; 7,669 women of East Asian ancestries; 1,072 women of African ancestries, and in 18,915 BRCA1 and 12,337 BRCA2 pathogenic variant carriers of European ancestries. In the external validation data, the model with the strongest association for non-mucinous EOC risk derived from the OCAC model development data was the S4 model (27,240 SNPs) with odds ratios (OR) of 1.38 (95% CI: 1.28-1.48, AUC: 0.588) per unit standard deviation, in women of European ancestries; 1.14 (95% CI: 1.08-1.19, AUC: 0.538) in women of East Asian ancestries; 1.38 (95% CI: 1.21-1.58, AUC: 0.593) in women of African ancestries; hazard ratios of 1.36 (95% CI: 1.29-1.43, AUC: 0.592) in BRCA1 pathogenic variant carriers and 1.49 (95% CI: 1.35-1.64, AUC: 0.624) in BRCA2 pathogenic variant carriers. Incorporation of the S4 PRS in risk prediction models for ovarian cancer may have clinical utility in ovarian cancer prevention programs.