MATERIALS AND METHOD: 180 SNPs, shown to be previously associated with prostate cancer, were used to develop a PHS model in men with European ancestry. A machine-learning approach, LASSO-regularized Cox regression, was used to select SNPs and to estimate their coefficients in the training set (75,596 men). Performance of the resulting model was evaluated in the testing/validation set (6,411 men) with two metrics: (1) hazard ratios (HRs) and (2) positive predictive value (PPV) of prostate-specific antigen (PSA) testing. HRs were estimated between individuals with PHS in the top 5% to those in the middle 40% (HR95/50), top 20% to bottom 20% (HR80/20), and bottom 20% to middle 40% (HR20/50). PPV was calculated for the top 20% (PPV80) and top 5% (PPV95) of PHS as the fraction of individuals with elevated PSA that were diagnosed with clinically significant prostate cancer on biopsy.
RESULTS: 166 SNPs had non-zero coefficients in the Cox model (PHS166). All HR metrics showed significant improvements for PHS166 compared to PHS46: HR95/50 increased from 3.72 to 5.09, HR80/20 increased from 6.12 to 9.45, and HR20/50 decreased from 0.41 to 0.34. By contrast, no significant differences were observed in PPV of PSA testing for clinically significant prostate cancer.
CONCLUSIONS: Incorporating 120 additional SNPs (PHS166 vs PHS46) significantly improved HRs for prostate cancer, while PPV of PSA testing remained the same.
METHODS: We evaluated the potential of gene-based aggregation in the Breast Cancer Association Consortium cohorts including 83,471 cases and 59,199 controls. Low-frequency variants were aggregated for individual genes' coding and regulatory regions. Association results in European ancestry samples were compared to single-marker association results in the same cohort. Gene-based associations were also combined in meta-analysis across individuals with European, Asian, African, and Latin American and Hispanic ancestry.
RESULTS: In European ancestry samples, 14 genes were significantly associated (q
METHODS AND FINDINGS: Genetic instruments to proxy 12 risk factors were constructed by identifying single nucleotide polymorphisms (SNPs) that were robustly (P < 5 × 10-8) and independently associated with each respective risk factor in previously reported genome-wide association studies. These risk factors included genetic liability to 3 factors (endometriosis, polycystic ovary syndrome, type 2 diabetes) scaled to reflect a 50% higher odds liability to disease. We obtained summary statistics for the association of these SNPs with risk of overall and histotype-specific invasive epithelial ovarian cancer (22,406 cases; 40,941 controls) and low malignant potential tumours (3,103 cases; 40,941 controls) from the Ovarian Cancer Association Consortium (OCAC). The OCAC dataset comprises 63 genotyping project/case-control sets with participants of European ancestry recruited from 14 countries (US, Australia, Belarus, Germany, Belgium, Denmark, Finland, Norway, Canada, Poland, UK, Spain, Netherlands, and Sweden). SNPs were combined into multi-allelic inverse-variance-weighted fixed or random effects models to generate effect estimates and 95% confidence intervals (CIs). Three complementary sensitivity analyses were performed to examine violations of MR assumptions: MR-Egger regression and weighted median and mode estimators. A Bonferroni-corrected P value threshold was used to establish strong evidence (P < 0.0042) and suggestive evidence (0.0042 < P < 0.05) for associations. In MR analyses, there was strong or suggestive evidence that 2 of the 12 risk factors were associated with invasive epithelial ovarian cancer and 8 of the 12 were associated with 1 or more invasive epithelial ovarian cancer histotypes. There was strong evidence that genetic liability to endometriosis was associated with an increased risk of invasive epithelial ovarian cancer (odds ratio [OR] per 50% higher odds liability: 1.10, 95% CI 1.06-1.15; P = 6.94 × 10-7) and suggestive evidence that lifetime smoking exposure was associated with an increased risk of invasive epithelial ovarian cancer (OR per unit increase in smoking score: 1.36, 95% CI 1.04-1.78; P = 0.02). In analyses examining histotypes and low malignant potential tumours, the strongest associations found were between height and clear cell carcinoma (OR per SD increase: 1.36, 95% CI 1.15-1.61; P = 0.0003); age at natural menopause and endometrioid carcinoma (OR per year later onset: 1.09, 95% CI 1.02-1.16; P = 0.007); and genetic liability to polycystic ovary syndrome and endometrioid carcinoma (OR per 50% higher odds liability: 0.89, 95% CI 0.82-0.96; P = 0.002). There was little evidence for an association of genetic liability to type 2 diabetes, parity, or circulating levels of 25-hydroxyvitamin D and sex hormone binding globulin with ovarian cancer or its subtypes. The primary limitations of this analysis include the modest statistical power for analyses of risk factors in relation to some less common ovarian cancer histotypes (low grade serous, mucinous, and clear cell carcinomas), the inability to directly examine the association of some ovarian cancer risk factors that did not have robust genetic variants available to serve as proxies (e.g., oral contraceptive use, hormone replacement therapy), and the assumption of linear relationships between risk factors and ovarian cancer risk.
CONCLUSIONS: Our comprehensive examination of possible aetiological drivers of ovarian carcinogenesis using germline genetic variants to proxy risk factors supports a role for few of these factors in invasive epithelial ovarian cancer overall and suggests distinct aetiologies across histotypes. The identification of novel risk factors remains an important priority for the prevention of epithelial ovarian cancer.
METHODS: We used a panel of 34 putative susceptibility genes to perform sequencing on samples from 60,466 women with breast cancer and 53,461 controls. In separate analyses for protein-truncating variants and rare missense variants in these genes, we estimated odds ratios for breast cancer overall and tumor subtypes. We evaluated missense-variant associations according to domain and classification of pathogenicity.
RESULTS: Protein-truncating variants in 5 genes (ATM, BRCA1, BRCA2, CHEK2, and PALB2) were associated with a risk of breast cancer overall with a P value of less than 0.0001. Protein-truncating variants in 4 other genes (BARD1, RAD51C, RAD51D, and TP53) were associated with a risk of breast cancer overall with a P value of less than 0.05 and a Bayesian false-discovery probability of less than 0.05. For protein-truncating variants in 19 of the remaining 25 genes, the upper limit of the 95% confidence interval of the odds ratio for breast cancer overall was less than 2.0. For protein-truncating variants in ATM and CHEK2, odds ratios were higher for estrogen receptor (ER)-positive disease than for ER-negative disease; for protein-truncating variants in BARD1, BRCA1, BRCA2, PALB2, RAD51C, and RAD51D, odds ratios were higher for ER-negative disease than for ER-positive disease. Rare missense variants (in aggregate) in ATM, CHEK2, and TP53 were associated with a risk of breast cancer overall with a P value of less than 0.001. For BRCA1, BRCA2, and TP53, missense variants (in aggregate) that would be classified as pathogenic according to standard criteria were associated with a risk of breast cancer overall, with the risk being similar to that of protein-truncating variants.
CONCLUSIONS: The results of this study define the genes that are most clinically useful for inclusion on panels for the prediction of breast cancer risk, as well as provide estimates of the risks associated with protein-truncating variants, to guide genetic counseling. (Funded by European Union Horizon 2020 programs and others.).