Chromosome 5p15.33 has been identified as a lung cancer susceptibility locus, however the underlying causal mechanisms were not fully elucidated. Previous fine-mapping studies of this locus have relied on imputation or investigated a small number of known, common variants. This study represents a significant advance over previous research by investigating a large number of novel, rare variants, as well as their underlying mechanisms through telomere length. Variants for this fine-mapping study were identified through a targeted deep sequencing (average depth of coverage greater than 4000×) of 576 individuals. Subsequently, 4652 SNPs, including 1108 novel SNPs, were genotyped in 5164 cases and 5716 controls of European ancestry. After adjusting for known risk loci, rs2736100 and rs401681, we identified a new, independent lung cancer susceptibility variant in LPCAT1: rs139852726 (OR = 0.46, P = 4.73×10(-9)), and three new adenocarcinoma risk variants in TERT: rs61748181 (OR = 0.53, P = 2.64×10(-6)), rs112290073 (OR = 1.85, P = 1.27×10(-5)), rs138895564 (OR = 2.16, P = 2.06×10(-5); among young cases, OR = 3.77, P = 8.41×10(-4)). In addition, we found that rs139852726 (P = 1.44×10(-3)) was associated with telomere length in a sample of 922 healthy individuals. The gene-based SKAT-O analysis implicated TERT as the most relevant gene in the 5p15.33 region for adenocarcinoma (P = 7.84×10(-7)) and lung cancer (P = 2.37×10(-5)) risk. In this largest fine-mapping study to investigate a large number of rare and novel variants within 5p15.33, we identified novel lung and adenocarcinoma susceptibility loci with large effects and provided support for the role of telomere length as the potential underlying mechanism.
A genome-wide association study (GWAS) of bladder cancer identified a genetic marker rs8102137 within the 19q12 region as a novel susceptibility variant. This marker is located upstream of the CCNE1 gene, which encodes cyclin E, a cell-cycle protein. We performed genetic fine-mapping analysis of the CCNE1 region using data from two bladder cancer GWAS (5,942 cases and 10,857 controls). We found that the original GWAS marker rs8102137 represents a group of 47 linked SNPs (with r(2) ≥ 0.7) associated with increased bladder cancer risk. From this group, we selected a functional promoter variant rs7257330, which showed strong allele-specific binding of nuclear proteins in several cell lines. In both GWASs, rs7257330 was associated only with aggressive bladder cancer, with a combined per-allele OR = 1.18 [95% confidence interval (CI), 1.09-1.27, P = 4.67 × 10(-5)] versus OR = 1.01 (95% CI, 0.93-1.10, P = 0.79) for nonaggressive disease, with P = 0.0015 for case-only analysis. Cyclin E protein expression analyzed in 265 bladder tumors was increased in aggressive tumors (P = 0.013) and, independently, with each rs7257330-A risk allele (P(trend) = 0.024). Overexpression of recombinant cyclin E in cell lines caused significant acceleration of cell cycle. In conclusion, we defined the 19q12 signal as the first GWAS signal specific for aggressive bladder cancer. Molecular mechanisms of this genetic association may be related to cyclin E overexpression and alteration of cell cycle in carriers of CCNE1 risk variants. In combination with established bladder cancer risk factors and other somatic and germline genetic markers, the CCNE1 variants could be useful for inclusion into bladder cancer risk prediction models.
Candidate gene and genome-wide association studies (GWAS) have identified 15 independent genomic regions associated with bladder cancer risk. In search for additional susceptibility variants, we followed up on four promising single-nucleotide polymorphisms (SNPs) that had not achieved genome-wide significance in 6911 cases and 11 814 controls (rs6104690, rs4510656, rs5003154 and rs4907479, P < 1 × 10(-6)), using additional data from existing GWAS datasets and targeted genotyping for studies that did not have GWAS data. In a combined analysis, which included data on up to 15 058 cases and 286 270 controls, two SNPs achieved genome-wide statistical significance: rs6104690 in a gene desert at 20p12.2 (P = 2.19 × 10(-11)) and rs4907479 within the MCF2L gene at 13q34 (P = 3.3 × 10(-10)). Imputation and fine-mapping analyses were performed in these two regions for a subset of 5551 bladder cancer cases and 10 242 controls. Analyses at the 13q34 region suggest a single signal marked by rs4907479. In contrast, we detected two signals in the 20p12.2 region-the first signal is marked by rs6104690, and the second signal is marked by two moderately correlated SNPs (r(2) = 0.53), rs6108803 and the previously reported rs62185668. The second 20p12.2 signal is more strongly associated with the risk of muscle-invasive (T2-T4 stage) compared with non-muscle-invasive (Ta, T1 stage) bladder cancer (case-case P ≤ 0.02 for both rs62185668 and rs6108803). Functional analyses are needed to explore the biological mechanisms underlying these novel genetic associations with risk for bladder cancer.
Genome-wide association studies (GWAS) have identified common pancreatic cancer susceptibility variants at 13 chromosomal loci in individuals of European descent. To identify new susceptibility variants, we performed imputation based on 1000 Genomes (1000G) Project data and association analysis using 5,107 case and 8,845 control subjects from 27 cohort and case-control studies that participated in the PanScan I-III GWAS. This analysis, in combination with a two-staged replication in an additional 6,076 case and 7,555 control subjects from the PANcreatic Disease ReseArch (PANDoRA) and Pancreatic Cancer Case-Control (PanC4) Consortia uncovered 3 new pancreatic cancer risk signals marked by single nucleotide polymorphisms (SNPs) rs2816938 at chromosome 1q32.1 (per allele odds ratio (OR) = 1.20, P = 4.88x10 -15), rs10094872 at 8q24.21 (OR = 1.15, P = 3.22x10 -9) and rs35226131 at 5p15.33 (OR = 0.71, P = 1.70x10 -8). These SNPs represent independent risk variants at previously identified pancreatic cancer risk loci on chr1q32.1 ( NR5A2), chr8q24.21 ( MYC) and chr5p15.33 ( CLPTM1L- TERT) as per analyses conditioned on previously reported susceptibility variants. We assessed expression of candidate genes at the three risk loci in histologically normal ( n = 10) and tumor ( n = 8) derived pancreatic tissue samples and observed a marked reduction of NR5A2 expression (chr1q32.1) in the tumors (fold change -7.6, P = 5.7x10 -8). This finding was validated in a second set of paired ( n = 20) histologically normal and tumor derived pancreatic tissue samples (average fold change for three NR5A2 isoforms -31.3 to -95.7, P = 7.5x10 -4-2.0x10 -3). Our study has identified new susceptibility variants independently conferring pancreatic cancer risk that merit functional follow-up to identify target genes and explain the underlying biology.
To investigate large structural clonal mosaicism of chromosome X, we analysed the SNP microarray intensity data of 38,303 women from cancer genome-wide association studies (20,878 cases and 17,425 controls) and detected 124 mosaic X events >2 Mb in 97 (0.25%) women. Here we show rates for X-chromosome mosaicism are four times higher than mean autosomal rates; X mosaic events more often include the entire chromosome and participants with X events more likely harbour autosomal mosaic events. X mosaicism frequency increases with age (0.11% in 50-year olds; 0.45% in 75-year olds), as reported for Y and autosomes. Methylation array analyses of 33 women with X mosaicism indicate events preferentially involve the inactive X chromosome. Our results provide further evidence that the sex chromosomes undergo mosaic events more frequently than autosomes, which could have implications for understanding the underlying mechanisms of mosaic events and their possible contribution to risk for chronic diseases.