Epithelial-mesenchymal transition (EMT) is a process whereby epithelial cells assume mesenchymal characteristics to facilitate cancer metastasis. However, EMT also contributes to the initiation and development of primary tumors. Prior studies that explored the hypothesis that EMT gene variants contribute to epithelial ovarian carcinoma (EOC) risk have been based on small sample sizes and none have sought replication in an independent population. We screened 15,816 single-nucleotide polymorphisms (SNPs) in 296 genes in a discovery phase using data from a genome-wide association study of EOC among women of European ancestry (1,947 cases and 2,009 controls) and identified 793 variants in 278 EMT-related genes that were nominally (P < 0.05) associated with invasive EOC. These SNPs were then genotyped in a larger study of 14,525 invasive-cancer patients and 23,447 controls. A P-value <0.05 and a false discovery rate (FDR) <0.2 were considered statistically significant. In the larger dataset, GPC6/GPC5 rs17702471 was associated with the endometrioid subtype among Caucasians (odds ratio (OR) = 1.16, 95% CI = 1.07-1.25, P = 0.0003, FDR = 0.19), whereas F8 rs7053448 (OR = 1.69, 95% CI = 1.27-2.24, P = 0.0003, FDR = 0.12), F8 rs7058826 (OR = 1.69, 95% CI = 1.27-2.24, P = 0.0003, FDR = 0.12), and CAPN13 rs1983383 (OR = 0.79, 95% CI = 0.69-0.90, P = 0.0005, FDR = 0.12) were associated with combined invasive EOC among Asians. In silico functional analyses revealed that GPC6/GPC5 rs17702471 coincided with DNA regulatory elements. These results suggest that EMT gene variants do not appear to play a significant role in the susceptibility to EOC.
The default causal single-nucleotide polymorphism (SNP) effect size prior in Bayesian fine-mapping studies is usually the Normal distribution. This choice is often based on computational convenience, rather than evidence that it is the most suitable prior distribution. The choice of prior is important because previous studies have shown considerable sensitivity of causal SNP Bayes factors to the form of the prior. In some well-studied diseases there are now considerable numbers of genome-wide association study (GWAS) top hits along with estimates of the number of yet-to-be-discovered causal SNPs. We show how the effect sizes of the top hits and estimates of the number of yet-to-be-discovered causal SNPs can be used to choose between the Laplace and Normal priors, to estimate the prior parameters and to quantify the uncertainty in this estimation. The methodology can readily be applied to other priors. We show that the top hits available from breast cancer GWAS provide overwhelming support for the Laplace over the Normal prior, which has important consequences for variant prioritisation. This work in this paper enables practitioners to derive more objective priors than are currently being used and could lead to prioritisation of different variants.