RESULTS: We analyzed the whole-genome deep sequencing data (~ 30×) of five native trios from Peninsular Malaysia and North Borneo, and characterized the genomic variants, including single nucleotide variants (SNVs), small insertions and deletions (indels) and copy number variants (CNVs). We discovered approximately 6.9 million SNVs, 1.2 million indels, and 9000 CNVs in the 15 samples, of which 2.7% SNVs, 2.3% indels and 22% CNVs were novel, implying the insufficient coverage of population diversity in existing databases. We identified a higher proportion of novel variants in the Orang Asli (OA) samples, i.e., the indigenous people from Peninsular Malaysia, than that of the North Bornean (NB) samples, likely due to more complex demographic history and long-time isolation of the OA groups. We used the pedigree information to identify de novo variants and estimated the autosomal mutation rates to be 0.81 × 10- 8 - 1.33 × 10- 8, 1.0 × 10- 9 - 2.9 × 10- 9, and ~ 0.001 per site per generation for SNVs, indels, and CNVs, respectively. The trio-genomes also allowed for haplotype phasing with high accuracy, which serves as references to the future genomic studies of OA and NB populations. In addition, high-frequency inherited CNVs specific to OA or NB were identified. One example is a 50-kb duplication in DEFA1B detected only in the Negrito trios, implying plausible effects on host defense against the exposure of diverse microbial in tropical rainforest environment of these hunter-gatherers. The CNVs shared between OA and NB groups were much fewer than those specific to each group. Nevertheless, we identified a 142-kb duplication in AMY1A in all the 15 samples, and this gene is associated with the high-starch diet. Moreover, novel insertions shared with archaic hominids were identified in our samples.
CONCLUSION: Our study presents a full catalogue of the genome variants of the native Malaysian populations, which is a complement of the genome diversity in Southeast Asians. It implies specific population history of the native inhabitants, and demonstrated the necessity of more genome sequencing efforts on the multi-ethnic native groups of Malaysia and Southeast Asia.
Objectives: To identify novel genome-wide significant loci for PD in Asian individuals and to compare genetic risk between Asian and European cohorts.
Design Setting, and Participants: Genome-wide association data generated from PD cases and controls in an Asian population (ie, Singapore/Malaysia, Hong Kong, Taiwan, mainland China, and South Korea) were collected from January 1, 2016, to December 31, 2018, as part of an ongoing study. Results were combined with inverse variance meta-analysis, and replication of top loci in European and Japanese samples was performed. Discovery samples of 31 575 individuals passing quality control of 35 994 recruited were used, with a greater than 90% participation rate. A replication cohort of 1 926 361 European-ancestry and 3509 Japanese samples was analyzed. Parkinson disease was diagnosed using UK Parkinson's Disease Society Brain Bank Criteria.
Main Outcomes and Measures: Genotypes of common variants, association with disease status, and polygenic risk scores.
Results: Of 31 575 samples identified, 6724 PD cases (mean [SD] age, 64.3 [10] years; age at onset, 58.8 [10.6] years; 3472 [53.2%] men) and 24 851 controls (age, 59.4 [11.4] years; 11 030 [45.0%] men) were analyzed in the discovery study. Eleven genome-wide significant loci were identified; 2 of these loci were novel (SV2C and WBSCR17) and 9 were previously found in Europeans. Replication in European-ancestry and Japanese samples showed robust association for SV2C (rs246814; odds ratio, 1.16; 95% CI, 1.11-1.21; P = 1.17 × 10-10 in meta-analysis of discovery and replication samples) but showed potential genetic heterogeneity at WBSCR17 (rs9638616; I2=67.1%; P = 3.40 × 10-3 for hetereogeneity). Polygenic risk score models including variants at these 11 loci were associated with a significant improvement in area under the curve over the model based on 78 European loci alone (63.1% vs 60.2%; P = 6.81 × 10-12).
Conclusions and Relevance: This study identified 2 apparently novel gene loci and found 9 previously identified European loci to be associated with PD in this large, meta-genome-wide association study in a worldwide population of Asian individuals and reports similarities and differences in genetic risk factors between Asian and European individuals in the risk for PD. These findings may lead to improved stratification of Asian patients and controls based on polygenic risk scores. Our findings have potential academic and clinical importance for risk stratification and precision medicine in Asia.