Genetic structure can be altered by chemical mutagenesis, which is a common method applied in molecular biology and genetics. Second-generation sequencing provides a platform to reveal base alterations occurring in the whole genome due to mutagenesis. A model legume, Lotus japonicus ecotype Miyakojima, was chemically mutated with alkylating ethyl methanesulfonate (EMS) for the scanning of DNA lesions throughout the genome. Using second-generation sequencing, two individually mutated third-generation progeny (M3, named AM and AS) were sequenced and analyzed to identify single nucleotide polymorphisms and reveal the effects of EMS on nucleotide sequences in these mutant genomes. Single-nucleotide polymorphisms were found in every 208 kb (AS) and 202 kb (AM) with a bias mutation of G/C-to-A/T changes at low percentage. Most mutations were intergenic. The mutation spectrum of the genomes was comparable in their individual chromosomes; however, each mutated genome has unique alterations, which are useful to identify causal mutations for their phenotypic changes. The data obtained demonstrate that whole genomic sequencing is applicable as a high-throughput tool to investigate genomic changes due to mutagenesis. The identification of these single-point mutations will facilitate the identification of phenotypically causative mutations in EMS-mutated germplasm.
Mosses and other bryophytes are vital components of forests, because they sustain a tremendous diversity of invertebrates and influence significant ecological functions. There have been few studies on moss population diversity in Southeast Asia, despite the escalating deforestation in this region of rich biodiversity. The genetic diversity of the tropical moss Acanthorrhynchium papillatum (Harv.) Fleisch., collected from forested areas in Singapore and Peninsular Malaysia, was elucidated using eight microsatellite markers developed for this species. Significant levels of allelic and haplotypic diversity were observed among clumps of the moss. Differences in allelic richness and genotypic diversity among the populations were higher in less disturbed forests compared to the more disturbed areas, suggesting that genetic diversity is affected by habitat quality. Genetic diversity levels within the clumps studied were low, indicating that vegetative reproduction was more important within clumps than sexual reproduction. However, multilocus genotypes of samples within the clumps studied were not all alike, providing evidence of microsatellite mutation or of occasional sexuality. Despite the isolation of populations, A. papillatum can introduce genetic variability by mutation among vegetatively propagated individuals. This study provides baseline information on the genetic diversity of A. papillatum tropical rain forests.
Primary angle closure glaucoma (PACG) is a major cause of blindness worldwide. We conducted a genome-wide association study (GWAS) followed by replication in a combined total of 10,503 PACG cases and 29,567 controls drawn from 24 countries across Asia, Australia, Europe, North America, and South America. We observed significant evidence of disease association at five new genetic loci upon meta-analysis of all patient collections. These loci are at EPDR1 rs3816415 (odds ratio (OR) = 1.24, P = 5.94 × 10(-15)), CHAT rs1258267 (OR = 1.22, P = 2.85 × 10(-16)), GLIS3 rs736893 (OR = 1.18, P = 1.43 × 10(-14)), FERMT2 rs7494379 (OR = 1.14, P = 3.43 × 10(-11)), and DPM2-FAM102A rs3739821 (OR = 1.15, P = 8.32 × 10(-12)). We also confirmed significant association at three previously described loci (P < 5 × 10(-8) for each sentinel SNP at PLEKHA7, COL11A1, and PCMTD1-ST18), providing new insights into the biology of PACG.
Genome-wide association studies (GWAS) and fine-mapping efforts to date have identified more than 100 prostate cancer (PrCa)-susceptibility loci. We meta-analyzed genotype data from a custom high-density array of 46,939 PrCa cases and 27,910 controls of European ancestry with previously genotyped data of 32,255 PrCa cases and 33,202 controls of European ancestry. Our analysis identified 62 novel loci associated (P C, p.Pro1054Arg) in ATM and rs2066827 (OR = 1.06; P = 2.3 × 10-9; T>G, p.Val109Gly) in CDKN1B. The combination of all loci captured 28.4% of the PrCa familial relative risk, and a polygenic risk score conferred an elevated PrCa risk for men in the ninetieth to ninety-ninth percentiles (relative risk = 2.69; 95% confidence interval (CI): 2.55-2.82) and first percentile (relative risk = 5.71; 95% CI: 5.04-6.48) risk stratum compared with the population average. These findings improve risk prediction, enhance fine-mapping, and provide insight into the underlying biology of PrCa1.
Over 3000 candidate gene association studies have been performed to elucidate the genetic underpinnings of schizophrenia. However, a comprehensive evaluation of these studies' findings has not been undertaken since the decommissioning of the schizophrenia gene (SzGene) database in 2011. As such, we systematically identified and carried out random-effects meta-analyses for all polymorphisms with four or more independent studies in schizophrenia along with a series of expanded meta-analyses incorporating published and unpublished genome-wide association (GWA) study data. Based on 550 meta-analyses, 11 SNPs in eight linkage disequilibrium (LD) independent loci showed Bonferroni-significant associations with schizophrenia. Expanded meta-analyses identified an additional 10 SNPs, for a total of 21 Bonferroni-significant SNPs in 14 LD-independent loci. Three of these loci (MTHFR, DAOA, ARVCF) had never been implicated by a schizophrenia GWA study. In sum, the present study has provided a comprehensive summary of the current schizophrenia genetics knowledgebase and has made available all the collected data as a resource for the research community.
Salmonellae display intricate evolutionary patterns comprising over 2500 serovars having diverse pathogenic profiles. The acquisition and/or exchange of various virulence factors influences the evolutionary framework. To gain insights into evolution of Salmonella in association with the CRISPR-Cas genes we performed phylogenetic surveillance across strains of 22 Salmonella serovars. The strains differed in their CRISPR1-leader and cas operon features assorting into two main clades, CRISPR1-STY/cas-STY and CRISPR1-STM/cas-STM, comprising majorly typhoidal and non-typhoidal Salmonella serovars respectively. Serovars of these two clades displayed better relatedness, concerning CRISPR1-leader and cas operon, across genera than between themselves. This signifies the acquisition of CRISPR1/Cas region could be through a horizontal gene transfer event owing to the presence of mobile genetic elements flanking CRISPR1 array. Comparison of CRISPR and cas phenograms with that of multilocus sequence typing (MLST) suggests differential evolution of CRISPR/Cas system. As opposed to broad-host-range, the host-specific serovars harbor fewer spacers. Mapping of protospacer sources suggested a partial correlation of spacer content with habitat diversity of the serovars. Some serovars like serovar Enteritidis and Typhimurium that inhabit similar environment/infect similar hosts hardly shared their protospacer sources.
In this study, we first conducted a genome survey assay for Sillago sihama by Illumina sequencing platform, and then developed 15 polymorphic microsatellite loci in a wild population. A total of 129.46 Gb raw data were obtained, of which 115.07 Gb were clean data, with a sequencing depth of 179.3-folds. This genome was estimated to be 522.6 Mb in size, with the heterozygosity, repeat content and GC content being 0.63%, 21% and 44%. A total of 630,028 microsatellites were identified from the genome, of which, dinucleotide repeat was the most abundant (56.80%), followed by mononucleotide repeat (30.23%). Furthermore, 60 pairs of primers were designed and synthesized based on microsatellite sequences, of which 15 were polymorphic in a wild population. A total of 91 alleles were found, with an average of 6.07 per locus. Number of alleles, observed and expected heterozygosity per locus ranged from two to 13, from 0.250 to 0.862, and from 0.396 to 0.901, respectively. Twelve loci were highly informative (PIC > 0.5), and the others were medium informative (0.25 loci deviated from Hardy-Weinberg equilibrium after Bonferroni correction (P loci pairs. This study provided a large number of genomic resources and 15 polymorphic microsatellite loci that should be helpful for the further genetic studies in S. sihama.
DNA profiling of X-chromosomal short tandem repeats (X-STR) has exceptional value in criminal investigations, especially for complex kinship and incest cases. In this study, Investigator® Argus X-12 Quality Sensor (QS) kits were successfully used to characterize 12 X-STR loci in 199 unrelated healthy Kedayan individuals living in Sabah and Sarawak, Malaysia. The LG1 haplogroup (DXS8378 - DXS10135 - DXS10148) has the largest HD (0.9799) as compared with all other closely linked haplotype groups examined (LG2; DXS7132-DXS10074-DXS10079, LG3; DXS10103-DXS10101-HPRTB and LG4; DXS10134-DXS7423-DXS10146). Data from statistical analysis showed that high combined of PDM, PDF, MEC_Krüger, MEC_Kishida, MEC_Desmarais, and MEC_Desmarais_duo values (0.999999994405922, 0.99999999999999, 0.999990463834938, 0.999999975914808, 0.999999975985006, and 0.999996491927194, respectively) in the Kedayan. In a two-dimensional scaling (MDS) plot and dendrogram constructed using allele frequencies at the 12 X-STR loci, Kedayan appear to be most closely related to their other Austronesian populations including the Malays and Filipinos as compared with other reference population groups. Findings from the present study thus demonstrate high genetic variability across the 12 tested X-STR loci and can be used for population studies and forensic applications.
Frullania subgenus Microfrullania is a clade of ca. 15 liverwort species occurring in Australasia, Malesia, and southern South America. We used combined nuclear and chloroplast sequence data from 265 ingroup accessions to test species circumscriptions and estimate the biogeographic history of the subgenus. With dense infra-specific sampling, we document an important role of long-distance dispersal in establishing phylogeographic patterns of extant species. At deeper time scales, a combination of phylogenetic analyses, divergence time estimation and ancestral range estimation were used to reject vicariance and to document the role of long-distance dispersal in explaining the evolution and biogeography of the clade across the southern Hemisphere. A backbone phylogeny for the subgenus is proposed, providing insight into evolution of morphological patterns and establishing the basis for an improved sectional classification of species within Microfrullania. Several species complexes are identified, the presence of two undescribed but genetically and morphologically distinct species is noted, and previously neglected names are discussed.
Because of variable inconvenient living conditions in some places around the world, it is difficult to collect reliable physiological data for ostriches. Therefore, this study aims to provide a comprehensive in silico insight for the nature of polymorphism of important genetic loci that are related to physiological and reproductive traits. Sixty-nine mature ostriches ranging over half of Iraq were screened. Six exonic genetic loci, including cytochrome c oxidase I (COX1), cytochrome b (CYTB), secretogranin V (SCG5), feather keratin 2-like (FK2), prolactin (PRL) and placenta growth factor (PGF) were genotyped by PCR-single stranded conformation polymorphism (SSCP). Thirty-six novel SNPs, including seventeen nonsynonymous (ns) SNPs, were observed. Several computational software programs were utilized to assess the extent of the nsSNPs on their corresponding proteins structure, function and stability. The results showed several deleterious functional and stability changes in almost all the proteins studied. The total severity of each missense mutation was evaluated and compared with other nsSNPs accumulatively. It is evident from the extensive cumulative in silico computation that both p.E34D and p.E60K in PGF have the highest deleterious effect. The cumulative predictions from the present study are an impressive guide for the genotypes of African ostriches, which bypassed the expensive protocols for wet laboratory screening, to identify the effects of variants. To the best of our knowledge, this is the first investigation of its kind on the analyses and prediction outcome of missense mutations in African ostrich populations. The highly deleterious nsSNPs in the placenta growth factor are possible adaptive mutations which might be associated with adaptation in extreme and new environments. The flow and protocol of the computational predictions can be extended for various wild animals to identify the molecular nature of adaptations.
The Arracacia clade (Apiaceae, Apioideae) is a heterogeneous assemblage of 12 genera, comprising 111 known species distributed in high montane temperate and sub-alpine habitats of meso- and South America. Previous studies have indicated that the genera Arracacia, Coulterophytum, and Prionosciadium are polyphyletic, but for the most part relationships among the members of the clade are largely unknown. Initially, cladistic analyses of nrDNA ITS sequences were carried out on 212 accessions (122 taxa), representing 92 species of the Arracacia clade and outgroups from the closely-related páramo genera Cotopaxia, Niphogeton, and Perissocoeleum and members of the Perennial Endemic North American clade and its allies. Using the ITS results to inform sampling of a small subset of taxa, a pilot study examining the phylogenetic utility of 20 noncoding chloroplast loci was subsequently performed to identify those regions most useful at resolving relationships. A cost-benefit analysis determined that five loci (trnQ-5'rps16, trnD-trnT, rpl32-trnL, psbD-trnT, ndhA intron) would maximize resolution and branch support in the clade. Cladistic analyses of four of these loci (trnQ-5'rps16, trnD-trnT, rpl32-trnL, ndhA intron) and the ITS region, separately and combined, revealed that Arracacia, Coaxana, Coulterophytum, Prionosciadium, and Rhodosciadium are each polyphyletic and that Donnellsmithia and Myrrhidendron are each monophyletic. Although most relationships in the Arracacia clade and among the closely-related genera Cotopaxia, Niphogeton, and Perissocoeleum are poorly resolved and supported, ten groups are recognized for future revisionary studies. Polyploidy and rapid species radiation have likely confounded generic circumscriptions and interpretation of relationships.
Phylogeography can provide insight into the potential for speciation and identify geographic regions and evolutionary processes associated with species richness and evolutionary endemism. In the marine environment, highly mobile species sometimes show structured patterns of diversity, but the processes isolating populations and promoting differentiation are often unclear. The Delphinidae (oceanic dolphins) are a striking case in point and, in particular, bottlenose dolphins (Tursiops spp.). Understanding the radiation of species in this genus is likely to provide broader inference about the processes that determine patterns of biogeography and speciation, because both fine-scale structure over a range of kilometers and relative panmixia over an oceanic range are known for Tursiops populations. In our study, novel Tursiops spp. sequences from the northwest Indian Ocean (including mitogenomes and two nuDNA loci) are included in a worldwide Tursiops spp. phylogeographic analysis. We discover a new 'aduncus' type lineage in the Arabian Sea (off India, Pakistan and Oman) that diverged from the Australasian lineage ∼261 Ka. Effective management of coastal dolphins in the region will need to consider this new lineage as an evolutionarily significant unit. We propose that the establishment of this lineage could have been in response to climate change during the Pleistocene and show data supporting hypotheses for multiple divergence events, including vicariance across the Indo-Pacific barrier and in the northwest Indian Ocean. These data provide valuable transferable inference on the potential mechanisms for population and species differentiation across this geographic range.
Bipolar Disorder and Schizophrenia Working Group of the Psychiatric Genomics Consortium. Electronic address: douglas.ruderfer@vanderbilt.edu, Bipolar Disorder and Schizophrenia Working Group of the Psychiatric Genomics Consortium
Schizophrenia and bipolar disorder are two distinct diagnoses that share symptomology. Understanding the genetic factors contributing to the shared and disorder-specific symptoms will be crucial for improving diagnosis and treatment. In genetic data consisting of 53,555 cases (20,129 bipolar disorder [BD], 33,426 schizophrenia [SCZ]) and 54,065 controls, we identified 114 genome-wide significant loci implicating synaptic and neuronal pathways shared between disorders. Comparing SCZ to BD (23,585 SCZ, 15,270 BD) identified four genomic regions including one with disorder-independent causal variants and potassium ion response genes as contributing to differences in biology between the disorders. Polygenic risk score (PRS) analyses identified several significant correlations within case-only phenotypes including SCZ PRS with psychotic features and age of onset in BD. For the first time, we discover specific loci that distinguish between BD and SCZ and identify polygenic components underlying multiple symptom dimensions. These results point to the utility of genetics to inform symptomology and potential treatment.
Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P ≤ 0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10(-7)). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r (2) with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11-1.24, P = 1.1×10(-7)). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72×10(-8)). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r (2) = 0.99 with rs6005807) and CHEK2 expression (P = 2.70×10(-8)). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene.
OBJECTIVES: Recently, several genome-wide association studies have demonstrated a cumulative association of 5 polymorphic variants in chromosomes 8q24 and 17q with prostate cancer (CaP) risk in Caucasians, particularly those harboring aggressive clinicopathologic characteristics. The purpose of this study was to evaluate the influence of these variants on CaP susceptibility in Singaporean Asian men.
MATERIALS AND METHODS: We performed a case-control study in 289 Chinese CaP patients and 412 healthy subjects (144 Chinese, 134 Malays, and 134 Indians), and examined the association of the 5 single nucleotide polymorphisms (SNPs) with CaP.
RESULTS: In the healthy subjects, rs16901979 A-allele frequency was highest amongst Chinese (0.32) compared with Malays (0.13; P < 0.0001) or Indians (0.09; P < 0.0001); rs6983267 G-allele was highest in Indians (0.51) compared with Chinese (0.42; P = 0.041) or Malays (0.43; P = 0.077); whereas rs1859962 G-allele frequency was highest amongst Indians (0.56) compared with Chinese (0.40; P = 0.0002) or Malays (0.38; P < 0.0001). Individuals with the rs4430796 TT genotype were at increased CaP risk in the Chinese via a recessive model (odds ratios (OR) = 1.56, 95% CI = 1.04-2.33). Significant associations were observed for rs4430796 TT with Gleason scores of ≥ 7 (OR = 1.76, 95% CI = 1.14-2.73) and prostate-specific antigen (PSA) levels of ≥ 10 ng/ml at diagnosis (OR = 1.63, 95% CI = 1.01-2.63), as well as for rs6983267 GG with stage 3-4 CaPs (OR = 1.91, 95% CI = 1.01-3.61). A cumulative gene interaction influence on disease risk, which approximately doubled for individuals with at least 2 susceptibility genotypes, was also identified (OR = 2.18, 95% CI = 1.10-4.32).
CONCLUSIONS: This exploratory analysis suggests that the 5 genetic variants previously described may contribute to prostate cancer risk in Singaporean men.
KEYWORDS: Cancer; Ethnicity; Gleason; Pharmacogenetics; Polymorphism; Prostate
We recently identified a novel susceptibility variant, rs865686, for estrogen-receptor positive breast cancer at 9q31.2. Here, we report a fine-mapping analysis of the 9q31.2 susceptibility locus using 43 160 cases and 42 600 controls of European ancestry ascertained from 52 studies and a further 5795 cases and 6624 controls of Asian ancestry from nine studies. Single nucleotide polymorphism (SNP) rs676256 was most strongly associated with risk in Europeans (odds ratios [OR] = 0.90 [0.88-0.92]; P-value = 1.58 × 10(-25)). This SNP is one of a cluster of highly correlated variants, including rs865686, that spans ∼14.5 kb. We identified two additional independent association signals demarcated by SNPs rs10816625 (OR = 1.12 [1.08-1.17]; P-value = 7.89 × 10(-09)) and rs13294895 (OR = 1.09 [1.06-1.12]; P-value = 2.97 × 10(-11)). SNP rs10816625, but not rs13294895, was also associated with risk of breast cancer in Asian individuals (OR = 1.12 [1.06-1.18]; P-value = 2.77 × 10(-05)). Functional genomic annotation using data derived from breast cancer cell-line models indicates that these SNPs localise to putative enhancer elements that bind known drivers of hormone-dependent breast cancer, including ER-α, FOXA1 and GATA-3. In vitro analyses indicate that rs10816625 and rs13294895 have allele-specific effects on enhancer activity and suggest chromatin interactions with the KLF4 gene locus. These results demonstrate the power of dense genotyping in large studies to identify independent susceptibility variants. Analysis of associations using subjects with different ancestry, combined with bioinformatic and genomic characterisation, can provide strong evidence for the likely causative alleles and their functional basis.
In the Arabian Peninsula malaria control is progressing steadily, backed by adequate logistic and political support. As a result, transmission has been interrupted throughout the region, with exception of limited sites in Yemen and Saudi Arabia. Here we examined Plasmodium falciparum parasites in these sites to assess if the above success has limited diversity and gene flow.
Leptin is known as the adipose peptide hormone. It plays an important role in the regulation of body fat and inhibits food intake by its action. Moreover, it is believed that leptin level deductions might be the cause of obesity and may play an important role in the development of Type 2 Diabetes Mellitus (T2DM), as well as in cardiovascular diseases (CVD). The Leptin Receptor (LEPR) gene and its polymorphisms have not been extensively studied in relation to the T2DM and its complications in various populations. In this study, we have determined the association of Gln223Agr loci of LEPR gene in three ethnic groups of Malaysia, namely: Malays, Chinese and Indians. A total of 284 T2DM subjects and 281 healthy individuals were recruited based on International Diabetes Federation (IDF) criteria. Genomic DNA was extracted from the buccal specimens of the subjects. The commercial polymerase chain reaction (PCR) method was carried out by proper restriction enzyme MSP I to both amplify and digest the Gln223Agr polymorphism. The p-value among the three studied races was 0.057, 0.011 and 0.095, respectively. The values such as age, WHR, FPG, HbA1C, LDL, HDL, Chol and Family History were significantly different among the subjects with Gln223Agr polymorphism of LEPR (p < 0.05).
Brucellosis is one of the most common zoonotic diseases worldwide. It can cause acute febrile illness in human and is a major health problem. Studies in human brucellosis in Malaysia is limited and so far no genotyping studies has been done on Brucella isolates. The aim of the study was to determine the genetic diversity among Brucella species isolated from human brucellosis, obtained over a 6-year period (2009-2014).
There are two very different interpretations of the prehistory of Island Southeast Asia (ISEA), with genetic evidence invoked in support of both. The "out-of-Taiwan" model proposes a major Late Holocene expansion of Neolithic Austronesian speakers from Taiwan. An alternative, proposing that Late Glacial/postglacial sea-level rises triggered largely autochthonous dispersals, accounts for some otherwise enigmatic genetic patterns, but fails to explain the Austronesian language dispersal. Combining mitochondrial DNA (mtDNA), Y-chromosome and genome-wide data, we performed the most comprehensive analysis of the region to date, obtaining highly consistent results across all three systems and allowing us to reconcile the models. We infer a primarily common ancestry for Taiwan/ISEA populations established before the Neolithic, but also detected clear signals of two minor Late Holocene migrations, probably representing Neolithic input from both Mainland Southeast Asia and South China, via Taiwan. This latter may therefore have mediated the Austronesian language dispersal, implying small-scale migration and language shift rather than large-scale expansion.