METHODS: Eleven full-length pkmsp1 sequences obtained from clinical isolates of Malaysia along with the H-strain were downloaded from the database for domain wise characterization of pkmsp1 gene. Additionally, 76 pkmsp-142 sequences from Thailand and Malaysia were downloaded from the database for intra and inter-population analysis. DnaSP 5.10 and MEGA 5.0 software were used to determine genetic diversity, polymorphism, haplotypes and natural selection. Genealogical relationships were determined using haplotype network tree in NETWORK software v5.0. Population genetic differentiation index (FST) of parasites were analysed using Arlequin v3.5.
RESULTS: Sequence analysis of 11 full-length pkmsp1 sequences along with the H-strain identified 477 (8.4%) polymorphic sites, of which 107 were singleton sites. The overall diversity observed in the full-length genes were high in comparison to its ortholog pvmsp1 and the 4 variable domains showed extensive size variations. The nucleotide diversity was low towards the pkmsp1-42 compared to the conserved domains. The 19 kDa domain was less diverse and completely conserved among isolates from Malaysian Borneo. The nucleotide diversity of isolates from Peninsular Malaysia and Thailand were higher than Malaysian Borneo. Network analysis of pkmsp1-42 haplotypes showed geographical clustering of the isolates from Malaysian Borneo and grouping of isolates from Peninsular Malaysia and Thailand. Population differentiation analysis indicated high FST values between parasite populations originating from Malaysian Borneo, Peninsular Malaysia and Thailand attributing to geographical distance. Moderate genetic differentiation was observed for parasite populations from Thailand and Peninsular Malaysia. Evidence of population expansion and purifying selection were observed in all conserved domains with strongest selection within the pkmsp1-42 domain.
CONCLUSIONS: This study is the first to report on inter country genetic diversity and population structure of P. knowlesi based on msp1. Strong evidence of negative selection was observed in the 42 kDa domain, indicating functional constrains. Geographical clustering of P. knowlesi and moderate to high genetic differentiation values between populations identified in this study highlights the importance of further evaluation using larger number of clinical samples from Southeast Asian countries.
Method: Thirty-five full-length pk41 sequences from clinical isolates of Malaysia along with four laboratory lines (along with H-strain) were downloaded from public databases. For comparative analysis between species, orthologous P41 genes from P. falciparum, P. vivax, P. coatneyi and P. cynomolgi were also downloaded. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 software. Phylogenetic relationships between Pk41 genes were determined using MEGA 5.0 software.
Results: Analysis of 39 full-length pk41 sequences along with the H-strain identified 36 SNPs (20 non-synonymous and 16 synonymous substitutions) resulting in 31 haplotypes. Nucleotide diversity across the full-length gene was low and was similar to its ortholog in P. vivax; pv41. Domain-wise amino acid analysis of the two s48/45 domains indicated low level of polymorphisms for both the domains, and the glutamic acid rich region had extensive size variations. In the central domain, upstream to the glutamate rich region, a unique two to six (K-E)n repeat region was identified within the clinical isolates. Overall, the pk41 genes were indicative of negative/purifying selection due to functional constraints. Domain-wise analysis of the s48/45 domains also indicated purifying selection. However, analysis of Tajima's D across the genes identified non-synonymous SNPs in the s48/45 domain II with high positive values indicating possible epitope binding regions. All the 6-cysteine residues within the s48/45 domains were conserved within the clinical isolates indicating functional conservation of these regions. Phylogenetic analysis of full-length pk41 genes indicated geographical clustering and identified three subpopulations of P. knowlesi; one originating in the laboratory lines and two originating from Sarawak, Malaysian Borneo.
Conclusion: This is the first study to report on the polymorphism and natural selection of pk41 genes from clinical isolates of Malaysia. The results reveal that there is low level of polymorphism in both s48/45 domains, indicating that this antigen could be a potential vaccine target. However, genetic and molecular immunology studies involving higher number of samples from various parts of Malaysia would be necessary to validate this antigen's candidacy as a vaccine target for P. knowlesi.
PRESENTATION OF THE HYPOTHESIS: The hypothesis that is presented consists of two parts. First, that shell ornamentation is the result of sexual selection. Second, that such sexual selection has caused the divergence in shell shape in different species.
TESTING THE HYPOTHESIS: The first part of the hypothesis may be tested by searching for sexual dimorphism in shell ornamentation in gonochoristic snails, by searching for increased variance in shell ornamentation relative to other shell traits, and by mate choice experiments using individuals with experimentally enhanced ornamentation. The second part of the hypothesis may be tested by comparing sister groups and correlating shell diversity with degree of polygamy.
IMPLICATIONS OF THE HYPOTHESIS: If the hypothesis were true, it would provide an explanation for the many cases of allopatric evolutionary radiation in snails, where shell diversity cannot be related to any niche differentiation or environmental differences.
RESULTS: Positively significant departures from neutral expectations were detected on the surf4.1region encoding C-terminus of the variable region 2 (Var2) by 3 population-based tests in the western Kenyan population as similar in the Thai population, which was not covered by the previous analysis for eastern Kenyan population. Significant excess of non-synonymous substitutions per nonsynonymous site over synonymous substitutions per synonymous site was also detected in the Var2 region. Negatively significant departures from neutral expectations was detected on the region encoding Var1 C-terminus consistent to the previous observation in the eastern Kenyan population. Parasites possessing a frameshift mutation resulting a product without intracellular Trp-rich (WR) domains were 22/23 in western Kenya and 22/36 in Thailand. More than one copy of surf4.1gene was detected in western Kenya (4/24), but no CNV was found in Thailand (0/36).
CONCLUSIONS: The authors infer that the high polymorphism of SURFIN4.1Var2 C-terminus in both Kenyan and Thai populations were shaped-up by diversifying selection and maintained by balancing selection. These phenomena were most likely driven by immunological pressure. Whereas the SURFIN4.1Var1 C-terminus is suggested to be under directional selection consistent to the previous report for the eastern Kenyan population. Most western Kenyan isolates possess a frameshift mutation that would limit the expression of SURFIN4.1on the merozoite, but only 60% of Thai isolates possess this frameshift, which would affect the level and type of the selection pressure against this protein as seen in the two extremities of Tajima's D values for Var1 C-terminus between Kenyan and Thai populations. CNV observed in Kenyan isolates may be a consequence of this frameshift mutation to increase benefits on the merozoite surface.
RESULTS: We identified a total of 644,225 SNPs in 131 neuropeptide genes in 6 worldwide population groups from a public database. Of these, 5163 SNPs that had ΔDAF |(African - non-African)| ≥ 0.20 were identified and fully annotated. A total of 20 outlier SNPs that included 19 missense SNPs with a moderate impact and one stop lost SNP with high impact, were identified in 16 neuropeptide genes. Our results indicate that an overall strong population differentiation was observed in the non-African populations that had a higher derived allele frequency for 15/20 of those SNPs. Highly differentiated SNPs in four genes were particularly striking: NPPA (rs5065) with high impact stop lost variant; CHGB (rs6085324, rs236150, rs236152, rs742710 and rs742711) with multiple moderate impact missense variants; IGF2 (rs10770125) and INS (rs3842753) with moderate impact missense variants that are in linkage disequilibrium. Phenotype and disease associations of these differentiated SNPs indicated their association with hypertension and diabetes and highlighted the pleiotropic effects of these neuropeptides and their role in maintaining physiological homeostasis in humans.
CONCLUSIONS: We compiled a list of 131 human neuropeptide genes from multiple databases and literature survey. We detect significant population differentiation in the derived allele frequencies of variants in several neuropeptide genes in African and non-African populations. The results highlights SNPs in these genes that may also contribute to population disparities in prevalence of diseases such as hypertension and diabetes.