RESULTS: The kinship coefficient between individuals in this family ranged from 0.35 to 0.62. S/F and O/DM had the highest genomic heritability, whereas F/B and O/P had the lowest. The accuracies using 135 SSRs were low, with accuracies of the traits around 0.20. The average accuracy of machine learning methods was 0.24, as compared to 0.20 achieved by other methods. The trait with the highest mean accuracy was F/B (0.28), while the lowest were both M/F and O/P (0.18). By using whole genomic SNPs, the accuracies for all traits, especially for O/DM (0.43), S/F (0.39) and M/F (0.30) were improved. The average accuracy of machine learning methods was 0.32, compared to 0.31 achieved by other methods.
CONCLUSION: Due to high genomic resolution, the use of whole-genome SNPs improved the efficiency of GS dramatically for oil palm and is recommended for dura breeding programs. Machine learning slightly outperformed other methods, but required parameters optimization for GS implementation.
RESULTS: Positively significant departures from neutral expectations were detected on the surf4.1region encoding C-terminus of the variable region 2 (Var2) by 3 population-based tests in the western Kenyan population as similar in the Thai population, which was not covered by the previous analysis for eastern Kenyan population. Significant excess of non-synonymous substitutions per nonsynonymous site over synonymous substitutions per synonymous site was also detected in the Var2 region. Negatively significant departures from neutral expectations was detected on the region encoding Var1 C-terminus consistent to the previous observation in the eastern Kenyan population. Parasites possessing a frameshift mutation resulting a product without intracellular Trp-rich (WR) domains were 22/23 in western Kenya and 22/36 in Thailand. More than one copy of surf4.1gene was detected in western Kenya (4/24), but no CNV was found in Thailand (0/36).
CONCLUSIONS: The authors infer that the high polymorphism of SURFIN4.1Var2 C-terminus in both Kenyan and Thai populations were shaped-up by diversifying selection and maintained by balancing selection. These phenomena were most likely driven by immunological pressure. Whereas the SURFIN4.1Var1 C-terminus is suggested to be under directional selection consistent to the previous report for the eastern Kenyan population. Most western Kenyan isolates possess a frameshift mutation that would limit the expression of SURFIN4.1on the merozoite, but only 60% of Thai isolates possess this frameshift, which would affect the level and type of the selection pressure against this protein as seen in the two extremities of Tajima's D values for Var1 C-terminus between Kenyan and Thai populations. CNV observed in Kenyan isolates may be a consequence of this frameshift mutation to increase benefits on the merozoite surface.
METHODS: A total of 36 full-length pkmsp1p sequences along with the reference H-strain and 40 C-terminal pkmsp1p sequences from clinical isolates of Malaysia were downloaded from published genomes. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 and MEGA 5.0 software. Genealogical relationships were determined using haplotype network tree in NETWORK software v5.0. Population genetic differentiation index (F ST ) and population structure of parasite was determined using Arlequin v3.5 and STRUCTURE v2.3.4 software.
RESULTS: Comparison of 36 full-length pkmsp1p sequences along with the H-strain identified 339 SNPs (175 non-synonymous and 164 synonymous substitutions). The nucleotide diversity across the full-length gene was low compared to its ortholog pvmsp1p. The nucleotide diversity was higher toward the N-terminal domains (pkmsp1p-83 and 30) compared to the C-terminal domains (pkmsp1p-38, 33 and 19). Phylogenetic analysis of full-length genes identified 2 distinct clusters of P. knowlesi from Malaysian Borneo. The 40 pkmsp1p-19 sequences showed low polymorphisms with 16 polymorphisms leading to 18 haplotypes. In total there were 10 synonymous and 6 non-synonymous substitutions and 12 cysteine residues were intact within the two EGF domains. Evidence of strong purifying selection was observed within the full-length sequences as well in all the domains. Shared haplotypes of 40 pkmsp1p-19 were identified within Malaysian Borneo haplotypes.
CONCLUSIONS: This study is the first to report on the genetic diversity and natural selection of pkmsp1p. A low level of genetic diversity and strong evidence of negative selection was detected and observed in all the domains of pkmsp1p of P. knowlesi indicating functional constrains. Shared haplotypes were identified within pkmsp1p-19 highlighting further evaluation using larger number of clinical samples from Malaysia.
METHODS: Blood samples from 78 knowlesi malaria patients were used. Forty-eight of the samples were from Peninsular Malaysia, and 30 were from Malaysia Borneo. The genomic DNA of the samples was extracted and used as template for the PCR amplification of the PkγRII. The PCR product was cloned and sequenced. The sequences obtained were analysed for genetic diversity and natural selection using MEGA6 and DnaSP (version 5.10.00) programmes. Genetic differentiation between the PkγRII of Peninsular Malaysia and North Borneo isolates was estimated using the Wright's FST fixation index in DnaSP (version 5.10.00). Haplotype analysis was carried out using the Median-Joining approach in NETWORK (version 4.6.1.3).
RESULTS: A total of 78 PkγRII sequences was obtained. Comparative analysis showed that the PkγRII have similar range of haplotype (Hd) and nucleotide diversity (π) with that of PkDBPαRII. Other similarities between PkγRII and PkDBPαRII include undergoing purifying (negative) selection, geographical clustering of haplotypes, and high inter-population genetic differentiation (FST index). The main differences between PkγRII and PkDBPαRII include length polymorphism and no departure from neutrality (as measured by Tajima's D statistics) in the PkγRII.
CONCLUSION: Despite the biological difference between PkγRII and PkDBPαRII, both generally have similar genetic diversity level, natural selection, geographical haplotype clustering and inter-population genetic differentiation index.
RESULTS: We identified a total of 644,225 SNPs in 131 neuropeptide genes in 6 worldwide population groups from a public database. Of these, 5163 SNPs that had ΔDAF |(African - non-African)| ≥ 0.20 were identified and fully annotated. A total of 20 outlier SNPs that included 19 missense SNPs with a moderate impact and one stop lost SNP with high impact, were identified in 16 neuropeptide genes. Our results indicate that an overall strong population differentiation was observed in the non-African populations that had a higher derived allele frequency for 15/20 of those SNPs. Highly differentiated SNPs in four genes were particularly striking: NPPA (rs5065) with high impact stop lost variant; CHGB (rs6085324, rs236150, rs236152, rs742710 and rs742711) with multiple moderate impact missense variants; IGF2 (rs10770125) and INS (rs3842753) with moderate impact missense variants that are in linkage disequilibrium. Phenotype and disease associations of these differentiated SNPs indicated their association with hypertension and diabetes and highlighted the pleiotropic effects of these neuropeptides and their role in maintaining physiological homeostasis in humans.
CONCLUSIONS: We compiled a list of 131 human neuropeptide genes from multiple databases and literature survey. We detect significant population differentiation in the derived allele frequencies of variants in several neuropeptide genes in African and non-African populations. The results highlights SNPs in these genes that may also contribute to population disparities in prevalence of diseases such as hypertension and diabetes.