Affiliations 

  • 1 Pathogen Molecular Biology Department, London School of Hygiene and Tropical Medicine, London, WC1E 7HT United Kingdom; Malaria Research Centre, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia; david.conway@lshtm.ac.uk arnab.pain@kaust.edu.sa bskhaira55@gmail.com
  • 2 Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Jeddah, Kingdom of Saudi Arabia; Malaria Research Centre, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia; Center for Zoonosis Control, Global Institution for Collaborative Research and Education, Hokkaido University, N20 W10 Kita-ku, Sapporo, Japan david.conway@lshtm.ac.uk arnab.pain@kaust.edu.sa bskhaira55@gmail.com
  • 3 Malaria Research Centre, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia; david.conway@lshtm.ac.uk arnab.pain@kaust.edu.sa bskhaira55@gmail.com
Proc Natl Acad Sci U S A, 2015 Oct 20;112(42):13027-32.
PMID: 26438871 DOI: 10.1073/pnas.1509534112

Abstract

Malaria cases caused by the zoonotic parasite Plasmodium knowlesi are being increasingly reported throughout Southeast Asia and in travelers returning from the region. To test for evidence of signatures of selection or unusual population structure in this parasite, we surveyed genome sequence diversity in 48 clinical isolates recently sampled from Malaysian Borneo and in five lines maintained in laboratory rhesus macaques after isolation in the 1960s from Peninsular Malaysia and the Philippines. Overall genomewide nucleotide diversity (π = 6.03 × 10(-3)) was much higher than has been seen in worldwide samples of either of the major endemic malaria parasite species Plasmodium falciparum and Plasmodium vivax. A remarkable substructure is revealed within P. knowlesi, consisting of two major sympatric clusters of the clinical isolates and a third cluster comprising the laboratory isolates. There was deep differentiation between the two clusters of clinical isolates [mean genomewide fixation index (FST) = 0.21, with 9,293 SNPs having fixed differences of FST = 1.0]. This differentiation showed marked heterogeneity across the genome, with mean FST values of different chromosomes ranging from 0.08 to 0.34 and with further significant variation across regions within several chromosomes. Analysis of the largest cluster (cluster 1, 38 isolates) indicated long-term population growth, with negatively skewed allele frequency distributions (genomewide average Tajima's D = -1.35). Against this background there was evidence of balancing selection on particular genes, including the circumsporozoite protein (csp) gene, which had the top Tajima's D value (1.57), and scans of haplotype homozygosity implicate several genomic regions as being under recent positive selection.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.