METHODS: This was a cross-sectional study where 1293 healthy women aged between 18 and 60 years were recruited via convenience sampling from five community-based clinics in Selangor, Malaysia. Cervicovaginal self-samples were obtained and DNA was extracted for HPV detection and genotyping. A comprehensive questionnaire was administered to determine the sociodemographics and behavioural patterns of participants.
RESULTS: The median age at enrolment was 37 years old (IQR: 30-47). In total, 86/1190 (7.2%) of the samples collected were positive for HPV infection, with the highest HPV prevalence (11.9%) detected in the subgroup of 18-24 years old. The top three most prevalent HPV genotypes were HPV 16, 52 and 58. The independent risk factors associated with higher rates of HPV infection included Indian ethnicity, widowed status and women with partners who are away from home for long periods and/or has another sexual partner.
CONCLUSIONS: The overall prevalence of HPV infection in this Malaysian multiethnic population was 7.2%, with 6.5% being high-risk genotypes. The top three most common high-risk HPV types were HPV 16, 52 and 58. This information is important for the planning of primary (HPV vaccination) and secondary (screening) cervical cancer prevention programmes in Malaysia.
RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time.
CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species.
METHODS: Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes.
RESULTS: From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups.
CONCLUSIONS: A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection still occur.