METHODS: A total of 322 samples of mainly human origin were analysed using eight protocols, applying a wide variety of laboratory components. Several samples (60% of human specimens) were processed using different protocols. In total, 712 sequencing libraries were investigated for viral sequence contamination.
RESULTS: Among sequences showing similarity to viruses, 493 were significantly associated with the use of laboratory components. Each of these viral sequences had sporadic appearance, only being identified in a subset of the samples treated with the linked laboratory component, and some were not identified in the non-template control samples. Remarkably, more than 65% of all viral sequences identified were within viral clusters linked to the use of laboratory components.
CONCLUSIONS: We show that high prevalence of contaminating viral sequences can be expected in HTS-based virome data and provide an extensive list of novel contaminating viral sequences that can be used for evaluation of viral findings in future virome and metagenome studies. Moreover, we show that detection can be problematic due to stochastic appearance and limited non-template controls. Although the exact origin of these viral sequences requires further research, our results support laboratory-component-linked viral sequence contamination of both biological and synthetic origin.
METHODS: The purpose of this study was to design an assay for the detection of triplications, common and rare deletional alpha thalassaemia using droplet digital PCR (ddPCR).
RESULTS: This is a quantitative detection method to measure the changes of copy number which can detect deletions, duplications and triplications of the alpha globin gene simultaneously.
CONCLUSION: In conclusion, ddPCR is an alternative method for rapid detection of alpha thalassaemia variants in Malaysia.
METHODS AND RESULTS: In view of the lack of study on their mitogenome, we sequenced (by next generation sequencing) and annotated the complete mitogenome of D. vijaysegarani from Malaysia to determine its features and phylogenetic relationship. The whole mitogenome of D. vijaysegarani has identical gene order with the published mitogenomes of the genus Dacus, with 13 protein-coding genes, two rRNA genes, 22 tRNAs, a non-coding A + T rich control region, and intergenic spacer and overlap sequences. Phylogenetic analysis based on 15 mitochondrial genes (13 PCGs and two rRNA genes), reveals Dacus, Zeugodacus and Bactrocera forming a distinct clade. The genus Dacus forms a monophyletic group in the subclade containing also the Zeugodacus group; this Dacus-Zeugodacus subclade is distinct from the Bactrocera subclade. D. (Mellesis) vijaysegarani forms a lineage with D. (Mellesis) trimacula in the subcluster containing also the lineage of D. (Mellesis) conopsoides and D. (Callantra) longicornis. D. (Dacus) bivittatus and D. (Didacus) ciliatus form a distinct subcluster. Based on cox1 sequences, the Malaysia and Vietnam taxa of D. vijaysegarani may not be conspecific.
CONCLUSIONS: Overall, the mitochondrial genome of D. vijaysegarani provided essential molecular data that could be useful for further studies for species diagnosis, evolution and phylogeny research of other tephritid fruit flies in the future.
RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time.
CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species.