FINDINGS: We optimized the assembly of a Hevea bark transcriptome based on 16 Gb Illumina PE RNA-Seq reads using the Oases assembler across a range of k-mer sizes. We then assessed assembly quality based on transcript N50 length and transcript mapping statistics in relation to (a) known Hevea cDNAs with complete open reading frames, (b) a set of core eukaryotic genes and (c) Hevea genome scaffolds. This was followed by a systematic transcript mapping process where sub-assemblies from a series of incremental amounts of bark transcripts were aligned to transcripts from the entire bark transcriptome assembly. The exercise served to relate read amounts to the degree of transcript mapping level, the latter being an indicator of the coverage of gene transcripts expressed in the sample. As read amounts or datasize increased toward 16 Gb, the number of transcripts mapped to the entire bark assembly approached saturation. A colour matrix was subsequently generated to illustrate sequencing depth requirement in relation to the degree of coverage of total sample transcripts.
CONCLUSIONS: We devised a procedure, the "transcript mapping saturation test", to estimate the amount of RNA-Seq reads needed for deep coverage of transcriptomes. For Hevea de novo assembly, we propose generating between 5-8 Gb reads, whereby around 90% transcript coverage could be achieved with optimized k-mers and transcript N50 length. The principle behind this methodology may also be applied to other non-model plants, or with reads from other second generation sequencing platforms.
METHODS: We used the genome-wide screening tool TraDIS (Transposon Directed Insertion-site Sequencing) to identify B. pseudomallei essential genes. Transposon-flanking regions were sequenced and gene essentiality was assessed based on the frequency of transposon insertions within each gene. Transposon mutants were grown in LB and M9 minimal medium to determine conditionally essential genes required for growth under laboratory conditions. The Caenorhabditis elegans infection model was used to assess genes associated with in vivo B. pseudomallei survival. Transposon mutants were fed to the worms, recovered from worm intestines, and sequenced. Two selected mutants were constructed and evaluated for the bacteria's ability to survive and proliferate in the nematode intestinal lumen.
RESULTS: Approximately 500,000 transposon-insertion mutants of B. pseudomallei strain R15 were generated. A total of 848,811 unique transposon insertion sites were identified in the B. pseudomallei R15 genome and 492 genes carrying low insertion frequencies were predicted to be essential. A total of 96 genes specifically required to support growth under nutrient-depleted conditions were identified. Genes most likely to be involved in B. pseudomallei survival and adaptation in the C. elegans intestinal lumen, were identified. When compared to wild type B. pseudomallei, a Tn5 mutant of bpsl2988 exhibited reduced survival in the worm intestine, was attenuated in C. elegans killing and showed decreased colonization in the organs of infected mice.
DISCUSSION: The B. pseudomallei conditional essential proteins should provide further insights into the bacteria's niche adaptation, pathogenesis, and virulence.
RESULTS: Two fungal isolates (UM 1400 and UM 1020) from human specimens were identified as Daldinia eschscholtzii by morphological features and ITS-based phylogenetic analysis. Both genomes were similar in size with 10,822 predicted genes in UM 1400 (35.8 Mb) and 11,120 predicted genes in UM 1020 (35.5 Mb). A total of 751 gene families were shared among both UM isolates, including gene families associated with fungus-host interactions. In the CAZyme comparative analysis, both genomes were found to contain arrays of CAZyme related to plant cell wall degradation. Genes encoding secreted peptidases were found in the genomes, which encode for the peptidases involved in the degradation of structural proteins in plant cell wall. In addition, arrays of secondary metabolite backbone genes were identified in both genomes, indicating of their potential to produce bioactive secondary metabolites. Both genomes also contained an abundance of gene encoding signaling components, with three proposed MAPK cascades involved in cell wall integrity, osmoregulation, and mating/filamentation. Besides genomic evidence for degrading capability, both isolates also harbored an array of genes encoding stress response proteins that are potentially significant for adaptation to living in the hostile environments.
CONCLUSIONS: Our genomic studies provide further information for the biological understanding of the D. eschscholtzii and suggest that these wood-decaying fungi are also equipped for adaptation to adverse environments in the human host.
METHODOLOGY: Complete rpoB gene sequences of globally distributed Brucella melitensis strains were analyzed. Single nucleotides polymorphisms (SNPs) of the rpoB gene sequences were identified and used to type Brucella melitensis strains.
RESULTS: Six DNA polymorphisms were identified, of which two (nucleotides 3201 and 558) were novel. Analysis of the geographical distribution of the strains revealed a spatial clustering pattern with rpoB type 1 representing European and American strains, rpoB type 2 representing European, African, and Asian strains, rpoB type 3 representing Mediterranean strains, and rpoB type 4 representing African (C3201T) and European (C3201T/T558A) strains.
CONCLUSIONS: We report the discovery of two novel SNPs of rpoB gene that can serve as useful markers for epidemiology and geographical tracking of B. melitensis.