MicroRNAs (miRNAs) are ~20-22 nucleotides, non protein-coding RNA regulatory genes that post-transcriptionally regulate many protein-coding genes, influencing critical biological and metabolic processes. While the number of known microRNA is increasing, there is currently no published data for miRNA from giant freshwater prawns, Macrobrachium rosenbergii (M. rosenbergii), a commercially cultured and economically important food species. In this study, we identified novel miRNAs in the gill and hepatopancreas of M. rosenbergii. Through a deep parallel sequencing analysis and an in silico data analysis approach, 327 miRNA families were identified from small RNA libraries with reference to both the de novo transcriptome of M. rosenbergii obtained from RNA-Seq and to miRBase (Release 18.0, November 2012). Based on the identified mature miRNA and recovered precursor sequences that form appropriate hairpin structures, three conserved miRNA (miR125, miR750, miR993) and 27 novel miRNA candidates encoding messenger-like non-coding RNA were identified. miR-125, miR-750, G-m0002/H-m0009, G-m0005, G-m0008/H-m0016, G-m0011/H-m0027 and G-m0015 were selected for experimental validation with stem-loop quantitative RT-PCR and were found to be coherent with the expression profile of deep sequencing data as evaluated with Pearson's correlation coefficient (r = 0.835178 for miRNA in gill, r = 0.724131 for miRNA in hepatopancreas). Using a combinatorial approach of pathway enrichment analysis and inverse expression relationship of miRNA and mRNA, four co-expressed novel miRNA candidates (G-m0005, G-m0008/H-m0016, G-m0011/H-m0027, and G-m0015) were found to be associated with energy metabolism. In addition, the expression of the three novel miRNA candidates (G-m0005, G-m0008/H-m0016, and G-m0011/H-m0027) were also found to be significantly reduced at 9 and 24 h post infection in M. rosenbergii challenged with infectious hypodermal and hematopoietic necrosis virus, suggesting a functional role of these miRNAs in crustacean immune defense.
Morphological identification of fish taxa can sometimes prove difficult because phenotypic variation is either being affected by environmental factors, phenotypic characters are highly conserved or marker selection has been inappropriate. DNA based markers especially neutral mitochondrial DNA (mtDNA) have been used widely in recent times to provide better resolution of systematic relationships among vertebrate taxa. The Asian Arowana (Scleropages formosus) is a high value ornamental fish belonging to the family Osteoglossidae with a number of different colour variants distributed geographically across different locations around Southeast Asia. Systematic relationships among colour variants still remain unresolved. Partial sequences of the Cytochrome B (Cyt B) and DNA barcoding gene, Cytochrome C Oxidase I (COI) were used here to assess genetic relationships among colour variants and as a tool for molecular identification for differentiating among colour variants in this species. Results of the study show that in general, colour pattern shows no relationship with extent of COI or Cyt B mtDNA differentiation and so cannot be used to identify taxa. Partial sequences of the mtDNA genes were sufficient however, to identify S. formosus from a closely related species within the order Osteoglossidae.
Gene discovery in the Malaysian giant freshwater prawn (Macrobrachium rosenbergii) has been limited to small scale data collection, despite great interest in various research fields related to the commercial significance of this species. Next generation sequencing technologies that have been developed recently and enabled whole transcriptome sequencing (RNA-seq), have allowed generation of large scale functional genomics data sets in a shorter time than was previously possible. Using this technology, transcriptome sequencing of three tissue types: hepatopancreas, gill and muscle, has been undertaken to generate functional genomics data for M. rosenbergii at a massive scale. De novo assembly of 75-bp paired end Ilumina reads has generated 102,230 unigenes. Sequence homology search and in silico prediction have identified known and novel protein coding candidate genes (∼24%), non-coding RNA, and repetitive elements in the transcriptome. Potential markers consisting of simple sequence repeats associated with known protein coding genes have been successfully identified. Using KEGG pathway enrichment, differentially expressed genes in different tissues were systematically represented. The functions of gill and hepatopancreas in the context of neuroactive regulation, metabolism, reproduction, environmental stress and disease responses are described and support relevant experimental studies conducted previously in M. rosenbergii and other crustaceans. This large scale gene discovery represents the most extensive transcriptome data for freshwater prawn. Comparison with model organisms has paved the path to address the possible conserved biological entities shared between vertebrates and crustaceans. The functional genomics resources generated from this study provide the basis for constructing hypotheses for future molecular research in the freshwater shrimp.