Gene discovery in the Malaysian giant freshwater prawn (Macrobrachium rosenbergii) has been limited to small scale data collection, despite great interest in various research fields related to the commercial significance of this species. Next generation sequencing technologies that have been developed recently and enabled whole transcriptome sequencing (RNA-seq), have allowed generation of large scale functional genomics data sets in a shorter time than was previously possible. Using this technology, transcriptome sequencing of three tissue types: hepatopancreas, gill and muscle, has been undertaken to generate functional genomics data for M. rosenbergii at a massive scale. De novo assembly of 75-bp paired end Ilumina reads has generated 102,230 unigenes. Sequence homology search and in silico prediction have identified known and novel protein coding candidate genes (∼24%), non-coding RNA, and repetitive elements in the transcriptome. Potential markers consisting of simple sequence repeats associated with known protein coding genes have been successfully identified. Using KEGG pathway enrichment, differentially expressed genes in different tissues were systematically represented. The functions of gill and hepatopancreas in the context of neuroactive regulation, metabolism, reproduction, environmental stress and disease responses are described and support relevant experimental studies conducted previously in M. rosenbergii and other crustaceans. This large scale gene discovery represents the most extensive transcriptome data for freshwater prawn. Comparison with model organisms has paved the path to address the possible conserved biological entities shared between vertebrates and crustaceans. The functional genomics resources generated from this study provide the basis for constructing hypotheses for future molecular research in the freshwater shrimp.
* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.