In this study, we first conducted a genome survey assay for Sillago sihama by Illumina sequencing platform, and then developed 15 polymorphic microsatellite loci in a wild population. A total of 129.46 Gb raw data were obtained, of which 115.07 Gb were clean data, with a sequencing depth of 179.3-folds. This genome was estimated to be 522.6 Mb in size, with the heterozygosity, repeat content and GC content being 0.63%, 21% and 44%. A total of 630,028 microsatellites were identified from the genome, of which, dinucleotide repeat was the most abundant (56.80%), followed by mononucleotide repeat (30.23%). Furthermore, 60 pairs of primers were designed and synthesized based on microsatellite sequences, of which 15 were polymorphic in a wild population. A total of 91 alleles were found, with an average of 6.07 per locus. Number of alleles, observed and expected heterozygosity per locus ranged from two to 13, from 0.250 to 0.862, and from 0.396 to 0.901, respectively. Twelve loci were highly informative (PIC > 0.5), and the others were medium informative (0.25
Invasive alien fish species have become a silent treat towards the ecosystem especially the native fish population in Malaysia. There has been a need to develop rapid identification methods that can aid management teams in identifying fish species that are not native to our ecosystem. Current visual identification methods are highly tedious and require time, delaying action towards curbing the invasion. The LAMP assay successfully identified six popular invasive fish species in Malaysia. None of the LAMP assays showed false positives and the Limit of Detection of the LAMP primers were highly sensitive and could detect DNA samples up to 1 × 10-15 ng/μl. The LAMP primers designed were highly specific to the target species and did not amplify non target species. DNA sequencing was done to ensure the accuracy of LAMP assay results. This study demonstrates that LAMP is a suitable tool in species identification efforts of invasive fish species in Malaysia.
We examine genetic structuring in three commercially important species of the teleost family Carangidae from Malaysian waters: yellowtail scad Atule mate, bigeye scad Selar crumenophthalmus and yellowstripe scad Selaroides leptolepis, from the Indo-Malay Archipelago. In view of their distribution across contrasting habitats, we tested the hypothesis that pelagic species display less genetic divergence compared with demersal species, due to their potential to undertake long-distance migrations in oceanic waters. To evaluate population genetic structure, we sequenced two mitochondrial (mt)DNA [650 bp of cytochrome oxidase I (coI), 450 bp of control region (CR)] and one nuclear gene (910 bp of rag1) in each species. One hundred and eighty samples from four geographical regions within the Indo-Malay Archipelago including a population of yellowtail from Kuwait were examined. Findings revealed that the extent of genetic structuring among populations in the semi-pelagic and pelagic, yellowtail and bigeye were lower than demersal yellowstripe, consistent with the hypothesis that pelagic species display less genetic divergence compared with demersal species. The yellowtail phylogeny identified three distinct clades with bootstrap values of 86%-99% in mtDNA and 63%-67% in rag1. However, in bigeye, three clades were also observed from mtDNA data while only one clade was identified in rag1 dataset. In yellowstripe, the mtDNA tree was split into three closely related clades and two clades in rag1 tree with bootstraps value of 73%-99% and 56% respectively. However, no geographic structure appears in both mtDNA and rag1 datasets. Hierarchical molecular variance analysis (AMOVA), pair wise FST comparisons and the nearest-neighbour statistic (Snn ) showed significant genetic differences among Kuwait and Indo-Malay yellowtail. Within the Indo-Malay Archipelago itself, two distinct mitochondrial lineages were detected in yellowtail suggesting potential cryptic species. Findings suggests varying degrees of genetic structuring, key information relevant to management of exploited stocks, though more rapidly evolving genetic markers should be used in future to better delimit the nature and dynamics of putative stock boundaries.
Mislabelling in fish products is a highly significant emerging issue in world fish trade in terms of health and economic concerns. DNA barcoding is an efficient sequencing-based tool for detecting fish species substitution but due to DNA degradation, it is in many cases difficult to amplify PCR products of the full-length barcode marker (~650 bp), especially in severely processed products. In the present study, a pair of universal primers targeting a 198 bp sequence of the mitochondrial 16s rRNA gene was designed for identification of fish species in the processed fish products commonly consumed in Malaysia. The specificity of the universal primers was tested by both in-silico studies using bioinformatics software and through cross-reaction assessment by practical PCR experiments against the DNA from 38 fish species and 22 other non-target species (animals and plants) and found to be specific for all the tested fish species. To eliminate the possibility of any false-negative detection, eukaryotic endogenous control was used during specificity evaluation. The developed primer set was validated with various heat-treated (boiled, autoclaved and microwaved) fish samples and was found to show high stability under all processing conditions. The newly developed marker successfully identified 92% of the tested commercial fish products with 96-100% sequence similarities. This study reveals a considerable degree of species mislabelling (20.8%); 5 out of 24 fish products were found to be mislabelled. The new marker developed in this work is a reliable tool to identify fish species even in highly processed products and might be useful in detecting fish species substitution thus protecting consumers' health and economic interests.
The migration of anadromous fish in heterogenic environments unceasingly imposes a selective pressure that results in genetic variation for local adaptation. However, discrimination of anadromous fish populations by fine-scale local adaptation is challenging because of their high rate of gene flow, highly connected divergent population, and large population size. Recent advances in next-generation sequencing (NGS) have expanded the prospects of defining the weakly structured population of anadromous fish. Therefore, we used NGS-based restriction site-associated DNA (NextRAD) techniques on 300 individuals of an anadromous Hilsa shad (Tenualosa ilisha) species, collected from nine strategic habitats, across their diverse migratory habitats, which include sea, estuary, and different freshwater rivers. The NextRAD technique successfully identified 15,453 single nucleotide polymorphism (SNP) loci. Outlier tests using the FST OutFLANK and pcadapt approaches identified 74 and 449 SNPs (49 SNPs being common), respectively, as putative adaptive loci under a divergent selection process. Our results, based on the different cluster analyses of these putatively adaptive loci, suggested that local adaptation has divided the Hilsa shad population into two genetically structured clusters, in which marine and estuarine collection sites were dominated by individuals of one genetic cluster and different riverine collection sites were dominated by individuals of another genetic cluster. The phylogenetic analysis revealed that all the riverine populations of Hilsa shad were further subdivided into the north-western riverine (turbid freshwater) and the north-eastern riverine (clear freshwater) ecotypes. Among all of the putatively adaptive loci, only 36 loci were observed to be in the coding region, and the encoded genes might be associated with important biological functions related to the local adaptation of Hilsa shad. In summary, our study provides both neutral and adaptive contexts for the observed genetic divergence of Hilsa shad and, consequently, resolves the previous inconclusive findings on their population genetic structure across their diverse migratory habitats. Moreover, the study has clearly demonstrated that NextRAD sequencing is an innovative approach to explore how dispersal and local adaptation can shape genetic divergence of non-model anadromous fish that intersect diverse migratory habitats during their life-history stages.
Snakehead fishes of the family Channidae are predatory freshwater teleosts from Africa and Asia comprising 38 valid species. Snakeheads are important food fishes (aquaculture, live food trade) and have been introduced widely with several species becoming highly invasive. A channid barcode library was recently assembled by Serrao and co-workers to better detect and identify potential and established invasive snakehead species outside their native range. Comparing our own recent phylogenetic results of this taxonomically confusing group with those previously reported revealed several inconsistencies that prompted us to expand and improve on previous studies. By generating 343 novel snakehead coxI sequences and combining them with an additional 434 coxI sequences from GenBank we highlight several problems with previous efforts towards the assembly of a snakehead reference barcode library. We found that 16.3% of the channid coxI sequences deposited in GenBank are based on misidentifications. With the inclusion of our own data we were, however, able to solve these cases of perpetuated taxonomic confusion. Different species delimitation approaches we employed (BIN, GMYC, and PTP) were congruent in suggesting a potentially much higher species diversity within snakeheads than currently recognized. In total, 90 BINs were recovered and within a total of 15 currently recognized species multiple BINs were identified. This higher species diversity is mostly due to either the incorporation of undescribed, narrow range, endemics from the Eastern Himalaya biodiversity hotspot or the incorporation of several widespread species characterized by deep genetic splits between geographically well-defined lineages. In the latter case, over-lumping in the past has deflated the actual species numbers. Further integrative approaches are clearly needed for providing a better taxonomic understanding of snakehead diversity, new species descriptions and taxonomic revisions of the group.
Adaptive differences across species' ranges can have important implications for population persistence and conservation management decisions. Despite advances in genomic technologies, detecting adaptive variation in natural populations remains challenging. Key challenges in gene-environment association studies involve distinguishing the effects of drift from those of selection and identifying subtle signatures of polygenic adaptation. We used paired-end restriction site-associated DNA sequencing data (6,605 biallelic single nucleotide polymorphisms; SNPs) to examine population structure and test for signatures of adaptation across the geographic range of an iconic Australian endemic freshwater fish species, the Murray cod Maccullochella peelii. Two univariate gene-association methods identified 61 genomic regions associated with climate variation. We also tested for subtle signatures of polygenic adaptation using a multivariate method (redundancy analysis; RDA). The RDA analysis suggested that climate (temperature- and precipitation-related variables) and geography had similar magnitudes of effect in shaping the distribution of SNP genotypes across the sampled range of Murray cod. Although there was poor agreement among the candidate SNPs identified by the univariate methods, the top 5% of SNPs contributing to significant RDA axes included 67% of the SNPs identified by univariate methods. We discuss the potential implications of our findings for the management of Murray cod and other species generally, particularly in relation to informing conservation actions such as translocations to improve evolutionary resilience of natural populations. Our results highlight the value of using a combination of different approaches, including polygenic methods, when testing for signatures of adaptation in landscape genomic studies.
The Merbok Estuary comprises one of the largest remaining mangrove forests in Peninsular Malaysia. Its value is significant as it provides important services to local and global communities. It also offers a unique opportunity to study the structure and functioning of mangrove ecosystems. However, its biodiversity is still partially inventoried, limiting its research value. A recent checklist based on morphological examination, reported 138 fish species residing, frequenting or subject to entering the Merbok Estuary. In this work, we reassessed the fish diversity of the Merbok Estuary by DNA barcoding 350 specimens assignable to 134 species initially identified based on morphology. Our results consistently revealed the presence of 139 Molecular Operational Taxonomic Units (MOTUs). 123 of them are congruent with morphology-based species delimitation (one species = one MOTU). In two cases, two morphological species share the same MOTU (two species = one MOTU), while we unveiled cryptic diversity (i.e. COI-based genetic variability > 2%) within seven other species (one species = two MOTUs), calling for further taxonomic investigations. This study provides a comprehensive core-list of fish taxa in Merbok Estuary, demonstrating the advantages of combining morphological and molecular evidence to describe diverse but still poorly studied tropical fish communities. It also delivers a large DNA reference collection for brackish fishes occurring in this region which will facilitate further biodiversity-oriented research studies and management activities.