Displaying publications 21 - 40 of 57 in total

Abstract:
Sort:
  1. Choo SW, Rishik S, Wee WY
    Microb Genom, 2020 12;6(12).
    PMID: 33295861 DOI: 10.1099/mgen.0.000495
    Mycobacteroides immunogenum is an emerging opportunistic pathogen implicated in nosocomial infections. Comparative genome analyses may provide better insights into its genomic structure, functions and evolution. The present analysis showed that M. immunogenum has an open pan-genome. Approximately 36.8% of putative virulence genes were identified in the accessory regions of M. immunogenum. Phylogenetic analyses revealed two potential novel subspecies of M. immunogenum, supported by evidence from ANIb (average nucleotide identity using blast) and GGDC (Genome to Genome Distance Calculator) analyses. We identified 74 genomic islands (GIs) in Subspecies 1 and 23 GIs in Subspecies 2. All Subspecies 2-harboured GIs were not found in Subspecies 1, indicating that they might have been acquired by Subspecies 2 after their divergence. Subspecies 2 has more defence genes than Subspecies 1, suggesting that it might be more resistant to the insertion of foreign DNA and probably explaining why Subspecies 2 has fewer GIs. Positive selection analysis suggest that M. immunogenum has a lower selection pressure compared to non-pathogenic mycobacteria. Thirteen genes were positively selected and many were involved in virulence.
    Matched MeSH terms: Genomics/methods*
  2. Wong EH, Ng CG, Chua EG, Tay AC, Peters F, Marshall BJ, et al.
    PLoS One, 2016;11(11):e0166835.
    PMID: 27870886 DOI: 10.1371/journal.pone.0166835
    BACKGROUND: Biofilm formation by Helicobacter pylori may be one of the factors influencing eradication outcome. However, genetic differences between good and poor biofilm forming strains have not been studied.

    MATERIALS AND METHODS: Biofilm yield of 32 Helicobacter pylori strains (standard strain and 31 clinical strains) were determined by crystal-violet assay and grouped into poor, moderate and good biofilm forming groups. Whole genome sequencing of these 32 clinical strains was performed on the Illumina MiSeq platform. Annotation and comparison of the differences between the genomic sequences were carried out using RAST (Rapid Annotation using Subsystem Technology) and SEED viewer. Genes identified were confirmed using PCR.

    RESULTS: Genes identified to be associated with biofilm formation in H. pylori includes alpha (1,3)-fucosyltransferase, flagellar protein, 3 hypothetical proteins, outer membrane protein and a cag pathogenicity island protein. These genes play a role in bacterial motility, lipopolysaccharide (LPS) synthesis, Lewis antigen synthesis, adhesion and/or the type-IV secretion system (T4SS). Deletion of cagA and cagPAI confirmed that CagA and T4SS were involved in H. pylori biofilm formation.

    CONCLUSIONS: Results from this study suggest that biofilm formation in H. pylori might be genetically determined and might be influenced by multiple genes. Good, moderate and poor biofilm forming strain might differ during the initiation of biofilm formation.

    Matched MeSH terms: Genomics/methods*
  3. Leong WM, Ripen AM, Mirsafian H, Mohamad SB, Merican AF
    Genomics, 2019 07;111(4):899-905.
    PMID: 29885984 DOI: 10.1016/j.ygeno.2018.05.019
    High-depth next generation sequencing data provide valuable insights into the number and distribution of RNA editing events. Here, we report the RNA editing events at cellular level of human primary monocyte using high-depth whole genomic and transcriptomic sequencing data. We identified over a ten thousand putative RNA editing sites and 69% of the sites were A-to-I editing sites. The sites enriched in repetitive sequences and intronic regions. High-depth sequencing datasets revealed that 90% of the canonical sites were edited at lower frequencies (<0.7). Single and multiple human monocytes and brain tissues samples were analyzed through genome sequence independent approach. The later approach was observed to identify more editing sites. Monocytes was observed to contain more C-to-U editing sites compared to brain tissues. Our results establish comparable pipeline that can address current limitations as well as demonstrate the potential for highly sensitive detection of RNA editing events in single cell type.
    Matched MeSH terms: Genomics/methods*
  4. Swain A, Gnanasekar P, Prava J, Rajeev AC, Kesarwani P, Lahiri C, et al.
    Microb Drug Resist, 2021 Feb;27(2):212-226.
    PMID: 32936741 DOI: 10.1089/mdr.2020.0161
    Many members of nontuberculous mycobacteria (NTM) are opportunistic pathogens causing several infections in animals. The incidence of NTM infections and emergence of drug-resistant NTM strains are rising worldwide, emphasizing the need to develop novel anti-NTM drugs. The present study is aimed to identify broad-spectrum drug targets in NTM using a comparative genomics approach. The study identified 537 core proteins in NTM of which 45 were pathogen specific and essential for the survival of pathogens. Furthermore, druggability analysis indicated that 15 were druggable among those 45 proteins. These 15 proteins, which were core proteins, pathogen-specific, essential, and druggable, were considered as potential broad-spectrum candidates. Based on their locations in cytoplasm and membrane, targets were classified as drug and vaccine targets. The identified 15 targets were different enzymes, carrier proteins, transcriptional regulator, two-component system protein, ribosomal, and binding proteins. The identified targets could further be utilized by researchers to design inhibitors for the discovery of antimicrobial agents.
    Matched MeSH terms: Genomics/methods
  5. Ashkani S, Yusop MR, Shabanimofrad M, Azady A, Ghasemzadeh A, Azizi P, et al.
    Curr Issues Mol Biol, 2015;17:57-73.
    PMID: 25706446
    Allele mining is a promising way to dissect naturally occurring allelic variants of candidate genes with essential agronomic qualities. With the identification, isolation and characterisation of blast resistance genes in rice, it is now possible to dissect the actual allelic variants of these genes within an array of rice cultivars via allele mining. Multiple alleles from the complex locus serve as a reservoir of variation to generate functional genes. The routine sequence exchange is one of the main mechanisms of R gene evolution and development. Allele mining for resistance genes can be an important method to identify additional resistance alleles and new haplotypes along with the development of allele-specific markers for use in marker-assisted selection. Allele mining can be visualised as a vital link between effective utilisation of genetic and genomic resources in genomics-driven modern plant breeding. This review studies the actual concepts and potential of mining approaches for the discovery of alleles and their utilisation for blast resistance genes in rice. The details provided here will be important to provide the rice breeder with a worthwhile introduction to allele mining and its methodology for breakthrough discovery of fresh alleles hidden in hereditary diversity, which is vital for crop improvement.
    Matched MeSH terms: Genomics/methods*
  6. Chaisson MJP, Sanders AD, Zhao X, Malhotra A, Porubsky D, Rausch T, et al.
    Nat Commun, 2019 04 16;10(1):1784.
    PMID: 30992455 DOI: 10.1038/s41467-018-08148-z
    The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50 bp) and 27,622 SVs (≥50 bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.
    Matched MeSH terms: Genomics/methods*
  7. Feng S, Stiller J, Deng Y, Armstrong J, Fang Q, Reeve AH, et al.
    Nature, 2020 11;587(7833):252-257.
    PMID: 33177665 DOI: 10.1038/s41586-020-2873-9
    Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1-4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families-including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species.
    Matched MeSH terms: Genomics/methods*
  8. Callari M, Batra AS, Batra RN, Sammut SJ, Greenwood W, Clifford H, et al.
    BMC Genomics, 2018 01 05;19(1):19.
    PMID: 29304755 DOI: 10.1186/s12864-017-4414-y
    BACKGROUND: Patient-Derived Tumour Xenografts (PDTXs) have emerged as the pre-clinical models that best represent clinical tumour diversity and intra-tumour heterogeneity. The molecular characterization of PDTXs using High-Throughput Sequencing (HTS) is essential; however, the presence of mouse stroma is challenging for HTS data analysis. Indeed, the high homology between the two genomes results in a proportion of mouse reads being mapped as human.

    RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time.

    CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species.

    Matched MeSH terms: Genomics/methods*
  9. Vasilakis N, Tesh RB, Popov VL, Widen SG, Wood TG, Forrester NL, et al.
    Viruses, 2019 05 23;11(5).
    PMID: 31126128 DOI: 10.3390/v11050471
    In recent years, it has become evident that a generational gap has developed in the community of arbovirus research. This apparent gap is due to the dis-investment of training for the next generation of arbovirologists, which threatens to derail the rich history of virus discovery, field epidemiology, and understanding of the richness of diversity that surrounds us. On the other hand, new technologies have resulted in an explosion of virus discovery that is constantly redefining the virosphere and the evolutionary relationships between viruses. This paradox presents new challenges that may have immediate and disastrous consequences for public health when yet to be discovered arboviruses emerge. In this review we endeavor to bridge this gap by providing a historical context for the work being conducted today and provide continuity between the generations. To this end, we will provide a narrative of the thrill of scientific discovery and excitement and the challenges lying ahead.
    Matched MeSH terms: Genomics/methods
  10. Lee BKB, Gan CP, Chang JK, Tan JL, Fadlullah MZ, Abdul Rahman ZA, et al.
    J Dent Res, 2018 07;97(8):909-916.
    PMID: 29512401 DOI: 10.1177/0022034518759038
    Head and neck cancer (HNC)-derived cell lines represent fundamental models for studying the biological mechanisms underlying cancer development and precision therapies. However, mining the genomic information of HNC cells from available databases requires knowledge on bioinformatics and computational skill sets. Here, we developed a user-friendly web resource for exploring, visualizing, and analyzing genomics information of commonly used HNC cell lines. We populated the current version of GENIPAC with 44 HNC cell lines from 3 studies: ORL Series, OPC-22, and H Series. Specifically, the mRNA expressions for all the 3 studies were derived with RNA-seq. The copy number alterations analysis of ORL Series was performed on the Genome Wide Human Cytoscan HD array, while copy number alterations for OPC-22 were derived from whole exome sequencing. Mutations from ORL Series and H Series were derived from RNA-seq information, while OPC-22 was based on whole exome sequencing. All genomic information was preprocessed with customized scripts and underwent data validation and correction through data set validator tools provided by cBioPortal. The clinical and genomic information of 44 HNC cell lines are easily assessable in GENIPAC. The functional utility of GENIPAC was demonstrated with some of the genomic alterations that are commonly reported in HNC, such as TP53, EGFR, CCND1, and PIK3CA. We showed that these genomic alterations as reported in The Cancer Genome Atlas database were recapitulated in the HNC cell lines in GENIPAC. Importantly, genomic alterations within pathways could be simultaneously visualized. We developed GENIPAC to create access to genomic information on HNC cell lines. This cancer omics initiative will help the research community to accelerate better understanding of HNC and the development of new precision therapeutic options for HNC treatment. GENIPAC is freely available at http://genipac.cancerresearch.my/ .
    Matched MeSH terms: Genomics/methods*
  11. Chan KL, Rosli R, Tatarinova TV, Hogan M, Firdaus-Raih M, Low EL
    BMC Bioinformatics, 2017 Jan 27;18(Suppl 1):1426.
    PMID: 28466793 DOI: 10.1186/s12859-016-1426-6
    BACKGROUND: Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion.

    RESULTS: We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure).

    CONCLUSIONS: Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.

    Matched MeSH terms: Genomics/methods*
  12. Rhie A, McCarthy SA, Fedrigo O, Damas J, Formenti G, Koren S, et al.
    Nature, 2021 Apr;592(7856):737-746.
    PMID: 33911273 DOI: 10.1038/s41586-021-03451-0
    High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.
    Matched MeSH terms: Genomics/methods*
  13. Tan SY, Dutta A, Jakubovics NS, Ang MY, Siow CC, Mutha NV, et al.
    BMC Bioinformatics, 2015;16:9.
    PMID: 25591325 DOI: 10.1186/s12859-014-0422-y
    Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity.
    Matched MeSH terms: Genomics/methods*
  14. Pearson RD, Amato R, Auburn S, Miotto O, Almagro-Garcia J, Amaratunga C, et al.
    Nat Genet, 2016 Aug;48(8):959-964.
    PMID: 27348299 DOI: 10.1038/ng.3599
    The widespread distribution and relapsing nature of Plasmodium vivax infection present major challenges for the elimination of malaria. To characterize the genetic diversity of this parasite in individual infections and across the population, we performed deep genome sequencing of >200 clinical samples collected across the Asia-Pacific region and analyzed data on >300,000 SNPs and nine regions of the genome with large copy number variations. Individual infections showed complex patterns of genetic structure, with variation not only in the number of dominant clones but also in their level of relatedness and inbreeding. At the population level, we observed strong signals of recent evolutionary selection both in known drug resistance genes and at new loci, and these varied markedly between geographical locations. These findings demonstrate a dynamic landscape of local evolutionary adaptation in the parasite population and provide a foundation for genomic surveillance to guide effective strategies for control and elimination of P. vivax.
    Matched MeSH terms: Genomics/methods*
  15. Mohd-Shamsudin MI, Kang Y, Lili Z, Tan TT, Kwong QB, Liu H, et al.
    PLoS One, 2013;8(5):e60839.
    PMID: 23734171 DOI: 10.1371/journal.pone.0060839
    Gene discovery in the Malaysian giant freshwater prawn (Macrobrachium rosenbergii) has been limited to small scale data collection, despite great interest in various research fields related to the commercial significance of this species. Next generation sequencing technologies that have been developed recently and enabled whole transcriptome sequencing (RNA-seq), have allowed generation of large scale functional genomics data sets in a shorter time than was previously possible. Using this technology, transcriptome sequencing of three tissue types: hepatopancreas, gill and muscle, has been undertaken to generate functional genomics data for M. rosenbergii at a massive scale. De novo assembly of 75-bp paired end Ilumina reads has generated 102,230 unigenes. Sequence homology search and in silico prediction have identified known and novel protein coding candidate genes (∼24%), non-coding RNA, and repetitive elements in the transcriptome. Potential markers consisting of simple sequence repeats associated with known protein coding genes have been successfully identified. Using KEGG pathway enrichment, differentially expressed genes in different tissues were systematically represented. The functions of gill and hepatopancreas in the context of neuroactive regulation, metabolism, reproduction, environmental stress and disease responses are described and support relevant experimental studies conducted previously in M. rosenbergii and other crustaceans. This large scale gene discovery represents the most extensive transcriptome data for freshwater prawn. Comparison with model organisms has paved the path to address the possible conserved biological entities shared between vertebrates and crustaceans. The functional genomics resources generated from this study provide the basis for constructing hypotheses for future molecular research in the freshwater shrimp.
    Matched MeSH terms: Genomics/methods*
  16. Sahebi M, Hanafi MM, Azizi P, Hakim A, Ashkani S, Abiri R
    Mol Biotechnol, 2015 Oct;57(10):880-903.
    PMID: 26271955 DOI: 10.1007/s12033-015-9884-z
    Suppression subtractive hybridization (SSH) is an effective method to identify different genes with different expression levels involved in a variety of biological processes. This method has often been used to study molecular mechanisms of plants in complex relationships with different pathogens and a variety of biotic stresses. Compared to other techniques used in gene expression profiling, SSH needs relatively smaller amounts of the initial materials, with lower costs, and fewer false positives present within the results. Extraction of total RNA from plant species rich in phenolic compounds, carbohydrates, and polysaccharides that easily bind to nucleic acids through cellular mechanisms is difficult and needs to be considered. Remarkable advancement has been achieved in the next-generation sequencing (NGS) field. As a result of progress within fields related to molecular chemistry and biology as well as specialized engineering, parallelization in the sequencing reaction has exceptionally enhanced the overall read number of generated sequences per run. Currently available sequencing platforms support an earlier unparalleled view directly into complex mixes associated with RNA in addition to DNA samples. NGS technology has demonstrated the ability to sequence DNA with remarkable swiftness, therefore allowing previously unthinkable scientific accomplishments along with novel biological purposes. However, the massive amounts of data generated by NGS impose a substantial challenge with regard to data safe-keeping and analysis. This review examines some simple but vital points involved in preparing the initial material for SSH and introduces this method as well as its associated applications to detect different novel genes from different plant species. This review evaluates general concepts, basic applications, plus the probable results of NGS technology in genomics, with unique mention of feasible potential tools as well as bioinformatics.
    Matched MeSH terms: Genomics/methods
  17. Briggs MT, Condina MR, Ho YY, Everest-Dass AV, Mittal P, Kaur G, et al.
    Proteomics, 2019 11;19(21-22):e1800482.
    PMID: 31364262 DOI: 10.1002/pmic.201800482
    Epithelial ovarian cancer is one of the most fatal gynecological malignancies in adult women. As studies on protein N-glycosylation have extensively reported aberrant patterns in the ovarian cancer tumor microenvironment, obtaining spatial information will uncover tumor-specific N-glycan alterations in ovarian cancer development and progression. matrix-assisted laser desorption/ionization (MALDI) mass spectrometry imaging (MSI) is employed to investigate N-glycan distribution on formalin-fixed paraffin-embedded ovarian cancer tissue sections from early- and late-stage patients. Tumor-specific N-glycans are identified and structurally characterized by porous graphitized carbon-liquid chromatography-electrospray ionization-tandem mass spectrometry (PGC-LC-ESI-MS/MS), and then assigned to high-resolution images obtained from MALDI-MSI. Spatial distribution of 14 N-glycans is obtained by MALDI-MSI and 42 N-glycans (including structural and compositional isomers) identified and structurally characterized by LC-MS. The spatial distribution of oligomannose, complex neutral, bisecting, and sialylated N-glycan families are localized to the tumor regions of late-stage ovarian cancer patients relative to early-stage patients. Potential N-glycan diagnostic markers that emerge include the oligomannose structure, (Hex)6 + (Man)3 (GlcNAc)2 , and the complex neutral structure, (Hex)2 (HexNAc)2 (Deoxyhexose)1 + (Man)3 (GlcNAc)2 . The distribution of these markers is evaluated using a tissue microarray of early- and late-stage patients.
    Matched MeSH terms: Genomics/methods
  18. Bhalla R, Narasimhan K, Swarup S
    Plant Cell Rep, 2005 Dec;24(10):562-71.
    PMID: 16220342
    A natural shift is taking place in the approaches being adopted by plant scientists in response to the accessibility of systems-based technology platforms. Metabolomics is one such field, which involves a comprehensive non-biased analysis of metabolites in a given cell at a specific time. This review briefly introduces the emerging field and a range of analytical techniques that are most useful in metabolomics when combined with computational approaches in data analyses. Using cases from Arabidopsis and other selected plant systems, this review highlights how information can be integrated from metabolomics and other functional genomics platforms to obtain a global picture of plant cellular responses. We discuss how metabolomics is enabling large-scale and parallel interrogation of cell states under different stages of development and defined environmental conditions to uncover novel interactions among various pathways. Finally, we discuss selected applications of metabolomics.
    Matched MeSH terms: Genomics/methods
  19. Biswas MK, Bagchi M, Biswas D, Harikrishna JA, Liu Y, Li C, et al.
    Genes (Basel), 2020 12 09;11(12).
    PMID: 33317074 DOI: 10.3390/genes11121479
    Trait tagging through molecular markers is an important molecular breeding tool for crop improvement. SSR markers encoded by functionally relevant parts of a genome are well suited for this task because they may be directly related to traits. However, a limited number of these markers are known for Musa spp. Here, we report 35136 novel functionally relevant SSR markers (FRSMs). Among these, 17,561, 15,373 and 16,286 FRSMs were mapped in-silico to the genomes of Musa acuminata, M. balbisiana and M. schizocarpa, respectively. A set of 273 markers was validated using eight accessions of Musa spp., from which 259 markers (95%) produced a PCR product of the expected size and 203 (74%) were polymorphic. In-silico comparative mapping of FRSMs onto Musa and related species indicated sequence-based orthology and synteny relationships among the chromosomes of Musa and other plant species. Fifteen FRSMs were used to estimate the phylogenetic relationships among 50 banana accessions, and the results revealed that all banana accessions group into two major clusters according to their genomic background. Here, we report the first large-scale development and characterization of functionally relevant Musa SSR markers. We demonstrate their utility for germplasm characterization, genetic diversity studies, and comparative mapping in Musa spp. and other monocot species. The sequences for these novel markers are freely available via a searchable web interface called Musa Marker Database.
    Matched MeSH terms: Genomics/methods
  20. Rahman F, Hassan M, Rosli R, Almousally I, Hanano A, Murphy DJ
    PLoS One, 2018;13(5):e0196669.
    PMID: 29771926 DOI: 10.1371/journal.pone.0196669
    Bioinformatics analyses of caleosin/peroxygenases (CLO/PXG) demonstrated that these genes are present in the vast majority of Viridiplantae taxa for which sequence data are available. Functionally active CLO/PXG proteins with roles in abiotic stress tolerance and lipid droplet storage are present in some Trebouxiophycean and Chlorophycean green algae but are absent from the small number of sequenced Prasinophyceaen genomes. CLO/PXG-like genes are expressed during dehydration stress in Charophyte algae, a sister clade of the land plants (Embryophyta). CLO/PXG-like sequences are also present in all of the >300 sequenced Embryophyte genomes, where some species contain as many as 10-12 genes that have arisen via selective gene duplication. Angiosperm genomes harbour at least one copy each of two distinct CLO/PX isoforms, termed H (high) and L (low), where H-forms contain an additional C-terminal motif of about 30-50 residues that is absent from L-forms. In contrast, species in other Viridiplantae taxa, including green algae, non-vascular plants, ferns and gymnosperms, contain only one (or occasionally both) of these isoforms per genome. Transcriptome and biochemical data show that CLO/PXG-like genes have complex patterns of developmental and tissue-specific expression. CLO/PXG proteins can associate with cytosolic lipid droplets and/or bilayer membranes. Many of the analysed isoforms also have peroxygenase activity and are involved in oxylipin metabolism. The distribution of CLO/PXG-like genes is consistent with an origin >1 billion years ago in at least two of the earliest diverging groups of the Viridiplantae, namely the Chlorophyta and the Streptophyta, after the Viridiplantae had already diverged from other Archaeplastidal groups such as the Rhodophyta and Glaucophyta. While algal CLO/PXGs have roles in lipid packaging and stress responses, the Embryophyte proteins have a much wider spectrum of roles and may have been instrumental in the colonisation of terrestrial habitats and the subsequent diversification as the major land flora.
    Matched MeSH terms: Genomics/methods
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links