MyMedR

Displaying publications 1 - 20 of 38 in total

Abstract:

Sort:

Structural dissection of two redox proteins from the shipworm symbiont Teredinibacter turnerae

Rajagopal BS, Yates N, Smith J, Paradisi A, Tétard-Jones C, Willats WGT, et al.

IUCrJ, 2024 Mar 01;11(Pt 2):260-274.
PMID: 38446458 DOI: 10.1107/S2052252524001386

The discovery of lytic polysaccharide monooxygenases (LPMOs), a family of copper-dependent enzymes that play a major role in polysaccharide degradation, has revealed the importance of oxidoreductases in the biological utilization of biomass. In fungi, a range of redox proteins have been implicated as working in harness with LPMOs to bring about polysaccharide oxidation. In bacteria, less is known about the interplay between redox proteins and LPMOs, or how the interaction between the two contributes to polysaccharide degradation. We therefore set out to characterize two previously unstudied proteins from the shipworm symbiont Teredinibacter turnerae that were initially identified by the presence of carbohydrate binding domains appended to uncharacterized domains with probable redox functions. Here, X-ray crystal structures of several domains from these proteins are presented together with initial efforts to characterize their functions. The analysis suggests that the target proteins are unlikely to function as LPMO electron donors, raising new questions as to the potential redox functions that these large extracellular multi-haem-containing c-type cytochromes may perform in these bacteria.
Biochemical and in silico structural characterization of a cold-active arginase from the psychrophilic yeast, Glaciozyma antarctica PI12

Yusof NY, Quay DHX, Kamaruddin S, Jonet MA, Md Illias R, Mahadi NM, et al.

Extremophiles, 2024 Feb 01;28(1):15.
PMID: 38300354 DOI: 10.1007/s00792-024-01333-7

Glaciozyma antarctica PI12 is a psychrophilic yeast isolated from Antarctica. In this work, we describe the heterologous production, biochemical properties and in silico structure analysis of an arginase from this yeast (GaArg). GaArg is a metalloenzyme that catalyses the hydrolysis of L-arginine to L-ornithine and urea. The cDNA of GaArg was reversed transcribed, cloned, expressed and purified as a recombinant protein in Escherichia coli. The purified protein was active against L-arginine as its substrate in a reaction at 20 °C, pH 9. At 10-35 °C and pH 7-9, the catalytic activity of the protein was still present around 50%. Mn2+, Ni2+, Co2+ and K+ were able to enhance the enzyme activity more than two-fold, while GaArg is most sensitive to SDS, EDTA and DTT. The predicted structure model of GaArg showed a very similar overall fold with other known arginases. GaArg possesses predominantly smaller and uncharged amino acids, fewer salt bridges, hydrogen bonds and hydrophobic interactions compared to the other counterparts. GaArg is the first reported arginase that is cold-active, facilitated by unique structural characteristics for its adaptation of catalytic functions at low-temperature environments. The structure and function of cold-active GaArg provide insights into the potentiality of new applications in various biotechnology and pharmaceutical industries.
Fulltext Structural and functional analyses of Burkholderia pseudomallei BPSL1038 reveal a Cas-2/VapD nuclease sub-family

Shaibullah S, Shuhaimi N, Ker DS, Mohd-Sharif N, Ho KL, Teh AH, et al.

Commun Biol, 2023 Sep 08;6(1):920.
PMID: 37684342 DOI: 10.1038/s42003-023-05265-4

Burkholderia pseudomallei is a highly versatile pathogen with ~25% of its genome annotated to encode hypothetical proteins. One such hypothetical protein, BPSL1038, is conserved across seven bacterial genera and 654 Burkholderia spp. Here, we present a 1.55 Å resolution crystal structure of BPSL1038. The overall structure folded into a modified βαββαβα ferredoxin fold similar to known Cas2 nucleases. The Cas2 equivalent catalytic aspartate (D11) pairs are conserved in BPSL1038 although B. pseudomallei has no known CRISPR associated system. Functional analysis revealed that BPSL1038 is a nuclease with endonuclease activity towards double-stranded DNA. The DNase activity is divalent ion independent and optimum at pH 6. The concentration of monovalent ions (Na+ and K+) is crucial for nuclease activity. An active site with a unique D11(X20)SST motif was identified and proposed for BPSL1038 and its orthologs. Structure modelling indicates the catalytic role of the D11(X20)SST motif and that the arginine residues R10 and R30 may interact with the nucleic acid backbone. The structural similarity of BPSL1038 to Cas2 proteins suggests that BPSL1038 may represent a sub-family of nucleases that share a common ancestor with Cas2.
Dissecting the Biology of Rafflesia Species: Current Progress and Future Directions Made Possible with High-Throughput Sequencing Data

Mursyidah AK, Hafizzudin-Fedeli M, Nor Muhammad NA, Latiff A, Firdaus-Raih M, Wan KL

Plant Cell Physiol, 2023 Apr 17;64(4):368-377.
PMID: 36611267 DOI: 10.1093/pcp/pcad004

The angiosperm Rafflesia exhibits a unique biology, including a growth strategy that involves endophytic parasitism of a specific host, with only the gigantic flower externally visible. The Rafflesia possesses many unique evolutionary, developmental and morphological features that are rooted in yet-to-be-explained physiological processes. Although studies on the molecular biology of Rafflesia are limited by sampling difficulties due to its rarity in the wild and the short life span of its flower, current advances in high-throughput sequencing technology have allowed for the genome- and transcriptome-level dissection of the molecular mechanisms behind the unique characteristics of this parasitic plant. In this review, we summarize major findings on the cryptic biology of Rafflesia and provide insights into future research directions. The wealth of data obtained can improve our understanding of Rafflesia species and contribute toward the conservation strategy of this endangered plant.
Fulltext Transitioning from Soil to Host: Comparative Transcriptome Analysis Reveals the Burkholderia pseudomallei Response to Different Niches

Ghazali AK, Firdaus-Raih M, Uthaya Kumar A, Lee WK, Hoh CC, Nathan S

Microbiol Spectr, 2023 Mar 01;11(2):e0383522.
PMID: 36856434 DOI: 10.1128/spectrum.03835-22

Burkholderia pseudomallei, a soil and water saprophyte, is responsible for the tropical human disease melioidosis. A hundred years since its discovery, there is still much to learn about B. pseudomallei proteins that are essential for the bacterium's survival in and interaction with the infected host, as well as their roles within the bacterium's natural soil habitat. To address this gap, bacteria grown under conditions mimicking the soil environment were subjected to transcriptome sequencing (RNA-seq) analysis. A dual RNA-seq approach was used on total RNA from spleens isolated from a B. pseudomallei mouse infection model at 5 days postinfection. Under these conditions, a total of 1,434 bacterial genes were induced, with 959 induced in the soil environment and 475 induced in bacteria residing within the host. Genes encoding metabolism and transporter proteins were induced when the bacteria were present in soil, while virulence factors, metabolism, and bacterial defense mechanisms were upregulated during active infection of mice. On the other hand, capsular polysaccharide and quorum-sensing pathways were inhibited during infection. In addition to virulence factors, reactive oxygen species, heat shock proteins, siderophores, and secondary metabolites were also induced to assist bacterial adaptation and survival in the host. Overall, this study provides crucial insights into the transcriptome-level adaptations which facilitate infection by soil-dwelling B. pseudomallei. Targeting novel therapeutics toward B. pseudomallei proteins required for adaptation provides an alternative treatment strategy given its intrinsic antimicrobial resistance and the absence of a vaccine. IMPORTANCE Burkholderia pseudomallei, a soil-dwelling bacterium, is the causative agent of melioidosis, a fatal infectious disease of humans and animals. The bacterium has a large genome consisting of two chromosomes carrying genes that encode proteins with important roles for survival in diverse environments as well as in the infected host. While a general mechanism of pathogenesis has been proposed, it is not clear which proteins have major roles when the bacteria are in the soil and whether the same proteins are key to successful infection and spread. To address this question, we grew the bacteria in soil medium and then in infected mice. At 5 days postinfection, bacteria were recovered from infected mouse organs and their gene expression was compared against that of bacteria grown in soil medium. The analysis revealed a list of genes expressed under soil growth conditions and a different set of genes encoding proteins which may be important for survival, replication, and dissemination in an infected host. These proteins are a potential resource for understanding the full adaptation mechanism of this pathogen. In the absence of a vaccine for melioidosis and with treatment being reliant on combinatorial antibiotic therapy, these proteins may be ideal targets for designing antimicrobials to treat melioidosis.
Fulltext Biofilm Signaling, Composition and Regulation in Burkholderia pseudomallei

Nyanasegran PK, Nathan S, Firdaus-Raih M, Muhammad NAN, Ng CL

J Microbiol Biotechnol, 2023 Jan 28;33(1):15-27.
PMID: 36451302 DOI: 10.4014/jmb.2207.07032

The incidence of melioidosis cases caused by the gram-negative pathogen Burkholderia pseudomallei (BP) is seeing an increasing trend that has spread beyond its previously known endemic regions. Biofilms produced by BP have been associated with antimicrobial therapy limitation and relapse melioidosis, thus making it urgently necessary to understand the mechanisms of biofilm formation and their role in BP biology. Microbial cells aggregate and enclose within a self-produced matrix of extracellular polymeric substances (EPSs) to form biofilm. The transition mechanism of bacterial cells from planktonic state to initiate biofilm formation, which involves the formation of surface attachment microcolonies and the maturation of the biofilm matrix, is a dynamic and complex process. Despite the emerging findings on the biofilm formation process, systemic knowledge on the molecular mechanisms of biofilm formation in BP remains fractured. This review provides insights into the signaling systems, matrix composition, and the biosynthesis regulation of EPSs (exopolysaccharide, eDNA and proteins) that facilitate the formation of biofilms in order to present an overview of our current knowledge and the questions that remain regarding BP biofilms.
Modeling and computational characterization of a Xanthomonas sp. Hypothetical protein identifies a remote ortholog of Burkholderia lethal factor 1

Muhamad Ismail NAS, Yap SH, Mohamad Yussoff MA, Nor Muhammad NA, Firdaus-Raih M, Quay DHX

J Biomol Struct Dyn, 2023;41(13):6027-6039.
PMID: 35862639 DOI: 10.1080/07391102.2022.2100827

Burkholderia Lethal Factor 1 (BLF1) is a deamidase first characterized in Burkholderia pseudomallei. This enzyme inhibits cellular protein synthesis by deamidating a glutamine residue to a glutamic acid in its target protein, the eukaryotic translation initiation factor 4 A (eIF4A). In this work, we present the characterization of a hypothetical protein from Xanthomonas sp. Leaf131 as the first report of a BLF1 family ortholog outside of the Burkholderia genus. Although standard sequence similarity searches such as BLAST were not able to detect the homology between the Xanthomonas sp. Leaf131 hypothetical protein sequence and BLF1, our computed structure model for the Xanthomonas sp. hypothetical protein revealed structural similarities with an RMSD of 2.7 Å/164 Cα atoms and a TM-score of 0.72 when superposed. Structural comparisons of the Xanthomonas model structure against BLF1 and Escherichia coli cytotoxic necrotizing factor 1 (CNF1) revealed that the conserved signature LXGC motif and putative catalytic residues are structurally aligned thus signifying a level of functional or mechanistic similarity. Protein-protein docking analysis and molecular dynamics simulations also demonstrated that eIF4A could still be a possible target substrate for deamidation by XLF1 as it is for BLF1. We therefore propose that this Xanthomonas hypothetical protein be renamed as Xanthomonas Lethal Factor 1 (XLF1). Our work also provides further evidence of the utility of programs such as AlphaFold in bridging the computational function annotation transfer gap despite very low sequence identities of under 20%.Communicated by Ramaswamy H. Sarma.
Fulltext GrAfSS: a webserver for substructure similarity searching and comparisons in the structures of proteins and RNA

Ghani NSA, Emrizal R, Moffit SM, Hamdani HY, Ramlan EI, Firdaus-Raih M

Nucleic Acids Res, 2022 Jul 05;50(W1):W375-W383.
PMID: 35639505 DOI: 10.1093/nar/gkac402

The GrAfSS (Graph theoretical Applications for Substructure Searching) webserver is a platform to search for three-dimensional substructures of: (i) amino acid side chains in protein structures; and (ii) base arrangements in RNA structures. The webserver interfaces the functions of five different graph theoretical algorithms - ASSAM, SPRITE, IMAAAGINE, NASSAM and COGNAC - into a single substructure searching suite. Users will be able to identify whether a three-dimensional (3D) arrangement of interest, such as a ligand binding site or 3D motif, observed in a protein or RNA structure can be found in other structures available in the Protein Data Bank (PDB). The webserver also allows users to determine whether a protein or RNA structure of interest contains substructural arrangements that are similar to known motifs or 3D arrangements. These capabilities allow for the functional annotation of new structures that were either experimentally determined or computationally generated (such as the coordinates generated by AlphaFold2) and can provide further insights into the diversity or conservation of functional mechanisms of structures in the PDB. The computed substructural superpositions are visualized using integrated NGL viewers. The GrAfSS server is available at http://mfrlab.org/grafss/.
Fulltext Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures

Emrizal R, Hamdani HY, Firdaus-Raih M

Int J Mol Sci, 2021 Aug 09;22(16).
PMID: 34445259 DOI: 10.3390/ijms22168553

The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule's functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
Regulation of Glycine Cleavage and Detoxification by a Highly Conserved Glycine Riboswitch in Burkholderia spp

Munyati-Othman N, Appasamy SD, Damiri N, Emrizal R, Alipiah NM, Ramlan EI, et al.

Curr Microbiol, 2021 Aug;78(8):2943-2955.
PMID: 34076709 DOI: 10.1007/s00284-021-02550-5

The glycine riboswitch is a known regulatory element that is unique in having two aptamers that are joined by a linker region. In this study, we investigated a glycine riboswitch located in the 5' untranslated region of a glycine cleavage system homolog (gcvTHP) in Burkholderia spp. Structure prediction using the sequence generated a model with a glycine binding pocket composed of base-triple interactions (G62-A64-A86 and G65-U84-C85) that are supported by A/G minor interactions (A17-C60-G88 and G16-C61-G87, respectively) and two ribose-zipper motifs (C11-G12 interacting with A248-A247 and C153-U154 interacting with A79-A78) which had not been previously reported. The capacity of the riboswitch to bind to glycine was experimentally validated by native gel assays and the crucial role of interactions that make up the glycine binding pocket were proven by mutations of A17U and G16C which resulted in conformational differences that may lead to dysfunction. Using glycine supplemented minimal media, we were able to prove that the expression of the gcvTHP genes found downstream of the riboswitch responded to the glycine concentrations introduced thus confirming the role of this highly conserved Burkholderia riboswitch and its associated genes as a putative glycine detoxification system in Burkholderia spp.
Fulltext Side chain similarity comparisons for integrated drug repositioning and potential toxicity assessments in epidemic response scenarios: The case for COVID-19

Ab Ghani NS, Emrizal R, Makmur H, Firdaus-Raih M

Comput Struct Biotechnol J, 2020;18:2931-2944.
PMID: 33101604 DOI: 10.1016/j.csbj.2020.10.013

Structures of protein-drug-complexes provide an atomic level profile of drug-target interactions. In this work, the three-dimensional arrangements of amino acid side chains in known drug binding sites (substructures) were used to search for similarly arranged sites in SARS-CoV-2 protein structures in the Protein Data Bank for the potential repositioning of approved compounds. We were able to identify 22 target sites for the repositioning of 16 approved drug compounds as potential therapeutics for COVID-19. Using the same approach, we were also able to investigate the potentially promiscuous binding of the 16 compounds to off-target sites that could be implicated in toxicity and side effects that had not been provided by any previous studies. The investigations of binding properties in disease-related proteins derived from the comparison of amino acid substructure arrangements allows for effective mechanism driven decision making to rank and select only the compounds with the highest potential for success and safety to be prioritized for clinical trials or treatments. The intention of this work is not to explicitly identify candidate compounds but to present how an integrated drug repositioning and potential toxicity pipeline using side chain similarity searching algorithms are of great utility in epidemic scenarios involving novel pathogens. In the case of the COVID-19 pandemic caused by the SARS-CoV-2 virus, we demonstrate that the pipeline can identify candidate compounds quickly and sustainably in combination with associated risk factors derived from the analysis of potential off-target site binding by the compounds to be repurposed.
Fulltext Drug ReposER: a web server for predicting similar amino acid arrangements to known drug binding interfaces for potential drug repositioning

Ab Ghani NS, Ramlan EI, Firdaus-Raih M

Nucleic Acids Res, 2019 07 02;47(W1):W350-W356.
PMID: 31106379 DOI: 10.1093/nar/gkz391

A common drug repositioning strategy is the re-application of an existing drug to address alternative targets. A crucial aspect to enable such repurposing is that the drug's binding site on the original target is similar to that on the alternative target. Based on the assumption that proteins with similar binding sites may bind to similar drugs, the 3D substructure similarity data can be used to identify similar sites in other proteins that are not known targets. The Drug ReposER (DRug REPOSitioning Exploration Resource) web server is designed to identify potential targets for drug repurposing based on sub-structural similarity to the binding interfaces of known drug binding sites. The application has pre-computed amino acid arrangements from protein structures in the Protein Data Bank that are similar to the 3D arrangements of known drug binding sites thus allowing users to explore them as alternative targets. Users can annotate new structures for sites that are similarly arranged to the residues found in known drug binding interfaces. The search results are presented as mappings of matched sidechain superpositions. The results of the searches can be visualized using an integrated NGL viewer. The Drug ReposER server has no access restrictions and is available at http://mfrlab.org/drugreposer/.
Fulltext Computational discovery and annotation of conserved small open reading frames in fungal genomes

Mat-Sharani S, Firdaus-Raih M

BMC Bioinformatics, 2019 Feb 04;19(Suppl 13):551.
PMID: 30717662 DOI: 10.1186/s12859-018-2550-2

BACKGROUND: Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes.
RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.
CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
Fulltext Comparative analysis of nucleus-encoded plastid-targeting proteins in Rafflesia cantleyi against photosynthetic and non-photosynthetic representatives reveals orthologous systems with potentially divergent functions

Ng SM, Lee XW, Mat-Isa MN, Aizat-Juhari MA, Adam JH, Mohamed R, et al.

Sci Rep, 2018 Nov 22;8(1):17258.
PMID: 30467394 DOI: 10.1038/s41598-018-35173-1

Parasitic plants are known to discard photosynthesis thus leading to the deletion or loss of the plastid genes. Despite plastid genome reduction in non-photosynthetic plants, some nucleus-encoded proteins are transported back to the plastid to carry out specific functions. In this work, we study such proteins in Rafflesia cantleyi, a member of the holoparasitic genus well-known for producing the largest single flower in the world. Our analyses of three transcriptome datasets, two holoparasites (R. cantleyi and Phelipanche aegyptiaca) and one photosynthetic plant (Arabidopsis thaliana), suggest that holoparasites, such as R. cantleyi, retain some common plastid associated processes such as biosynthesis of amino acids and lipids, but are missing photosynthesis components that can be extensions of these pathways. The reconstruction of two selected biosynthetic pathways involving plastids correlates the trend of plastid retention to pathway complexity - transcriptome evidence for R. cantleyi suggests alternate mechanisms in regulating the plastidial heme and terpenoid backbone biosynthesis pathways. The evolution to holoparasitism from autotrophy trends towards devolving the plastid genes to the nuclear genome despite the functional sites remaining in the plastid, or maintaining non-photosynthetic processes in the plastid, before the eventual loss of the plastid and any site dependent functions.
Unravelling the adaptation strategies employed by Glaciozyma antarctica PI12 on Antarctic sea ice

Bharudin I, Abu Bakar MF, Hashim NHF, Mat Isa MN, Alias H, Firdaus-Raih M, et al.

Mar Environ Res, 2018 Jun;137:169-176.
PMID: 29598997 DOI: 10.1016/j.marenvres.2018.03.007

Glaciozyma antarctica PI12, is a psychrophilic yeast isolated from Antarctic sea. In this work, Expressed Sequence Tags (EST) from cells exposed to three different temperatures; 15 °C, 0 °C and -12 °C were generated to identify genes associated with cold adaptation. A total of 5376 clones from each library were randomly picked and sequenced. Comparative analyses from the resulting ESTs in each condition identified several groups of genes required for cold adaptation. Additionally, 319 unique transcripts that encoded uncharacterised functions were identified in the -12 °C library and are currently unique to G. antarctica. Gene expression analysis using RT-qPCR revealed two of the unknown genes to be up-regulated at -12 °C compared to 0 °C and 15 °C. These findings further contribute to the collective knowledge into G. antarctica cold adaptation and as a resource for understanding the ecological and physiological tolerance of psychrophilic microbes in general.
Fulltext The Glaciozyma antarctica genome reveals an array of systems that provide sustained responses towards temperature variations in a persistently cold habitat

Firdaus-Raih M, Hashim NHF, Bharudin I, Abu Bakar MF, Huang KK, Alias H, et al.

PLoS One, 2018;13(1):e0189947.
PMID: 29385175 DOI: 10.1371/journal.pone.0189947

Extremely low temperatures present various challenges to life that include ice formation and effects on metabolic capacity. Psyhcrophilic microorganisms typically have an array of mechanisms to enable survival in cold temperatures. In this study, we sequenced and analysed the genome of a psychrophilic yeast isolated in the Antarctic region, Glaciozyma antarctica. The genome annotation identified 7857 protein coding sequences. From the genome sequence analysis we were able to identify genes that encoded for proteins known to be associated with cold survival, in addition to annotating genes that are unique to G. antarctica. For genes that are known to be involved in cold adaptation such as anti-freeze proteins (AFPs), our gene expression analysis revealed that they were differentially transcribed over time and in response to different temperatures. This indicated the presence of an array of adaptation systems that can respond to a changing but persistent cold environment. We were also able to validate the activity of all the AFPs annotated where the recombinant AFPs demonstrated anti-freeze capacity. This work is an important foundation for further collective exploration into psychrophilic microbiology where among other potential, the genes unique to this species may represent a pool of novel mechanisms for cold survival.
Fulltext Identification of sRNA mediated responses to nutrient depletion in Burkholderia pseudomallei

Mohd-Padil H, Damiri N, Sulaiman S, Chai SF, Nathan S, Firdaus-Raih M

Sci Rep, 2017 12 07;7(1):17173.
PMID: 29215024 DOI: 10.1038/s41598-017-17356-4

The Burkholderia genus includes many species that are known to survive in diverse environmental conditions including low nutrient environments. One species, Burkholderia pseudomallei is a versatile pathogen that can survive in a wide range of hosts and environmental conditions. In this study, we investigated how a nutrient depleted growth environment evokes sRNA mediated responses by B. pseudomallei. Computationally predicted B. pseudomallei D286 sRNAs were mapped to RNA-sequencing data for cultures grown under two conditions: (1) BHIB as a nutrient rich media reference environment and (2) M9 media as a nutrient depleted stress environment. The sRNAs were further selected to identify potentially cis-encoded systems by investigating their possible interactions with their flanking genes. The mappings of predicted sRNA genes and interactions analysis to their flanking genes identified 12 sRNA candidates that may possibly have cis-acting regulatory roles that are associated to a nutrient depleted growth environment. Our approach can be used for identifying novel sRNA genes and their possible role as cis-mediated regulatory systems.
Fulltext Evidence-based gene models for structural and functional annotations of the oil palm genome

Chan KL, Tatarinova TV, Rosli R, Amiruddin N, Azizi N, Halim MAA, et al.

Biol. Direct, 2017 Sep 08;12(1):21.
PMID: 28886750 DOI: 10.1186/s13062-017-0191-4

BACKGROUND: Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools.
RESULTS: Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC3-rich genes (GC3 ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures.
CONCLUSIONS: We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC3-rich and intronless), as well as those associated with important functions, such as FA biosynthesis and disease resistance. The study demonstrated the advantages of having an integrated approach to gene prediction and developed a computational framework for combining multiple genome annotations. These results, available in the oil palm annotation database ( http://palmxplore.mpob.gov.my ), will provide important resources for studies on the genomes of oil palm and related crops.
REVIEWERS: This article was reviewed by Alexander Kel, Igor Rogozin, and Vladimir A. Kuznetsov.
Fulltext Seqping: gene prediction pipeline for plant genomes using self-training gene models and transcriptomic data

Chan KL, Rosli R, Tatarinova TV, Hogan M, Firdaus-Raih M, Low EL

BMC Bioinformatics, 2017 Jan 27;18(Suppl 1):1426.
PMID: 28466793 DOI: 10.1186/s12859-016-1426-6

BACKGROUND: Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion.
RESULTS: We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure).
CONCLUSIONS: Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.
An analysis of simple computational strategies to facilitate the design of functional molecular information processors

Lee Y, Roslan R, Azizan S, Firdaus-Raih M, Ramlan EI

BMC Bioinformatics, 2016 Oct 28;17(1):438.
PMID: 27793081

BACKGROUND: Biological macromolecules (DNA, RNA and proteins) are capable of processing physical or chemical inputs to generate outputs that parallel conventional Boolean logical operators. However, the design of functional modules that will enable these macromolecules to operate as synthetic molecular computing devices is challenging.
RESULTS: Using three simple heuristics, we designed RNA sensors that can mimic the function of a seven-segment display (SSD). Ten independent and orthogonal sensors representing the numerals 0 to 9 are designed and constructed. Each sensor has its own unique oligonucleotide binding site region that is activated uniquely by a specific input. Each operator was subjected to a stringent in silico filtering. Random sensors were selected and functionally validated via ribozyme self cleavage assays that were visualized via electrophoresis.
CONCLUSIONS: By utilising simple permutation and randomisation in the sequence design phase, we have developed functional RNA sensors thus demonstrating that even the simplest of computational methods can greatly aid the design phase for constructing functional molecular devices.

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links