Displaying publications 1 - 20 of 38 in total

Abstract:
Sort:
  1. Shaibullah S, Mohd-Sharif N, Ho KL, Firdaus-Raih M, Nathan S, Mohamed R, et al.
    Acta Crystallogr F Struct Biol Commun, 2014 Dec 01;70(Pt 12):1697-700.
    PMID: 25484229 DOI: 10.1107/S2053230X14025278
    Melioidosis is an infectious disease caused by the pathogenic bacterium Burkholderia pseudomallei. Whole-genome sequencing revealed that the B. pseudomallei genome includes 5855 coding DNA sequences (CDSs), of which ∼25% encode hypothetical proteins. A pathogen-associated hypothetical protein, BPSL1038, was overexpressed in Escherichia coli, purified and crystallized using vapour-diffusion methods. A BPSL1038 protein crystal that grew using sodium formate as precipitant diffracted to 1.55 Å resolution. It belonged to space group C2221, with unit-cell parameters a = 85.36, b = 115.63, c = 46.73 Å. The calculated Matthews coefficient (VM) suggests that there are two molecules per asymmetric unit, with a solvent content of 48.8%.
  2. Firdaus Raih M, Ahmad HA, Sharum MY, Azizi N, Mohamed R
    Appl. Bioinformatics, 2005;4(2):147-50.
    PMID: 16128617
    Bacterial proteases are an important group of enzymes that have very diverse biochemical and cellular functions. Proteases from prokaryotic sources also have a wide range of uses, either in medicine as pathogenic factors or in industry and therapeutics. ProLysED (Prokaryotic Lysis Enzymes Database), our meta-server integrated database of bacterial proteases, is a useful, albeit very niche, resource. The features include protease classification browsing and searching, organism-specific protease browsing, molecular information and visualisation of protease structures from the Protein Data Bank (PDB) as well as predicted protease structures.
  3. Lee Y, Roslan R, Azizan S, Firdaus-Raih M, Ramlan EI
    BMC Bioinformatics, 2016 Oct 28;17(1):438.
    PMID: 27793081
    BACKGROUND: Biological macromolecules (DNA, RNA and proteins) are capable of processing physical or chemical inputs to generate outputs that parallel conventional Boolean logical operators. However, the design of functional modules that will enable these macromolecules to operate as synthetic molecular computing devices is challenging.

    RESULTS: Using three simple heuristics, we designed RNA sensors that can mimic the function of a seven-segment display (SSD). Ten independent and orthogonal sensors representing the numerals 0 to 9 are designed and constructed. Each sensor has its own unique oligonucleotide binding site region that is activated uniquely by a specific input. Each operator was subjected to a stringent in silico filtering. Random sensors were selected and functionally validated via ribozyme self cleavage assays that were visualized via electrophoresis.

    CONCLUSIONS: By utilising simple permutation and randomisation in the sequence design phase, we have developed functional RNA sensors thus demonstrating that even the simplest of computational methods can greatly aid the design phase for constructing functional molecular devices.

  4. Mat-Sharani S, Firdaus-Raih M
    BMC Bioinformatics, 2019 Feb 04;19(Suppl 13):551.
    PMID: 30717662 DOI: 10.1186/s12859-018-2550-2
    BACKGROUND: Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes.

    RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.

    CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.

  5. Chan KL, Rosli R, Tatarinova TV, Hogan M, Firdaus-Raih M, Low EL
    BMC Bioinformatics, 2017 Jan 27;18(Suppl 1):1426.
    PMID: 28466793 DOI: 10.1186/s12859-016-1426-6
    BACKGROUND: Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion.

    RESULTS: We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure).

    CONCLUSIONS: Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.

  6. Yap HY, Chooi YH, Firdaus-Raih M, Fung SY, Ng ST, Tan CS, et al.
    BMC Genomics, 2014;15:635.
    PMID: 25073817 DOI: 10.1186/1471-2164-15-635
    The sclerotium of Lignosus rhinocerotis (Cooke) Ryvarden or Tiger milk mushroom (Polyporales, Basidiomycota) is a valuable folk medicine for indigenous peoples in Southeast Asia. Despite the increasing interest in this ethnobotanical mushroom, very little is known about the molecular and genetic basis of its medicinal and nutraceutical properties.
  7. Khoo JS, Chai SF, Mohamed R, Nathan S, Firdaus-Raih M
    BMC Genomics, 2012;13 Suppl 7:S13.
    PMID: 23282220 DOI: 10.1186/1471-2164-13-S7-S13
    The sRNAs of bacterial pathogens are known to be involved in various cellular roles including environmental adaptation as well as regulation of virulence and pathogenicity. It is expected that sRNAs may also have similar functions for Burkholderia pseudomallei, a soil bacterium that can adapt to diverse environmental conditions, which causes the disease melioidosis and is also able to infect a wide variety of hosts.
  8. Ahmad L, Hung TL, Mat Akhir NA, Mohamed R, Nathan S, Firdaus-Raih M
    BMC Microbiol, 2015;15:270.
    PMID: 26597807 DOI: 10.1186/s12866-015-0604-4
    There are still numerous protein subfamilies within families and superfamilies that do not yet have conclusive empirical experimental evidence providing a specific function. These proteins persist in databases with the annotation of a specific 'putative' function made by association with discernible features in the protein sequence.
  9. Chan KL, Tatarinova TV, Rosli R, Amiruddin N, Azizi N, Halim MAA, et al.
    Biol. Direct, 2017 Sep 08;12(1):21.
    PMID: 28886750 DOI: 10.1186/s13062-017-0191-4
    BACKGROUND: Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools.

    RESULTS: Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC3-rich genes (GC3 ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures.

    CONCLUSIONS: We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC3-rich and intronless), as well as those associated with important functions, such as FA biosynthesis and disease resistance. The study demonstrated the advantages of having an integrated approach to gene prediction and developed a computational framework for combining multiple genome annotations. These results, available in the oil palm annotation database ( http://palmxplore.mpob.gov.my ), will provide important resources for studies on the genomes of oil palm and related crops.

    REVIEWERS: This article was reviewed by Alexander Kel, Igor Rogozin, and Vladimir A. Kuznetsov.

  10. Shaibullah S, Shuhaimi N, Ker DS, Mohd-Sharif N, Ho KL, Teh AH, et al.
    Commun Biol, 2023 Sep 08;6(1):920.
    PMID: 37684342 DOI: 10.1038/s42003-023-05265-4
    Burkholderia pseudomallei is a highly versatile pathogen with ~25% of its genome annotated to encode hypothetical proteins. One such hypothetical protein, BPSL1038, is conserved across seven bacterial genera and 654 Burkholderia spp. Here, we present a 1.55 Å resolution crystal structure of BPSL1038. The overall structure folded into a modified βαββαβα ferredoxin fold similar to known Cas2 nucleases. The Cas2 equivalent catalytic aspartate (D11) pairs are conserved in BPSL1038 although B. pseudomallei has no known CRISPR associated system. Functional analysis revealed that BPSL1038 is a nuclease with endonuclease activity towards double-stranded DNA. The DNase activity is divalent ion independent and optimum at pH 6. The concentration of monovalent ions (Na+ and K+) is crucial for nuclease activity. An active site with a unique D11(X20)SST motif was identified and proposed for BPSL1038 and its orthologs. Structure modelling indicates the catalytic role of the D11(X20)SST motif and that the arginine residues R10 and R30 may interact with the nucleic acid backbone. The structural similarity of BPSL1038 to Cas2 proteins suggests that BPSL1038 may represent a sub-family of nucleases that share a common ancestor with Cas2.
  11. Ong HS, Mohamed R, Firdaus-Raih M
    Comp. Funct. Genomics, 2012;2012:752867.
    PMID: 22991502
    Members of the Burkholderia family occupy diverse ecological niches. In pathogenic family members, glycan-associated proteins are often linked to functions that include virulence, protein conformation maintenance, surface recognition, cell adhesion, and immune system evasion. Comparative analysis of available Burkholderia genomes has revealed a core set of 178 glycan-associated proteins shared by all Burkholderia of which 68 are homologous to known essential genes. The genome sequence comparisons revealed insights into species-specific gene acquisitions through gene transfers, identified an S-layer protein, and proposed that significantly reactive surface proteins are associated to sugar moieties as a potential means to circumvent host defense mechanisms. The comparative analysis using a curated database of search queries enabled us to gain insights into the extent of conservation and diversity, as well as the possible virulence-associated roles of glycan-associated proteins in members of the Burkholderia spp. The curated list of glycan-associated proteins used can also be directed to screen other genomes for glycan-associated homologs.
  12. Ab Ghani NS, Emrizal R, Makmur H, Firdaus-Raih M
    Comput Struct Biotechnol J, 2020;18:2931-2944.
    PMID: 33101604 DOI: 10.1016/j.csbj.2020.10.013
    Structures of protein-drug-complexes provide an atomic level profile of drug-target interactions. In this work, the three-dimensional arrangements of amino acid side chains in known drug binding sites (substructures) were used to search for similarly arranged sites in SARS-CoV-2 protein structures in the Protein Data Bank for the potential repositioning of approved compounds. We were able to identify 22 target sites for the repositioning of 16 approved drug compounds as potential therapeutics for COVID-19. Using the same approach, we were also able to investigate the potentially promiscuous binding of the 16 compounds to off-target sites that could be implicated in toxicity and side effects that had not been provided by any previous studies. The investigations of binding properties in disease-related proteins derived from the comparison of amino acid substructure arrangements allows for effective mechanism driven decision making to rank and select only the compounds with the highest potential for success and safety to be prioritized for clinical trials or treatments. The intention of this work is not to explicitly identify candidate compounds but to present how an integrated drug repositioning and potential toxicity pipeline using side chain similarity searching algorithms are of great utility in epidemic scenarios involving novel pathogens. In the case of the COVID-19 pandemic caused by the SARS-CoV-2 virus, we demonstrate that the pipeline can identify candidate compounds quickly and sustainably in combination with associated risk factors derived from the analysis of potential off-target site binding by the compounds to be repurposed.
  13. Mohamed Salleh FH, Arif SM, Zainudin S, Firdaus-Raih M
    Comput Biol Chem, 2015 Dec;59 Pt B:3-14.
    PMID: 26278974 DOI: 10.1016/j.compbiolchem.2015.04.012
    A gene regulatory network (GRN) is a large and complex network consisting of interacting elements that, over time, affect each other's state. The dynamics of complex gene regulatory processes are difficult to understand using intuitive approaches alone. To overcome this problem, we propose an algorithm for inferring the regulatory interactions from knock-out data using a Gaussian model combines with Pearson Correlation Coefficient (PCC). There are several problems relating to GRN construction that have been outlined in this paper. We demonstrated the ability of our proposed method to (1) predict the presence of regulatory interactions between genes, (2) their directionality and (3) their states (activation or suppression). The algorithm was applied to network sizes of 10 and 50 genes from DREAM3 datasets and network sizes of 10 from DREAM4 datasets. The predicted networks were evaluated based on AUROC and AUPR. We discovered that high false positive values were generated by our GRN prediction methods because the indirect regulations have been wrongly predicted as true relationships. We achieved satisfactory results as the majority of sub-networks achieved AUROC values above 0.5.
  14. Munyati-Othman N, Appasamy SD, Damiri N, Emrizal R, Alipiah NM, Ramlan EI, et al.
    Curr Microbiol, 2021 Aug;78(8):2943-2955.
    PMID: 34076709 DOI: 10.1007/s00284-021-02550-5
    The glycine riboswitch is a known regulatory element that is unique in having two aptamers that are joined by a linker region. In this study, we investigated a glycine riboswitch located in the 5' untranslated region of a glycine cleavage system homolog (gcvTHP) in Burkholderia spp. Structure prediction using the sequence generated a model with a glycine binding pocket composed of base-triple interactions (G62-A64-A86 and G65-U84-C85) that are supported by A/G minor interactions (A17-C60-G88 and G16-C61-G87, respectively) and two ribose-zipper motifs (C11-G12 interacting with A248-A247 and C153-U154 interacting with A79-A78) which had not been previously reported. The capacity of the riboswitch to bind to glycine was experimentally validated by native gel assays and the crucial role of interactions that make up the glycine binding pocket were proven by mutations of A17U and G16C which resulted in conformational differences that may lead to dysfunction. Using glycine supplemented minimal media, we were able to prove that the expression of the gcvTHP genes found downstream of the riboswitch responded to the glycine concentrations introduced thus confirming the role of this highly conserved Burkholderia riboswitch and its associated genes as a putative glycine detoxification system in Burkholderia spp.
  15. Yusof NY, Quay DHX, Kamaruddin S, Jonet MA, Md Illias R, Mahadi NM, et al.
    Extremophiles, 2024 Feb 01;28(1):15.
    PMID: 38300354 DOI: 10.1007/s00792-024-01333-7
    Glaciozyma antarctica PI12 is a psychrophilic yeast isolated from Antarctica. In this work, we describe the heterologous production, biochemical properties and in silico structure analysis of an arginase from this yeast (GaArg). GaArg is a metalloenzyme that catalyses the hydrolysis of L-arginine to L-ornithine and urea. The cDNA of GaArg was reversed transcribed, cloned, expressed and purified as a recombinant protein in Escherichia coli. The purified protein was active against L-arginine as its substrate in a reaction at 20 °C, pH 9. At 10-35 °C and pH 7-9, the catalytic activity of the protein was still present around 50%. Mn2+, Ni2+, Co2+ and K+ were able to enhance the enzyme activity more than two-fold, while GaArg is most sensitive to SDS, EDTA and DTT. The predicted structure model of GaArg showed a very similar overall fold with other known arginases. GaArg possesses predominantly smaller and uncharged amino acids, fewer salt bridges, hydrogen bonds and hydrophobic interactions compared to the other counterparts. GaArg is the first reported arginase that is cold-active, facilitated by unique structural characteristics for its adaptation of catalytic functions at low-temperature environments. The structure and function of cold-active GaArg provide insights into the potentiality of new applications in various biotechnology and pharmaceutical industries.
  16. Lim MP, Firdaus-Raih M, Nathan S
    Front Microbiol, 2016;7:1436.
    PMID: 27672387 DOI: 10.3389/fmicb.2016.01436
    Burkholderia pseudomallei, the causative agent of melioidosis, is among a growing number of bacterial pathogens that are increasingly antibiotic resistant. Antimicrobial peptides (AMPs) have been investigated as an alternative approach to treat microbial infections, as generally, there is a lower likelihood that a pathogen will develop resistance to AMPs. In this study, 36 candidate Caenorhabditis elegans genes that encode secreted peptides of <150 amino acids and previously shown to be overexpressed during infection by B. pseudomallei were identified from the expression profile of infected nematodes. RNA interference (RNAi)-based knockdown of 12/34 peptide-encoding genes resulted in enhanced nematode susceptibility to B. pseudomallei without affecting worm fitness. A microdilution test demonstrated that two peptides, NLP-31 and Y43C5A.3, exhibited anti-B. pseudomallei activity in a dose dependent manner on different pathogens. Time kill analysis proposed that these peptides were bacteriostatic against B. pseudomallei at concentrations up to 8× MIC90. The SYTOX green assay demonstrated that NLP-31 and Y43C5A.3 did not disrupt the B. pseudomallei membrane. Instead, gel retardation assays revealed that both peptides were able to bind to DNA and interfere with bacterial viability. In parallel, microscopic examination showed induction of cellular filamentation, a hallmark of DNA synthesis inhibition, of NLP-31 and Y43C5A.3 treated cells. In addition, the peptides also regulated the expression of inflammatory cytokines in B. pseudomallei infected macrophage cells. Collectively, these findings demonstrate the potential of NLP-31 and Y43C5A.3 as anti-B. pseudomallei peptides based on their function as immune modulators.
  17. Rajagopal BS, Yates N, Smith J, Paradisi A, Tétard-Jones C, Willats WGT, et al.
    IUCrJ, 2024 Mar 01;11(Pt 2):260-274.
    PMID: 38446458 DOI: 10.1107/S2052252524001386
    The discovery of lytic polysaccharide monooxygenases (LPMOs), a family of copper-dependent enzymes that play a major role in polysaccharide degradation, has revealed the importance of oxidoreductases in the biological utilization of biomass. In fungi, a range of redox proteins have been implicated as working in harness with LPMOs to bring about polysaccharide oxidation. In bacteria, less is known about the interplay between redox proteins and LPMOs, or how the interaction between the two contributes to polysaccharide degradation. We therefore set out to characterize two previously unstudied proteins from the shipworm symbiont Teredinibacter turnerae that were initially identified by the presence of carbohydrate binding domains appended to uncharacterized domains with probable redox functions. Here, X-ray crystal structures of several domains from these proteins are presented together with initial efforts to characterize their functions. The analysis suggests that the target proteins are unlikely to function as LPMO electron donors, raising new questions as to the potential redox functions that these large extracellular multi-haem-containing c-type cytochromes may perform in these bacteria.
  18. Nadzirin N, Firdaus-Raih M
    Int J Mol Sci, 2012;13(10):12761-72.
    PMID: 23202924 DOI: 10.3390/ijms131012761
    Proteins of uncharacterized functions form a large part of many of the currently available biological databases and this situation exists even in the Protein Data Bank (PDB). Our analysis of recent PDB data revealed that only 42.53% of PDB entries (1084 coordinate files) that were categorized under "unknown function" are true examples of proteins of unknown function at this point in time. The remainder 1465 entries also annotated as such appear to be able to have their annotations re-assessed, based on the availability of direct functional characterization experiments for the protein itself, or for homologous sequences or structures thus enabling computational function inference.
  19. Emrizal R, Hamdani HY, Firdaus-Raih M
    Int J Mol Sci, 2021 Aug 09;22(16).
    PMID: 34445259 DOI: 10.3390/ijms22168553
    The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule's functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
  20. Muhamad Ismail NAS, Yap SH, Mohamad Yussoff MA, Nor Muhammad NA, Firdaus-Raih M, Quay DHX
    J Biomol Struct Dyn, 2023;41(13):6027-6039.
    PMID: 35862639 DOI: 10.1080/07391102.2022.2100827
    Burkholderia Lethal Factor 1 (BLF1) is a deamidase first characterized in Burkholderia pseudomallei. This enzyme inhibits cellular protein synthesis by deamidating a glutamine residue to a glutamic acid in its target protein, the eukaryotic translation initiation factor 4 A (eIF4A). In this work, we present the characterization of a hypothetical protein from Xanthomonas sp. Leaf131 as the first report of a BLF1 family ortholog outside of the Burkholderia genus. Although standard sequence similarity searches such as BLAST were not able to detect the homology between the Xanthomonas sp. Leaf131 hypothetical protein sequence and BLF1, our computed structure model for the Xanthomonas sp. hypothetical protein revealed structural similarities with an RMSD of 2.7 Å/164 Cα atoms and a TM-score of 0.72 when superposed. Structural comparisons of the Xanthomonas model structure against BLF1 and Escherichia coli cytotoxic necrotizing factor 1 (CNF1) revealed that the conserved signature LXGC motif and putative catalytic residues are structurally aligned thus signifying a level of functional or mechanistic similarity. Protein-protein docking analysis and molecular dynamics simulations also demonstrated that eIF4A could still be a possible target substrate for deamidation by XLF1 as it is for BLF1. We therefore propose that this Xanthomonas hypothetical protein be renamed as Xanthomonas Lethal Factor 1 (XLF1). Our work also provides further evidence of the utility of programs such as AlphaFold in bridging the computational function annotation transfer gap despite very low sequence identities of under 20%.Communicated by Ramaswamy H. Sarma.
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links