Displaying publications 1 - 20 of 57 in total

Abstract:
Sort:
  1. Abdullah-Zawawi MR, Ahmad-Nizammuddin NF, Govender N, Harun S, Mohd-Assaad N, Mohamed-Hussein ZA
    Sci Rep, 2021 10 04;11(1):19678.
    PMID: 34608238 DOI: 10.1038/s41598-021-99206-y
    Transcription factors (TFs) form the major class of regulatory genes and play key roles in multiple plant stress responses. In most eukaryotic plants, transcription factor (TF) families (WRKY, MADS-box and MYB) activate unique cellular-level abiotic and biotic stress-responsive strategies, which are considered as key determinants for defense and developmental processes. Arabidopsis and rice are two important representative model systems for dicot and monocot plants, respectively. A comprehensive comparative study on 101 OsWRKY, 34 OsMADS box and 122 OsMYB genes (rice genome) and, 71 AtWRKY, 66 AtMADS box and 144 AtMYB genes (Arabidopsis genome) showed various relationships among TFs across species. The phylogenetic analysis clustered WRKY, MADS-box and MYB TF family members into 10, 7 and 14 clades, respectively. All clades in WRKY and MYB TF families and almost half of the total number of clades in the MADS-box TF family are shared between both species. Chromosomal and gene structure analysis showed that the Arabidopsis-rice orthologous TF gene pairs were unevenly localized within their chromosomes whilst the distribution of exon-intron gene structure and motif conservation indicated plausible functional similarity in both species. The abiotic and biotic stress-responsive cis-regulatory element type and distribution patterns in the promoter regions of Arabidopsis and rice WRKY, MADS-box and MYB orthologous gene pairs provide better knowledge on their role as conserved regulators in both species. Co-expression network analysis showed the correlation between WRKY, MADs-box and MYB genes in each independent rice and Arabidopsis network indicating their role in stress responsiveness and developmental processes.
    Matched MeSH terms: Genomics/methods*
  2. Abdulrauf Sharifai G, Zainol Z
    Genes (Basel), 2020 06 27;11(7).
    PMID: 32605144 DOI: 10.3390/genes11070717
    The training machine learning algorithm from an imbalanced data set is an inherently challenging task. It becomes more demanding with limited samples but with a massive number of features (high dimensionality). The high dimensional and imbalanced data set has posed severe challenges in many real-world applications, such as biomedical data sets. Numerous researchers investigated either imbalanced class or high dimensional data sets and came up with various methods. Nonetheless, few approaches reported in the literature have addressed the intersection of the high dimensional and imbalanced class problem due to their complicated interactions. Lately, feature selection has become a well-known technique that has been used to overcome this problem by selecting discriminative features that represent minority and majority class. This paper proposes a new method called Robust Correlation Based Redundancy and Binary Grasshopper Optimization Algorithm (rCBR-BGOA); rCBR-BGOA has employed an ensemble of multi-filters coupled with the Correlation-Based Redundancy method to select optimal feature subsets. A binary Grasshopper optimisation algorithm (BGOA) is used to construct the feature selection process as an optimisation problem to select the best (near-optimal) combination of features from the majority and minority class. The obtained results, supported by the proper statistical analysis, indicate that rCBR-BGOA can improve the classification performance for high dimensional and imbalanced datasets in terms of G-mean and the Area Under the Curve (AUC) performance metrics.
    Matched MeSH terms: Genomics/methods
  3. Ang MY, Heydari H, Jakubovics NS, Mahmud MI, Dutta A, Wee WY, et al.
    Database (Oxford), 2014;2014.
    PMID: 25149689 DOI: 10.1093/database/bau082
    Fusobacterium are anaerobic gram-negative bacteria that have been associated with a wide spectrum of human infections and diseases. As the biology of Fusobacterium is still not well understood, comparative genomic analysis on members of this species will provide further insights on their taxonomy, phylogeny, pathogenicity and other information that may contribute to better management of infections and diseases. To facilitate the ongoing genomic research on Fusobacterium, a specialized database with easy-to-use analysis tools is necessary. Here we present FusoBase, an online database providing access to genome-wide annotated sequences of Fusobacterium strains as well as bioinformatics tools, to support the expanding scientific community. Using our custom-developed Pairwise Genome Comparison tool, we demonstrate how differences between two user-defined genomes and how insertion of putative prophages can be identified. In addition, Pathogenomics Profiling Tool is capable of clustering predicted genes across Fusobacterium strains and visualizing the results in the form of a heat map with dendrogram.
    Matched MeSH terms: Genomics/methods*
  4. Ashkani S, Yusop MR, Shabanimofrad M, Azady A, Ghasemzadeh A, Azizi P, et al.
    Curr Issues Mol Biol, 2015;17:57-73.
    PMID: 25706446
    Allele mining is a promising way to dissect naturally occurring allelic variants of candidate genes with essential agronomic qualities. With the identification, isolation and characterisation of blast resistance genes in rice, it is now possible to dissect the actual allelic variants of these genes within an array of rice cultivars via allele mining. Multiple alleles from the complex locus serve as a reservoir of variation to generate functional genes. The routine sequence exchange is one of the main mechanisms of R gene evolution and development. Allele mining for resistance genes can be an important method to identify additional resistance alleles and new haplotypes along with the development of allele-specific markers for use in marker-assisted selection. Allele mining can be visualised as a vital link between effective utilisation of genetic and genomic resources in genomics-driven modern plant breeding. This review studies the actual concepts and potential of mining approaches for the discovery of alleles and their utilisation for blast resistance genes in rice. The details provided here will be important to provide the rice breeder with a worthwhile introduction to allele mining and its methodology for breakthrough discovery of fresh alleles hidden in hereditary diversity, which is vital for crop improvement.
    Matched MeSH terms: Genomics/methods*
  5. Asnet MJ, Rubia AG, Ramya G, Nagalakshmi RN, Shenbagarathai R
    J Vector Borne Dis, 2014 Jun;51(2):82-5.
    PMID: 24947213
    DENVirDB is a web portal that provides the sequence information and computationally curated information of dengue viral proteins. The advent of genomic technology has increased the sequences available in the public databases. In order to create relevant concise information on Dengue Virus (DENV), the genomic sequences were collected, analysed with the bioinformatics tools and presented as DENVirDB. It provides the comprehensive information of complete genome sequences of dengue virus isolates of Southeast Asia, viz. India, Bangladesh, Sri Lanka, East Timor, Philippines, Malaysia, Papua New Guinea, Brunei and China. DENVirDB also includes the structural and non-structural protein sequences of DENV. It intends to provide the integrated information on the physicochemical properties, topology, secondary structure, domain and structural properties for each protein sequences. It contains over 99 entries in complete genome sequences and 990 entries in protein sequences, respectively. Therefore, DENVirDB could serve as a user friendly database for researchers in acquiring sequences and proteomic information in one platform.
    Matched MeSH terms: Genomics/methods
  6. Barnard RT
    Expert Rev Vaccines, 2010 May;9(5):461-3.
    PMID: 20450319 DOI: 10.1586/erv.10.48
    The Recombinant Vaccines: Strategies for Candidate Discovery and Vaccine Delivery conference, organized by EuroSciCon, hosted a group of UK-based and international scientists from as far afield as Malaysia and Australia. Genomic analyses of pathogens and elucidation of mechanisms of pathogenesis has advanced candidate discovery and development of vaccines. Therefore, it was timely that this conference featured, in addition to detailed expositions of target selection and clinical trials, presentations on manufacturability, scale-up and delivery of vaccines. Ten talks were presented. This meeting report describes the key topics presented and the themes that emerged from this conference.
    Matched MeSH terms: Genomics/methods
  7. Bhalla R, Narasimhan K, Swarup S
    Plant Cell Rep, 2005 Dec;24(10):562-71.
    PMID: 16220342
    A natural shift is taking place in the approaches being adopted by plant scientists in response to the accessibility of systems-based technology platforms. Metabolomics is one such field, which involves a comprehensive non-biased analysis of metabolites in a given cell at a specific time. This review briefly introduces the emerging field and a range of analytical techniques that are most useful in metabolomics when combined with computational approaches in data analyses. Using cases from Arabidopsis and other selected plant systems, this review highlights how information can be integrated from metabolomics and other functional genomics platforms to obtain a global picture of plant cellular responses. We discuss how metabolomics is enabling large-scale and parallel interrogation of cell states under different stages of development and defined environmental conditions to uncover novel interactions among various pathways. Finally, we discuss selected applications of metabolomics.
    Matched MeSH terms: Genomics/methods
  8. Biswas MK, Bagchi M, Biswas D, Harikrishna JA, Liu Y, Li C, et al.
    Genes (Basel), 2020 12 09;11(12).
    PMID: 33317074 DOI: 10.3390/genes11121479
    Trait tagging through molecular markers is an important molecular breeding tool for crop improvement. SSR markers encoded by functionally relevant parts of a genome are well suited for this task because they may be directly related to traits. However, a limited number of these markers are known for Musa spp. Here, we report 35136 novel functionally relevant SSR markers (FRSMs). Among these, 17,561, 15,373 and 16,286 FRSMs were mapped in-silico to the genomes of Musa acuminata, M. balbisiana and M. schizocarpa, respectively. A set of 273 markers was validated using eight accessions of Musa spp., from which 259 markers (95%) produced a PCR product of the expected size and 203 (74%) were polymorphic. In-silico comparative mapping of FRSMs onto Musa and related species indicated sequence-based orthology and synteny relationships among the chromosomes of Musa and other plant species. Fifteen FRSMs were used to estimate the phylogenetic relationships among 50 banana accessions, and the results revealed that all banana accessions group into two major clusters according to their genomic background. Here, we report the first large-scale development and characterization of functionally relevant Musa SSR markers. We demonstrate their utility for germplasm characterization, genetic diversity studies, and comparative mapping in Musa spp. and other monocot species. The sequences for these novel markers are freely available via a searchable web interface called Musa Marker Database.
    Matched MeSH terms: Genomics/methods
  9. Briggs MT, Condina MR, Ho YY, Everest-Dass AV, Mittal P, Kaur G, et al.
    Proteomics, 2019 11;19(21-22):e1800482.
    PMID: 31364262 DOI: 10.1002/pmic.201800482
    Epithelial ovarian cancer is one of the most fatal gynecological malignancies in adult women. As studies on protein N-glycosylation have extensively reported aberrant patterns in the ovarian cancer tumor microenvironment, obtaining spatial information will uncover tumor-specific N-glycan alterations in ovarian cancer development and progression. matrix-assisted laser desorption/ionization (MALDI) mass spectrometry imaging (MSI) is employed to investigate N-glycan distribution on formalin-fixed paraffin-embedded ovarian cancer tissue sections from early- and late-stage patients. Tumor-specific N-glycans are identified and structurally characterized by porous graphitized carbon-liquid chromatography-electrospray ionization-tandem mass spectrometry (PGC-LC-ESI-MS/MS), and then assigned to high-resolution images obtained from MALDI-MSI. Spatial distribution of 14 N-glycans is obtained by MALDI-MSI and 42 N-glycans (including structural and compositional isomers) identified and structurally characterized by LC-MS. The spatial distribution of oligomannose, complex neutral, bisecting, and sialylated N-glycan families are localized to the tumor regions of late-stage ovarian cancer patients relative to early-stage patients. Potential N-glycan diagnostic markers that emerge include the oligomannose structure, (Hex)6 + (Man)3 (GlcNAc)2 , and the complex neutral structure, (Hex)2 (HexNAc)2 (Deoxyhexose)1 + (Man)3 (GlcNAc)2 . The distribution of these markers is evaluated using a tissue microarray of early- and late-stage patients.
    Matched MeSH terms: Genomics/methods
  10. Callari M, Batra AS, Batra RN, Sammut SJ, Greenwood W, Clifford H, et al.
    BMC Genomics, 2018 01 05;19(1):19.
    PMID: 29304755 DOI: 10.1186/s12864-017-4414-y
    BACKGROUND: Patient-Derived Tumour Xenografts (PDTXs) have emerged as the pre-clinical models that best represent clinical tumour diversity and intra-tumour heterogeneity. The molecular characterization of PDTXs using High-Throughput Sequencing (HTS) is essential; however, the presence of mouse stroma is challenging for HTS data analysis. Indeed, the high homology between the two genomes results in a proportion of mouse reads being mapped as human.

    RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time.

    CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species.

    Matched MeSH terms: Genomics/methods*
  11. Chaisson MJP, Sanders AD, Zhao X, Malhotra A, Porubsky D, Rausch T, et al.
    Nat Commun, 2019 04 16;10(1):1784.
    PMID: 30992455 DOI: 10.1038/s41467-018-08148-z
    The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50 bp) and 27,622 SVs (≥50 bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.
    Matched MeSH terms: Genomics/methods*
  12. Chan KL, Rosli R, Tatarinova TV, Hogan M, Firdaus-Raih M, Low EL
    BMC Bioinformatics, 2017 Jan 27;18(Suppl 1):1426.
    PMID: 28466793 DOI: 10.1186/s12859-016-1426-6
    BACKGROUND: Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion.

    RESULTS: We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure).

    CONCLUSIONS: Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.

    Matched MeSH terms: Genomics/methods*
  13. Choo SW, Ang MY, Dutta A, Tan SY, Siow CC, Heydari H, et al.
    Sci Rep, 2015 Dec 15;5:18227.
    PMID: 26666970 DOI: 10.1038/srep18227
    Mycobacterium spp. are renowned for being the causative agent of diseases like leprosy, Buruli ulcer and tuberculosis in human beings. With more and more mycobacterial genomes being sequenced, any knowledge generated from comparative genomic analysis would provide better insights into the biology, evolution, phylogeny and pathogenicity of this genus, thus helping in better management of diseases caused by Mycobacterium spp.With this motivation, we constructed MycoCAP, a new comparative analysis platform dedicated to the important genus Mycobacterium. This platform currently provides information of 2108 genome sequences of at least 55 Mycobacterium spp. A number of intuitive web-based tools have been integrated in MycoCAP particularly for comparative analysis including the PGC tool for comparison between two genomes, PathoProT for comparing the virulence genes among the Mycobacterium strains and the SuperClassification tool for the phylogenic classification of the Mycobacterium strains and a specialized classification system for strains of Mycobacterium abscessus. We hope the broad range of functions and easy-to-use tools provided in MycoCAP makes it an invaluable analysis platform to speed up the research discovery on mycobacteria for researchers. Database URL: http://mycobacterium.um.edu.my.
    Matched MeSH terms: Genomics/methods*
  14. Choo SW, Rishik S, Wee WY
    Microb Genom, 2020 12;6(12).
    PMID: 33295861 DOI: 10.1099/mgen.0.000495
    Mycobacteroides immunogenum is an emerging opportunistic pathogen implicated in nosocomial infections. Comparative genome analyses may provide better insights into its genomic structure, functions and evolution. The present analysis showed that M. immunogenum has an open pan-genome. Approximately 36.8% of putative virulence genes were identified in the accessory regions of M. immunogenum. Phylogenetic analyses revealed two potential novel subspecies of M. immunogenum, supported by evidence from ANIb (average nucleotide identity using blast) and GGDC (Genome to Genome Distance Calculator) analyses. We identified 74 genomic islands (GIs) in Subspecies 1 and 23 GIs in Subspecies 2. All Subspecies 2-harboured GIs were not found in Subspecies 1, indicating that they might have been acquired by Subspecies 2 after their divergence. Subspecies 2 has more defence genes than Subspecies 1, suggesting that it might be more resistant to the insertion of foreign DNA and probably explaining why Subspecies 2 has fewer GIs. Positive selection analysis suggest that M. immunogenum has a lower selection pressure compared to non-pathogenic mycobacteria. Thirteen genes were positively selected and many were involved in virulence.
    Matched MeSH terms: Genomics/methods*
  15. Chung FF, Tan PF, Raja VJ, Tan BS, Lim KH, Kam TS, et al.
    Sci Rep, 2017 02 15;7:42504.
    PMID: 28198434 DOI: 10.1038/srep42504
    Precursor mRNA (pre-mRNA) splicing is catalyzed by a large ribonucleoprotein complex known as the spliceosome. Numerous studies have indicated that aberrant splicing patterns or mutations in spliceosome components, including the splicing factor 3b subunit 1 (SF3B1), are associated with hallmark cancer phenotypes. This has led to the identification and development of small molecules with spliceosome-modulating activity as potential anticancer agents. Jerantinine A (JA) is a novel indole alkaloid which displays potent anti-proliferative activities against human cancer cell lines by inhibiting tubulin polymerization and inducing G2/M cell cycle arrest. Using a combined pooled-genome wide shRNA library screen and global proteomic profiling, we showed that JA targets the spliceosome by up-regulating SF3B1 and SF3B3 protein in breast cancer cells. Notably, JA induced significant tumor-specific cell death and a significant increase in unspliced pre-mRNAs. In contrast, depletion of endogenous SF3B1 abrogated the apoptotic effects, but not the G2/M cell cycle arrest induced by JA. Further analyses showed that JA stabilizes endogenous SF3B1 protein in breast cancer cells and induced dissociation of the protein from the nucleosome complex. Together, these results demonstrate that JA exerts its antitumor activity by targeting SF3B1 and SF3B3 in addition to its reported targeting of tubulin polymerization.
    Matched MeSH terms: Genomics/methods
  16. Doni F, Suhaimi NSM, Mispan MS, Fathurrahman F, Marzuki BM, Kusmoro J, et al.
    Int J Mol Sci, 2022 Jan 10;23(2).
    PMID: 35054923 DOI: 10.3390/ijms23020737
    Rice, the main staple food for about half of the world's population, has had the growth of its production stagnate in the last two decades. One of the ways to further improve rice production is to enhance the associations between rice plants and the microbiome that exists around, on, and inside the plant. This article reviews recent developments in understanding how microorganisms exert positive influences on plant growth, production, and health, focusing particularly on rice. A variety of microbial species and taxa reside in the rhizosphere and the phyllosphere of plants and also have multiple roles as symbiotic endophytes while living within plant tissues and even cells. They alter the morphology of host plants, enhance their growth, health, and yield, and reduce their vulnerability to biotic and abiotic stresses. The findings of both agronomic and molecular analysis show ways in which microorganisms regulate the growth, physiological traits, and molecular signaling within rice plants. However, many significant scientific questions remain to be resolved. Advancements in high-throughput multi-omics technologies can be used to elucidate mechanisms involved in microbial-rice plant associations. Prospectively, the use of microbial inoculants and associated approaches offers some new, cost-effective, and more eco-friendly practices for increasing rice production.
    Matched MeSH terms: Genomics/methods
  17. Feng S, Stiller J, Deng Y, Armstrong J, Fang Q, Reeve AH, et al.
    Nature, 2020 11;587(7833):252-257.
    PMID: 33177665 DOI: 10.1038/s41586-020-2873-9
    Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1-4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families-including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species.
    Matched MeSH terms: Genomics/methods*
  18. Graham NS, Hammond JP, Lysenko A, Mayes S, O Lochlainn S, Blasco B, et al.
    Plant Cell, 2014 Jul;26(7):2818-30.
    PMID: 25082855 DOI: 10.1105/tpc.114.128603
    Although Ca transport in plants is highly complex, the overexpression of vacuolar Ca(2+) transporters in crops is a promising new technology to improve dietary Ca supplies through biofortification. Here, we sought to identify novel targets for increasing plant Ca accumulation using genetical and comparative genomics. Expression quantitative trait locus (eQTL) mapping to 1895 cis- and 8015 trans-loci were identified in shoots of an inbred mapping population of Brassica rapa (IMB211 × R500); 23 cis- and 948 trans-eQTLs responded specifically to altered Ca supply. eQTLs were screened for functional significance using a large database of shoot Ca concentration phenotypes of Arabidopsis thaliana. From 31 Arabidopsis gene identifiers tagged to robust shoot Ca concentration phenotypes, 21 mapped to 27 B. rapa eQTLs, including orthologs of the Ca(2+) transporters At-CAX1 and At-ACA8. Two of three independent missense mutants of BraA.cax1a, isolated previously by targeting induced local lesions in genomes, have allele-specific shoot Ca concentration phenotypes compared with their segregating wild types. BraA.CAX1a is a promising target for altering the Ca composition of Brassica, consistent with prior knowledge from Arabidopsis. We conclude that multiple-environment eQTL analysis of complex crop genomes combined with comparative genomics is a powerful technique for novel gene identification/prioritization.
    Matched MeSH terms: Genomics/methods*
  19. Heydari H, Mutha NV, Mahmud MI, Siow CC, Wee WY, Wong GJ, et al.
    Database (Oxford), 2014;2014:bau010.
    PMID: 24578355 DOI: 10.1093/database/bau010
    With the advent of high-throughput sequencing technologies, many staphylococcal genomes have been sequenced. Comparative analysis of these strains will provide better understanding of their biology, phylogeny, virulence and taxonomy, which may contribute to better management of diseases caused by staphylococcal pathogens. We developed StaphyloBase with the goal of having a one-stop genomic resource platform for the scientific community to access, retrieve, download, browse, search, visualize and analyse the staphylococcal genomic data and annotations. We anticipate this resource platform will facilitate the analysis of staphylococcal genomic data, particularly in comparative analyses. StaphyloBase currently has a collection of 754 032 protein-coding sequences (CDSs), 19 258 rRNAs and 15 965 tRNAs from 292 genomes of different staphylococcal species. Information about these features is also included, such as putative functions, subcellular localizations and gene/protein sequences. Our web implementation supports diverse query types and the exploration of CDS- and RNA-type information in detail using an AJAX-based real-time search system. JBrowse has also been incorporated to allow rapid and seamless browsing of staphylococcal genomes. The Pairwise Genome Comparison tool is designed for comparative genomic analysis, for example, to reveal the relationships between two user-defined staphylococcal genomes. A newly designed Pathogenomics Profiling Tool (PathoProT) is also included in this platform to facilitate comparative pathogenomics analysis of staphylococcal strains. In conclusion, StaphyloBase offers access to a range of staphylococcal genomic resources as well as analysis tools for comparative analyses. Database URL: http://staphylococcus.um.edu.my/.
    Matched MeSH terms: Genomics/methods*
  20. Heydari H, Siow CC, Tan MF, Jakubovics NS, Wee WY, Mutha NV, et al.
    PLoS One, 2014;9(1):e86318.
    PMID: 24466021 DOI: 10.1371/journal.pone.0086318
    Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/.
    Matched MeSH terms: Genomics/methods*
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links