Displaying publications 61 - 80 of 92 in total

Abstract:
Sort:
  1. Mohd Salleh F, Ramos-Madrigal J, Peñaloza F, Liu S, Mikkel-Holger SS, Riddhi PP, et al.
    Gigascience, 2017 08 01;6(8):1-8.
    PMID: 28873965 DOI: 10.1093/gigascience/gix053
    Southeast (SE) Asia is 1 of the most biodiverse regions in the world, and it holds approximately 20% of all mammal species. Despite this, the majority of SE Asia's genetic diversity is still poorly characterized. The growing interest in using environmental DNA to assess and monitor SE Asian species, in particular threatened mammals-has created the urgent need to expand the available reference database of mitochondrial barcode and complete mitogenome sequences. We have partially addressed this need by generating 72 new mitogenome sequences reconstructed from DNA isolated from a range of historical and modern tissue samples. Approximately 55 gigabases of raw sequence were generated. From this data, we assembled 72 complete mitogenome sequences, with an average depth of coverage of ×102.9 and ×55.2 for modern samples and historical samples, respectively. This dataset represents 52 species, of which 30 species had no previous mitogenome data available. The mitogenomes were geotagged to their sampling location, where known, to display a detailed geographical distribution of the species. Our new database of 52 taxa will strongly enhance the utility of environmental DNA approaches for monitoring mammals in SE Asia as it greatly increases the likelihoods that identification of metabarcoding sequencing reads can be assigned to reference sequences. This magnifies the confidence in species detections and thus allows more robust surveys and monitoring programmes of SE Asia's threatened mammal biodiversity. The extensive collections of historical samples from SE Asia in western and SE Asian museums should serve as additional valuable material to further enrich this reference database.
    Matched MeSH terms: Molecular Sequence Annotation
  2. da Fonseca RR, Couto A, Machado AM, Brejova B, Albertin CB, Silva F, et al.
    Gigascience, 2020 Jan 01;9(1).
    PMID: 31942620 DOI: 10.1093/gigascience/giz152
    BACKGROUND: The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked.

    FINDINGS: We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.

    CONCLUSIONS: This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.

    Matched MeSH terms: Molecular Sequence Annotation
  3. Tan MH, Austin CM, Hammer MP, Lee YP, Croft LJ, Gan HM
    Gigascience, 2018 03 01;7(3):1-6.
    PMID: 29342277 DOI: 10.1093/gigascience/gix137
    Background: Some of the most widely recognized coral reef fishes are clownfish or anemonefish, members of the family Pomacentridae (subfamily: Amphiprioninae). They are popular aquarium species due to their bright colours, adaptability to captivity, and fascinating behavior. Their breeding biology (sequential hermaphrodites) and symbiotic mutualism with sea anemones have attracted much scientific interest. Moreover, there are some curious geographic-based phenotypes that warrant investigation. Leveraging on the advancement in Nanopore long read technology, we report the first hybrid assembly of the clown anemonefish (Amphiprion ocellaris) genome utilizing Illumina and Nanopore reads, further demonstrating the substantial impact of modest long read sequencing data sets on improving genome assembly statistics.

    Results: We generated 43 Gb of short Illumina reads and 9 Gb of long Nanopore reads, representing approximate genome coverage of 54× and 11×, respectively, based on the range of estimated k-mer-predicted genome sizes of between 791 and 967 Mbp. The final assembled genome is contained in 6404 scaffolds with an accumulated length of 880 Mb (96.3% BUSCO-calculated genome completeness). Compared with the Illumina-only assembly, the hybrid approach generated 94% fewer scaffolds with an 18-fold increase in N50 length (401 kb) and increased the genome completeness by an additional 16%. A total of 27 240 high-quality protein-coding genes were predicted from the clown anemonefish, 26 211 (96%) of which were annotated functionally with information from either sequence homology or protein signature searches.

    Conclusions: We present the first genome of any anemonefish and demonstrate the value of low coverage (∼11×) long Nanopore read sequencing in improving both genome assembly contiguity and completeness. The near-complete assembly of the A. ocellaris genome will be an invaluable molecular resource for supporting a range of genetic, genomic, and phylogenetic studies specifically for clownfish and more generally for other related fish species of the family Pomacentridae.

    Matched MeSH terms: Molecular Sequence Annotation
  4. Austin CM, Tan MH, Harrisson KA, Lee YP, Croft LJ, Sunnucks P, et al.
    Gigascience, 2017 08 01;6(8):1-6.
    PMID: 28873963 DOI: 10.1093/gigascience/gix063
    One of the most iconic Australian fish is the Murray cod, Maccullochella peelii (Mitchell 1838), a freshwater species that can grow to ∼1.8 metres in length and live to age ≥48 years. The Murray cod is of a conservation concern as a result of strong population contractions, but it is also popular for recreational fishing and is of growing aquaculture interest. In this study, we report the whole genome sequence of the Murray cod to support ongoing population genetics, conservation, and management research, as well as to better understand the evolutionary ecology and history of the species. A draft Murray cod genome of 633 Mbp (N50 = 109 974bp; BUSCO and CEGMA completeness of 94.2% and 91.9%, respectively) with an estimated 148 Mbp of putative repetitive sequences was assembled from the combined sequencing data of 2 fish individuals with an identical maternal lineage; 47.2 Gb of Illumina HiSeq data and 804 Mb of Nanopore data were generated from the first individual while 23.2 Gb of Illumina MiSeq data were generated from the second individual. The inclusion of Nanopore reads for scaffolding followed by subsequent gap-closing using Illumina data led to a 29% reduction in the number of scaffolds and a 55% and 54% increase in the scaffold and contig N50, respectively. We also report the first transcriptome of Murray cod that was subsequently used to annotate the Murray cod genome, leading to the identification of 26 539 protein-coding genes. We present the whole genome of the Murray cod and anticipate this will be a catalyst for a range of genetic, genomic, and phylogenetic studies of the Murray cod and more generally other fish species of the Percichthydae family.
    Matched MeSH terms: Molecular Sequence Annotation
  5. Lau YL, Lee WC, Gudimella R, Zhang G, Ching XT, Razali R, et al.
    PLoS One, 2016;11(6):e0157901.
    PMID: 27355363 DOI: 10.1371/journal.pone.0157901
    Toxoplasmosis is a widespread parasitic infection by Toxoplasma gondii, a parasite with at least three distinct clonal lineages. This article reports the whole genome sequencing and de novo assembly of T. gondii RH (type I representative strain), as well as genome-wide comparison across major T. gondii lineages. Genomic DNA was extracted from tachyzoites of T. gondii RH strain and its identity was verified by PCR and LAMP. Subsequently, whole genome sequencing was performed, followed by sequence filtering, genome assembly, gene annotation assignments, clustering of gene orthologs and phylogenetic tree construction. Genome comparison was done with the already archived genomes of T. gondii. From this study, the genome size of T. gondii RH strain was found to be 69.35Mb, with a mean GC content of 52%. The genome shares high similarity to the archived genomes of T. gondii GT1, ME49 and VEG strains. Nevertheless, 111 genes were found to be unique to T. gondii RH strain. Importantly, unique genes annotated to functions that are potentially critical for T. gondii virulence were found, which may explain the unique phenotypes of this particular strain. This report complements the genomic archive of T. gondii. Data obtained from this study contribute to better understanding of T. gondii and serve as a reference for future studies on this parasite.
    Matched MeSH terms: Molecular Sequence Annotation
  6. Baxter JS, Johnson N, Tomczyk K, Gillespie A, Maguire S, Brough R, et al.
    Am J Hum Genet, 2021 Jul 01;108(7):1190-1203.
    PMID: 34146516 DOI: 10.1016/j.ajhg.2021.05.013
    A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30- to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74-0.81, p = 3.1 × 10-31).
    Matched MeSH terms: Molecular Sequence Annotation*
  7. Appasamy SD, Ramlan EI, Firdaus-Raih M
    PLoS One, 2013;8(9):e73984.
    PMID: 24040136 DOI: 10.1371/journal.pone.0073984
    The tertiary motifs in complex RNA molecules play vital roles to either stabilize the formation of RNA 3D structure or to provide important biological functionality to the molecule. In order to better understand the roles of these tertiary motifs in riboswitches, we examined 11 representative riboswitch PDB structures for potential agreement of both motif occurrences and conservations. A total of 61 unique tertiary interactions were found in the reference structures. In addition to the expected common A-minor motifs and base-triples mainly involved in linking distant regions the riboswitch structures three highly conserved variants of A-minor interactions called G-minors were found in the SAM-I and FMN riboswitches where they appear to be involved in the recognition of the respective ligand's functional groups. From our structural survey as well as corresponding structure and sequence alignments, the agreement between motif occurrences and conservations are very prominent across the representative riboswitches. Our analysis provide evidence that some of these tertiary interactions are essential components to form the structure where their sequence positions are conserved despite a high degree of diversity in other parts of the respective riboswitches sequences. This is indicative of a vital role for these tertiary interactions in determining the specific biological function of riboswitch.
    Matched MeSH terms: Molecular Sequence Annotation
  8. Hamdani HY, Appasamy SD, Willett P, Artymiuk PJ, Firdaus-Raih M
    Nucleic Acids Res, 2012 Jul;40(Web Server issue):W35-41.
    PMID: 22661578 DOI: 10.1093/nar/gks513
    Similarities in the 3D patterns of RNA base interactions or arrangements can provide insights into their functions and roles in stabilization of the RNA 3D structure. Nucleic Acids Search for Substructures and Motifs (NASSAM) is a graph theoretical program that can search for 3D patterns of base arrangements by representing the bases as pseudo-atoms. The geometric relationship of the pseudo-atoms to each other as a pattern can be represented as a labeled graph where the pseudo-atoms are the graph's nodes while the edges are the inter-pseudo-atomic distances. The input files for NASSAM are PDB formatted 3D coordinates. This web server can be used to identify matches of base arrangement patterns in a query structure to annotated patterns that have been reported in the literature or that have possible functional and structural stabilization implications. The NASSAM program is freely accessible without any login requirement at http://mfrlab.org/grafss/nassam/.
    Matched MeSH terms: Molecular Sequence Annotation*
  9. Appasamy SD, Hamdani HY, Ramlan EI, Firdaus-Raih M
    Nucleic Acids Res, 2016 Jan 4;44(D1):D266-71.
    PMID: 26553798 DOI: 10.1093/nar/gkv1186
    A major component of RNA structure stabilization are the hydrogen bonded interactions between the base residues. The importance and biological relevance for large clusters of base interactions can be much more easily investigated when their occurrences have been systematically detected, catalogued and compared. In this paper, we describe the database InterRNA (INTERactions in RNA structures database-http://mfrlab.org/interrna/) that contains records of known RNA 3D motifs as well as records for clusters of bases that are interconnected by hydrogen bonds. The contents of the database were compiled from RNA structural annotations carried out by the NASSAM (http://mfrlab.org/grafss/nassam) and COGNAC (http://mfrlab.org/grafss/cognac) computer programs. An analysis of the database content and comparisons with the existing corpus of knowledge regarding RNA 3D motifs clearly show that InterRNA is able to provide an extension of the annotations for known motifs as well as able to provide novel interactions for further investigations.
    Matched MeSH terms: Molecular Sequence Annotation
  10. Emrizal R, Hamdani HY, Firdaus-Raih M
    Int J Mol Sci, 2021 Aug 09;22(16).
    PMID: 34445259 DOI: 10.3390/ijms22168553
    The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule's functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
    Matched MeSH terms: Molecular Sequence Annotation*
  11. Mat-Sharani S, Firdaus-Raih M
    BMC Bioinformatics, 2019 Feb 04;19(Suppl 13):551.
    PMID: 30717662 DOI: 10.1186/s12859-018-2550-2
    BACKGROUND: Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes.

    RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.

    CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.

    Matched MeSH terms: Molecular Sequence Annotation/methods*
  12. Yahaya B, McLachlan G, McCorquodale C, Collie D
    PLoS One, 2013;8(4):e58930.
    PMID: 23593124 DOI: 10.1371/journal.pone.0058930
    BACKGROUND: Understanding the way in which the airway heals in response to injury is fundamental to dissecting the mechanisms underlying airway disease pathology. As only limited data is available in relation to the in vivo characterisation of the molecular features of repair in the airway we sought to characterise the dynamic changes in gene expression that are associated with the early response to physical injury in the airway wall.

    METHODOLOGY/PRINCIPAL FINDINGS: We profiled gene expression changes in the airway wall using a large animal model of physical injury comprising bronchial brush biopsy in anaesthetised sheep. The experimental design featured sequential studies in the same animals over the course of a week and yielded data relating to the response at 6 hours, and 1, 3 and 7 days after injury. Notable features of the transcriptional response included the early and sustained preponderance of down-regulated genes associated with angiogenesis and immune cell activation, selection and differentiation. Later features of the response included the up-regulation of cell cycle genes at d1 and d3, and the latter pronounced up-regulation of extracellular matrix-related genes at d3 and d7.

    CONCLUSIONS/SIGNIFICANCE: It is possible to follow the airway wall response to physical injury in the same animal over the course of time. Transcriptional changes featured coordinate expression of functionally related genes in a reproducible manner both within and between animals. This characterisation will provide a foundation against which to assess the perturbations that accompany airway disease pathologies of comparative relevance.

    Matched MeSH terms: Molecular Sequence Annotation
  13. Heydari H, Wee WY, Lokanathan N, Hari R, Mohamed Yusoff A, Beh CY, et al.
    PLoS One, 2013;8(4):e62443.
    PMID: 23658631 DOI: 10.1371/journal.pone.0062443
    Mycobacterium abscessus is a rapidly growing non-tuberculous mycobacterial species that has been associated with a wide spectrum of human infections. As the classification and biology of this organism is still not well understood, comparative genomic analysis on members of this species may provide further insights on their taxonomy, phylogeny, pathogenicity and other information that may contribute to better management of infections. The MabsBase described in this paper is a user-friendly database providing access to whole-genome sequences of newly discovered M. abscessus strains as well as resources for whole-genome annotations and computational predictions, to support the expanding scientific community interested in M. abscessus research. The MabsBase is freely available at http://mabscessus.um.edu.my.
    Matched MeSH terms: Molecular Sequence Annotation*
  14. Tan TK, Tan KY, Hari R, Mohamed Yusoff A, Wong GJ, Siow CC, et al.
    Database (Oxford), 2016;2016.
    PMID: 27616775 DOI: 10.1093/database/baw063
    Pangolins (order Pholidota) are the only mammals covered by scales. We have recently sequenced and analyzed the genomes of two critically endangered Asian pangolin species, namely the Malayan pangolin (Manis javanica) and the Chinese pangolin (Manis pentadactyla). These complete genome sequences will serve as reference sequences for future research to address issues of species conservation and to advance knowledge in mammalian biology and evolution. To further facilitate the global research effort in pangolin biology, we developed the Pangolin Genome Database (PGD), as a future hub for hosting pangolin genomic and transcriptomic data and annotations, and with useful analysis tools for the research community. Currently, the PGD provides the reference pangolin genome and transcriptome data, gene sequences and functional information, expressed transcripts, pseudogenes, genomic variations, organ-specific expression data and other useful annotations. We anticipate that the PGD will be an invaluable platform for researchers who are interested in pangolin and mammalian research. We will continue updating this hub by including more data, annotation and analysis tools particularly from our research consortium.Database URL: http://pangolin-genome.um.edu.my.
    Matched MeSH terms: Molecular Sequence Annotation
  15. Sakharkar MK, Kashmir Singh SK, Rajamanickam K, Mohamed Essa M, Yang J, Chidambaram SB
    PLoS One, 2019;14(9):e0220995.
    PMID: 31487305 DOI: 10.1371/journal.pone.0220995
    Parkinson's disease (PD) is an irreversible and incurable multigenic neurodegenerative disorder. It involves progressive loss of mid brain dopaminergic neurons in the substantia nigra pars compacta (SN). We compared brain gene expression profiles with those from the peripheral blood cells of a separate sample of PD patients to identify disease-associated genes. Here, we demonstrate the use of gene expression profiling of brain and blood for detecting valid targets and identifying early PD biomarkers. Implementing this systematic approach, we discovered putative PD risk genes in brain, delineated biological processes and molecular functions that may be particularly disrupted in PD and also identified several putative PD biomarkers in blood. 20 of the differentially expressed genes in SN were also found to be differentially expressed in the blood. Further application of this methodology to other brain regions and neurological disorders should facilitate the discovery of highly reliable and reproducible candidate risk genes and biomarkers for PD. The identification of valid peripheral biomarkers for PD may ultimately facilitate early identification, intervention, and prevention efforts as well.
    Matched MeSH terms: Molecular Sequence Annotation
  16. Hong KW, Asmah Hani AW, Nurul Aina Murni CA, Pusparani RR, Chong CK, Verasahib K, et al.
    Infect Genet Evol, 2017 Oct;54:263-270.
    PMID: 28711373 DOI: 10.1016/j.meegid.2017.07.015
    In this study, we report the comparative genomics and phylogenetic analysis of Corynebacterium diphtheriae strain B-D-16-78 that was isolated from a clinical specimen in 2016. The complete genome of C. diphtheriae strain B-D-16-78 was sequenced using PacBio Single Molecule, Real-Time sequencing technology and consists of a 2,474,151-bp circular chromosome with an average GC content of 53.56%. The core genome of C. diphtheriae was also deduced from a total of 74 strains with complete or draft genome sequences and the core genome-based phylogenetic analysis revealed close genetic relationship among strains that shared the same MLST allelic profile. In the context of CRISPR-Cas system, which confers adaptive immunity against re-invading DNA, 73 out of 86 spacer sequences were found to be unique to Malaysian strains which harboured only type-II-C and/or type-I-E-a systems. A total of 48 tox genes which code for the diphtheria toxin were retrieved from the 74 genomes and with the exception of one truncated gene, only nucleotide substitutions were detected when compared to the tox gene sequence of PW8. More than half were synonymous substitution and only two were nonsynonymous substitutions whereby H24Y was predicted to have a damaging effect on the protein function whilst T262V was predicted to be tolerated. Both toxigenic and non-toxigenic toxin-gene bearing strains have been isolated in Malaysia but the repeated isolation of toxigenic strains with the same MLST profile suggests the possibility of some of these strains may be circulating in the population. Hence, efforts to increase herd immunity should be continued and supported by an effective monitoring and surveillance system to track, manage and control outbreak of cases.
    Matched MeSH terms: Molecular Sequence Annotation
  17. Chin PS, Yu CY, Ang GY, Yin WF, Chan KG
    J Glob Antimicrob Resist, 2017 06;9:41-42.
    PMID: 28300643 DOI: 10.1016/j.jgar.2016.12.017
    OBJECTIVES: Salmonella spp. represent one of the main diarrhoeal pathogens that are transmitted via the food supply chain. Here we report the draft genome sequence of a multidrug-resistant Salmonella enterica serovar Brancaster (PS01) that was isolated from poultry meat in Malaysia.

    METHODS: Genomic DNA was extracted from Salmonella strain PS01 and was sequenced using an Illumina HiSeq 2000 platform. The generated reads were de novo assembled using CLC Genomics Workbench. The draft genome was annotated and the presence of antimicrobial resistance genes was identified.

    RESULTS: The 5 036 442bp genome contains various antimicrobial resistance genes conferring resistance to aminoglycosides, fluoroquinolones, fosfomycin, macrolides, phenicols, sulphonamides, tetracyclines and trimethoprim. The β-lactamase gene blaTEM-176 encoding TEM-176 was also found in this strain.

    CONCLUSIONS: The genome sequence will aid in the understanding of drug resistance mechanisms in foodborne Salmonella Brancaster and highlights the need to ensure the judicious use of antibiotics in animal husbandry as well as the importance of implementing proper food handling and preparation practices.

    Matched MeSH terms: Molecular Sequence Annotation
  18. Yong HS, Song SL, Chua KO, Wayan Suana I, Eamsobhana P, Tan J, et al.
    Sci Rep, 2021 May 21;11(1):10680.
    PMID: 34021208 DOI: 10.1038/s41598-021-90162-1
    Spiders of the genera Nephila and Trichonephila are large orb-weaving spiders. In view of the lack of study on the mitogenome of these genera, and the conflicting systematic status, we sequenced (by next generation sequencing) and annotated the complete mitogenomes of N. pilipes, T. antipodiana and T. vitiana (previously N. vitiana) to determine their features and phylogenetic relationship. Most of the tRNAs have aberrant clover-leaf secondary structure. Based on 13 protein-coding genes (PCGs) and 15 mitochondrial genes (13 PCGs and two rRNA genes), Nephila and Trichonephila form a clade distinctly separated from the other araneid subfamilies/genera. T. antipodiana forms a lineage with T. vitiana in the subclade containing also T. clavata, while N. pilipes forms a sister clade to Trichonephila. The taxon vitiana is therefore a member of the genus Trichonephila and not Nephila as currently recognized. Studies on the mitogenomes of other Nephila and Trichonephila species and related taxa are needed to provide a potentially more robust phylogeny and systematics.
    Matched MeSH terms: Molecular Sequence Annotation
  19. Goh KM, Gan HM, Chan KG, Chan GF, Shahar S, Chong CS, et al.
    PLoS One, 2014;9(6):e90549.
    PMID: 24603481 DOI: 10.1371/journal.pone.0090549
    Species of Anoxybacillus are widespread in geothermal springs, manure, and milk-processing plants. The genus is composed of 22 species and two subspecies, but the relationship between its lifestyle and genome is little understood. In this study, two high-quality draft genomes were generated from Anoxybacillus spp. SK3-4 and DT3-1, isolated from Malaysian hot springs. De novo assembly and annotation were performed, followed by comparative genome analysis with the complete genome of Anoxybacillus flavithermus WK1 and two additional draft genomes, of A. flavithermus TNO-09.006 and A. kamchatkensis G10. The genomes of Anoxybacillus spp. are among the smaller of the family Bacillaceae. Despite having smaller genomes, their essential genes related to lifestyle adaptations at elevated temperature, extreme pH, and protection against ultraviolet are complete. Due to the presence of various competence proteins, Anoxybacillus spp. SK3-4 and DT3-1 are able to take up foreign DNA fragments, and some of these transferred genes are important for the survival of the cells. The analysis of intact putative prophage genomes shows that they are highly diversified. Based on the genome analysis using SEED, many of the annotated sequences are involved in carbohydrate metabolism. The presence of glycosyl hydrolases among the Anoxybacillus spp. was compared, and the potential applications of these unexplored enzymes are suggested here. This is the first study that compares Anoxybacillus genomes from the aspect of lifestyle adaptations, the capacity for horizontal gene transfer, and carbohydrate metabolism.
    Matched MeSH terms: Molecular Sequence Annotation
  20. Forde BM, Ben Zakour NL, Stanton-Cook M, Phan MD, Totsika M, Peters KM, et al.
    PLoS One, 2014;9(8):e104400.
    PMID: 25126841 DOI: 10.1371/journal.pone.0104400
    Escherichia coli ST131 is now recognised as a leading contributor to urinary tract and bloodstream infections in both community and clinical settings. Here we present the complete, annotated genome of E. coli EC958, which was isolated from the urine of a patient presenting with a urinary tract infection in the Northwest region of England and represents the most well characterised ST131 strain. Sequencing was carried out using the Pacific Biosciences platform, which provided sufficient depth and read-length to produce a complete genome without the need for other technologies. The discovery of spurious contigs within the assembly that correspond to site-specific inversions in the tail fibre regions of prophages demonstrates the potential for this technology to reveal dynamic evolutionary mechanisms. E. coli EC958 belongs to the major subgroup of ST131 strains that produce the CTX-M-15 extended spectrum β-lactamase, are fluoroquinolone resistant and encode the fimH30 type 1 fimbrial adhesin. This subgroup includes the Indian strain NA114 and the North American strain JJ1886. A comparison of the genomes of EC958, JJ1886 and NA114 revealed that differences in the arrangement of genomic islands, prophages and other repetitive elements in the NA114 genome are not biologically relevant and are due to misassembly. The availability of a high quality uropathogenic E. coli ST131 genome provides a reference for understanding this multidrug resistant pathogen and will facilitate novel functional, comparative and clinical studies of the E. coli ST131 clonal lineage.
    Matched MeSH terms: Molecular Sequence Annotation
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links