Displaying publications 21 - 40 of 92 in total

Abstract:
Sort:
  1. Mat-Sharani S, Firdaus-Raih M
    BMC Bioinformatics, 2019 Feb 04;19(Suppl 13):551.
    PMID: 30717662 DOI: 10.1186/s12859-018-2550-2
    BACKGROUND: Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes.

    RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.

    CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.

    Matched MeSH terms: Molecular Sequence Annotation/methods*
  2. Ramachandran H, Shafie NAH, Sudesh K, Azizan MN, Majid MIA, Amirul AA
    Antonie Van Leeuwenhoek, 2018 Mar;111(3):361-372.
    PMID: 29022146 DOI: 10.1007/s10482-017-0958-8
    Bacterial classification on the basis of a polyphasic approach was conducted on three poly(3 hydroxybutyrate-co-4-hydroxybutyrate) [P(3HB-co-4HB)] accumulating bacterial strains that were isolated from samples collected from Malaysian environments; Kulim Lake, Sg. Pinang river and Sg. Manik paddy field. The Gram-negative, rod-shaped, motile, non-sporulating and non-fermenting bacteria were shown to belong to the genus Cupriavidus of the Betaproteobacteria on the basis of their 16S rRNA gene sequence analyses. The sequence similarity value with their near phylogenetic neighbour, Cupriavidus pauculus LMG3413T, was 98.5%. However, the DNA-DNA hybridization values (8-58%) and ribotyping analysis both enabled these strains to be differentiated from related Cupriavidus species with validly published names. The RiboPrint patterns of the three strains also revealed that the strains were genetically related even though they displayed a clonal diversity. The major cellular fatty acids detected in these strains included C15:0 ISO 2OH/C16:1 ω7c, hexadecanoic (16:0) and cis-11-octadecenoic (C18:1 ω7c). Their G+C contents ranged from 68.0  to 68.6 mol%, and their major isoprenoid quinone was Ubiquinone Q-8. Of these three strains, only strain USMAHM13 (= DSM 25816 = KCTC 32390) was discovered to exhibit yellow pigmentation that is characteristic of the carotenoid family. Their assembled genomes also showed that the three strains were not identical in terms of their genome sizes that were 7.82, 7.95 and 8.70 Mb for strains USMAHM13, USMAA1020 and USMAA2-4, respectively, which are slightly larger than that of Cupriavidus necator H16 (7.42 Mb). The average nucleotide identity (ANI) results indicated that the strains were genetically related and the genome pairs belong to the same species. On the basis of the results obtained in this study, the three strains are considered to represent a novel species for which the name Cupriavidus malaysiensis sp. nov. is proposed. The type strain of the species is USMAA1020T (= DSM 19416T = KCTC 32390T).
    Matched MeSH terms: Molecular Sequence Annotation
  3. Taheri S, Abdullah TL, Rafii MY, Harikrishna JA, Werbrouck SPO, Teo CH, et al.
    Sci Rep, 2019 Feb 28;9(1):3047.
    PMID: 30816255 DOI: 10.1038/s41598-019-39944-2
    Curcuma alismatifolia widely used as an ornamental plant in Thailand and Cambodia. This species of herbaceous perennial from the Zingiberaceae family, includes cultivars with a wide range of colours and long postharvest life, and is used as an ornamental cut flower, as a potted plant, and in exterior landscapes. For further genetic improvement, however, little genomic information and no specific molecular markers are available. The present study used Illumina sequencing and de novo transcriptome assembly of two C. alismatifolia cvs, 'Chiang Mai Pink' and 'UB Snow 701', to develop simple sequence repeat markers for genetic diversity studies. After de novo assembly, 62,105 unigenes were generated and 48,813 (78.60%) showed significant similarities versus six functional protein databases. In addition, 9,351 expressed sequence tag-simple sequence repeats (EST-SSRs) were identified with a distribution frequency of 12.5% total unigenes. Out of 8,955 designed EST-SSR primers, 150 primers were selected for the development of potential molecular markers. Among these markers, 17 EST-SSR markers presented a moderate level of genetic diversity among three C. alismatifolia cultivars, one hybrid, three Curcuma, and two Zingiber species. Three different genetic groups within these species were revealed using EST-SSR markers, indicating that the markers developed in this study can be effectively applied to the population genetic analysis of Curcuma and Zingiber species. This report describes the first analysis of transcriptome data of an important ornamental ginger cultivars, also provides a valuable resource for gene discovery and marker development in the genus Curcuma.
    Matched MeSH terms: Molecular Sequence Annotation
  4. Ong WD, Voo LY, Kumar VS
    PLoS One, 2012;7(10):e46937.
    PMID: 23091603 DOI: 10.1371/journal.pone.0046937
    BACKGROUND: Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed.

    METHODOLOGY/PRINCIPAL FINDINGS: To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown.

    CONCLUSIONS: The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.

    Matched MeSH terms: Molecular Sequence Annotation*
  5. Austin CM, Tan MH, Harrisson KA, Lee YP, Croft LJ, Sunnucks P, et al.
    Gigascience, 2017 08 01;6(8):1-6.
    PMID: 28873963 DOI: 10.1093/gigascience/gix063
    One of the most iconic Australian fish is the Murray cod, Maccullochella peelii (Mitchell 1838), a freshwater species that can grow to ∼1.8 metres in length and live to age ≥48 years. The Murray cod is of a conservation concern as a result of strong population contractions, but it is also popular for recreational fishing and is of growing aquaculture interest. In this study, we report the whole genome sequence of the Murray cod to support ongoing population genetics, conservation, and management research, as well as to better understand the evolutionary ecology and history of the species. A draft Murray cod genome of 633 Mbp (N50 = 109 974bp; BUSCO and CEGMA completeness of 94.2% and 91.9%, respectively) with an estimated 148 Mbp of putative repetitive sequences was assembled from the combined sequencing data of 2 fish individuals with an identical maternal lineage; 47.2 Gb of Illumina HiSeq data and 804 Mb of Nanopore data were generated from the first individual while 23.2 Gb of Illumina MiSeq data were generated from the second individual. The inclusion of Nanopore reads for scaffolding followed by subsequent gap-closing using Illumina data led to a 29% reduction in the number of scaffolds and a 55% and 54% increase in the scaffold and contig N50, respectively. We also report the first transcriptome of Murray cod that was subsequently used to annotate the Murray cod genome, leading to the identification of 26 539 protein-coding genes. We present the whole genome of the Murray cod and anticipate this will be a catalyst for a range of genetic, genomic, and phylogenetic studies of the Murray cod and more generally other fish species of the Percichthydae family.
    Matched MeSH terms: Molecular Sequence Annotation
  6. Lau YL, Lee WC, Gudimella R, Zhang G, Ching XT, Razali R, et al.
    PLoS One, 2016;11(6):e0157901.
    PMID: 27355363 DOI: 10.1371/journal.pone.0157901
    Toxoplasmosis is a widespread parasitic infection by Toxoplasma gondii, a parasite with at least three distinct clonal lineages. This article reports the whole genome sequencing and de novo assembly of T. gondii RH (type I representative strain), as well as genome-wide comparison across major T. gondii lineages. Genomic DNA was extracted from tachyzoites of T. gondii RH strain and its identity was verified by PCR and LAMP. Subsequently, whole genome sequencing was performed, followed by sequence filtering, genome assembly, gene annotation assignments, clustering of gene orthologs and phylogenetic tree construction. Genome comparison was done with the already archived genomes of T. gondii. From this study, the genome size of T. gondii RH strain was found to be 69.35Mb, with a mean GC content of 52%. The genome shares high similarity to the archived genomes of T. gondii GT1, ME49 and VEG strains. Nevertheless, 111 genes were found to be unique to T. gondii RH strain. Importantly, unique genes annotated to functions that are potentially critical for T. gondii virulence were found, which may explain the unique phenotypes of this particular strain. This report complements the genomic archive of T. gondii. Data obtained from this study contribute to better understanding of T. gondii and serve as a reference for future studies on this parasite.
    Matched MeSH terms: Molecular Sequence Annotation
  7. Kuan CS, Yew SM, Chan CL, Toh YF, Lee KW, Cheong WH, et al.
    Database (Oxford), 2016;2016.
    PMID: 26980516 DOI: 10.1093/database/baw008
    Many species of dematiaceous fungi are associated with allergic reactions and potentially fatal diseases in human, especially in tropical climates. Over the past 10 years, we have isolated more than 400 dematiaceous fungi from various clinical samples. In this study, DemaDb, an integrated database was designed to support the integration and analysis of dematiaceous fungal genomes. A total of 92 072 putative genes and 6527 pathways that identified in eight dematiaceous fungi (Bipolaris papendorfii UM 226, Daldinia eschscholtzii UM 1400, D. eschscholtzii UM 1020, Pyrenochaeta unguis-hominis UM 256, Ochroconis mirabilis UM 578, Cladosporium sphaerospermum UM 843, Herpotrichiellaceae sp. UM 238 and Pleosporales sp. UM 1110) were deposited in DemaDb. DemaDb includes functional annotations for all predicted gene models in all genomes, such as Gene Ontology, EuKaryotic Orthologous Groups, Kyoto Encyclopedia of Genes and Genomes (KEGG), Pfam and InterProScan. All predicted protein models were further functionally annotated to Carbohydrate-Active enzymes, peptidases, secondary metabolites and virulence factors. DemaDb Genome Browser enables users to browse and visualize entire genomes with annotation data including gene prediction, structure, orientation and custom feature tracks. The Pathway Browser based on the KEGG pathway database allows users to look into molecular interaction and reaction networks for all KEGG annotated genes. The availability of downloadable files containing assembly, nucleic acid, as well as protein data allows the direct retrieval for further downstream works. DemaDb is a useful resource for fungal research community especially those involved in genome-scale analysis, functional genomics, genetics and disease studies of dematiaceous fungi. Database URL: http://fungaldb.um.edu.my.
    Matched MeSH terms: Molecular Sequence Annotation
  8. Ong WD, Voo CL, Kumar SV
    Mol Biol Rep, 2012 May;39(5):5889-96.
    PMID: 22207174 DOI: 10.1007/s11033-011-1400-3
    Improving the quality of the non-climacteric fruit, pineapple, is possible with information on the expression of genes that occur during the process of fruit ripening. This can be made known though the generation of partial mRNA transcript sequences known as expressed sequence tags (ESTs). ESTs are useful not only for gene discovery but also function as a resource for the identification of molecular markers, such as simple sequence repeats (SSRs). This paper reports on firstly, the construction of a normalized library of the mature green pineapple fruit and secondly, the mining of EST-SSRs markers using the newly obtained pineapple ESTs as well as publically available pineapple ESTs deposited in GenBank. Sequencing of the clones from the EST library resulted in 282 good sequences. Assembly of sequences generated 168 unique transcripts (UTs) consisting of 34 contigs and 134 singletons with an average length of ≈500 bp. Annotation of the UTs categorized the known proteins transcripts into the three ontologies as: molecular function (34.88%), biological process (38.43%), and cellular component (26.69%). Approximately 7% (416) of the pineapple ESTs contained SSRs with an abundance of trinucleotide SSRs (48.3%) being identified. This was followed by dinucleotide and tetranucleotide SSRs with frequency of 46 and 57%, respectively. From these EST-containing SSRs, 355 (85.3%) matched to known proteins while 133 contained flanking regions for primer design. Both the ESTs were sequenced and the mined EST-SSRs will be useful in the understanding of non-climacteric ripening and the screening of biomarkers linked to fruit quality traits.
    Matched MeSH terms: Molecular Sequence Annotation
  9. Teh SL, Chan WS, Abdullah JO, Namasivayam P
    Mol Biol Rep, 2011 Aug;38(6):3903-9.
    PMID: 21116862 DOI: 10.1007/s11033-010-0506-3
    Vanda Mimi Palmer (VMP) is a highly sought as fragrant-orchid hybrid in Malaysia. It is economically important in cosmetic and beauty industries and also a famous potted ornamental plant. To date, no work on fragrance-related genes of vandaceous orchids has been reported from other research groups although the analysis of floral fragrance or volatiles have been extensively studied. An expressed sequence tag (EST) resource was developed for VMP principally to mine any potential fragrance-related expressed sequence tag-simple sequence repeat (EST-SSR) for future development as markers in the identification of fragrant vandaceous orchids endemic to Malaysia. Clustering, annotation and assembling of the ESTs identified 1,196 unigenes which defined 966 singletons and 230 contigs. The VMP dbEST was functionally classified by gene ontology (GO) into three groups: molecular functions (51.2%), cellular components (16.4%) and biological processes (24.6%) while the remaining 7.8% showed no hits with GO identifier. A total of 112 EST-SSR (9.4%) was mined on which at least five units of di-, tri-, tetra-, penta-, or hexa-nucleotide repeats were predicted. The di-nucleotide motif repeats appeared to be the most frequent repeats among the detected SSRs with the AT/TA types as the most abundant among the dimerics, while AAG/TTC, AGA/TCT-type were the most frequent trimerics. The mined EST-SSR is believed to be useful in the development of EST-SSR markers that is applicable in the screening and characterization of fragrance-related transcripts in closely related species.
    Matched MeSH terms: Molecular Sequence Annotation
  10. Thio CL, Yusof R, Ashrafzadeh A, Bahari S, Abdul-Rahman PS, Karsani SA
    PLoS One, 2015;10(6):e0129033.
    PMID: 26083627 DOI: 10.1371/journal.pone.0129033
    The Chikungunya virus (CHIKV) is an arthropod borne virus. In the last 50 years, it has been the cause of numerous outbreaks in tropical and temperate regions, worldwide. There is limited understanding regarding the underlying molecular mechanisms involved in CHIKV replication and how the virus interacts with its host. In the present study, comparative proteomics was used to identify secreted host proteins that changed in abundance in response to early CHIKV infection. Two-dimensional gel electrophoresis was used to analyse and compare the secretome profiles of WRL-68 cells infected with CHIKV against mock control WRL-68 cells. The analysis identified 25 regulated proteins in CHIKV infected cells. STRING network analysis was then used to predict biological processes that may be affected by these proteins. The processes predicted to be affected include signal transduction, cellular component and extracellular matrix (ECM) organization, regulation of cytokine stimulus and immune response. These results provide an initial view of CHIKV may affect the secretome of infected cells during early infection. The results presented here will compliment earlier results from the study of late host response. However, functional characterization will be necessary to further enhance our understanding of the roles played by these proteins in the early stages of CHIKV infection in humans.
    Matched MeSH terms: Molecular Sequence Annotation
  11. Appunni S, Rubens M, Ramamoorthy V, Sharma H, Singh AK, Swarup V, et al.
    Malays J Med Sci, 2020 Dec;27(6):53-67.
    PMID: 33447134 DOI: 10.21315/mjms2020.27.6.6
    Background: Ischaemic stroke (IS), a multifactorial neurological disorder, is mediated by interplay between genes and the environment and, thus, blood-based IS biomarkers are of significant clinical value. Therefore, this study aimed to find global differentially expressed genes (DEGs) in-silico, to identify key enriched genes via gene set enrichment analysis (GSEA) and to determine the clinical significance of these genes in IS.

    Methods: Microarray expression dataset GSE22255 was retrieved from the Gene Expression Omnibus (GEO) database. It includes messenger ribonucleic acid (mRNA) expression data for the peripheral blood mononuclear cells of 20 controls and 20 IS patients. The bioconductor-package 'affy' was used to calculate expression and a pairwise t-test was applied to screen DEGs (P < 0.01). Further, GSEA was used to determine the enrichment of DEGs specific to gene ontology (GO) annotations.

    Results: GSEA analysis revealed 21 genes to be significantly plausible gene markers, enriched in multiple pathways among all the DEGs (n = 881). Ten gene sets were found to be core enriched in specific GO annotations. JunD, NCX3 and fibroblast growth factor receptor 4 (FGFR4) were under-represented and glycoprotein M6-B (GPM6B) was persistently over-represented.

    Conclusion: The identified genes are either associated with the pathophysiology of IS or they affect post-IS neuronal regeneration, thereby influencing clinical outcome. These genes should, therefore, be evaluated for their utility as suitable markers for predicting IS in clinical scenarios.

    Matched MeSH terms: Molecular Sequence Annotation
  12. Chin PS, Yu CY, Ang GY, Yin WF, Chan KG
    J Glob Antimicrob Resist, 2017 06;9:41-42.
    PMID: 28300643 DOI: 10.1016/j.jgar.2016.12.017
    OBJECTIVES: Salmonella spp. represent one of the main diarrhoeal pathogens that are transmitted via the food supply chain. Here we report the draft genome sequence of a multidrug-resistant Salmonella enterica serovar Brancaster (PS01) that was isolated from poultry meat in Malaysia.

    METHODS: Genomic DNA was extracted from Salmonella strain PS01 and was sequenced using an Illumina HiSeq 2000 platform. The generated reads were de novo assembled using CLC Genomics Workbench. The draft genome was annotated and the presence of antimicrobial resistance genes was identified.

    RESULTS: The 5 036 442bp genome contains various antimicrobial resistance genes conferring resistance to aminoglycosides, fluoroquinolones, fosfomycin, macrolides, phenicols, sulphonamides, tetracyclines and trimethoprim. The β-lactamase gene blaTEM-176 encoding TEM-176 was also found in this strain.

    CONCLUSIONS: The genome sequence will aid in the understanding of drug resistance mechanisms in foodborne Salmonella Brancaster and highlights the need to ensure the judicious use of antibiotics in animal husbandry as well as the importance of implementing proper food handling and preparation practices.

    Matched MeSH terms: Molecular Sequence Annotation
  13. Rahman AY, Usharraj AO, Misra BB, Thottathil GP, Jayasekaran K, Feng Y, et al.
    BMC Genomics, 2013;14:75.
    PMID: 23375136 DOI: 10.1186/1471-2164-14-75
    Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876.
    Matched MeSH terms: Molecular Sequence Annotation
  14. Ng YL, Olivos-García A, Lim TK, Noordin R, Lin Q, Othman N
    Am J Trop Med Hyg, 2018 12;99(6):1518-1529.
    PMID: 30298805 DOI: 10.4269/ajtmh.18-0415
    Entamoeba histolytica is a protozoan parasite that causes amebiasis and poses a significant health risk for populations in endemic areas. The molecular mechanisms involved in the pathogenesis and regulation of the parasite are not well characterized. We aimed to identify and quantify the differentially abundant membrane proteins by comparing the membrane proteins of virulent and avirulent variants of E. histolytica HM-1:IMSS, and to investigate the potential associations among the differentially abundant membrane proteins. We performed quantitative proteomics analysis using isobaric tags for relative and absolute quantitation labeling, in combination with two mass spectrometry instruments, that is, nano-liquid chromatography (nanoLC)-matrix-assisted laser desorption/ionization-mass spectrometry/mass spectrometry and nanoLC-electrospray ionization tandem mass spectrometry. Overall, 37 membrane proteins were found to be differentially abundant, whereby 19 and 18 membrane proteins of the virulent variant of E. histolytica increased and decreased in abundance, respectively. Proteins that were differentially abundant include Rho family GTPase, calreticulin, a 70-kDa heat shock protein, and hypothetical proteins. Analysis by Protein ANalysis THrough Evolutionary Relationships database revealed that the differentially abundant membrane proteins were mainly involved in catalytic activities (29.7%) and metabolic processes (32.4%). Differentially abundant membrane proteins that were found to be involved mainly in the catalytic activities and the metabolic processes were highlighted together with their putative roles in relation to the virulence. Further investigations should be performed to elucidate the roles of these proteins in E. histolytica pathogenesis.
    Matched MeSH terms: Molecular Sequence Annotation
  15. Kang WT, Vellasamy KM, Vadivelu J
    Sci Rep, 2016 09 16;6:33528.
    PMID: 27634329 DOI: 10.1038/srep33528
    Burkholderia pseudomallei, the etiological agent for melioidosis, is known to secrete a type III secretion system (TTSS) protein into the host's internal milieu. One of the TTSS effector protein, BipC, has been shown to play an important role in the B. pseudomallei pathogenesis. To identify the host response profile that was directly or indirectly regulated by this protein, genome-wide transcriptome approach was used to examine the gene expression profiles of infected mice. The transcriptome analysis of the liver and spleen revealed that a total of approximately 1,000 genes were transcriptionally affected by BipC. Genes involved in bacterial invasion, regulation of actin cytoskeleton, and MAPK signalling pathway were over-expressed and may be specifically regulated by BipC in vivo. These results suggest that BipC mainly targets pathways related to the cellular processes which could modulate the cellular trafficking processes. The host transcriptional response exhibited remarkable differences with and without the presence of the BipC protein. Overall, the detailed picture of this study provides new insights that BipC may have evolved to efficiently manipulate host-cell pathways which is crucial in the intracellular lifecycle of B. pseudomallei.
    Matched MeSH terms: Molecular Sequence Annotation
  16. Shettima A, Ishak IH, Abdul Rais SH, Abu Hasan H, Othman N
    PeerJ, 2021;9:e10863.
    PMID: 33717682 DOI: 10.7717/peerj.10863
    Background: Proteomic analyses have broadened the horizons of vector control measures by identifying proteins associated with different biological and physiological processes and give further insight into the mosquitoes' biology, mechanism of insecticide resistance and pathogens-mosquitoes interaction. Female Ae. aegypti ingests human blood to acquire the requisite nutrients to make eggs. During blood ingestion, female mosquitoes transmit different pathogens. Therefore, this study aimed to determine the best protein extraction method for mass spectrometry analysis which will allow a better proteome profiling for female mosquitoes.

    Methods: In this present study, two protein extractions methods were performed to analyze female Ae. aegyti proteome, via TCA acetone precipitation extraction method and a commercial protein extraction reagent CytoBusterTM. Then, protein identification was performed by LC-ESI-MS/MS and followed by functional protein annotation analysis.

    Results: The CytoBusterTM reagent gave the highest protein yield with a mean of 475.90 µg compared to TCA acetone precipitation extraction showed 283.15 µg mean of protein. LC-ESI-MS/MS identified 1,290 and 890 proteins from the CytoBusterTM reagent and TCA acetone precipitation, respectively. When comparing the protein class categories in both methods, there were three additional categories for proteins identified using CytoBusterTM reagent. The proteins were related to scaffold/adaptor protein (PC00226), protein binding activity modulator (PC00095) and intercellular signal molecule (PC00207). In conclusion, the CytoBusterTM protein extraction reagent showed a better performance for the extraction of proteins in term of the protein yield, proteome coverage and extraction speed.

    Matched MeSH terms: Molecular Sequence Annotation
  17. Chan KL, Tatarinova TV, Rosli R, Amiruddin N, Azizi N, Halim MAA, et al.
    Biol. Direct, 2017 Sep 08;12(1):21.
    PMID: 28886750 DOI: 10.1186/s13062-017-0191-4
    BACKGROUND: Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools.

    RESULTS: Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC3-rich genes (GC3 ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures.

    CONCLUSIONS: We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC3-rich and intronless), as well as those associated with important functions, such as FA biosynthesis and disease resistance. The study demonstrated the advantages of having an integrated approach to gene prediction and developed a computational framework for combining multiple genome annotations. These results, available in the oil palm annotation database ( http://palmxplore.mpob.gov.my ), will provide important resources for studies on the genomes of oil palm and related crops.

    REVIEWERS: This article was reviewed by Alexander Kel, Igor Rogozin, and Vladimir A. Kuznetsov.

    Matched MeSH terms: Molecular Sequence Annotation*
  18. Hon KW, Ab-Mutalib NS, Abdullah NMA, Jamal R, Abu N
    Sci Rep, 2019 Nov 11;9(1):16497.
    PMID: 31712601 DOI: 10.1038/s41598-019-53063-y
    Chemo-resistance is associated with poor prognosis in colorectal cancer (CRC), with the absence of early biomarker. Exosomes are microvesicles released by body cells for intercellular communication. Circular RNAs (circRNAs) are non-coding RNAs with covalently closed loops and enriched in exosomes. Crosstalk between circRNAs in exosomes and chemo-resistance in CRC remains unknown. This research aims to identify exosomal circRNAs associated with FOLFOX-resistance in CRC. FOLFOX-resistant HCT116 CRC cells (HCT116-R) were generated from parental HCT116 cells (HCT116-P) using periodic drug induction. Exosomes were characterized using transmission electron microscopy (TEM), Zetasizer and Western blot. Our exosomes were translucent cup-shaped structures under TEM with differential expression of TSG101, CD9, and CD63. We performed circRNAs microarray using exosomal RNAs from HCT116-R and HCT116-P cells. We validated our microarray data using serum samples. We performed drug sensitivity assay and cell cycle analysis to characterize selected circRNA after siRNA-knockdown. Using fold change >2 and p 
    Matched MeSH terms: Molecular Sequence Annotation
  19. Tan MH, Austin CM, Hammer MP, Lee YP, Croft LJ, Gan HM
    Gigascience, 2018 03 01;7(3):1-6.
    PMID: 29342277 DOI: 10.1093/gigascience/gix137
    Background: Some of the most widely recognized coral reef fishes are clownfish or anemonefish, members of the family Pomacentridae (subfamily: Amphiprioninae). They are popular aquarium species due to their bright colours, adaptability to captivity, and fascinating behavior. Their breeding biology (sequential hermaphrodites) and symbiotic mutualism with sea anemones have attracted much scientific interest. Moreover, there are some curious geographic-based phenotypes that warrant investigation. Leveraging on the advancement in Nanopore long read technology, we report the first hybrid assembly of the clown anemonefish (Amphiprion ocellaris) genome utilizing Illumina and Nanopore reads, further demonstrating the substantial impact of modest long read sequencing data sets on improving genome assembly statistics.

    Results: We generated 43 Gb of short Illumina reads and 9 Gb of long Nanopore reads, representing approximate genome coverage of 54× and 11×, respectively, based on the range of estimated k-mer-predicted genome sizes of between 791 and 967 Mbp. The final assembled genome is contained in 6404 scaffolds with an accumulated length of 880 Mb (96.3% BUSCO-calculated genome completeness). Compared with the Illumina-only assembly, the hybrid approach generated 94% fewer scaffolds with an 18-fold increase in N50 length (401 kb) and increased the genome completeness by an additional 16%. A total of 27 240 high-quality protein-coding genes were predicted from the clown anemonefish, 26 211 (96%) of which were annotated functionally with information from either sequence homology or protein signature searches.

    Conclusions: We present the first genome of any anemonefish and demonstrate the value of low coverage (∼11×) long Nanopore read sequencing in improving both genome assembly contiguity and completeness. The near-complete assembly of the A. ocellaris genome will be an invaluable molecular resource for supporting a range of genetic, genomic, and phylogenetic studies specifically for clownfish and more generally for other related fish species of the family Pomacentridae.

    Matched MeSH terms: Molecular Sequence Annotation
  20. Dadaev T, Saunders EJ, Newcombe PJ, Anokian E, Leongamornlert DA, Brook MN, et al.
    Nat Commun, 2018 06 11;9(1):2256.
    PMID: 29892050 DOI: 10.1038/s41467-018-04109-8
    Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.
    Matched MeSH terms: Molecular Sequence Annotation
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links