MyMedR

Displaying publications 1 - 20 of 92 in total

Abstract:

Sort:

Fulltext A draft genome sequence of the elusive giant squid, Architeuthis dux

da Fonseca RR, Couto A, Machado AM, Brejova B, Albertin CB, Silva F, et al.

Gigascience, 2020 Jan 01;9(1).
PMID: 31942620 DOI: 10.1093/gigascience/giz152

BACKGROUND: The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked.
FINDINGS: We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
CONCLUSIONS: This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext A systems biology approach towards the identification of candidate therapeutic genes and potential biomarkers for Parkinson's disease

Sakharkar MK, Kashmir Singh SK, Rajamanickam K, Mohamed Essa M, Yang J, Chidambaram SB

PLoS One, 2019;14(9):e0220995.
PMID: 31487305 DOI: 10.1371/journal.pone.0220995

Parkinson's disease (PD) is an irreversible and incurable multigenic neurodegenerative disorder. It involves progressive loss of mid brain dopaminergic neurons in the substantia nigra pars compacta (SN). We compared brain gene expression profiles with those from the peripheral blood cells of a separate sample of PD patients to identify disease-associated genes. Here, we demonstrate the use of gene expression profiling of brain and blood for detecting valid targets and identifying early PD biomarkers. Implementing this systematic approach, we discovered putative PD risk genes in brain, delineated biological processes and molecular functions that may be particularly disrupted in PD and also identified several putative PD biomarkers in blood. 20 of the differentially expressed genes in SN were also found to be differentially expressed in the blood. Further application of this methodology to other brain regions and neurological disorders should facilitate the discovery of highly reliable and reproducible candidate risk genes and biomarkers for PD. The identification of valid peripheral biomarkers for PD may ultimately facilitate early identification, intervention, and prevention efforts as well.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext An expanded mammal mitogenome dataset from Southeast Asia

Mohd Salleh F, Ramos-Madrigal J, Peñaloza F, Liu S, Mikkel-Holger SS, Riddhi PP, et al.

Gigascience, 2017 08 01;6(8):1-8.
PMID: 28873965 DOI: 10.1093/gigascience/gix053

Southeast (SE) Asia is 1 of the most biodiverse regions in the world, and it holds approximately 20% of all mammal species. Despite this, the majority of SE Asia's genetic diversity is still poorly characterized. The growing interest in using environmental DNA to assess and monitor SE Asian species, in particular threatened mammals-has created the urgent need to expand the available reference database of mitochondrial barcode and complete mitogenome sequences. We have partially addressed this need by generating 72 new mitogenome sequences reconstructed from DNA isolated from a range of historical and modern tissue samples. Approximately 55 gigabases of raw sequence were generated. From this data, we assembled 72 complete mitogenome sequences, with an average depth of coverage of ×102.9 and ×55.2 for modern samples and historical samples, respectively. This dataset represents 52 species, of which 30 species had no previous mitogenome data available. The mitogenomes were geotagged to their sampling location, where known, to display a detailed geographical distribution of the species. Our new database of 52 taxa will strongly enhance the utility of environmental DNA approaches for monitoring mammals in SE Asia as it greatly increases the likelihoods that identification of metabarcoding sequencing reads can be assigned to reference sequences. This magnifies the confidence in species detections and thus allows more robust surveys and monitoring programmes of SE Asia's threatened mammal biodiversity. The extensive collections of historical samples from SE Asia in western and SE Asian museums should serve as additional valuable material to further enrich this reference database.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Analyses of hypomethylated oil palm gene space

Low ET, Rosli R, Jayanthi N, Mohd-Amin AH, Azizi N, Chan KL, et al.

PLoS One, 2014;9(1):e86728.
PMID: 24497974 DOI: 10.1371/journal.pone.0086728

Demand for palm oil has been increasing by an average of ∼8% the past decade and currently accounts for about 59% of the world's vegetable oil market. This drives the need to increase palm oil production. Nevertheless, due to the increasing need for sustainable production, it is imperative to increase productivity rather than the area cultivated. Studies on the oil palm genome are essential to help identify genes or markers that are associated with important processes or traits, such as flowering, yield and disease resistance. To achieve this, 294,115 and 150,744 sequences from the hypomethylated or gene-rich regions of Elaeis guineensis and E. oleifera genome were sequenced and assembled into contigs. An additional 16,427 shot-gun sequences and 176 bacterial artificial chromosomes (BAC) were also generated to check the quality of libraries constructed. Comparison of these sequences revealed that although the methylation-filtered libraries were sequenced at low coverage, they still tagged at least 66% of the RefSeq supported genes in the BAC and had a filtration power of at least 2.0. A total 33,752 microsatellites and 40,820 high-quality single nucleotide polymorphism (SNP) markers were identified. These represent the most comprehensive collection of microsatellites and SNPs to date and would be an important resource for genetic mapping and association studies. The gene models predicted from the assembled contigs were mined for genes of interest, and 242, 65 and 14 oil palm transcription factors, resistance genes and miRNAs were identified respectively. Examples of the transcriptional factors tagged include those associated with floral development and tissue culture, such as homeodomain proteins, MADS, Squamosa and Apetala2. The E. guineensis and E. oleifera hypomethylated sequences provide an important resource to understand the molecular mechanisms associated with important agronomic traits in oil palm.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Analysis of anoxybacillus genomes from the aspects of lifestyle adaptations, prophage diversity, and carbohydrate metabolism

Goh KM, Gan HM, Chan KG, Chan GF, Shahar S, Chong CS, et al.

PLoS One, 2014;9(6):e90549.
PMID: 24603481 DOI: 10.1371/journal.pone.0090549

Species of Anoxybacillus are widespread in geothermal springs, manure, and milk-processing plants. The genus is composed of 22 species and two subspecies, but the relationship between its lifestyle and genome is little understood. In this study, two high-quality draft genomes were generated from Anoxybacillus spp. SK3-4 and DT3-1, isolated from Malaysian hot springs. De novo assembly and annotation were performed, followed by comparative genome analysis with the complete genome of Anoxybacillus flavithermus WK1 and two additional draft genomes, of A. flavithermus TNO-09.006 and A. kamchatkensis G10. The genomes of Anoxybacillus spp. are among the smaller of the family Bacillaceae. Despite having smaller genomes, their essential genes related to lifestyle adaptations at elevated temperature, extreme pH, and protection against ultraviolet are complete. Due to the presence of various competence proteins, Anoxybacillus spp. SK3-4 and DT3-1 are able to take up foreign DNA fragments, and some of these transferred genes are important for the survival of the cells. The analysis of intact putative prophage genomes shows that they are highly diversified. Based on the genome analysis using SEED, many of the annotated sequences are involved in carbohydrate metabolism. The presence of glycosyl hydrolases among the Anoxybacillus spp. was compared, and the potential applications of these unexplored enzymes are suggested here. This is the first study that compares Anoxybacillus genomes from the aspect of lifestyle adaptations, the capacity for horizontal gene transfer, and carbohydrate metabolism.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Analysis of the leaf transcriptome of Musa acuminata during interaction with Mycosphaerella musicola: gene assembly, annotation and marker development

Passos MA, de Cruz VO, Emediato FL, de Teixeira CC, Azevedo VC, Brasileiro AC, et al.

BMC Genomics, 2013 Feb 05;14:78.
PMID: 23379821 DOI: 10.1186/1471-2164-14-78

BACKGROUND: Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores.
RESULTS: The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases.Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82.
CONCLUSIONS: A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Annotated genome sequence of Mycobacterium massiliense strain M154, belonging to the recently created taxon Mycobacterium abscessus subsp. bolletii comb. nov

Choo SW, Wong YL, Tan JL, Ong CS, Wong GJ, Ng KP, et al.

J Bacteriol, 2012 Sep;194(17):4778.
PMID: 22887675 DOI: 10.1128/JB.01043-12

Mycobacterium massiliense has recently been proposed as a member of Mycobacterium abscessus subsp. bolletii comb. nov. Strain M154, a clinical isolate from the bronchoalveolar lavage fluid of a Malaysian patient presenting with lower respiratory tract infection, was subjected to shotgun DNA sequencing with the Illumina sequencing technology to obtain whole-genome sequence data for comparison with other genetically related strains within the M. abscessus species complex.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Announcing the Genome Atlas of Bamboo and Rattan (GABR) project: promoting research in evolution and in economically and ecologically beneficial plants

Zhao H, Zhao S, International Network for Bamboo and Rattan, Fei B, Liu H, Yang H, et al.

Gigascience, 2017 07 01;6(7):1-7.
PMID: 28637269 DOI: 10.1093/gigascience/gix046

Bamboo and rattan are widely grown for manufacturing, horticulture, and agroforestry. Bamboo and rattan production might help reduce poverty, boost economic growth, mitigate climate change, and protect the natural environment. Despite progress in research, sufficient molecular and genomic resources to study these species are lacking. We launched the Genome Atlas of Bamboo and Rattan (GABR) project, a comprehensive, coordinated international effort to accelerate understanding of bamboo and rattan genetics through genome analysis. GABR includes 2 core subprojects: Bamboo-T1K (Transcriptomes of 1000 Bamboos) and Rattan-G5 (Genomes of 5 Rattans), and several other subprojects. Here we describe the organization, directions, and status of GABR.

Matched MeSH terms: Molecular Sequence Annotation
Application of Spiroplasma melliferum proteogenomic profiling for the discovery of virulence factors and pathogenicity mechanisms in host-associated spiroplasmas

Alexeev D, Kostrjukova E, Aliper A, Popenko A, Bazaleev N, Tyakht A, et al.

J Proteome Res, 2012 Jan 1;11(1):224-36.
PMID: 22129229 DOI: 10.1021/pr2008626

To date, no genome of any of the species from the genus Spiroplasma has been completely sequenced. Long repetitive sequences similar to mobile units present a major obstacle for current genome sequencing technologies. Here, we report the assembly of the Spiroplasma melliferum KC3 genome into 4 contigs, followed by proteogenomic annotation and metabolic reconstruction based on the discovery of 521 expressed proteins and comprehensive metabolomic profiling. A systems approach allowed us to elucidate putative pathogenicity mechanisms and to discover major virulence factors, such as Chitinase utilization enzymes and toxins never before reported for insect pathogenic spiroplasmas.

Matched MeSH terms: Molecular Sequence Annotation
Bursal transcriptome profiling of different inbred chicken lines reveals key differentially expressed genes at 3 days post-infection with very virulent infectious bursal disease virus

Farhanah MI, Yasmin AR, Mat Isa N, Hair-Bejo M, Ideris A, Powers C, et al.

J Gen Virol, 2018 Jan;99(1):21-35.
PMID: 29058656 DOI: 10.1099/jgv.0.000956

Infectious bursal disease is a highly contagious disease in the poultry industry and causes immunosuppression in chickens. Genome-wide regulations of immune response genes of inbred chickens with different genetic backgrounds, following very virulent infectious bursal disease virus (vvIBDV) infection are poorly characterized. Therefore, this study aims to analyse the bursal tissue transcriptome of six inbred chicken lines 6, 7, 15, N, O and P following infection with vvIBDV strain UK661 using strand-specific next-generation sequencing, by highlighting important genes and pathways involved in the infected chicken during peak infection at 3 days post-infection. All infected chickens succumbed to the infection without major variations among the different lines. However, based on the viral loads and bursal lesion scoring, lines P and 6 can be considered as the most susceptible lines, while lines 15 and N were regarded as the least affected lines. Transcriptome profiling of the bursa identified 4588 genes to be differentially expressed, with 2985 upregulated and 1642 downregulated genes, in which these genes were commonly or uniquely detected in all or several infected lines. Genes that were upregulated are primarily pro-inflammatory cytokines, chemokines and IFN-related. Various genes that are associated with B-cell functions and genes related to apoptosis were downregulated, together with the genes involved in p53 signalling. In conclusion, bursal transcriptome profiles of different inbred lines showed differential expressions of pro-inflammatory cytokines and chemokines, Th1 cytokines, JAK-STAT signalling genes, MAPK signalling genes, and their related pathways following vvIBDV infection.

Matched MeSH terms: Molecular Sequence Annotation
CATH-Gene3D: Generation of the Resource and Its Use in Obtaining Structural and Functional Annotations for Protein Sequences

Dawson NL, Sillitoe I, Lees JG, Lam SD, Orengo CA

Methods Mol Biol, 2017;1558:79-110.
PMID: 28150234 DOI: 10.1007/978-1-4939-6783-4_4

This chapter describes the generation of the data in the CATH-Gene3D online resource and how it can be used to study protein domains and their evolutionary relationships. Methods will be presented for: comparing protein structures, recognizing homologs, predicting domain structures within protein sequences, and subclassifying superfamilies into functionally pure families, together with a guide on using the webpages.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext CATH: increased structural coverage of functional space

Sillitoe I, Bordin N, Dawson N, Waman VP, Ashford P, Scholes HM, et al.

Nucleic Acids Res, 2021 Jan 08;49(D1):D266-D273.
PMID: 33237325 DOI: 10.1093/nar/gkaa1079

CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext COGNAC: a web server for searching and annotating hydrogen-bonded base interactions in RNA three-dimensional structures

Firdaus-Raih M, Hamdani HY, Nadzirin N, Ramlan EI, Willett P, Artymiuk PJ

Nucleic Acids Res, 2014 Jul;42(Web Server issue):W382-8.
PMID: 24831543 DOI: 10.1093/nar/gku438

Hydrogen bonds are crucial factors that stabilize a complex ribonucleic acid (RNA) molecule's three-dimensional (3D) structure. Minute conformational changes can result in variations in the hydrogen bond interactions in a particular structure. Furthermore, networks of hydrogen bonds, especially those found in tight clusters, may be important elements in structure stabilization or function and can therefore be regarded as potential tertiary motifs. In this paper, we describe a graph theoretical algorithm implemented as a web server that is able to search for unbroken networks of hydrogen-bonded base interactions and thus provide an accounting of such interactions in RNA 3D structures. This server, COGNAC (COnnection tables Graphs for Nucleic ACids), is also able to compare the hydrogen bond networks between two structures and from such annotations enable the mapping of atomic level differences that may have resulted from conformational changes due to mutations or binding events. The COGNAC server can be accessed at http://mfrlab.org/grafss/cognac.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Characterisation of full-length cDNA sequences provides insights into the Eimeria tenella transcriptome

Amiruddin N, Lee XW, Blake DP, Suzuki Y, Tay YL, Lim LS, et al.

BMC Genomics, 2012 Jan 13;13:21.
PMID: 22244352 DOI: 10.1186/1471-2164-13-21

BACKGROUND: Eimeria tenella is an apicomplexan parasite that causes coccidiosis in the domestic fowl. Infection with this parasite is diagnosed frequently in intensively reared poultry and its control is usually accorded a high priority, especially in chickens raised for meat. Prophylactic chemotherapy has been the primary method used for the control of coccidiosis. However, drug efficacy can be compromised by drug-resistant parasites and the lack of new drugs highlights demands for alternative control strategies including vaccination. In the long term, sustainable control of coccidiosis will most likely be achieved through integrated drug and vaccination programmes. Characterisation of the E. tenella transcriptome may provide a better understanding of the biology of the parasite and aid in the development of a more effective control for coccidiosis.
RESULTS: More than 15,000 partial sequences were generated from the 5' and 3' ends of clones randomly selected from an E. tenella second generation merozoite full-length cDNA library. Clustering of these sequences produced 1,529 unique transcripts (UTs). Based on the transcript assembly and subsequently primer walking, 433 full-length cDNA sequences were successfully generated. These sequences varied in length, ranging from 441 bp to 3,083 bp, with an average size of 1,647 bp. Simple sequence repeat (SSR) analysis identified CAG as the most abundant trinucleotide motif, while codon usage analysis revealed that the ten most infrequently used codons in E. tenella are UAU, UGU, GUA, CAU, AUA, CGA, UUA, CUA, CGU and AGU. Subsequent analysis of the E. tenella complete coding sequences identified 25 putative secretory and 60 putative surface proteins, all of which are now rational candidates for development as recombinant vaccines or drug targets in the effort to control avian coccidiosis.
CONCLUSIONS: This paper describes the generation and characterisation of full-length cDNA sequences from E. tenella second generation merozoites and provides new insights into the E. tenella transcriptome. The data generated will be useful for the development and validation of diagnostic and control strategies for coccidiosis and will be of value in annotation of the E. tenella genome sequence.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Comparative analyses identify genomic features potentially involved in the evolution of birds-of-paradise

Prost S, Armstrong EE, Nylander J, Thomas GWC, Suh A, Petersen B, et al.

Gigascience, 2019 May 01;8(5).
PMID: 30689847 DOI: 10.1093/gigascience/giz003

The diverse array of phenotypes and courtship displays exhibited by birds-of-paradise have long fascinated scientists and nonscientists alike. Remarkably, almost nothing is known about the genomics of this iconic radiation. There are 41 species in 16 genera currently recognized within the birds-of-paradise family (Paradisaeidae), most of which are endemic to the island of New Guinea. In this study, we sequenced genomes of representatives from all five major clades within this family to characterize genomic changes that may have played a role in the evolution of the group's extensive phenotypic diversity. We found genes important for coloration, morphology, and feather and eye development to be under positive selection. In birds-of-paradise with complex lekking systems and strong sexual dimorphism, the core birds-of-paradise, we found Gene Ontology categories for "startle response" and "olfactory receptor activity" to be enriched among the gene families expanding significantly faster compared to the other birds in our study. Furthermore, we found novel families of retrovirus-like retrotransposons active in all three de novo genomes since the early diversification of the birds-of-paradise group, which might have played a role in the evolution of this fascinating group of birds.

Matched MeSH terms: Molecular Sequence Annotation
Comparative genomic and phylogenetic analysis of a toxigenic clinical isolate of Corynebacterium diphtheriae strain B-D-16-78 from Malaysia

Hong KW, Asmah Hani AW, Nurul Aina Murni CA, Pusparani RR, Chong CK, Verasahib K, et al.

Infect Genet Evol, 2017 Oct;54:263-270.
PMID: 28711373 DOI: 10.1016/j.meegid.2017.07.015

In this study, we report the comparative genomics and phylogenetic analysis of Corynebacterium diphtheriae strain B-D-16-78 that was isolated from a clinical specimen in 2016. The complete genome of C. diphtheriae strain B-D-16-78 was sequenced using PacBio Single Molecule, Real-Time sequencing technology and consists of a 2,474,151-bp circular chromosome with an average GC content of 53.56%. The core genome of C. diphtheriae was also deduced from a total of 74 strains with complete or draft genome sequences and the core genome-based phylogenetic analysis revealed close genetic relationship among strains that shared the same MLST allelic profile. In the context of CRISPR-Cas system, which confers adaptive immunity against re-invading DNA, 73 out of 86 spacer sequences were found to be unique to Malaysian strains which harboured only type-II-C and/or type-I-E-a systems. A total of 48 tox genes which code for the diphtheria toxin were retrieved from the 74 genomes and with the exception of one truncated gene, only nucleotide substitutions were detected when compared to the tox gene sequence of PW8. More than half were synonymous substitution and only two were nonsynonymous substitutions whereby H24Y was predicted to have a damaging effect on the protein function whilst T262V was predicted to be tolerated. Both toxigenic and non-toxigenic toxin-gene bearing strains have been isolated in Malaysia but the repeated isolation of toxigenic strains with the same MLST profile suggests the possibility of some of these strains may be circulating in the population. Hence, efforts to increase herd immunity should be continued and supported by an effective monitoring and surveillance system to track, manage and control outbreak of cases.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Comparative sequence and structure analysis reveals the conservation and diversity of nucleotide positions and their associated tertiary interactions in the riboswitches

Appasamy SD, Ramlan EI, Firdaus-Raih M

PLoS One, 2013;8(9):e73984.
PMID: 24040136 DOI: 10.1371/journal.pone.0073984

The tertiary motifs in complex RNA molecules play vital roles to either stabilize the formation of RNA 3D structure or to provide important biological functionality to the molecule. In order to better understand the roles of these tertiary motifs in riboswitches, we examined 11 representative riboswitch PDB structures for potential agreement of both motif occurrences and conservations. A total of 61 unique tertiary interactions were found in the reference structures. In addition to the expected common A-minor motifs and base-triples mainly involved in linking distant regions the riboswitch structures three highly conserved variants of A-minor interactions called G-minors were found in the SAM-I and FMN riboswitches where they appear to be involved in the recognition of the respective ligand's functional groups. From our structural survey as well as corresponding structure and sequence alignments, the agreement between motif occurrences and conservations are very prominent across the representative riboswitches. Our analysis provide evidence that some of these tertiary interactions are essential components to form the structure where their sequence positions are conserved despite a high degree of diversity in other parts of the respective riboswitches sequences. This is indicative of a vital role for these tertiary interactions in determining the specific biological function of riboswitch.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Comparative transcriptome analysis to identify candidate genes involved in 2-methoxy-1,4-naphthoquinone (MNQ) biosynthesis in Impatiens balsamina L

Foong LC, Chai JY, Ho ASH, Yeo BPH, Lim YM, Tam SM

Sci Rep, 2020 09 30;10(1):16123.
PMID: 32999341 DOI: 10.1038/s41598-020-72997-2

Impatiens balsamina L. is a tropical ornamental and traditional medicinal herb rich in natural compounds, especially 2-methoxy-1,4-naphthoquinone (MNQ) which is a bioactive compound with tested anticancer activities. Characterization of key genes involved in the shikimate and 1,4-dihydroxy-2-naphthoate (DHNA) pathways responsible for MNQ biosynthesis and their expression profiles in I. balsamina will facilitate adoption of genetic/metabolic engineering or synthetic biology approaches to further increase production for pre-commercialization. In this study, HPLC analysis showed that MNQ was present in significantly higher quantities in the capsule pericarps throughout three developmental stages (early-, mature- and postbreaker stages) whilst its immediate precursor, 2-hydroxy-1,4-naphthoquinone (lawsone) was mainly detected in mature leaves. Transcriptomes of I. balsamina derived from leaf, flower, and three capsule developmental stages were generated, totalling 59.643 Gb of raw reads that were assembled into 94,659 unigenes (595,828 transcripts). A total of 73.96% of unigenes were functionally annotated against seven public databases and 50,786 differentially expressed genes (DEGs) were identified. Expression profiles of 20 selected genes from four major secondary metabolism pathways were studied and validated using qRT-PCR method. Majority of the DHNA pathway genes were found to be significantly upregulated in early stage capsule compared to flower and leaf, suggesting tissue-specific synthesis of MNQ. Correlation analysis identified 11 candidate unigenes related to three enzymes (NADH-quinone oxidoreductase, UDP-glycosyltransferases and S-adenosylmethionine-dependent O-methyltransferase) important in the final steps of MNQ biosynthesis based on genes expression profiles consistent with MNQ content. This study provides the first molecular insight into the dynamics of MNQ biosynthesis and accumulation across different tissues of I. balsamina and serves as a valuable resource to facilitate further manipulation to increase production of MNQ.

Matched MeSH terms: Molecular Sequence Annotation/methods
Fulltext Comparison of eight complete chloroplast genomes of the endangered Aquilaria tree species (Thymelaeaceae) and their phylogenetic relationships

Hishamuddin MS, Lee SY, Ng WL, Ramlee SI, Lamasudin DU, Mohamed R

Sci Rep, 2020 Aug 03;10(1):13034.
PMID: 32747724 DOI: 10.1038/s41598-020-70030-0

Aquilaria tree species are naturally distributed in the Indomalesian region and are protected against over-exploitation. They produce a fragrant non-timber product of high economic value, agarwood. Ambiguous species delimitation and limited genetic information within Aquilaria are among the impediments to conservation efforts. In this study, we conducted comparative analysis on eight Aquilaria species complete chloroplast (cp) genomes, of which seven were newly sequenced using Illumina HiSeq X Ten platform followed by de novo assembly. Aquilaria cp genomes possess a typical quadripartite structure including gene order and genomic structure. The length of each of the cp genome is about 174 kbp and encoded between 89 and 92 proteins, 38 tRNAs, and 8 rRNAs, with 27 duplicated in the IR (inverted repeat) region. Besides, 832 repeats (forward, reverse, palindrome and complement repeats) and nine highly variable regions were also identified. The phylogenetic analysis suggests that the topology structure of Aquilaria cp genomes were well presented with strong support values based on the cp genomes data set and matches their geographic distribution pattern. In summary, the complete cp genomes will facilitate development of species-specific molecular tools to discriminate Aquilaria species and resolve the evolutionary relationships of members of the Thymelaeaceae family.

Matched MeSH terms: Molecular Sequence Annotation
Fulltext Complete mitochondrial genomes and phylogenetic relationships of the genera Nephila and Trichonephila (Araneae, Araneoidea)

Yong HS, Song SL, Chua KO, Wayan Suana I, Eamsobhana P, Tan J, et al.

Sci Rep, 2021 May 21;11(1):10680.
PMID: 34021208 DOI: 10.1038/s41598-021-90162-1

Spiders of the genera Nephila and Trichonephila are large orb-weaving spiders. In view of the lack of study on the mitogenome of these genera, and the conflicting systematic status, we sequenced (by next generation sequencing) and annotated the complete mitogenomes of N. pilipes, T. antipodiana and T. vitiana (previously N. vitiana) to determine their features and phylogenetic relationship. Most of the tRNAs have aberrant clover-leaf secondary structure. Based on 13 protein-coding genes (PCGs) and 15 mitochondrial genes (13 PCGs and two rRNA genes), Nephila and Trichonephila form a clade distinctly separated from the other araneid subfamilies/genera. T. antipodiana forms a lineage with T. vitiana in the subclade containing also T. clavata, while N. pilipes forms a sister clade to Trichonephila. The taxon vitiana is therefore a member of the genus Trichonephila and not Nephila as currently recognized. Studies on the mitogenomes of other Nephila and Trichonephila species and related taxa are needed to provide a potentially more robust phylogeny and systematics.

Matched MeSH terms: Molecular Sequence Annotation

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links