RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.
CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
METHOD: By using the keywords "acute lymphoblastic leukemia", and "microarray", a total of 280 and 275 microarray datasets were found listed in Gene Expression Omnibus database GEO and ArrayExpress database respectively. Further manual inspection found that only three studies (GSE18497, GSE28460, GSE3910) were focused on gene expression profiling of paired diagnosis-relapsed pediatric B-ALL. These three datasets which comprised of a total of 108 matched diagnosis-relapsed pediatric B-ALL samples were then included for this meta-analysis using RankProd approach.
RESULTS: Our analysis identified a total of 1795 upregulated probes which corresponded to 1527 genes (pfp 1), and 1493 downregulated probes which corresponded to 1214 genes (pfp gene (pfp gene ontology biological process annotation, the upregulated genes were most enriched in cell cycle processes (enrichment score = 15.3), whilst the downregulated genes were clustered in transcription regulation (enrichment score = 12.6). Elevated expression of cell cycle regulators (e.g kinesins, AURKA, CDKs) was the key genetic defect implicated in relapsed ALL, and serve as attractive targets for therapeutic intervention.
CONCLUSION: We identified S100A8 as the most overexpressed gene, and the cell cycle pathway as the most promising biomarker and therapeutic target for relapsed childhood B-ALL. The validity of the results warrants further investigation.
RESULTS: In this study, the alignment analysis based on structural similarity allows the prediction of 48 potential interactions between 27 human RPs and the EBV proteins EBNA1, LMP1, LMP2A, and LMP2B. Gene ontology analysis of the putative protein-protein interactions (PPIs) reveals their probable involvement in RNA binding, ribosome biogenesis, metabolic and biosynthetic processes, and gene regulation. Pathway analysis shows their possible participation in viral infection strategies (viral translation), as well as oncogenesis (Wnt and EGFR signalling pathways). Finally, our molecular docking assay predicts the functional interactions of EBNA1 with four RPs individually: EBNA1-eS10, EBNA1-eS25, EBNA1-uL10 and EBNA1-uL11.
CONCLUSION: These interactions have never been revealed previously via either experimental or in silico approach. We envisage that the calculated interactions between the ribosomal and EBV proteins herein would provide a hypothetical model for future experimental studies on the functional relationship between ribosomal proteins and EBV infection.
RESULTS: A total of 75,350,240 sequence reads were obtained via Hi-seq 2500 sequencing technology. A total of 5473 significant differentially expressed genes were called. Gene ontology functional categorisation showed that cellular process, catalytic activity, and cell part categories had the highest number of expressed genes, while the metabolic pathways category possessed the highest number of expressed genes in the KEGG pathway analysis. The additional sequence dataset will further enrich existing M. fascicularis transcriptome assemblies, and provide a dataset for further downstream studies.