RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.
CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
METHOD: By using the keywords "acute lymphoblastic leukemia", and "microarray", a total of 280 and 275 microarray datasets were found listed in Gene Expression Omnibus database GEO and ArrayExpress database respectively. Further manual inspection found that only three studies (GSE18497, GSE28460, GSE3910) were focused on gene expression profiling of paired diagnosis-relapsed pediatric B-ALL. These three datasets which comprised of a total of 108 matched diagnosis-relapsed pediatric B-ALL samples were then included for this meta-analysis using RankProd approach.
RESULTS: Our analysis identified a total of 1795 upregulated probes which corresponded to 1527 genes (pfp 1), and 1493 downregulated probes which corresponded to 1214 genes (pfp gene (pfp gene ontology biological process annotation, the upregulated genes were most enriched in cell cycle processes (enrichment score = 15.3), whilst the downregulated genes were clustered in transcription regulation (enrichment score = 12.6). Elevated expression of cell cycle regulators (e.g kinesins, AURKA, CDKs) was the key genetic defect implicated in relapsed ALL, and serve as attractive targets for therapeutic intervention.
CONCLUSION: We identified S100A8 as the most overexpressed gene, and the cell cycle pathway as the most promising biomarker and therapeutic target for relapsed childhood B-ALL. The validity of the results warrants further investigation.