RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.
CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
Methods: Combination of throat and nasal swab specimens was subjected to viral RNA extraction. For screening, the extracted RNA was subjected to real-time RT-PCR targeting upstream of E gene, open reading frame 1b and open reading frame 1a. For confirmation, the RNA was subjected to RT-PCR targeting partial part of the RNA-dependent RNA polymerase and nucleocapsid, followed by amplification of complete N gene region. Nucleotide sequencing of the first Malaysian case of MERS-CoV was performed following the confirmation with real-time RT-PCR detection.
Results: Initial analysis of partial RNA-dependent RNA polymerase and N gene revealed that the nucleotides had high similarity to Jeddah_1_2013 strain. Analysis of complete N gene region (1 242 nucleotides) from the case showed high similarity and yet distinct to the nucleotide sequences of camel-derived MERS-CoV.
Conclusions: From the finding, there are possibilities that the patient acquired the infection from zoonotic transmission from dromedary camels.