Ureaplasma parvum colonizes human mucosal surfaces, primarily in the respiratory and urogenital tracts, causing a wide spectrum of diseases, from non-gonococcal urethritis to pneumonitis in immunocompromised hosts. Although the basis for these diverse clinical outcomes is not yet understood, more severe disease may be associated with strains harboring a certain set of strain-specific genes. To investigate this, whole genome DNA macroarrays were constructed and used to assess genomic diversity in 10 U. parvum clinical strains. We found that 7.6% of U. parvum genes were dispersed into one or more strains, thus defining a minimal functional core of 538 U. parvum genes. Most of the strain-specific genes (79%) were of unknown function and were unique to U. parvum. Four hypervariable plasticity regions were identified in the genome containing 93% of the variability in the gene pool (UU32-UU33, UU145-UU170, UU440-UU447 and UU527-UU529). We hypothesized that one of them (UU145-UU170) was a pathogenicity island in U. parvum and we characterized it. Thus, we propose that the clinical outcome of U. parvum infection is probably associated with this newly identified pathogenicity island.
To date, no genome of any of the species from the genus Spiroplasma has been completely sequenced. Long repetitive sequences similar to mobile units present a major obstacle for current genome sequencing technologies. Here, we report the assembly of the Spiroplasma melliferum KC3 genome into 4 contigs, followed by proteogenomic annotation and metabolic reconstruction based on the discovery of 521 expressed proteins and comprehensive metabolomic profiling. A systems approach allowed us to elucidate putative pathogenicity mechanisms and to discover major virulence factors, such as Chitinase utilization enzymes and toxins never before reported for insect pathogenic spiroplasmas.