A molecular epidemiological investigation conducted among injecting drug users in eastern Peninsular Malaysia in 2007 identified a cluster of sequences (n = 3) located outside any known HIV-1 genotype. Analyses of near full-length nucleotide sequences of these strains from individuals with no recognizable linkage revealed that they have an identical subtype structure comprised of CRF01_AE and subtype B', distinct from any known circulating recombinant forms (CRFs). This novel CRF, designated CRF48_01B, is closely related to CRF33_01B, previously identified in Kuala Lumpur. Phylogenetic analysis of multiple CRF48_01B genome regions showed that CRF48_01B forms a monophyletic cluster within CRF33_01B, suggesting that this new recombinant is very likely a descendant of CRF33_01B. CRF48_01B thus represents one of the first examples of a "second-generation" CRF, generated by additional crossover with pre-existing CRFs. Corroborating these results, Bayesian molecular clock analyses indicated that CRF48_01B emerged in approximately 2001, approximately approximately 8 years after the emergence of CRF33_01B.
In the search for natural reservoirs of hepatitis C virus (HCV), a broad diversity of non-human viruses within the Hepacivirus genus has been uncovered. However, the evolutionary dynamics that shaped the diversity and timescale of hepaciviruses evolution remain elusive. To gain further insights into the origins and evolution of this genus, we screened a large dataset of wild mammal samples (n = 1,672) from Africa and Asia, and generated 34 full-length hepacivirus genomes. Phylogenetic analysis of these data together with publicly available genomes emphasizes the importance of rodents as hepacivirus hosts and we identify 13 rodent species and 3 rodent genera (in Cricetidae and Muridae families) as novel hosts of hepaciviruses. Through co-phylogenetic analyses, we demonstrate that hepacivirus diversity has been affected by cross-species transmission events against the backdrop of detectable signal of virus-host co-divergence in the deep evolutionary history. Using a Bayesian phylogenetic multidimensional scaling approach, we explore the extent to which host relatedness and geographic distances have structured present-day hepacivirus diversity. Our results provide evidence for a substantial structuring of mammalian hepacivirus diversity by host as well as geography, with a somewhat more irregular diffusion process in geographic space. Finally, using a mechanistic model that accounts for substitution saturation, we provide the first formal estimates of the timescale of hepacivirus evolution and estimate the origin of the genus to be about 22 million years ago. Our results offer a comprehensive overview of the micro- and macroevolutionary processes that have shaped hepacivirus diversity and enhance our understanding of the long-term evolution of the Hepacivirus genus.