Displaying publications 1 - 20 of 39 in total

Abstract:
Sort:
  1. Sivaprakasam S, Mohd Azim Khan NA, Yee Fan T, Kumarasan Y, Sicheritz-Pontén T, Petersen B, et al.
    Microbiol Resour Announc, 2024 Apr 11;13(4):e0113723.
    PMID: 38506531 DOI: 10.1128/mra.01137-23
    Here, we present the complete genome of a plant growth-promoting strain, Bacillus stratosphericus AIMST-CREST02 isolated from the bulk soil of a high-yielding paddy plot. The genome is 3,840,451 bp in size with a GC content of 41.25%. Annotation predicted the presence of 3,907 coding sequences, including genes involved in auxin biosynthesis regulation and gamma-aminobutyric acid (GABA) metabolism.
  2. Stiller J, Feng S, Chowdhury AA, Rivas-González I, Duchêne DA, Fang Q, et al.
    Nature, 2024 Apr 01.
    PMID: 38560995 DOI: 10.1038/s41586-024-07323-1
    Despite tremendous efforts in the past decades, relationships among main avian lineages remain heavily debated without a clear resolution. Discrepancies have been attributed to diversity of species sampled, phylogenetic method, and the choice of genomic regions 1-3. Here, we address these issues by analyzing genomes of 363 bird species 4 (218 taxonomic families, 92% of total). Using intergenic regions and coalescent methods, we present a well-supported tree but also a remarkable degree of discordance. The tree confirms that Neoaves experienced rapid radiation at or near the Cretaceous-Paleogene (K-Pg) boundary. Sufficient loci rather than extensive taxon sampling were more effective in resolving difficult nodes. Remaining recalcitrant nodes involve species that challenge modeling due to extreme GC content, variable substitution rates, incomplete lineage sorting, or complex evolutionary events such as ancient hybridization. Assessment of the impacts of different genomic partitions showed high heterogeneity across the genome. We discovered sharp increases in effective population size, substitution rates, and relative brain size following the K-Pg extinction event, supporting the hypothesis that emerging ecological opportunities catalyzed the diversification of modern birds. The resulting phylogenetic estimate offers novel insights into the rapid radiation of modern birds and provides a taxon-rich backbone tree for future comparative studies.
  3. Aad G, Abbott B, Abeling K, Abicht NJ, Abidi SH, Aboulhorma A, et al.
    Phys Rev Lett, 2024 Jan 12;132(2):021803.
    PMID: 38277607 DOI: 10.1103/PhysRevLett.132.021803
    The first evidence for the Higgs boson decay to a Z boson and a photon is presented, with a statistical significance of 3.4 standard deviations. The result is derived from a combined analysis of the searches performed by the ATLAS and CMS Collaborations with proton-proton collision datasets collected at the CERN Large Hadron Collider (LHC) from 2015 to 2018. These correspond to integrated luminosities of around 140  fb^{-1} for each experiment, at a center-of-mass energy of 13 TeV. The measured signal yield is 2.2±0.7 times the standard model prediction, and agrees with the theoretical expectation within 1.9 standard deviations.
  4. Reeve AH, Kennedy JD, Pujolar JM, Petersen B, Blom MPK, Alström P, et al.
    Nat Commun, 2023 Dec 11;14(1):8215.
    PMID: 38081809 DOI: 10.1038/s41467-023-43964-y
    The processes generating the earth's montane biodiversity remain a matter of debate. Two contrasting hypotheses have been advanced to explain how montane populations form: via direct colonization from other mountains, or, alternatively, via upslope range shifts from adjacent lowland areas. We seek to reconcile these apparently conflicting hypotheses by asking whether a species' ancestral geographic origin determines its mode of mountain colonization. Island-dwelling passerine birds at the faunal crossroads between Eurasia and Australo-Papua provide an ideal study system. We recover the phylogenetic relationships of the region's montane species and reconstruct their ancestral geographic ranges, elevational ranges, and migratory behavior. We also perform genomic population studies of three super-dispersive montane species/clades with broad island distributions. Eurasian-origin species populated archipelagos via direct colonization between mountains. This mode of colonization appears related to ancestral adaptations to cold and seasonal climates, specifically short-distance migration. Australo-Papuan-origin mountain populations, by contrast, evolved from lowland ancestors, and highland distribution mostly precludes their further colonization of island mountains. Our study explains much of the distributional variation within a complex biological system, and provides a synthesis of two seemingly discordant hypotheses for montane community formation.
  5. Sánchez-Barreiro F, De Cahsan B, Westbury MV, Sun X, Margaryan A, Fontsere C, et al.
    Mol Biol Evol, 2023 Sep 01;40(9).
    PMID: 37561011 DOI: 10.1093/molbev/msad180
    The black rhinoceros (Diceros bicornis L.) is a critically endangered species historically distributed across sub-Saharan Africa. Hunting and habitat disturbance have diminished both its numbers and distribution since the 19th century, but a poaching crisis in the late 20th century drove them to the brink of extinction. Genetic and genomic assessments can greatly increase our knowledge of the species and inform management strategies. However, when a species has been severely reduced, with the extirpation and artificial admixture of several populations, it is extremely challenging to obtain an accurate understanding of historic population structure and evolutionary history from extant samples. Therefore, we generated and analyzed whole genomes from 63 black rhinoceros museum specimens collected between 1775 and 1981. Results showed that the black rhinoceros could be genetically structured into six major historic populations (Central Africa, East Africa, Northwestern Africa, Northeastern Africa, Ruvuma, and Southern Africa) within which were nested four further subpopulations (Maasailand, southwestern, eastern rift, and northern rift), largely mirroring geography, with a punctuated north-south cline. However, we detected varying degrees of admixture among groups and found that several geographical barriers, most prominently the Zambezi River, drove population discontinuities. Genomic diversity was high in the middle of the range and decayed toward the periphery. This comprehensive historic portrait also allowed us to ascertain the ancestry of 20 resequenced genomes from extant populations. Lastly, using insights gained from this unique temporal data set, we suggest management strategies, some of which require urgent implementation, for the conservation of the remaining black rhinoceros diversity.
  6. Battlay P, Wilson J, Bieker VC, Lee C, Prapas D, Petersen B, et al.
    Nat Commun, 2023 Mar 27;14(1):1717.
    PMID: 36973251 DOI: 10.1038/s41467-023-37303-4
    Adaptation is the central feature and leading explanation for the evolutionary diversification of life. Adaptation is also notoriously difficult to study in nature, owing to its complexity and logistically prohibitive timescale. Here, we leverage extensive contemporary and historical collections of Ambrosia artemisiifolia-an aggressively invasive weed and primary cause of pollen-induced hayfever-to track the phenotypic and genetic causes of recent local adaptation across its native and invasive ranges in North America and Europe, respectively. Large haploblocks-indicative of chromosomal inversions-contain a disproportionate share (26%) of genomic regions conferring parallel adaptation to local climates between ranges, are associated with rapidly adapting traits, and exhibit dramatic frequency shifts over space and time. These results highlight the importance of large-effect standing variants in rapid adaptation, which have been critical to A. artemisiifolia's global spread across vast climatic gradients.
  7. Mutusamy P, Banga Singh KK, Su Yin L, Petersen B, Sicheritz-Ponten T, Clokie MRJ, et al.
    Int J Mol Sci, 2023 Feb 12;24(4).
    PMID: 36835084 DOI: 10.3390/ijms24043678
    Salmonella infections across the globe are becoming more challenging to control due to the emergence of multidrug-resistant (MDR) strains. Lytic phages may be suitable alternatives for treating these multidrug-resistant Salmonella infections. Most Salmonella phages to date were collected from human-impacted environments. To further explore the Salmonella phage space, and to potentially identify phages with novel characteristics, we characterized Salmonella-specific phages isolated from the Penang National Park, a conserved rainforest. Four phages with a broad lytic spectrum (kills >5 Salmonella serovars) were further characterized; they have isometric heads and cone-shaped tails, and genomes of ~39,900 bp, encoding 49 CDSs. As the genomes share a <95% sequence similarity to known genomes, the phages were classified as a new species within the genus Kayfunavirus. Interestingly, the phages displayed obvious differences in their lytic spectrum and pH stability, despite having a high sequence similarity (~99% ANI). Subsequent analysis revealed that the phages differed in the nucleotide sequence in the tail spike proteins, tail tubular proteins, and portal proteins, suggesting that the SNPs were responsible for their differing phenotypes. Our findings highlight the diversity of novel Salmonella bacteriophages from rainforest regions, which can be explored as an antimicrobial agent against MDR-Salmonella strains.
  8. Reeve AH, Gower G, Pujolar JM, Smith BT, Petersen B, Olsson U, et al.
    Evol Lett, 2023 Feb 01;7(1):24-36.
    PMID: 37065434 DOI: 10.1093/evlett/qrac006
    Tropical islands are renowned as natural laboratories for evolutionary study. Lineage radiations across tropical archipelagos are ideal systems for investigating how colonization, speciation, and extinction processes shape biodiversity patterns. The expansion of the island thrush across the Indo-Pacific represents one of the largest yet most perplexing island radiations of any songbird species. The island thrush exhibits a complex mosaic of pronounced plumage variation across its range and is arguably the world's most polytypic bird. It is a sedentary species largely restricted to mountain forests, yet it has colonized a vast island region spanning a quarter of the globe. We conducted a comprehensive sampling of island thrush populations and obtained genome-wide SNP data, which we used to reconstruct its phylogeny, population structure, gene flow, and demographic history. The island thrush evolved from migratory Palearctic ancestors and radiated explosively across the Indo-Pacific during the Pleistocene, with numerous instances of gene flow between populations. Its bewildering plumage variation masks a biogeographically intuitive stepping stone colonization path from the Philippines through the Greater Sundas, Wallacea, and New Guinea to Polynesia. The island thrush's success in colonizing Indo-Pacific mountains can be understood in light of its ancestral mobility and adaptation to cool climates; however, shifts in elevational range, degree of plumage variation and apparent dispersal rates in the eastern part of its range raise further intriguing questions about its biology.
  9. Gopalakrishnan S, Ebenesersdóttir SS, Lundstrøm IKC, Turner-Walker G, Moore KHS, Luisi P, et al.
    Curr Biol, 2022 Nov 07;32(21):4743-4751.e6.
    PMID: 36182700 DOI: 10.1016/j.cub.2022.09.023
    Human populations have been shaped by catastrophes that may have left long-lasting signatures in their genomes. One notable example is the second plague pandemic that entered Europe in ca. 1,347 CE and repeatedly returned for over 300 years, with typical village and town mortality estimated at 10%-40%.1 It is assumed that this high mortality affected the gene pools of these populations. First, local population crashes reduced genetic diversity. Second, a change in frequency is expected for sequence variants that may have affected survival or susceptibility to the etiologic agent (Yersinia pestis).2 Third, mass mortality might alter the local gene pools through its impact on subsequent migration patterns. We explored these factors using the Norwegian city of Trondheim as a model, by sequencing 54 genomes spanning three time periods: (1) prior to the plague striking Trondheim in 1,349 CE, (2) the 17th-19th century, and (3) the present. We find that the pandemic period shaped the gene pool by reducing long distance immigration, in particular from the British Isles, and inducing a bottleneck that reduced genetic diversity. Although we also observe an excess of large FST values at multiple loci in the genome, these are shaped by reference biases introduced by mapping our relatively low genome coverage degraded DNA to the reference genome. This implies that attempts to detect selection using ancient DNA (aDNA) datasets that vary by read length and depth of sequencing coverage may be particularly challenging until methods have been developed to account for the impact of differential reference bias on test statistics.
  10. Høie MH, Kiehl EN, Petersen B, Nielsen M, Winther O, Nielsen H, et al.
    Nucleic Acids Res, 2022 Jul 05;50(W1):W510-W515.
    PMID: 35648435 DOI: 10.1093/nar/gkac439
    Recent advances in machine learning and natural language processing have made it possible to profoundly advance our ability to accurately predict protein structures and their functions. While such improvements are significantly impacting the fields of biology and biotechnology at large, such methods have the downside of high demands in terms of computing power and runtime, hampering their applicability to large datasets. Here, we present NetSurfP-3.0, a tool for predicting solvent accessibility, secondary structure, structural disorder and backbone dihedral angles for each residue of an amino acid sequence. This NetSurfP update exploits recent advances in pre-trained protein language models to drastically improve the runtime of its predecessor by two orders of magnitude, while displaying similar prediction performance. We assessed the accuracy of NetSurfP-3.0 on several independent test datasets and found it to consistently produce state-of-the-art predictions for each of its output features, with a runtime that is up to to 600 times faster than the most commonly available methods performing the same tasks. The tool is freely available as a web server with a user-friendly interface to navigate the results, as well as a standalone downloadable package.
  11. Cerca J, Armstrong EE, Vizueta J, Fernández R, Dimitrov D, Petersen B, et al.
    Genome Biol Evol, 2021 Dec 01;13(12).
    PMID: 34849853 DOI: 10.1093/gbe/evab262
    Spiders (Araneae) have a diverse spectrum of morphologies, behaviors, and physiologies. Attempts to understand the genomic-basis of this diversity are often hindered by their large, heterozygous, and AT-rich genomes with high repeat content resulting in highly fragmented, poor-quality assemblies. As a result, the key attributes of spider genomes, including gene family evolution, repeat content, and gene function, remain poorly understood. Here, we used Illumina and Dovetail Chicago technologies to sequence the genome of the long-jawed spider Tetragnatha kauaiensis, producing an assembly distributed along 3,925 scaffolds with an N50 of ∼2 Mb. Using comparative genomics tools, we explore genome evolution across available spider assemblies. Our findings suggest that the previously reported and vast genome size variation in spiders is linked to the different representation and number of transposable elements. Using statistical tools to uncover gene-family level evolution, we find expansions associated with the sensory perception of taste, immunity, and metabolism. In addition, we report strikingly different histories of chemosensory, venom, and silk gene families, with the first two evolving much earlier, affected by the ancestral whole genome duplication in Arachnopulmonata (∼450 Ma) and exhibiting higher numbers. Together, our findings reveal that spider genomes are highly variable and that genomic novelty may have been driven by the burst of an ancient whole genome duplication, followed by gene family and transposable element expansion.
  12. Mutusamy P, Jaya Jothi S, Lee SY, Petersen B, Sicheritz-Ponten T, Clokie MRJ, et al.
    Microbiol Resour Announc, 2021 Nov 24;10(47):e0096521.
    PMID: 34817216 DOI: 10.1128/MRA.00965-21
    We characterized the complete genome sequence of the lytic Salmonella enterica bacteriophage PRF-SP1, isolated from Penang National Park, a conserved rainforest in northern Malaysia. The novel phage species from the Autographiviridae family has a 39,966-bp double-stranded DNA (dsDNA) genome containing 49 protein-encoding genes and shares 90.96% similarity with Escherichia phage DY1.
  13. Sinding MS, Ciucani MM, Ramos-Madrigal J, Carmagnini A, Rasmussen JA, Feng S, et al.
    iScience, 2021 Nov 19;24(11):103226.
    PMID: 34712923 DOI: 10.1016/j.isci.2021.103226
    The evolution of the genera Bos and Bison, and the nature of gene flow between wild and domestic species, is poorly understood, with genomic data of wild species being limited. We generated two genomes from the likely extinct kouprey (Bos sauveli) and analyzed them alongside other Bos and Bison genomes. We found that B. sauveli possessed genomic signatures characteristic of an independent species closely related to Bos javanicus and Bos gaurus. We found evidence for extensive incomplete lineage sorting across the three species, consistent with a polytomic diversification of the major ancestry in the group, potentially followed by secondary gene flow. Finally, we detected significant gene flow from an unsampled Asian Bos-like source into East Asian zebu cattle, demonstrating both that the full genomic diversity and evolutionary history of the Bos complex has yet to be elucidated and that museum specimens and ancient DNA are valuable resources to do so.
  14. Alanin KWS, Jørgensen TS, Browne PD, Petersen B, Riber L, Kot W, et al.
    Plasmid, 2021 05;115:102576.
    PMID: 33872684 DOI: 10.1016/j.plasmid.2021.102576
    Mobile genetic elements (MGEs) are instrumental in natural prokaryotic genome editing, permitting genome plasticity and allowing microbes to accumulate genetic diversity. MGEs serve as a vast communal gene pool and include DNA elements such as plasmids and bacteriophages (phages) among others. These mobile DNA elements represent a human health risk as they can introduce new traits, such as antibiotic resistance or virulence, to a bacterial strain. Sequencing libraries targeting environmental circular MGEs, referred to as metamobilomes, may broaden our current understanding of the mechanisms behind the mobility, prevalence and content of these elements. However, metamobilomics is affected by a severe bias towards small circular elements, introduced by multiple displacement amplification (MDA). MDA is typically used to overcome limiting DNA quantities after the removal of non-circular DNA during library preparations. By examining the relationship between sequencing coverage and the size of circular MGEs in paired metamobilome datasets with and without MDA, we show that larger circular elements are lost when using MDA. This study is the first to systematically demonstrate that MDA is detrimental to detecting larger-sized plasmids if small plasmids are present. It is also the first to show that MDA can be omitted when using enzyme-based DNA fragmentation and PCR in library preparation kits such as Nextera XT® from Illumina.
  15. Feng S, Stiller J, Deng Y, Armstrong J, Fang Q, Reeve AH, et al.
    Nature, 2021 Apr;592(7856):E24.
    PMID: 33833441 DOI: 10.1038/s41586-021-03473-8
  16. Jorquera R, González C, Clausen PTLC, Petersen B, Holmes DS
    Database (Oxford), 2021 01 28;2021.
    PMID: 33507271 DOI: 10.1093/database/baab002
    Single-exon coding sequences (CDSs), also known as 'single-exon genes' (SEGs), are defined as nuclear, protein-coding genes that lack introns in their CDSs. They have been studied not only to determine their origin and evolution but also because their expression has been linked to several types of human cancers and neurological/developmental disorders, and many exhibit tissue-specific transcription. We developed SinEx DB that houses DNA and protein sequence information of SEGs from 10 mammalian genomes including human. SinEx DB includes their functional predictions (KOG (euKaryotic Orthologous Groups)) and the relative distribution of these functions within species. Here, we report SinEx 2.0, a major update of SinEx DB that includes information of the occurrence, distribution and functional prediction of SEGs from 60 completely sequenced eukaryotic genomes, representing animals, fungi, protists and plants. The information is stored in a relational database built with MySQL Server 5.7, and the complete dataset of SEG sequences and their GO (Gene Ontology) functional assignations are available for downloading. SinEx DB 2.0 was built with a novel pipeline that helps disambiguate single-exon isoforms from SEGs. SinEx DB 2.0 is the largest available database for SEGs and provides a rich source of information for advancing our understanding of the evolution, function of SEGs and their associations with disorders including cancers and neurological and developmental diseases. Database URL: http://v2.sinex.cl/.
  17. Ramos-Madrigal J, Sinding MS, Carøe C, Mak SST, Niemann J, Samaniego Castruita JA, et al.
    Curr Biol, 2021 01 11;31(1):198-206.e8.
    PMID: 33125870 DOI: 10.1016/j.cub.2020.10.002
    Extant Canis lupus genetic diversity can be grouped into three phylogenetically distinct clades: Eurasian and American wolves and domestic dogs.1 Genetic studies have suggested these groups trace their origins to a wolf population that expanded during the last glacial maximum (LGM)1-3 and replaced local wolf populations.4 Moreover, ancient genomes from the Yana basin and the Taimyr peninsula provided evidence of at least one extinct wolf lineage that dwelled in Siberia during the Pleistocene.35 Previous studies have suggested that Pleistocene Siberian canids can be classified into two groups based on cranial morphology. Wolves in the first group are most similar to present-day populations, although those in the second group possess intermediate features between dogs and wolves.67 However, whether this morphological classification represents distinct genetic groups remains unknown. To investigate this question and the relationships between Pleistocene canids, present-day wolves, and dogs, we resequenced the genomes of four Pleistocene canids from Northeast Siberia dated between >50 and 14 ka old, including samples from the two morphological categories. We found these specimens cluster with the two previously sequenced Pleistocene wolves, which are genetically more similar to Eurasian wolves. Our results show that, though the four specimens represent extinct wolf lineages, they do not form a monophyletic group. Instead, each Pleistocene Siberian canid branched off the lineage that gave rise to present-day wolves and dogs. Finally, our results suggest the two previously described morphological groups could represent independent lineages similarly related to present-day wolves and dogs.
  18. Feng S, Stiller J, Deng Y, Armstrong J, Fang Q, Reeve AH, et al.
    Nature, 2020 11;587(7833):252-257.
    PMID: 33177665 DOI: 10.1038/s41586-020-2873-9
    Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1-4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families-including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species.
  19. Sinding MS, Gopalakrishnan S, Ramos-Madrigal J, de Manuel M, Pitulko VV, Kuderna L, et al.
    Science, 2020 06 26;368(6498):1495-1499.
    PMID: 32587022 DOI: 10.1126/science.aaz8599
    Although sled dogs are one of the most specialized groups of dogs, their origin and evolution has received much less attention than many other dog groups. We applied a genomic approach to investigate their spatiotemporal emergence by sequencing the genomes of 10 modern Greenland sled dogs, an ~9500-year-old Siberian dog associated with archaeological evidence for sled technology, and an ~33,000-year-old Siberian wolf. We found noteworthy genetic similarity between the ancient dog and modern sled dogs. We detected gene flow from Pleistocene Siberian wolves, but not modern American wolves, to present-day sled dogs. The results indicate that the major ancestry of modern sled dogs traces back to Siberia, where sled dog-specific haplotypes of genes that potentially relate to Arctic adaptation were established by 9500 years ago.
  20. Gelabert P, Sandoval-Velasco M, Serres A, de Manuel M, Renom P, Margaryan A, et al.
    Curr Biol, 2020 01 06;30(1):108-114.e5.
    PMID: 31839456 DOI: 10.1016/j.cub.2019.10.066
    As the only endemic neotropical parrot to have recently lived in the northern hemisphere, the Carolina parakeet (Conuropsis carolinensis) was an iconic North American bird. The last surviving specimen died in the Cincinnati Zoo in 1918 [1]. The cause of its extinction remains contentious: besides excessive mortality associated to habitat destruction and active hunting, their survival could have been negatively affected by its range having become increasingly patchy [2] or by the exposure to poultry pathogens [3, 4]. In addition, the Carolina parakeet showed a predilection for cockleburs, an herbaceous plant that contains a powerful toxin, carboxyatractyloside, or CAT [5], which did not seem to affect them but made the birds notoriously toxic to most predators [3]. To explore the demographic history of this bird, we generated the complete genomic sequence of a preserved specimen held in a private collection in Espinelves (Girona, Spain), as well as of a close extant relative, Aratinga solstitialis. We identified two non-synonymous genetic changes in two highly conserved proteins known to interact with CAT that could underlie a specific dietary adaptation to this toxin. Our genomic analyses did not reveal evidence of a dramatic past demographic decline in the Carolina parakeet; also, its genome did not exhibit the long runs of homozygosity that are signals of recent inbreeding and are typically found in endangered species. As such, our results suggest its extinction was an abrupt process and thus likely solely attributable to human causes.
Related Terms
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links