MyMedR

Displaying all 4 publications

Abstract:

Sort:

Fulltext Complete vertebrate mitogenomes reveal widespread repeats and gene duplications

Formenti G, Rhie A, Balacco J, Haase B, Mountcastle J, Fedrigo O, et al.

Genome Biol, 2021 04 29;22(1):120.
PMID: 33910595 DOI: 10.1186/s13059-021-02336-9

BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly.
RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100-300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization.
CONCLUSIONS: Our results indicate that even in the "simple" case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.

Matched MeSH terms: Vertebrates/genetics*
Fulltext Towards complete and error-free genome assemblies of all vertebrate species

Rhie A, McCarthy SA, Fedrigo O, Damas J, Formenti G, Koren S, et al.

Nature, 2021 Apr;592(7856):737-746.
PMID: 33911273 DOI: 10.1038/s41586-021-03451-0

High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.

Matched MeSH terms: Vertebrates/genetics*
Fulltext Phylogenetic analyses uncover a novel clade of transferrin in nonmammalian vertebrates

Mohd-Padil H, Mohd-Adnan A, Gabaldón T

Mol Biol Evol, 2013 Apr;30(4):894-905.
PMID: 23258311 DOI: 10.1093/molbev/mss325

Transferrin is a protein super-family involved in iron transport, a central process in cellular homeostasis. Throughout the evolution of vertebrates, transferrin members have diversified into distinct subfamilies including serotransferrin, ovotransferrin, lactoferrin, melanotransferrin, the inhibitor of carbonic anhydrase, pacifastin, and the major yolk protein in sea urchin. Previous phylogenetic analyses have established the branching order of the diverse transferrin subfamilies but were mostly focused on the transferrin repertoire present in mammals. Here, we conduct a comprehensive phylogenetic analysis of transferrin protein sequences in sequenced vertebrates, placing a special focus on the less-studied nonmammalian vertebrates. Our analyses uncover a novel transferrin clade present across fish, sauropsid, and amphibian genomes but strikingly absent from mammals. Our reconstructed scenario implies that this novel class emerged through a duplication event at the vertebrate ancestor, and that it was subsequently lost in the lineage leading to mammals. We detect footprints of accelerated evolution following the duplication event, which suggest positive selection and early functional divergence of this novel clade. Interestingly, the loss of this novel class of transferrin in mammals coincided with the divergence by duplication of lactoferrin and serotransferrin in this lineage. Altogether, our results provide novel insights on the evolution of iron-binding proteins in the various vertebrate groups.

Matched MeSH terms: Vertebrates/genetics
Fulltext Molecular characterization of a novel proto-type antimicrobial protein galectin-1 from striped murrel

Arasu A, Kumaresan V, Sathyamoorthi A, Chaurasia MK, Bhatt P, Gnanam AJ, et al.

Microbiol Res, 2014 Nov;169(11):824-34.
PMID: 24780642 DOI: 10.1016/j.micres.2014.03.005

In this study, we reported a molecular characterization of a novel proto-type galectin-1 from the striped murrel Channa striatus (named as CsGal-1). The full length CsGal-1 was identified from an established striped murrel cDNA library and further we confirmed the sequence by cloning. The complete cDNA sequence of CsGal-1 is 590 base pairs (bp) in length and its coding region encoded a poly peptide of 135 amino acids. The polypeptide contains a galactoside binding lectin domain at 4-135. The domain carries a sugar binding site at 45-74 along with its signatures (H(45)-X-Asn(47)-X-Arg(49) and Trp(69)-X-X-Glu(72)-X-Arg(74)). CsGal-1 shares a highly conserved carbohydrate recognition domain (CRD) with galectin-1 from other proto-type galectin of teleosts. The mRNA expressions of CsGal-1 in healthy and various immune stimulants including Aphanomyces invadans, Aeromonas hydrophila, Escherchia coli lipopolysaccharide and poly I:C injected tissues of C. striatus were examined using qRT-PCR. CsGal-1 mRNA is highly expressed in kidney and is up-regulated with different immune stimulants at various time points. To understand its biological activity, the coding region of CsGal-1 gene was expressed in an E. coli BL21 (DE3) cloning system and its recombinant protein was purified. The recombinant CsGal-1 protein was agglutinated with mouse erythrocytes at a concentration of 4μg/mL in a calcium independent manner. CsGal-1 activity was inhibited by d-galactose at 25mM(-1) and d-glucose and d-fructose at 100mM(-1). The results of microbial binding assay showed that the recombinant CsGal-1 protein agglutinated only with the Gram-negative bacteria. Interestingly, we observed no agglutination against Gram-positive bacteria. Overall, the study showed that CsGal-1 is an important immune gene involved in the recognition and elimination of pathogens in C. striatus.

Matched MeSH terms: Vertebrates/genetics