FINDINGS: We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
CONCLUSIONS: This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.
Results: We generated 43 Gb of short Illumina reads and 9 Gb of long Nanopore reads, representing approximate genome coverage of 54× and 11×, respectively, based on the range of estimated k-mer-predicted genome sizes of between 791 and 967 Mbp. The final assembled genome is contained in 6404 scaffolds with an accumulated length of 880 Mb (96.3% BUSCO-calculated genome completeness). Compared with the Illumina-only assembly, the hybrid approach generated 94% fewer scaffolds with an 18-fold increase in N50 length (401 kb) and increased the genome completeness by an additional 16%. A total of 27 240 high-quality protein-coding genes were predicted from the clown anemonefish, 26 211 (96%) of which were annotated functionally with information from either sequence homology or protein signature searches.
Conclusions: We present the first genome of any anemonefish and demonstrate the value of low coverage (∼11×) long Nanopore read sequencing in improving both genome assembly contiguity and completeness. The near-complete assembly of the A. ocellaris genome will be an invaluable molecular resource for supporting a range of genetic, genomic, and phylogenetic studies specifically for clownfish and more generally for other related fish species of the family Pomacentridae.
RESULTS: A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized.
CONCLUSIONS: It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
METHODOLOGY/PRINCIPAL FINDINGS: We profiled gene expression changes in the airway wall using a large animal model of physical injury comprising bronchial brush biopsy in anaesthetised sheep. The experimental design featured sequential studies in the same animals over the course of a week and yielded data relating to the response at 6 hours, and 1, 3 and 7 days after injury. Notable features of the transcriptional response included the early and sustained preponderance of down-regulated genes associated with angiogenesis and immune cell activation, selection and differentiation. Later features of the response included the up-regulation of cell cycle genes at d1 and d3, and the latter pronounced up-regulation of extracellular matrix-related genes at d3 and d7.
CONCLUSIONS/SIGNIFICANCE: It is possible to follow the airway wall response to physical injury in the same animal over the course of time. Transcriptional changes featured coordinate expression of functionally related genes in a reproducible manner both within and between animals. This characterisation will provide a foundation against which to assess the perturbations that accompany airway disease pathologies of comparative relevance.
METHODS: Genomic DNA was extracted from Salmonella strain PS01 and was sequenced using an Illumina HiSeq 2000 platform. The generated reads were de novo assembled using CLC Genomics Workbench. The draft genome was annotated and the presence of antimicrobial resistance genes was identified.
RESULTS: The 5 036 442bp genome contains various antimicrobial resistance genes conferring resistance to aminoglycosides, fluoroquinolones, fosfomycin, macrolides, phenicols, sulphonamides, tetracyclines and trimethoprim. The β-lactamase gene blaTEM-176 encoding TEM-176 was also found in this strain.
CONCLUSIONS: The genome sequence will aid in the understanding of drug resistance mechanisms in foodborne Salmonella Brancaster and highlights the need to ensure the judicious use of antibiotics in animal husbandry as well as the importance of implementing proper food handling and preparation practices.