RESULTS: Sequence data were obtained for both A. dorsata and H. itama. The raw sequence data for A. dorsata was 5 Mb, which was assembled into 5 contigs with a size of 6,098,728 bp, an N50 of 15,534, and a GC average of 57.42. Similarly, the raw sequence data for H. itama was 6.3 Mb, which was assembled into 11 contigs with a size of 7,642,048 bp, an N50 of 17,180, and a GC average of 55.38. In the honey sample of A. dorsata, we identified five different plant/pollen species, with only one of the five species exhibiting a relative abundance of less than 1%. For H. itama, we identified seven different plant/pollen species, with only three of the species exhibiting a relative abundance of less than 1%. All of the identified plant species were native to Peninsular Malaysia, especially the East Coast area of Terengganu.
DATA DESCRIPTION: Our data offers valuable insights into honey's geographical and botanical origin and authenticity. Metagenomic studies could help identify the plant species that honeybees forage and provide preliminary data for researchers studying the biological development of A. dorsata and H. itama. The identification of various flowers from the eDNA of honey that are known for their medicinal properties could aid in regional honey with accurate product origin labeling, which is crucial for guaranteeing product authenticity to consumers.
METHODS: The fructophilic characteristics of strain Sy-1 were determined, and the genome was sequenced using Illumina iSeq100 and Oxford Nanopore. The average nucleotide identity and phylogenetic analyses based on 16S rRNA, 92 core genes, and whole-genome sequence were performed to unravel the phylogenetic position of strain Sy-1. NCBI Prokaryotic Genome Annotation Pipeline annotated the genome, while the EggNOG-mapper, BLASTKoala, and GHOSTKoala were used to add functional genes and pathways information.
RESULTS: Strain Sy-1 prefers D-fructose over D-glucose and actively metabolizes D-glucose in the presence of electron acceptors. Genomic annotation of strain Sy-1 revealed few genes involved in carbohydrate transport and metabolism, and partial deletion of adhE gene, in line with the characteristic of FLAB. The 16S rRNA gene sequence of strain Sy-1 showed the highest similarity to unknown LAB species isolated from the gut of honeybees. The phylogenetic analyses discovered that strain Sy-1 belonged to the Lactobacillaceae family and formed a separate branch closer to type strain from the genera of Acetilactobacillus and Apilactobacillus. The ANI analysis showed the similarity of the closest relative, Apilactobacillus micheneri Hlig3T. The assembled genome of Sy-1 contains 3 contigs with 2.03 Mbp and a 41% GC content. A total of 1,785 genes were identified, including 1,685 protein-coding genes, 68 tRNA, and 15 rRNA. Interestingly, strain Sy-1 encoded complete genes for the biosynthesis of folate and riboflavin. High-performance liquid chromatography analysis further confirmed the high production of folic acid (1.346 mg/L) by Sy-1.
DISCUSSION: Based on phylogenetic and biochemical characteristics, strain Sy-1 should be classified as a novel genus in the family of Lactobacillaceae and a new member of FLAB. The genome information coupled with experimental studies supported the ability of strain Sy-1 to produce high folic acid. Our collective findings support the suitable application of FLAB strain Sy-1 in the functional food and pharmaceutical industries.