This Data Descriptor announces the submission to public repositories of the monoterpene indole alkaloid database (MIADB), a cumulative collection of 172 tandem mass spectrometry (MS/MS) spectra from multiple research projects conducted in eight natural product chemistry laboratories since the 1960s. All data have been annotated and organized to promote reuse by the community. Being a unique collection of these complex natural products, these data can be used to guide the dereplication and targeting of new related monoterpene indole alkaloids within complex mixtures when applying computer-based approaches, such as molecular networking. Each spectrum has its own accession number from CCMSLIB00004679916 to CCMSLIB00004680087 on the GNPS. The MIADB is available for download from MetaboLights under the identifier: MTBLS142 ( https://www.ebi.ac.uk/metabolights/MTBLS142 ).
Mycobacterium tuberculosis (MTB) is commonly used as a model to study pathogenicity and multiple drug resistance in bacteria. These MTB characteristics are highly dependent on the evolution and phylogeography of the bacterium. In this paper, we describe 15 new genomes of multidrug-resistant MTB (MDRTB) from Malaysia. The assessments and annotations on the genome assemblies suggest that strain differences are due to lineages and horizontal gene transfer during the course of evolution. The genomes show mutations listed in current drug resistance databases and global MTB collections. This genome data will augment existing information available for comparative genomic studies to understand MTB drug resistance mechanisms and evolution.
The FLUXNET2015 dataset provides ecosystem-scale data on CO2, water, and energy exchange between the biosphere and the atmosphere, and other meteorological and biological measurements, from 212 sites around the globe (over 1500 site-years, up to and including year 2014). These sites, independently managed and operated, voluntarily contributed their data to create global datasets. Data were quality controlled and processed using uniform methods, to improve consistency and intercomparability across sites. The dataset is already being used in a number of applications, including ecophysiology studies, remote sensing studies, and development of ecosystem and Earth system models. FLUXNET2015 includes derived-data products, such as gap-filled time series, ecosystem respiration and photosynthetic uptake estimates, estimation of uncertainties, and metadata about the measurements, presented for the first time in this paper. In addition, 206 of these sites are for the first time distributed under a Creative Commons (CC-BY 4.0) license. This paper details this enhanced dataset and the processing methods, now made available as open-source codes, making the dataset more accessible, transparent, and reproducible.
Thermal discomfort is one of the main triggers for occupants' interactions with components of the built environment such as adjustments of thermostats and/or opening windows and strongly related to the energy use in buildings. Understanding causes for thermal (dis-)comfort is crucial for design and operation of any type of building. The assessment of human thermal perception through rating scales, for example in post-occupancy studies, has been applied for several decades; however, long-existing assumptions related to these rating scales had been questioned by several researchers. The aim of this study was to gain deeper knowledge on contextual influences on the interpretation of thermal perception scales and their verbal anchors by survey participants. A questionnaire was designed and consequently applied in 21 language versions. These surveys were conducted in 57 cities in 30 countries resulting in a dataset containing responses from 8225 participants. The database offers potential for further analysis in the areas of building design and operation, psycho-physical relationships between human perception and the built environment, and linguistic analyses.
This study developed 0.05° × 0.05° land-only datasets of daily maximum and minimum temperatures in the densely populated Central North region of Egypt (CNE) for the period 1981-2017. Existing coarse-resolution datasets were evaluated to find the best dataset for the study area to use as a base of the new datasets. The Climate Prediction Centre (CPC) global temperature dataset was found to be the best. The CPC data were interpolated to a spatial resolution of 0.05° latitude/longitude using linear interpolation technique considering the flat topography of the study area. The robust kernel density distribution mapping method was used to correct the bias using observations, and WorldClim v.2 temperature climatology was used to adjust the spatial variability in temperature. The validation of CNE datasets using probability density function skill score and hot and cold extremes tail skill scores showed remarkable improvement in replicating the spatial and temporal variability in observed temperature. Because CNE datasets are the best available high-resolution estimate of daily temperatures, they will be beneficial for climatic and hydrological studies.
Forest biomass is an essential indicator for monitoring the Earth's ecosystems and climate. It is a critical input to greenhouse gas accounting, estimation of carbon losses and forest degradation, assessment of renewable energy potential, and for developing climate change mitigation policies such as REDD+, among others. Wall-to-wall mapping of aboveground biomass (AGB) is now possible with satellite remote sensing (RS). However, RS methods require extant, up-to-date, reliable, representative and comparable in situ data for calibration and validation. Here, we present the Forest Observation System (FOS) initiative, an international cooperation to establish and maintain a global in situ forest biomass database. AGB and canopy height estimates with their associated uncertainties are derived at a 0.25 ha scale from field measurements made in permanent research plots across the world's forests. All plot estimates are geolocated and have a size that allows for direct comparison with many RS measurements. The FOS offers the potential to improve the accuracy of RS-based biomass products while developing new synergies between the RS and ground-based ecosystem research communities.
Complete genomes of xenobiotic-degrading microorganisms provide valuable resources for researchers to understand molecular mechanisms involved in bioremediation. Despite the well-known ability of Sphingomonas paucimobilis to degrade persistent xenobiotic compounds, a complete genome sequencing is lacking for this organism. In line with this, we report the first complete genome sequence of Sphingomonas paucimobilis (strain AIMST S2), an organophosphate and hydrocarbon-degrading bacterium isolated from oil-polluted soil at Kedah, Malaysia. The genome was derived from a hybrid assembly of short and long reads generated by Illumina HiSeq and MinION, respectively. The assembly resulted in a single contig of 4,005,505 bases which consisted of 3,612 CDS and 56 tRNAs. An array of genes involved in xenobiotic degradation and plant-growth promoters were identified, suggesting its' potential role as an effective microorganism in bioremediation and agriculture. Having reported the first complete genome of the species, this study will serve as a stepping stone for comparative genome analysis of Sphingomonas strains and other xenobiotic-degrading microorganisms as well as gene expression studies in organophosphate biodegradation.