Ganoderma boninense is known to be the causal agent for basal stem rot (BSR) affecting the oil palm industry worldwide thus cumulating to high economic losses every year. Several reports have shown that a compatible monokaryon pair needs to mate; producing dikaryotic mycelia to initiate the infection towards the oil palm. However, the molecular events occurs during mating process are not well understood. We performed transcriptome sequencing using Illumina RNA-seq technology and de novo assembly of the transcripts from monokaryon, mating junction and dikaryon mycelia of G. boninense. Raw reads from these three libraries were deposited in the NCBI database with accession number SRR1745787, SRR1745773 and SRR1745777, respectively.
Hevea brasiliensis is exploited for its latex production, and it is the only viable source of natural rubber worldwide. The demand for natural rubber remains high due its high-quality properties, which synthetic rubber cannot compete with. In this paper, we present transcriptomic data and analysis of three H. brasiliensis clones using tissue from latex and bark tissues collected from 10-year-old plant. The combined, assembled transcripts were mapped onto an H. brasiliensis draft genome. Gene ontology analysis showed that the most abundant transcripts related to molecular functions, followed by biological processes and cellular components. Simple sequence repeats (SSR) and single nucleotide polymorphisms (SNP) were also identified, and these can be useful for selection of parental and new clones in a breeding program. Data generated by RNA sequencing were deposited in the NCBI public repository under accession number PRJNA629890.
Eucheuma denticulatum or commonly known as "Spinosum", is an economically important red alga that naturally grows on coral reefs with moderately strong currents in tropical and sub-tropical areas. This species is the primary source of iota-carrageenan which has high demands in the food, pharmaceutical and manufacturing industries, and as such it has been widely cultivated. The increasing global demand for carrageenan has led to extensive commercial cultivation of carrageenophytes mainly in the tropics. The carrageenophyte seaweeds including E. denticulatum are indigenous to Sabah, Malaysia. To enrich the information on the genes involved in carrageenan biosynthesis, RNA sequencing has been performed and transcriptomic dataset has been generated using Illumina HiSeq™ 2000 sequencer. The raw data and transcriptomic data have been deposited in NCBI database with the accession number PRJNA477734. These data will provide valuable resources for functional genomics annotation and investigation of mechanisms underlying the regulations of genes in this algal species.
Vibriosis disease by Vibrio spp. greatly reduced productivity of aquaculture, such as brown-marbled grouper (Epinephelus fuscoguttatus), which is an economically important fish species in Malaysia. Preventive measures and immediate treatment are critical to reduce the mortality of E. fuscoguttatus from vibriosis. To investigate the molecular mechanisms associated with immune response and host-bacteria interaction, a transcriptomic analysis was performed to compare between healthy and Vibrio-infected groupers. This permits the discovery of immune-related genes, specifically the resistance genes upon infection. Herein, we provide the raw transcriptome data from Illumina HiSeq. 4000 that have been deposited into NCBI SRA database with the BioProject accession number PRJNA396437. A total of 493,403,076 raw sequences of 74.5 Gb were obtained. Trimming of the raw data produced 437,186,232 clean reads of ~58 Gb. These datasets will be useful to elucidate the defence mechanisms of E. fuscoguttatus against Vibrio vulnificus infection for future development of effective prevention and treatment of vibriosis.
In 2019, 10 million new cases of tuberculosis have been reported worldwide. Our data reports genetic analyses of a Mycobacterium tuberculosis strain SBH321 isolated from a 31-year-old female with pulmonary tuberculosis. The genomic DNA of the strain was extracted from pure culture and subjected to sequencing using Illumina platform. M. tuberculosis strain SBH321 consists of 4,374,895 bp with G+C content of 65.59%. The comparative analysis by SNP-based phylogenetic analysis using maximum-likelihood method showed that our strain belonging to sublineage of the Ural family of Europe-America-Africa lineage (Lineage 4) and clustered with M. tuberculosis strain OFXR-4 from Taiwan. The whole genome sequence is deposited at DDBJ/ENA/GenBank under the accession WCJH00000000 (SRR10230353).
The research aims to study the correlations between attitudes of users, social norms, perceived behavioural control, and purchase behaviours. The research population consists of social media users in Malaysia. The data was collected from 205 respondents via a self-administered online survey. The theory of planned behaviour acts as the underlying theory in the research. Next, descriptive and hypothesis-testing quantitative analysis were adopted to probe the relationships between the variables. Moreover, G*Power was used to identify the minimum sample size, and SPSS v.22 was employed to examine the datasets.
This article presents internet and mobile technology (IMT) usage during the pandemic and examines its impact on Malaysian workers' work-life balance. This study also included the Malaysian local value, namely Ugahari, and its role in shaping individual work and personal/family life behavior. The operationalization of variables was developed based on the work-life balance from permeability and flexibility, type of work-life balance consequences, and the theory of planned behavior utilized for local wisdom to characterize the respondents. Data were collected through online surveys and distributed to industries and government agencies in the Urban Area of Malaysia. There are 466 valid and complete questionnaires. The data set has been collected as a reference source for further research regarding the role of local value "Ugahari, especially on work-life balance.
White threads fish Holothuria (Mertensiothuria) leucospilota (Brandt, 1835) or locally known as bat puntil is a neritic marine organism, and it is widely distributed in Indo Pacific. They serve many important roles in ecosystem services and were discovered to contain many bioactive compounds that are useful for medicinal value. However, despite its abundance in Malaysian seawater, there is still a lack of records on H. leucospilota mitochondrial genome (mitogenome) from Malaysia. The mitogenome of H. leucospilota originating from Sedili Kechil, Kota Tinggi, Johor, Malaysia, is presented here. Whole genome sequencing was successfully sequenced using Illumina NovaSEQ6000 sequencing system and the mitochondrial-derived contigs were assembled using de novo approach. The size of the mitogenome is 15,982 bp which consists of 13 protein-coding genes (PCGs), 21 transfer RNAs, and 2 ribosomal RNAs. The overall composition of nucleotide bases was estimated to be 25.8% for T, 25.9% for C, 31.8% for A and 16.5% for G (with A + T content of 57.6%). Maximum likelihood phylogenetic tree analysis revealed that the mitochondrial Protein-Coding Genes (PCGs) sequence data from our H. leucospilota is closely related to H. leucospilota from accession number MK940237 and H. leucospilota from accession number MN594790, followed by H. leucospilota from accession number MN276190, forming sister group with H. hilla (MN163001), known as Tiger tail sea cucumber. The mitogenome of H. leucospilota will be valuable for genetic research, mitogenome reference and future conservation management of sea cucumber in Malaysia. The mitogenome data of H. leucospilota from Sedili Kechil, Kota Tinggi, Johor, Malaysia is available in the GenBank database repository with accession number ON584426.
The Javan mahseer (Tor tambra) is one of the most valuable freshwater fish found in Tor species. To date, other than mitogenomic data (BioProject: PRJNA422829), genomic and transcriptomic resources for this species are still lacking which is crucial to understand the molecular mechanisms associated with important traits such as growth, immune response, reproduction and sex determination. For the first time, we sequenced the transcriptome from a whole juvenile fish using Illumina NovaSEQ6000 generating raw paired-end reads. De novo transcriptome assembly generated a draft transcriptome (BUSCO5 completeness of 91.2% [Actinopterygii_odb10 database]) consisting of 259,403 putative transcripts with a total and N50 length of 333,881,215 bp and 2283 bp, respectively. A total count of 77,503 non-redundant protein coding sequences were predicted from the transcripts and used for functional annotation. We mapped the predicted proteins to 304 known KEGG pathways with signal transduction cluster having the highest representation followed by immune system and endocrine system. In addition, transcripts exhibiting significant similarity to previously published growth-and immune-related genes were identified which will facilitate future molecular breeding of Tor tambra.
Roselle (Hibiscus sabdariffa L.) is recognized for its phytochemical compounds such as anthocyanins, which possess pharmacological potentials in the treatments of hypertension, diabetes, cancer, hyperlipidaemia and hyperglycaemia. The calyx is the most commercially valuable part of the roselle and usually harvested at maturation. However, genetic study to understand the transcriptome changes in the calyx during maturation has yet to be explored. In this study, we sequenced the transcriptomes of roselle calyces at maturation stages III and IV using Illumina NextSeq 500 platform. These are the two most critical maturation stages in roselle, as these stages are often associated with the quality of the calyx. Over 200 million good quality paired-end reads were generated and de novo assembled into a reference transcriptome consisting of 221,334 transcripts with N50 score of 491bp. Among these transcripts, 92,974 transcripts (42%) were successfully annotated. The total number of significantly differentially expressed genes (DEGs) and the top five most significantly regulated genes in each of the maturation stage were presented. Twenty-one genes implicated in the biosynthesis of anthocyanins and their relative expressions in the calyx tissues at the two maturation stages were reported. Two secondary metabolites biosynthesis pathways that attained a relatively higher number of DEG mappings compared to other pathways were also reported. The findings from this work provide novel insights to better understand the transcriptional changes in roselle during calyx maturation, and the data made available here is intended for continued genetic study on roselle. The work is registered under NCBI Bioproject PRJNA664826. The raw sequencing reads are available in Short Read Archive with the accession numbers SRX9171161, SRX9171162, SRX9171163, SRX9171164, SRX9171165 and SRX9171166.
Heterotrigona itama is a species of stingless bee recently domesticated (or reared) for honey production in a few Southeast Asian countries namely Malaysia and Indonesia. Being categorized in the clade Corbiculata together with the honeybees (Apis spp.) and bumble bees (Bombus spp.), the stingless bees are highly social in which the colony members are subjected to labor division where a queen functions as the reproductive caste. In this data article, we provide a resource encompassing a transcriptome profile (de novo assembled) from H. itama queen larva - the first report of transcriptome assembly for this species. The generated data is pivotal for the characterization of important genes and biological pathways in order to further improve our understanding on the developmental biology, behavior, social structure and ecological needs of this eusocial hymenopteran insect from the molecular aspect. The raw RNA sequencing data is available at NCBI Sequence Read Archive (SAR) under the accession number SRP230250 and the assembled reads are deposited at DDBJ/EMBL/Genbank as Transcriptome Shotgun Assembly (TSA) under the accession GIIH00000000.
The spotted seahorse, Hippocampus kuda population is exponentially decreasing globally due to habitat loss contributed by massive coastal urbanization as well as its large exploitation for Chinese herbal medicine. Genomic data would be highly useful to improve biomonitoring of seahorse populations in Malaysia via the usage of non-invasive approaches such as water environmental DNA. Here we report the first complete mitogenome of two H. kuda individuals originating from Malaysia, generated using BGISEQ-500RS sequencer. The lengths of both mitogenomes are 16,529bp, consisting of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a control region. The overall base composition was 32.46% for A, 29.40% for T, 14.73% for G and 23.41% for C with AT rich features (61.86%). The gene organization of Malaysian H. kuda were similar to that of most teleost species. A phylogenetic analysis of the genome against mtDNA data from other Hippocampus species showed that Malaysian H. kuda samples clustered with H. capensis, H. reidi and H. kuda. Notably however, analysis of the data using BLASTn revealed they had 99.18% similarity to H. capensis, and only 97.66% to H. kuda and H. reidi, which are all part of the unresolved H. kuda complex. The mitogenomes are deposited in Genbank under the accession number MT221436 (HK1) and MT221436 (HK2).
Periplaneta americana is a cosmopolitan pest cockroach endemic to tropical and subtropical climates. It occurs frequently in urban sewer and wastewater system and transit in human proximities, spreading pathogens that causes serious public health concerns such as asthma, allergies, and others. By using the Next-generation Sequencing (NGS) known as Illumina NovaSeq 6000, this article documents for the draft genome data set of P. americana collected in Penang Island, Malaysia. This article displays the pair-end 150 bp genome dataset and results on the sequence quality. This genome dataset presents the information for further understanding of P. americana populations at molecular level and the opportunity to develop effective control and management strategies for the species. This dataset is available under Sequence Read Archive (SRA) databases with the SRR23867103.
The datasets designated in this article were obtained as a result of 10 months of data collection, carried out between august 2017 and May 2018 in Kuala Lumpur, Malaysia. The aim of study was to evaluate the factors Influencing adoption of battery electric vehicles (BEVs) in Malaysia. The data were collected from University Technology Malaysia (Kuala Lumpur campus) and five different companies in Kuala Lumpur. A total of 500 questionnaires were distributed and 322 questionnaires were obtained. After data screening, 312 datasets were analysed. Respondents of the study were postgraduate students, university lecturers and top to low level managers.
This dataset describes two main variables, technological pedagogical and content knowledge (TPACK) and Beliefs on ICT, which may affect pre-service teachers' (PSTs) use of ICT (UICT) during teaching practices. TPACK assumes that PSTs should actively combine some domains of knowledge to design good quality of ICT-integrated courses lessons. Beliefs on ICT in this study consist of a mix of behavioral, normative, and control beliefs on ICT integration in education. In addition, UICT is defined as ICT used by PSTs during teaching practices. Three approaches were applied for the purification of the dataset; development of instruments, survey, and exploratory factor analysis (EFA). The dataset consists of demographic information, TPACK, Beliefs on ICT, and UICT. The dataset is beneficial to teacher educators in designing effective programs that best nurture PSTs' UICT during teaching practices. Researchers sharing similar sample characteristics in developing countries may adapt this dataset for more rigorous statistical analyses.
This study intended to validate customer inspiration (CI)in Malaysian/developing country context. Data were collected from two different respondents for two studies - from Millennial customers of the auto industry and Generation Z customers of the smartphone industry. The survey conducted through a standardized and structured questionnaire. The variables of the both studies were customer-defined market orientation (MO) (customer orientation, competitor orientation, and interfunctional coordination), CI (inspired-by and inspired-to), and customer loyalty (CL). This research strategy, in terms of quantity, is descriptive and correlational. Statistical analysis of the data was carried out, using ADANCO 2.0. The finding of the study suggests all results of data 1 and data 2 were significant, and CI mediates the sub-constructs of MO with CL.
This dataset contains demographic information of 355 respondents and a validated 32-items Stages of Concerns Questionnaire (SoCQ). The SoCQ questionnaire was developed based on the Concerns-Based Adoption Model (CBAM) which measures seven stages of concerns as the variables. They are unconcerned, informational, personal, management, consequence, collaboration and refocusing. The data was firstly tested with normality, followed by validity checking using confirmatory factor analysis (CFA). It is useful for policy makers and stakeholders to have a thorough understanding about teachers' concerns on the use of the e-learning platform and thus, design suitable interventions to smoothen the adoption process of using the technology. This set of data could be used in a multi-racial developing country for more complex analyses.
Rhynchophorus ferrugineus or red palm weevil (RPW) is a destructive insect pest of major cultivated palms such as coconut, date and oil palm. One of the control management of RPW is trunk injection using monocrotophos or methamidophos, but these chemicals are found to affect ecosystems and human health. Thus, we aimed to determine a bio-pesticide to replace these synthetic chemicals. We tested the antifeedant activity of three eugenol-based compounds as potential control agent against RPW larvae in vitro condition for two weeks. All these compounds show significant effect as feeding deterrent agent on 4th instar larvae, while WN16 (4-allyl-2-methoxy-1-(4-trifluoromethyl-benzyloxy)-benzene) shows the highest feeding deterrent index (FDI = 64.42%). Here we present the data regarding the biological aspect on treated RPW larvae as well as antifeedant activity index of these eugenol derived compounds.
This article presents data on factors that depict [pre-service] teacher beliefs about educational research. A cross-sectional survey was used to collect the data that involved 352 final-year students working on their undergraduate theses. The students were registered in the faculty of teacher training and education in one of the private universities with an A (excellent) accreditation in Jakarta, Indonesia. The participants were selected conveniently. Exploratory factor analysis was performed to reveal the factors comprised in the data. 49 out of 72 items formed five factors: negative views about educational research, positive views about educational research, knowledge about educational research, open-mindedness, and accessibility.
Here, we present the complete mitochondrial genome of Pachliopta aristolochiae, a Common Rose butterfly from Malaysia. The sequence was generated using Illumina NovaSeq 6000 sequencing platform. The mitogenome is 15,235bp long, consisting of 13 protein-coding genes, 22 transfer RNAs, two ribosomal RNAs, and two D-loop regions. The total base composition was (81.6%), with A (39.3%), T (42.3%), C (11.0%) and G (7.3%). The gene order of the three tRNAs was trnM-trnI-trnQ, which differs from the ancestral insect gene order trnI-trnQ-trnM. Phylogenetic tree analysis revealed that the sequenced Pachliopta aristolochiae in this data is closely related to Losaria neptunus (NC 037868), with highly supported ML and BI analysis. The data presented in this work can provide useful resources for other researchers to study deeper into the phylogenetic relationships of Lepidoptera and the diversification of the Pachliopta species. Also, as one of the bioindicator species, this data can be used to assess environmental changes in the terrestrial and aquatic ecosystem via enviromental DNA approahes. The mitogenome of Pachliopta aristolochiae is available in GenBank under the accession number MZ781228.