OBJECTIVE: The objectives of the study are to explore the determinants, motives, and barriers to healthy eating behaviors in online communities and provide insight into YouTube video commenters' perceptions and sentiments of healthy eating through text mining techniques.
METHODS: This paper applied text mining techniques to identify and categorize meaningful healthy eating determinants. These determinants were then incorporated into hypothetically defined constructs that reflect their thematic and sentimental nature in order to test our proposed model using a variance-based structural equation modeling procedure.
RESULTS: With a dataset of 4654 comments extracted from YouTube videos in the context of Malaysia, we apply a text mining method to analyze the perceptions and behavior of healthy eating. There were 10 clusters identified with regard to food ingredients, food price, food choice, food portion, well-being, cooking, and culture in the concept of healthy eating. The structural equation modeling results show that clusters are positively associated with healthy eating with all P values less than .001, indicating a statistical significance of the study results. People hold complex and multifaceted beliefs about healthy eating in the context of YouTube videos. Fruits and vegetables are the epitome of healthy foods. Despite having a favorable perception of healthy eating, people may not purchase commonly recognized healthy food if it has a premium price. People associate healthy eating with weight concerns. Food taste, variety, and availability are identified as reasons why Malaysians cannot act on eating healthily.
CONCLUSIONS: This study offers significant value to the existing literature of health-related studies by investigating the rich and diverse social media data gleaned from YouTube. This research integrated text mining analytics with predictive modeling techniques to identify thematic constructs and analyze the sentiments of healthy eating.
METHODS: We use Non-linear Iterative Partial Least Squares to perform the data dimensionality reduction, Self-Organizing Map technique for clustering task and ensembles of Neuro-Fuzzy Inference System for predicting the hepatitis disease. We also use decision trees for the selection of most important features in the experimental dataset. We test our method on a real-world dataset and present our results in comparison with the latest results of previous studies.
RESULTS: The results of our analyses on the dataset demonstrated that our method performance is superior to the Neural Network, ANFIS, K-Nearest Neighbors and Support Vector Machine.
CONCLUSIONS: The method has potential to be used as an intelligent learning system for hepatitis disease diagnosis in the healthcare.