MyMedR

Displaying publications 81 - 100 of 325 in total

Abstract:

Sort:

A novel classical machine learning framework for early sepsis prediction using electronic health record data from ICU patients

Prithula J, Islam KR, Kumar J, Tan TL, Reaz MBI, Rahman T, et al.

Comput Biol Med, 2025 Jan;184:109284.
PMID: 39579661 DOI: 10.1016/j.compbiomed.2024.109284

Sepsis, a life-threatening condition triggered by the body's response to infection, remains a significant global health challenge, annually affecting millions in the United States alone with substantial mortality and healthcare costs. Early prediction of sepsis is critical for timely intervention and improved patient outcomes. This study introduces an innovative predictive model leveraging machine learning techniques and a specific data-splitting approach on highly imbalanced electronic health records (EHRs). Using PhysioNet/CinC Challenge 2019 data from 40,336 patients, including vital signs, lab values, and demographics. Preliminary assessments using classical and stacked ML models with Synthetic Minority Oversampling Technique (SMOTE) augmentation were conducted, showing improved performance. It is found that stacking ML models enhances overall accuracy but faces limitations in precision, recall, and F1 score for positive class prediction. A novel data-splitting approach with 5-fold cross-validation and SMOTE and COPULA augmentation techniques demonstrated promise, with F1 scores ranging from 93 % to 94 % using the COPULA technique. COPULA excelled at predictions for different hours' onsets compared to the SMOTE technique. The proposed model outperformed existing studies, suggesting clinical viability for early sepsis prediction.

Matched MeSH terms: Machine Learning*
Fulltext Latest clinical frontiers related to autism diagnostic strategies

Cortese S, Bellato A, Gabellone A, Marzulli L, Matera E, Parlatini V, et al.

Cell Rep Med, 2025 Feb 18;6(2):101916.
PMID: 39879991 DOI: 10.1016/j.xcrm.2024.101916

The diagnosis of autism is currently based on the developmental history, direct observation of behavior, and reported symptoms, supplemented by rating scales/interviews/structured observational evaluations-which is influenced by the clinician's knowledge and experience-with no established diagnostic biomarkers. A growing body of research has been conducted over the past decades to improve diagnostic accuracy. Here, we provide an overview of the current diagnostic assessment process as well as of recent and ongoing developments to support diagnosis in terms of genetic evaluation, telemedicine, digital technologies, use of machine learning/artificial intelligence, and research on candidate diagnostic biomarkers. Genetic testing can meaningfully contribute to the assessment process, but caution is required when interpreting negative results, and more work is needed to strengthen the transferability of genetic information into clinical practice. Digital diagnostic and machine-learning-based analyses are emerging as promising approaches, but larger and more robust studies are needed. To date, there are no available diagnostic biomarkers. Moving forward, international collaborations may help develop multimodal datasets to identify biomarkers, ensure reproducibility, and support clinical translation.

Matched MeSH terms: Machine Learning*
Fulltext An efficient detection of Sinkhole attacks using machine learning: Impact on energy and security

Hasan MZ, Hanapi ZM, Zukarnain ZA, Huyop FH, Abdullah MDH

PLoS One, 2025;20(3):e0309532.
PMID: 40096085 DOI: 10.1371/journal.pone.0309532

In the realm of Wireless Sensor Networks (WSNs), the detection and mitigation of sinkhole attacks remain pivotal for ensuring network integrity and efficiency. This paper introduces SFlexCrypt, an innovative approach tailored to address these security challenges while optimizing energy consumption in WSNs. SFlexCrypt stands out by seamlessly integrating advanced machine learning algorithms to achieve high-precision detection and effective mitigation of sinkhole attacks. Employing a dataset from Contiki-Cooja, SFlexCrypt has been rigorously tested, demonstrating a detection accuracy of 100% and a mitigation rate of 97.31%. This remarkable performance not only bolsters network security but also significantly extends network longevity and reduces energy expenditure, crucial factors in the sustainability of WSNs. The study contributes substantially to the field of IoT security, offering a comprehensive and efficient framework for implementing Internet-based security strategies. The results affirm that SFlexCrypt is a robust solution, capable of enhancing the resilience of WSNs against sinkhole attacks while maintaining optimal energy efficiency.

Matched MeSH terms: Machine Learning*
Prediction of recombinant protein overexpression in Escherichia coli using a machine learning based model (RPOLP)

Habibi N, Norouzi A, Mohd Hashim SZ, Shamsir MS, Samian R

Comput Biol Med, 2015 Nov 1;66:330-6.
PMID: 26476414 DOI: 10.1016/j.compbiomed.2015.09.015

Recombinant protein overexpression, an important biotechnological process, is ruled by complex biological rules which are mostly unknown, is in need of an intelligent algorithm so as to avoid resource-intensive lab-based trial and error experiments in order to determine the expression level of the recombinant protein. The purpose of this study is to propose a predictive model to estimate the level of recombinant protein overexpression for the first time in the literature using a machine learning approach based on the sequence, expression vector, and expression host. The expression host was confined to Escherichia coli which is the most popular bacterial host to overexpress recombinant proteins. To provide a handle to the problem, the overexpression level was categorized as low, medium and high. A set of features which were likely to affect the overexpression level was generated based on the known facts (e.g. gene length) and knowledge gathered from related literature. Then, a representative sub-set of features generated in the previous objective was determined using feature selection techniques. Finally a predictive model was developed using random forest classifier which was able to adequately classify the multi-class imbalanced small dataset constructed. The result showed that the predictive model provided a promising accuracy of 80% on average, in estimating the overexpression level of a recombinant protein.

Matched MeSH terms: Machine Learning
Integration of machine learning-based prediction for enhanced Model's generalization: Application in photocatalytic polishing of palm oil mill effluent (POME)

Ng KH, Gan YS, Cheng CK, Liu KH, Liong ST

Environ Pollut, 2020 Dec;267:115500.
PMID: 33254722 DOI: 10.1016/j.envpol.2020.115500

In predicting palm oil mill effluent (POME) degradation efficiency, previous developed quadratic model quantitatively evaluated the effects of O2 flowrate, TiO2 loadings and initial concentration of POME in labscale photocatalytic system, which however suffered from low generalization due to the overfitting behaviour. Evidently, high RMSE (131.61) and low R2 (-630.49) obtained indicates its insufficiency in describing POME degradation at unseen factor ranges, hence verified the fact of poor generalization. To overcome this issue, several models were developed via machine learning-assisted techniques, namely Gaussian Process Regression (GPR), Linear Regression (LR), Decision Tree (DT), Supported Vector Machine (SVM) and Regression Tree Ensemble (RTE), subsequently being assessed systematically. To achieve high generalization, all models were subjected to 'train-all-test-all' strategy, 5-fold and 10-fold cross validation. Specifically, GPR model was furnished with high accuracy in 'train-all-test-all' strategy, judging from its low RMSE (1.0394) and high R2 (0.9962), which however menaced by the risk of overfitting. In contrast, despite relatively poorer RMSE and R2 (1.7964 and 0.9886) obtained in 5-fold cross validation, GPR model was rendered with highest generalization, while sufficiently preserving its accuracy in development process. Besides, SVM and RTE models were also demonstrated promising R2 (0.9372 and 0.9208), which however shadowed by their high RMSEs (4.2174 and 4.7366). Furthermore, the extraordinary generalization of GPR model was coincidentally verified in 10-fold cross validation. The lowest RMSE (2.1624) and highest R2 (0.9835) obtained with feature number of 36 asserted its sufficiency in both generalization and accuracy prospect. Other models were all rendered with slight lower R2 (> 0.9), plausibly due to the higher RMSE (> 4.0). According to GPR model, optimized POME degradation (52.52%) can be obtained at 70 mL/min of O2, 70.0 g/L of TiO2 and 250 ppm of POME concentration, with only ∼3% error as compared to the actual data.

Matched MeSH terms: Machine Learning
Fulltext 3D texture-based face recognition system using fine-tuned deep residual networks

Zheng S, Rahmat RWO, Khalid F, Nasharuddin NA

PeerJ Comput Sci, 2019;5:e236.
PMID: 33816889 DOI: 10.7717/peerj-cs.236

As the technology for 3D photography has developed rapidly in recent years, an enormous amount of 3D images has been produced, one of the directions of research for which is face recognition. Improving the accuracy of a number of data is crucial in 3D face recognition problems. Traditional machine learning methods can be used to recognize 3D faces, but the face recognition rate has declined rapidly with the increasing number of 3D images. As a result, classifying large amounts of 3D image data is time-consuming, expensive, and inefficient. The deep learning methods have become the focus of attention in the 3D face recognition research. In our experiment, the end-to-end face recognition system based on 3D face texture is proposed, combining the geometric invariants, histogram of oriented gradients and the fine-tuned residual neural networks. The research shows that when the performance is evaluated by the FRGC-v2 dataset, as the fine-tuned ResNet deep neural network layers are increased, the best Top-1 accuracy is up to 98.26% and the Top-2 accuracy is 99.40%. The framework proposed costs less iterations than traditional methods. The analysis suggests that a large number of 3D face data by the proposed recognition framework could significantly improve recognition decisions in realistic 3D face scenarios.

Matched MeSH terms: Machine Learning
Fulltext Classification of botnet attacks in IoT smart factory using honeypot combined with machine learning

Lee S, Abdullah A, Jhanjhi N, Kok S

PeerJ Comput Sci, 2021;7:e350.
PMID: 33817000 DOI: 10.7717/peerj-cs.350

The Industrial Revolution 4.0 began with the breakthrough technological advances in 5G, and artificial intelligence has innovatively transformed the manufacturing industry from digitalization and automation to the new era of smart factories. A smart factory can do not only more than just produce products in a digital and automatic system, but also is able to optimize the production on its own by integrating production with process management, service distribution, and customized product requirement. A big challenge to the smart factory is to ensure that its network security can counteract with any cyber attacks such as botnet and Distributed Denial of Service, They are recognized to cause serious interruption in production, and consequently economic losses for company producers. Among many security solutions, botnet detection using honeypot has shown to be effective in some investigation studies. It is a method of detecting botnet attackers by intentionally creating a resource within the network with the purpose of closely monitoring and acquiring botnet attacking behaviors. For the first time, a proposed model of botnet detection was experimented by combing honeypot with machine learning to classify botnet attacks. A mimicking smart factory environment was created on IoT device hardware configuration. Experimental results showed that the model performance gave a high accuracy of above 96%, with very fast time taken of just 0.1 ms and false positive rate at 0.24127 using random forest algorithm with Weka machine learning program. Hence, the honeypot combined machine learning model in this study was proved to be highly feasible to apply in the security network of smart factory to detect botnet attacks.

Matched MeSH terms: Machine Learning
Fulltext Automatic COVID-19 Detection Using Exemplar Hybrid Deep Features with X-ray Images

Barua PD, Muhammad Gowdh NF, Rahmat K, Ramli N, Ng WL, Chan WY, et al.

Int J Environ Res Public Health, 2021 07 29;18(15).
PMID: 34360343 DOI: 10.3390/ijerph18158052

COVID-19 and pneumonia detection using medical images is a topic of immense interest in medical and healthcare research. Various advanced medical imaging and machine learning techniques have been presented to detect these respiratory disorders accurately. In this work, we have proposed a novel COVID-19 detection system using an exemplar and hybrid fused deep feature generator with X-ray images. The proposed Exemplar COVID-19FclNet9 comprises three basic steps: exemplar deep feature generation, iterative feature selection and classification. The novelty of this work is the feature extraction using three pre-trained convolutional neural networks (CNNs) in the presented feature extraction phase. The common aspects of these pre-trained CNNs are that they have three fully connected layers, and these networks are AlexNet, VGG16 and VGG19. The fully connected layer of these networks is used to generate deep features using an exemplar structure, and a nine-feature generation method is obtained. The loss values of these feature extractors are computed, and the best three extractors are selected. The features of the top three fully connected features are merged. An iterative selector is used to select the most informative features. The chosen features are classified using a support vector machine (SVM) classifier. The proposed COVID-19FclNet9 applied nine deep feature extraction methods by using three deep networks together. The most appropriate deep feature generation model selection and iterative feature selection have been employed to utilise their advantages together. By using these techniques, the image classification ability of the used three deep networks has been improved. The presented model is developed using four X-ray image corpora (DB1, DB2, DB3 and DB4) with two, three and four classes. The proposed Exemplar COVID-19FclNet9 achieved a classification accuracy of 97.60%, 89.96%, 98.84% and 99.64% using the SVM classifier with 10-fold cross-validation for four datasets, respectively. Our developed Exemplar COVID-19FclNet9 model has achieved high classification accuracy for all four databases and may be deployed for clinical application.

Matched MeSH terms: Machine Learning
Automatic colonic polyp detection using integration of modified deep residual convolutional neural network and ensemble learning approaches

Liew WS, Tang TB, Lin CH, Lu CK

Comput Methods Programs Biomed, 2021 Jul;206:106114.
PMID: 33984661 DOI: 10.1016/j.cmpb.2021.106114

BACKGROUND AND OBJECTIVE: The increased incidence of colorectal cancer (CRC) and its mortality rate have attracted interest in the use of artificial intelligence (AI) based computer-aided diagnosis (CAD) tools to detect polyps at an early stage. Although these CAD tools have thus far achieved a good accuracy level to detect polyps, they still have room to improve further (e.g. sensitivity). Therefore, a new CAD tool is developed in this study to detect colonic polyps accurately.
METHODS: In this paper, we propose a novel approach to distinguish colonic polyps by integrating several techniques, including a modified deep residual network, principal component analysis and AdaBoost ensemble learning. A powerful deep residual network architecture, ResNet-50, was investigated to reduce the computational time by altering its architecture. To keep the interference to a minimum, median filter, image thresholding, contrast enhancement, and normalisation techniques were exploited on the endoscopic images to train the classification model. Three publicly available datasets, i.e., Kvasir, ETIS-LaribPolypDB, and CVC-ClinicDB, were merged to train the model, which included images with and without polyps.
RESULTS: The proposed approach trained with a combination of three datasets achieved Matthews Correlation Coefficient (MCC) of 0.9819 with accuracy, sensitivity, precision, and specificity of 99.10%, 98.82%, 99.37%, and 99.38%, respectively.
CONCLUSIONS: These results show that our method could repeatedly classify endoscopic images automatically and could be used to effectively develop computer-aided diagnostic tools for early CRC detection.

Matched MeSH terms: Machine Learning
Fulltext Identification of significant climatic risk factors and machine learning models in dengue outbreak prediction

Yavari Nejad F, Varathan KD

BMC Med Inform Decis Mak, 2021 04 30;21(1):141.
PMID: 33931058 DOI: 10.1186/s12911-021-01493-y

BACKGROUND: Dengue fever is a widespread viral disease and one of the world's major pandemic vector-borne infections, causing serious hazard to humanity. The World Health Organisation (WHO) reported that the incidence of dengue fever has increased dramatically across the world in recent decades. WHO currently estimates an annual incidence of 50-100 million dengue infections worldwide. To date, no tested vaccine or treatment is available to stop or prevent dengue fever. Thus, the importance of predicting dengue outbreaks is significant. The current issue that should be addressed in dengue outbreak prediction is accuracy. A limited number of studies have conducted an in-depth analysis of climate factors in dengue outbreak prediction.
METHODS: The most important climatic factors that contribute to dengue outbreaks were identified in the current work. Correlation analyses were performed in order to determine these factors and these factors were used as input parameters for machine learning models. Top five machine learning classification models (Bayes network (BN) models, support vector machine (SVM), RBF tree, decision table and naive Bayes) were chosen based on past research. The models were then tested and evaluated on the basis of 4-year data (January 2010 to December 2013) collected in Malaysia.
RESULTS: This research has two major contributions. A new risk factor, called the TempeRain factor (TRF), was identified and used as an input parameter for the model of dengue outbreak prediction. Moreover, TRF was applied to demonstrate its strong impact on dengue outbreaks. Experimental results showed that the Bayes Network model with the new meteorological risk factor identified in this study increased accuracy to 92.35% for predicting dengue outbreaks.
CONCLUSIONS: This research explored the factors used in dengue outbreak prediction systems. The major contribution of this study is identifying new significant factors that contribute to dengue outbreak prediction. From the evaluation result, we obtained a significant improvement in the accuracy of a machine learning model for dengue outbreak prediction.

Matched MeSH terms: Machine Learning
Fulltext Using SVMs for classification of cross-document relationships

Kumar, Yogan Jaya, Naomie Salim, Ahmed Hamza Osman, Abuobieda, Albaraa

Pertanika Journal of Science & Technology, 2013;21(1):239-246.
MyJurnal

Cross-document Structure Theory (CST) has recently been proposed to facilitate tasks related to multidocument analysis. Classifying and identifying the CST relationships between sentences across topically related documents have since been proven as necessary. However, there have not been sufficient studies presented in literature to automatically identify these CST relationships. In this study, a supervised machine learning technique, i.e. Support Vector Machines (SVMs), was applied to identify four types of CST relationships, namely “Identity”, “Overlap”, “Subsumption”, and “Description” on the datasets obtained from CSTBank corpus. The performance of the SVMs classification was measured using Precision, Recall and F-measure. In addition, the results obtained using SVMs were also compared with those from the previous literature using boosting classification algorithm. It was found that SVMs yielded better results in classifying the four CST relationships.

Matched MeSH terms: Supervised Machine Learning
Fulltext Empirical investigation of feature sets effectiveness in product review sentiment classification

Nurfadhlina Mohd Sharef, Rozilah Rosli

Pertanika Journal of Science & Technology, 2017;25(106):125-132.
MyJurnal

Sentiment analysis classification has been typically performed by combining features that represent the dataset at hand. Existing works have employed various features individually such as the syntactical, lexical and machine learning, and some have hybridized to reach optimistic results. Since the debate on the best combination is still unresolved this paper addresses the empirical investigation of the combination of features for product review classification. Results indicate the Support Vector Machine classification model combined with any of the observed lexicon namely MPQA, BingLiu and General Inquirer and either the unigram or inte-gration of unigram and bigram features is the top performer.

Matched MeSH terms: Machine Learning
Arrhythmia detection using deep convolutional neural network with long duration ECG signals

Yıldırım Ö, Pławiak P, Tan RS, Acharya UR

Comput Biol Med, 2018 11 01;102:411-420.
PMID: 30245122 DOI: 10.1016/j.compbiomed.2018.09.009

This article presents a new deep learning approach for cardiac arrhythmia (17 classes) detection based on long-duration electrocardiography (ECG) signal analysis. Cardiovascular disease prevention is one of the most important tasks of any health care system as about 50 million people are at risk of heart disease in the world. Although automatic analysis of ECG signal is very popular, current methods are not satisfactory. The goal of our research was to design a new method based on deep learning to efficiently and quickly classify cardiac arrhythmias. Described research are based on 1000 ECG signal fragments from the MIT - BIH Arrhythmia database for one lead (MLII) from 45 persons. Approach based on the analysis of 10-s ECG signal fragments (not a single QRS complex) is applied (on average, 13 times less classifications/analysis). A complete end-to-end structure was designed instead of the hand-crafted feature extraction and selection used in traditional methods. Our main contribution is to design a new 1D-Convolutional Neural Network model (1D-CNN). The proposed method is 1) efficient, 2) fast (real-time classification) 3) non-complex and 4) simple to use (combined feature extraction and selection, and classification in one stage). Deep 1D-CNN achieved a recognition overall accuracy of 17 cardiac arrhythmia disorders (classes) at a level of 91.33% and classification time per single sample of 0.015 s. Compared to the current research, our results are one of the best results to date, and our solution can be implemented in mobile devices and cloud computing.

Matched MeSH terms: Machine Learning
Fulltext Knowledge Preserving OSELM Model for Wi-Fi-Based Indoor Localization

Al-Khaleefa AS, Ahmad MR, Isa AAM, Esa MRM, Aljeroudi Y, Jubair MA, et al.

Sensors (Basel), 2019 May 25;19(10).
PMID: 31130657 DOI: 10.3390/s19102397

Wi-Fi has shown enormous potential for indoor localization because of its wide utilization and availability. Enabling the use of Wi-Fi for indoor localization necessitates the construction of a fingerprint and the adoption of a learning algorithm. The goal is to enable the use of the fingerprint in training the classifiers for predicting locations. Existing models of machine learning Wi-Fi-based localization are brought from machine learning and modified to accommodate for practical aspects that occur in indoor localization. The performance of these models varies depending on their effectiveness in handling and/or considering specific characteristics and the nature of indoor localization behavior. One common behavior in the indoor navigation of people is its cyclic dynamic nature. To the best of our knowledge, no existing machine learning model for Wi-Fi indoor localization exploits cyclic dynamic behavior for improving localization prediction. This study modifies the widely popular online sequential extreme learning machine (OSELM) to exploit cyclic dynamic behavior for achieving improved localization results. Our new model is called knowledge preserving OSELM (KP-OSELM). Experimental results conducted on the two popular datasets TampereU and UJIndoorLoc conclude that KP-OSELM outperforms benchmark models in terms of accuracy and stability. The last achieved accuracy was 92.74% for TampereU and 72.99% for UJIndoorLoc.

Matched MeSH terms: Machine Learning
Fulltext Performance of SVM with multiple kernel learning for classification tasks of imbalanced datasets

Saeed, Sana, Ong, Hong Choon

Pertanika Journal of Science & Technology, 2019;27(1):527-545.
MyJurnal

Support vector machine (SVM) is one of the most popular algorithms in machine learning
and data mining. However, its reduced efficiency is usually observed for imbalanced
datasets. To improve the performance of SVM for binary imbalanced datasets, a new scheme
based on oversampling and the hybrid algorithm were introduced. Besides the use of a
single kernel function, SVM was applied with multiple kernel learning (MKL). A weighted
linear combination was defined based on the linear kernel function, radial basis function
(RBF kernel), and sigmoid kernel function for MKL. By generating the synthetic samples
in the minority class, searching the best choices of the SVM parameters and identifying
the weights of MKL by minimizing the objective function, the improved performance of
SVM was observed. To prove the strength of the proposed scheme, an experimental study,
including noisy borderline and real imbalanced datasets was conducted. SVM was applied
with linear kernel function, RBF kernel, sigmoid kernel function and MKL on all datasets.
The performance of SVM with all kernel functions was evaluated by using sensitivity,
G Mean, and F measure. A significantly improved performance of SVM with MKL was
observed by applying the proposed scheme.

Matched MeSH terms: Machine Learning
Computer Vision and Machine Learning Analysis of Commercial Rice Grains: A Potential Digital Approach for Consumer Perception Studies

Aznan A, Gonzalez Viejo C, Pang A, Fuentes S

Sensors (Basel), 2021 Sep 23;21(19).
PMID: 34640673 DOI: 10.3390/s21196354

Rice quality assessment is essential for meeting high-quality standards and consumer demands. However, challenges remain in developing cost-effective and rapid techniques to assess commercial rice grain quality traits. This paper presents the application of computer vision (CV) and machine learning (ML) to classify commercial rice samples based on dimensionless morphometric parameters and color parameters extracted using CV algorithms from digital images obtained from a smartphone camera. The artificial neural network (ANN) model was developed using nine morpho-colorimetric parameters to classify rice samples into 15 commercial rice types. Furthermore, the ANN models were deployed and evaluated on a different imaging system to simulate their practical applications under different conditions. Results showed that the best classification accuracy was obtained using the Bayesian Regularization (BR) algorithm of the ANN with ten hidden neurons at 91.6% (MSE = <0.01) and 88.5% (MSE = 0.01) for the training and testing stages, respectively, with an overall accuracy of 90.7% (Model 2). Deployment also showed high accuracy (93.9%) in the classification of the rice samples. The adoption by the industry of rapid, reliable, and accurate methods, such as those presented here, may allow the incorporation of different morpho-colorimetric traits in rice with consumer perception studies.

Matched MeSH terms: Machine Learning
Fulltext Rapid Detection of Fraudulent Rice Using Low-Cost Digital Sensing Devices and Machine Learning

Aznan A, Gonzalez Viejo C, Pang A, Fuentes S

Sensors (Basel), 2022 Nov 09;22(22).
PMID: 36433249 DOI: 10.3390/s22228655

Rice fraud is one of the common threats to the rice industry. Conventional methods to detect rice adulteration are costly, time-consuming, and tedious. This study proposes the quantitative prediction of rice adulteration levels measured through the packaging using a handheld near-infrared (NIR) spectrometer and electronic nose (e-nose) sensors measuring directly on samples and paired with machine learning (ML) algorithms. For these purposes, the samples were prepared by mixing rice at different ratios from 0% to 100% with a 10% increment based on the rice's weight, consisting of (i) rice from different origins, (ii) premium with regular rice, (iii) aromatic with non-aromatic, and (iv) organic with non-organic rice. Multivariate data analysis was used to explore the sample distribution and its relationship with the e-nose sensors for parameter engineering before ML modeling. Artificial neural network (ANN) algorithms were used to predict the adulteration levels of the rice samples using the e-nose sensors and NIR absorbances readings as inputs. Results showed that both sensing devices could detect rice adulteration at different mixing ratios with high correlation coefficients through direct (e-nose; R = 0.94-0.98) and non-invasive measurement through the packaging (NIR; R = 0.95-0.98). The proposed method uses low-cost, rapid, and portable sensing devices coupled with ML that have shown to be reliable and accurate to increase the efficiency of rice fraud detection through the rice production chain.

Matched MeSH terms: Machine Learning
Fulltext Machine Learning-Based Epileptic Seizure Detection Methods Using Wavelet and EMD-Based Decomposition Techniques: A Review

Thangarajoo RG, Reaz MBI, Srivastava G, Haque F, Ali SHM, Bakar AAA, et al.

Sensors (Basel), 2021 Dec 20;21(24).
PMID: 34960577 DOI: 10.3390/s21248485

Epileptic seizures are temporary episodes of convulsions, where approximately 70 percent of the diagnosed population can successfully manage their condition with proper medication and lead a normal life. Over 50 million people worldwide are affected by some form of epileptic seizures, and their accurate detection can help millions in the proper management of this condition. Increasing research in machine learning has made a great impact on biomedical signal processing and especially in electroencephalogram (EEG) data analysis. The availability of various feature extraction techniques and classification methods makes it difficult to choose the most suitable combination for resource-efficient and correct detection. This paper intends to review the relevant studies of wavelet and empirical mode decomposition-based feature extraction techniques used for seizure detection in epileptic EEG data. The articles were chosen for review based on their Journal Citation Report, feature selection methods, and classifiers used. The high-dimensional EEG data falls under the category of '3N' biosignals-nonstationary, nonlinear, and noisy; hence, two popular classifiers, namely random forest and support vector machine, were taken for review, as they are capable of handling high-dimensional data and have a low risk of over-fitting. The main metrics used are sensitivity, specificity, and accuracy; hence, some papers reviewed were excluded due to insufficient metrics. To evaluate the overall performances of the reviewed papers, a simple mean value of all metrics was used. This review indicates that the system that used a Stockwell transform wavelet variant as a feature extractor and SVM classifiers led to a potentially better result.

Matched MeSH terms: Machine Learning
A novel machine learning scheme for classification of medicinal herbs based on 2D-FTIR fingerprints

Yoon TL, Yeap ZQ, Tan CS, Chen Y, Chen J, Yam MF

Spectrochim Acta A Mol Biomol Spectrosc, 2022 Feb 05;266:120440.
PMID: 34627017 DOI: 10.1016/j.saa.2021.120440

A proof-of-concept medicinal herbs identification scheme using machine learning classifiers is proposed in the form of an automated computational package. The scheme makes use of two-dimensional correlation Fourier Transformed Infrared (FTIR) fingerprinting maps derived from the FTIR of raw herb spectra as digital input. The prototype package admits a collection of 11 machine learning classifiers to form a voting pool. A common set of oversampled dataset containing 5 different herbal classes is used to train the pool of classifiers on a one-verses-others manner. The collections of trained models, dubbed the voting classifiers, are deployed in a collective manner to cast their votes to support or against a given inference fingerprint whether it belongs to a particular class. By collecting the votes casted by all voting classifiers, a logically designed scoring system will select out the most probable guess of the identity of the inference fingerprint. The same scoring system is also capable of discriminating an inference fingerprint that does not belong to any of the classes the voting classifiers are trained for as the 'others' type. The proposed classification scheme is stress-tested to evaluate its performance and expected consistency. Our experimental runs show that, by and large, a satisfactory performance of the classification scheme of up to 90 % accuracy is achieved, providing a proof-of-concept viability that the proposed scheme is a feasible, practical, and convenient tool for herbal classification. The scheme is implemented in the form of a packaged Python code, dubbed the "Collective Voting" (CV) package, which is easily scalable, maintained and used in practice.

Matched MeSH terms: Machine Learning
Feature selection and risk prediction for patients with coronary artery disease using data mining

Md Idris N, Chiam YK, Varathan KD, Wan Ahmad WA, Chee KH, Liew YM

Med Biol Eng Comput, 2020 Dec;58(12):3123-3140.
PMID: 33155096 DOI: 10.1007/s11517-020-02268-9

Coronary artery disease (CAD) is an important cause of mortality across the globe. Early risk prediction of CAD would be able to reduce the death rate by allowing early and targeted treatments. In healthcare, some studies applied data mining techniques and machine learning algorithms on the risk prediction of CAD using patient data collected by hospitals and medical centers. However, most of these studies used all the attributes in the datasets which might reduce the performance of prediction models due to data redundancy. The objective of this research is to identify significant features to build models for predicting the risk level of patients with CAD. In this research, significant features were selected using three methods (i.e., Chi-squared test, recursive feature elimination, and Embedded Decision Tree). Synthetic Minority Over-sampling Technique (SMOTE) oversampling technique was implemented to address the imbalanced dataset issue. The prediction models were built based on the identified significant features and eight machine learning algorithms, utilizing Acute Coronary Syndrome (ACS) datasets provided by National Cardiovascular Disease Database (NCVD) Malaysia. The prediction models were evaluated and compared using six performance evaluation metrics, and the top-performing models have achieved AUC more than 90%. Graphical abstract.

Matched MeSH terms: Machine Learning

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links