MyMedR

Displaying publications 41 - 60 of 269 in total

Abstract:

Sort:

Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia

Ansari M, Othman F, Abunama T, El-Shafie A

Environ Sci Pollut Res Int, 2018 Apr;25(12):12139-12149.
PMID: 29455350 DOI: 10.1007/s11356-018-1438-z

The function of a sewage treatment plant is to treat the sewage to acceptable standards before being discharged into the receiving waters. To design and operate such plants, it is necessary to measure and predict the influent flow rate. In this research, the influent flow rate of a sewage treatment plant (STP) was modelled and predicted by autoregressive integrated moving average (ARIMA), nonlinear autoregressive network (NAR) and support vector machine (SVM) regression time series algorithms. To evaluate the models' accuracy, the root mean square error (RMSE) and coefficient of determination (R2) were calculated as initial assessment measures, while relative error (RE), peak flow criterion (PFC) and low flow criterion (LFC) were calculated as final evaluation measures to demonstrate the detailed accuracy of the selected models. An integrated model was developed based on the individual models' prediction ability for low, average and peak flow. An initial assessment of the results showed that the ARIMA model was the least accurate and the NAR model was the most accurate. The RE results also prove that the SVM model's frequency of errors above 10% or below - 10% was greater than the NAR model's. The influent was also forecasted up to 44 weeks ahead by both models. The graphical results indicate that the NAR model made better predictions than the SVM model. The final evaluation of NAR and SVM demonstrated that SVM made better predictions at peak flow and NAR fit well for low and average inflow ranges. The integrated model developed includes the NAR model for low and average influent and the SVM model for peak inflow.

Matched MeSH terms: Machine Learning*
Fulltext Ridge regression and its applications in genetic studies

Arashi M, Roozbeh M, Hamzah NA, Gasparini M

PLoS One, 2021;16(4):e0245376.
PMID: 33831027 DOI: 10.1371/journal.pone.0245376

With the advancement of technology, analysis of large-scale data of gene expression is feasible and has become very popular in the era of machine learning. This paper develops an improved ridge approach for the genome regression modeling. When multicollinearity exists in the data set with outliers, we consider a robust ridge estimator, namely the rank ridge regression estimator, for parameter estimation and prediction. On the other hand, the efficiency of the rank ridge regression estimator is highly dependent on the ridge parameter. In general, it is difficult to provide a satisfactory answer about the selection for the ridge parameter. Because of the good properties of generalized cross validation (GCV) and its simplicity, we use it to choose the optimum value of the ridge parameter. The GCV function creates a balance between the precision of the estimators and the bias caused by the ridge estimation. It behaves like an improved estimator of risk and can be used when the number of explanatory variables is larger than the sample size in high-dimensional problems. Finally, some numerical illustrations are given to support our findings.

Matched MeSH terms: Machine Learning*
Fulltext Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm

Asim Shahid M, Alam MM, Mohd Su'ud M

PLoS One, 2023;18(4):e0284209.
PMID: 37053173 DOI: 10.1371/journal.pone.0284209

The benefits and opportunities offered by cloud computing are among the fastest-growing technologies in the computer industry. Additionally, it addresses the difficulties and issues that make more users more likely to accept and use the technology. The proposed research comprised of machine learning (ML) algorithms is Naïve Bayes (NB), Library Support Vector Machine (LibSVM), Multinomial Logistic Regression (MLR), Sequential Minimal Optimization (SMO), K Nearest Neighbor (KNN), and Random Forest (RF) to compare the classifier gives better results in accuracy and less fault prediction. In this research, the secondary data results (CPU-Mem Mono) give the highest percentage of accuracy and less fault prediction on the NB classifier in terms of 80/20 (77.01%), 70/30 (76.05%), and 5 folds cross-validation (74.88%), and (CPU-Mem Multi) in terms of 80/20 (89.72%), 70/30 (90.28%), and 5 folds cross-validation (92.83%). Furthermore, on (HDD Mono) the SMO classifier gives the highest percentage of accuracy and less fault prediction fault in terms of 80/20 (87.72%), 70/30 (89.41%), and 5 folds cross-validation (88.38%), and (HDD-Multi) in terms of 80/20 (93.64%), 70/30 (90.91%), and 5 folds cross-validation (88.20%). Whereas, primary data results found RF classifier gives the highest percentage of accuracy and less fault prediction in terms of 80/20 (97.14%), 70/30 (96.19%), and 5 folds cross-validation (95.85%) in the primary data results, but the algorithm complexity (0.17 seconds) is not good. In terms of 80/20 (95.71%), 70/30 (95.71%), and 5 folds cross-validation (95.71%), SMO has the second highest accuracy and less fault prediction, but the algorithm complexity is good (0.3 seconds). The difference in accuracy and less fault prediction between RF and SMO is only (.13%), and the difference in time complexity is (14 seconds). We have decided that we will modify SMO. Finally, the Modified Sequential Minimal Optimization (MSMO) Algorithm method has been proposed to get the highest accuracy & less fault prediction errors in terms of 80/20 (96.42%), 70/30 (96.42%), & 5 fold cross validation (96.50%).

Matched MeSH terms: Machine Learning*
Fulltext Extreme learning machine based optimal embedding location finder for image steganography

Atee HA, Ahmad R, Noor NM, Rahma AM, Aljeroudi Y

PLoS One, 2017;12(2):e0170329.
PMID: 28196080 DOI: 10.1371/journal.pone.0170329

In image steganography, determining the optimum location for embedding the secret message precisely with minimum distortion of the host medium remains a challenging issue. Yet, an effective approach for the selection of the best embedding location with least deformation is far from being achieved. To attain this goal, we propose a novel approach for image steganography with high-performance, where extreme learning machine (ELM) algorithm is modified to create a supervised mathematical model. This ELM is first trained on a part of an image or any host medium before being tested in the regression mode. This allowed us to choose the optimal location for embedding the message with best values of the predicted evaluation metrics. Contrast, homogeneity, and other texture features are used for training on a new metric. Furthermore, the developed ELM is exploited for counter over-fitting while training. The performance of the proposed steganography approach is evaluated by computing the correlation, structural similarity (SSIM) index, fusion matrices, and mean square error (MSE). The modified ELM is found to outperform the existing approaches in terms of imperceptibility. Excellent features of the experimental results demonstrate that the proposed steganographic approach is greatly proficient for preserving the visual information of an image. An improvement in the imperceptibility as much as 28% is achieved compared to the existing state of the art methods.

Matched MeSH terms: Machine Learning*
Fulltext Machine Learning-Based Performance Comparison to Diagnose Anterior Cruciate Ligament Tears

Awan MJ, Mohd Rahim MS, Salim N, Rehman A, Nobanee H

J Healthc Eng, 2022;2022:2550120.
PMID: 35444781 DOI: 10.1155/2022/2550120

In recent times, knee joint pains have become severe enough to make daily tasks difficult. Knee osteoarthritis is a type of arthritis and a leading cause of disability worldwide. The middle of the knee contains a vital portion, the anterior cruciate ligament (ACL). It is necessary to diagnose the ACL ruptured tears early to avoid surgery. The study aimed to perform a comparative analysis of machine learning models to identify the condition of three ACL tears. In contrast to previous studies, this study also considers imbalanced data distributions as machine learning techniques struggle to deal with this problem. The paper applied and analyzed four machine learning classification models, namely, random forest (RF), categorical boosting (Cat Boost), light gradient boosting machines (LGBM), and highly randomized classifier (ETC) on the balanced, structured dataset of ACL. After oversampling a hyperparameter adjustment, the above four models have achieved an average accuracy of 95.72%, 94.98%, 94.98%, and 98.26%. There are 2070 observations and eight features in the collection of three diagnosis ACL classes after oversampling. The area under curve value was approximately 0.998, respectively. Experiments were performed using twelve machine learning algorithms with imbalanced and balanced datasets. However, the accuracy of the imbalanced dataset has remained under 76% for all twelve models. After oversampling, the proposed model may contribute to the investigation of ACL tears on magnetic resonance imaging and other knee ligaments efficiently and automatically without involving radiologists.

Matched MeSH terms: Machine Learning
Modelling gully-erosion susceptibility in a semi-arid region, Iran: Investigation of applicability of certainty factor and maximum entropy models

Azareh A, Rahmati O, Rafiei-Sardooi E, Sankey JB, Lee S, Shahabi H, et al.

Sci Total Environ, 2019 Mar 10;655:684-696.
PMID: 30476849 DOI: 10.1016/j.scitotenv.2018.11.235

Gully erosion susceptibility mapping is a fundamental tool for land-use planning aimed at mitigating land degradation. However, the capabilities of some state-of-the-art data-mining models for developing accurate maps of gully erosion susceptibility have not yet been fully investigated. This study assessed and compared the performance of two different types of data-mining models for accurately mapping gully erosion susceptibility at a regional scale in Chavar, Ilam, Iran. The two methods evaluated were: Certainty Factor (CF), a bivariate statistical model; and Maximum Entropy (ME), an advanced machine learning model. Several geographic and environmental factors that can contribute to gully erosion were considered as predictor variables of gully erosion susceptibility. Based on an existing differential GPS survey inventory of gully erosion, a total of 63 eroded gullies were spatially randomly split in a 70:30 ratio for use in model calibration and validation, respectively. Accuracy assessments completed with the receiver operating characteristic curve method showed that the ME-based regional gully susceptibility map has an area under the curve (AUC) value of 88.6% whereas the CF-based map has an AUC of 81.8%. According to jackknife tests that were used to investigate the relative importance of predictor variables, aspect, distance to river, lithology and land use are the most influential factors for the spatial distribution of gully erosion susceptibility in this region of Iran. The gully erosion susceptibility maps produced in this study could be useful tools for land managers and engineers tasked with road development, urbanization and other future development.

Matched MeSH terms: Machine Learning
Fulltext Determining hypertensive patients' beliefs towards medication and associations with medication adherence using machine learning methods

Aziz F, Malek S, Mhd Ali A, Wong MS, Mosleh M, Milow P

PeerJ, 2020;8:e8286.
PMID: 32206445 DOI: 10.7717/peerj.8286

Background: This study assesses the feasibility of using machine learning methods such as Random Forests (RF), Artificial Neural Networks (ANN), Support Vector Regression (SVR) and Self-Organizing Feature Maps (SOM) to identify and determine factors associated with hypertensive patients' adherence levels. Hypertension is the medical term for systolic and diastolic blood pressure higher than 140/90 mmHg. A conventional medication adherence scale was used to identify patients' adherence to their prescribed medication. Using machine learning applications to predict precise numeric adherence scores in hypertensive patients has not yet been reported in the literature.
Methods: Data from 160 hypertensive patients from a tertiary hospital in Kuala Lumpur, Malaysia, were used in this study. Variables were ranked based on their significance to adherence levels using the RF variable importance method. The backward elimination method was then performed using RF to obtain the variables significantly associated with the patients' adherence levels. RF, SVR and ANN models were developed to predict adherence using the identified significant variables. Visualizations of the relationships between hypertensive patients' adherence levels and variables were generated using SOM.
Result: Machine learning models constructed using the selected variables reported RMSE values of 1.42 for ANN, 1.53 for RF, and 1.55 for SVR. The accuracy of the dichotomised scores, calculated based on a percentage of correctly identified adherence values, was used as an additional model performance measure, resulting in accuracies of 65% (ANN), 78% (RF) and 79% (SVR), respectively. The Wilcoxon signed ranked test reported that there was no significant difference between the predictions of the machine learning models and the actual scores. The significant variables identified from the RF variable importance method were educational level, marital status, General Overuse, monthly income, and Specific Concern.
Conclusion: This study suggests an effective alternative to conventional methods in identifying the key variables to understand hypertensive patients' adherence levels. This can be used as a tool to educate patients on the importance of medication in managing hypertension.

Matched MeSH terms: Machine Learning
Fulltext Short- and long-term mortality prediction after an acute ST-elevation myocardial infarction (STEMI) in Asians: A machine learning approach

Aziz F, Malek S, Ibrahim KS, Raja Shariff RE, Wan Ahmad WA, Ali RM, et al.

PLoS One, 2021;16(8):e0254894.
PMID: 34339432 DOI: 10.1371/journal.pone.0254894

BACKGROUND: Conventional risk score for predicting short and long-term mortality following an ST-segment elevation myocardial infarction (STEMI) is often not population specific.
OBJECTIVE: Apply machine learning for the prediction and identification of factors associated with short and long-term mortality in Asian STEMI patients and compare with a conventional risk score.
METHODS: The National Cardiovascular Disease Database for Malaysia registry, of a multi-ethnic, heterogeneous Asian population was used for in-hospital (6299 patients), 30-days (3130 patients), and 1-year (2939 patients) model development. 50 variables were considered. Mortality prediction was analysed using feature selection methods with machine learning algorithms and compared to Thrombolysis in Myocardial Infarction (TIMI) score. Invasive management of varying degrees was selected as important variables that improved mortality prediction.
RESULTS: Model performance using a complete and reduced variable produced an area under the receiver operating characteristic curve (AUC) from 0.73 to 0.90. The best machine learning model for in-hospital, 30 days, and 1-year outperformed TIMI risk score (AUC = 0.88, 95% CI: 0.846-0.910; vs AUC = 0.81, 95% CI:0.772-0.845, AUC = 0.90, 95% CI: 0.870-0.935; vs AUC = 0.80, 95% CI: 0.746-0.838, AUC = 0.84, 95% CI: 0.798-0.872; vs AUC = 0.76, 95% CI: 0.715-0.802, p < 0.0001 for all). TIMI score underestimates patients' risk of mortality. 90% of non-survival patients are classified as high risk (>50%) by machine learning algorithm compared to 10-30% non-survival patients by TIMI. Common predictors identified for short- and long-term mortality were age, heart rate, Killip class, fasting blood glucose, prior primary PCI or pharmaco-invasive therapy and diuretics. The final algorithm was converted into an online tool with a database for continuous data archiving for algorithm validation.
CONCLUSIONS: In a multi-ethnic population, patients with STEMI were better classified using the machine learning method compared to TIMI scoring. Machine learning allows for the identification of distinct factors in individual Asian populations for better mortality prediction. Ongoing continuous testing and validation will allow for better risk stratification and potentially alter management and outcomes in the future.

Matched MeSH terms: Machine Learning*
Computer Vision and Machine Learning Analysis of Commercial Rice Grains: A Potential Digital Approach for Consumer Perception Studies

Aznan A, Gonzalez Viejo C, Pang A, Fuentes S

Sensors (Basel), 2021 Sep 23;21(19).
PMID: 34640673 DOI: 10.3390/s21196354

Rice quality assessment is essential for meeting high-quality standards and consumer demands. However, challenges remain in developing cost-effective and rapid techniques to assess commercial rice grain quality traits. This paper presents the application of computer vision (CV) and machine learning (ML) to classify commercial rice samples based on dimensionless morphometric parameters and color parameters extracted using CV algorithms from digital images obtained from a smartphone camera. The artificial neural network (ANN) model was developed using nine morpho-colorimetric parameters to classify rice samples into 15 commercial rice types. Furthermore, the ANN models were deployed and evaluated on a different imaging system to simulate their practical applications under different conditions. Results showed that the best classification accuracy was obtained using the Bayesian Regularization (BR) algorithm of the ANN with ten hidden neurons at 91.6% (MSE = <0.01) and 88.5% (MSE = 0.01) for the training and testing stages, respectively, with an overall accuracy of 90.7% (Model 2). Deployment also showed high accuracy (93.9%) in the classification of the rice samples. The adoption by the industry of rapid, reliable, and accurate methods, such as those presented here, may allow the incorporation of different morpho-colorimetric traits in rice with consumer perception studies.

Matched MeSH terms: Machine Learning
Fulltext Rapid Detection of Fraudulent Rice Using Low-Cost Digital Sensing Devices and Machine Learning

Aznan A, Gonzalez Viejo C, Pang A, Fuentes S

Sensors (Basel), 2022 Nov 09;22(22).
PMID: 36433249 DOI: 10.3390/s22228655

Rice fraud is one of the common threats to the rice industry. Conventional methods to detect rice adulteration are costly, time-consuming, and tedious. This study proposes the quantitative prediction of rice adulteration levels measured through the packaging using a handheld near-infrared (NIR) spectrometer and electronic nose (e-nose) sensors measuring directly on samples and paired with machine learning (ML) algorithms. For these purposes, the samples were prepared by mixing rice at different ratios from 0% to 100% with a 10% increment based on the rice's weight, consisting of (i) rice from different origins, (ii) premium with regular rice, (iii) aromatic with non-aromatic, and (iv) organic with non-organic rice. Multivariate data analysis was used to explore the sample distribution and its relationship with the e-nose sensors for parameter engineering before ML modeling. Artificial neural network (ANN) algorithms were used to predict the adulteration levels of the rice samples using the e-nose sensors and NIR absorbances readings as inputs. Results showed that both sensing devices could detect rice adulteration at different mixing ratios with high correlation coefficients through direct (e-nose; R = 0.94-0.98) and non-invasive measurement through the packaging (NIR; R = 0.95-0.98). The proposed method uses low-cost, rapid, and portable sensing devices coupled with ML that have shown to be reliable and accurate to increase the efficiency of rice fraud detection through the rice production chain.

Matched MeSH terms: Machine Learning
Fulltext Bioactive Molecule Prediction Using Extreme Gradient Boosting

Babajide Mustapha I, Saeed F

Molecules, 2016 Jul 28;21(8).
PMID: 27483216 DOI: 10.3390/molecules21080983

Following the explosive growth in chemical and biological data, the shift from traditional methods of drug discovery to computer-aided means has made data mining and machine learning methods integral parts of today's drug discovery process. In this paper, extreme gradient boosting (Xgboost), which is an ensemble of Classification and Regression Tree (CART) and a variant of the Gradient Boosting Machine, was investigated for the prediction of biological activity based on quantitative description of the compound's molecular structure. Seven datasets, well known in the literature were used in this paper and experimental results show that Xgboost can outperform machine learning algorithms like Random Forest (RF), Support Vector Machines (LSVM), Radial Basis Function Neural Network (RBFN) and Naïve Bayes (NB) for the prediction of biological activities. In addition to its ability to detect minority activity classes in highly imbalanced datasets, it showed remarkable performance on both high and low diversity datasets.

Matched MeSH terms: Machine Learning
Machine learning approaches in diagnosing tuberculosis through biomarkers - A systematic review

Balakrishnan V, Kherabi Y, Ramanathan G, Paul SA, Tiong CK

Prog Biophys Mol Biol, 2023 May;179:16-25.
PMID: 36931609 DOI: 10.1016/j.pbiomolbio.2023.03.001

Biomarker-based tests may facilitate Tuberculosis (TB) diagnosis, accelerate treatment initiation, and thus improve outcomes. This review synthesizes the literature on biomarker-based detection for TB diagnosis using machine learning. The systematic review approach follows the PRISMA guideline. Articles were sought using relevant keywords from Web of Science, PubMed, and Scopus, resulting in 19 eligible studies after a meticulous screening. All the studies were found to have focused on the supervised learning approach, with Support Vector Machine (SVM) and Random Forest emerging as the top two algorithms, with the highest accuracy, sensitivity and specificity reported to be 97.0%, 99.2%, and 98.0%, respectively. Further, protein-based biomarkers were widely explored, followed by gene-based such as RNA sequence and, Spoligotypes. Publicly available datasets were observed to be popularly used by the studies reviewed whilst studies targeting specific cohorts such as HIV patients or children gathering their own data from healthcare facilities, leading to smaller datasets. Of these, most studies used the leave one out cross validation technique to mitigate overfitting. The review shows that machine learning is increasingly assessed in research to improve TB diagnosis through biomarkers, as promising results were shown in terms of model's detection performance. This provides insights on the possible application of machine learning approaches to diagnose TB using biomarkers as opposed to the traditional methods that can be time consuming. Low-middle income settings, where access to basic biomarkers could be provided as compared to sputum-based tests that are not always available, could be a major application of such models.

Matched MeSH terms: Machine Learning
Fulltext Automatic COVID-19 Detection Using Exemplar Hybrid Deep Features with X-ray Images

Barua PD, Muhammad Gowdh NF, Rahmat K, Ramli N, Ng WL, Chan WY, et al.

Int J Environ Res Public Health, 2021 07 29;18(15).
PMID: 34360343 DOI: 10.3390/ijerph18158052

COVID-19 and pneumonia detection using medical images is a topic of immense interest in medical and healthcare research. Various advanced medical imaging and machine learning techniques have been presented to detect these respiratory disorders accurately. In this work, we have proposed a novel COVID-19 detection system using an exemplar and hybrid fused deep feature generator with X-ray images. The proposed Exemplar COVID-19FclNet9 comprises three basic steps: exemplar deep feature generation, iterative feature selection and classification. The novelty of this work is the feature extraction using three pre-trained convolutional neural networks (CNNs) in the presented feature extraction phase. The common aspects of these pre-trained CNNs are that they have three fully connected layers, and these networks are AlexNet, VGG16 and VGG19. The fully connected layer of these networks is used to generate deep features using an exemplar structure, and a nine-feature generation method is obtained. The loss values of these feature extractors are computed, and the best three extractors are selected. The features of the top three fully connected features are merged. An iterative selector is used to select the most informative features. The chosen features are classified using a support vector machine (SVM) classifier. The proposed COVID-19FclNet9 applied nine deep feature extraction methods by using three deep networks together. The most appropriate deep feature generation model selection and iterative feature selection have been employed to utilise their advantages together. By using these techniques, the image classification ability of the used three deep networks has been improved. The presented model is developed using four X-ray image corpora (DB1, DB2, DB3 and DB4) with two, three and four classes. The proposed Exemplar COVID-19FclNet9 achieved a classification accuracy of 97.60%, 89.96%, 98.84% and 99.64% using the SVM classifier with 10-fold cross-validation for four datasets, respectively. Our developed Exemplar COVID-19FclNet9 model has achieved high classification accuracy for all four databases and may be deployed for clinical application.

Matched MeSH terms: Machine Learning
Artificial neural network and convolutional neural network for prediction of dental caries

Basri KN, Yazid F, Mohd Zain MN, Md Yusof Z, Abdul Rani R, Zoolfakar AS

Spectrochim Acta A Mol Biomol Spectrosc, 2024 May 05;312:124063.
PMID: 38394882 DOI: 10.1016/j.saa.2024.124063

Dental caries has high prevalence among kids and adults thus it has become one of the global health concerns. The current modern dentistry focused on the preventives measures to reduce the number of dental caries cases. The employment of machine learning coupled with UV spectroscopy plays a crucial role to detect the early stage of caries. Artificial neural network with hyperparameter tuning was employed to train spectral data for the classification based on the International Caries Detection and Assesment System (ICDAS). Spectra preprocessing namely mean center (MC), autoscale (AS) and Savitzky Golay smoothing (SG) were applied on the data for spectra correction. The best performance of ANN model obtained has accuracy of 0.85 with precision of 1.00. Convolutional neural network (CNN) combined with Savitzky Golay smoothing performed on the spectral data has accuracy, precision, sensitivity and specificity for validation data of 1.00 respectively. The result obtained shows that the application of ANN and CNN capable to produce robust model to be used as an early screening of dental caries.

Matched MeSH terms: Machine Learning
Parkinson's disease: Cause factors, measurable indicators, and early diagnosis

Bhat S, Acharya UR, Hagiwara Y, Dadmehr N, Adeli H

Comput Biol Med, 2018 11 01;102:234-241.
PMID: 30253869 DOI: 10.1016/j.compbiomed.2018.09.008

Parkinson's disease (PD) is a neurodegenerative disease of the central nervous system caused due to the loss of dopaminergic neurons. It is classified under movement disorder as patients with PD present with tremor, rigidity, postural changes, and a decrease in spontaneous movements. Comorbidities including anxiety, depression, fatigue, and sleep disorders are observed prior to the diagnosis of PD. Gene mutations, exposure to toxic substances, and aging are considered as the causative factors of PD even though its genesis is unknown. This paper reviews PD etiologies, progression, and in particular measurable indicators of PD such as neuroimaging and electrophysiology modalities. In addition to gene therapy, neuroprotective, pharmacological, and neural transplantation treatments, researchers are actively aiming at identifying biological markers of PD with the goal of early diagnosis. Neuroimaging modalities used together with advanced machine learning techniques offer a promising path for the early detection and intervention in PD patients.

Matched MeSH terms: Machine Learning
Driving style recognition using machine learning and smartphones

Bin Jamal Mohd Lokman EH, Goh VT, Yap TTV, Ng H

F1000Res, 2022;11:57.
PMID: 37082303 DOI: 10.12688/f1000research.73134.1

Background: The lack of real-time monitoring is one of the reasons for the lack of awareness among drivers of their dangerous driving behavior. This work aims to develop a driver profiling system where a smartphone's built-in sensors are used alongside machine learning algorithms to classify different driving behaviors. Methods: We attempt to determine the optimal combination of smartphone sensors such as accelerometer, gyroscope, and GPS in order to develop an accurate machine learning algorithm capable of identifying different driving events (e.g. turning, accelerating, or braking). Results: In our preliminary studies, we encountered some difficulties in obtaining consistent driving events, which had the potential to add "noise" to the observations, thus reducing the accuracy of the classification. However, after some pre-processing, which included manual elimination of extraneous and erroneous events, and with the use of the Convolutional Neural Networks (CNN), we have been able to distinguish different driving events with an accuracy of about 95%. Conclusions: Based on the results of preliminary studies, we have determined that proposed approach is effective in classifying different driving events, which in turn will allow us to determine driver's driving behavior.

Matched MeSH terms: Machine Learning
Medical student knowledge and critical appraisal of machine learning: a multicentre international cross-sectional study

Blacketer C, Parnis R, B Franke K, Wagner M, Wang D, Tan Y, et al.

Intern Med J, 2021 Sep;51(9):1539-1542.
PMID: 34541769 DOI: 10.1111/imj.15479

To utilise effectively tools that employ machine learning (ML) in clinical practice medical students and doctors will require a degree of understanding of ML models. To evaluate current levels of understanding, a formative examination and survey was conducted across three centres in Australia, New Zealand and the United States. Of the 245 individuals who participated in the study (response rate = 45.4%), the majority had difficulty with identifying weaknesses in model performance analysis. Further studies examining educational interventions addressing such ML topics are warranted.

Matched MeSH terms: Machine Learning
Using artificial intelligence methods for systematic review in health sciences: A systematic review

Blaizot A, Veettil SK, Saidoung P, Moreno-Garcia CF, Wiratunga N, Aceves-Martins M, et al.

Res Synth Methods, 2022 May;13(3):353-362.
PMID: 35174972 DOI: 10.1002/jrsm.1553

The exponential increase in published articles makes a thorough and expedient review of literature increasingly challenging. This review delineated automated tools and platforms that employ artificial intelligence (AI) approaches and evaluated the reported benefits and challenges in using such methods. A search was conducted in 4 databases (Medline, Embase, CDSR, and Epistemonikos) up to April 2021 for systematic reviews and other related reviews implementing AI methods. To be included, the review must use any form of AI method, including machine learning, deep learning, neural network, or any other applications used to enable the full or semi-autonomous performance of one or more stages in the development of evidence synthesis. Twelve reviews were included, using nine different tools to implement 15 different AI methods. Eleven methods were used in the screening stages of the review (73%). The rest were divided: two in data extraction (13%) and two in risk of bias assessment (13%). The ambiguous benefits of the data extractions, combined with the reported advantages from 10 reviews, indicating that AI platforms have taken hold with varying success in evidence synthesis. However, the results are qualified by the reliance on the self-reporting of the review authors. Extensive human validation still appears required at this stage in implementing AI methods, though further evaluation is required to define the overall contribution of such platforms in enhancing efficiency and quality in evidence synthesis.

Matched MeSH terms: Machine Learning
Fulltext Leaf venation networks of Bornean trees: images and hand-traced segmentations

Blonder B, Both S, Jodra M, Majalap N, Burslem D, Teh YA, et al.

Ecology, 2019 Nov;100(11):e02844.
PMID: 31336398 DOI: 10.1002/ecy.2844

The data set contains images of leaf venation networks obtained from tree species in Malaysian Borneo. The data set contains 726 leaves from 295 species comprising 50 families, sampled from eight forest plots in Sabah. Image extents are approximately 1 × 1 cm, or 50 megapixels. All images contain a region of interest in which all veins have been hand traced. The complete data set includes over 30 billion pixels, of which more than 600 million have been validated by hand tracing. These images are suitable for morphological characterization of these species, as well as for training of machine-learning algorithms that segment biological networks from images. Data are made available under the Open Data Commons Attribution License. You are free to copy, distribute, and use the database; to produce works from the database; and to modify, transform, and build upon the database. You must attribute any public use of the database, or works produced from the database, in the manner specified in the license. For any use or redistribution of the database, or works produced from it, you must make clear to others the license of the database and keep intact any notices on the original database.

Matched MeSH terms: Machine Learning
Groundwater level forecasting with machine learning models: A review

Boo KBW, El-Shafie A, Othman F, Khan MMH, Birima AH, Ahmed AN

Water Res, 2024 Mar 15;252:121249.
PMID: 38330715 DOI: 10.1016/j.watres.2024.121249

Groundwater, the world's most abundant source of freshwater, is rapidly depleting in many regions due to a variety of factors. Accurate forecasting of groundwater level (GWL) is essential for effective management of this vital resource, but it remains a complex and challenging task. In recent years, there has been a notable increase in the use of machine learning (ML) techniques to model GWL, with many studies reporting exceptional results. In this paper, we present a comprehensive review of 142 relevant articles indexed by the Web of Science from 2017 to 2023, focusing on key ML models, including artificial neural networks (ANN), adaptive neuro-fuzzy inference systems (ANFIS), support vector regression (SVR), evolutionary computing (EC), deep learning (DL), ensemble learning (EN), and hybrid-modeling (HM). We also discussed key modeling concepts such as dataset size, data splitting, input variable selection, forecasting time-step, performance metrics (PM), study zones, and aquifers, highlighting best practices for optimal GWL forecasting with ML. This review provides valuable insights and recommendations for researchers and water management agencies working in the field of groundwater management and hydrology.

Matched MeSH terms: Machine Learning

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links