The Electrocardiogram (ECG) is the P-QRS-T wave depicting the cardiac activity of the heart. The subtle changes in the electric potential patterns of repolarization and depolarization are indicative of the disease afflicting the patient. These clinical time domain features of the ECG waveform can be used in cardiac health diagnosis. Due to the presence of noise and minute morphological parameter values, it is very difficult to identify the ECG classes accurately by the naked eye. Various computer aided cardiac diagnosis (CACD) systems, analysis methods, challenges addressed and the future of cardiovascular disease screening are reviewed in this paper. Methods developed for time domain, frequency transform domain, and time-frequency domain analysis, such as the wavelet transform, cannot by themselves represent the inherent distinguishing features accurately. Hence, nonlinear methods which can capture the small variations in the ECG signal and provide improved accuracy in the presence of noise are discussed in greater detail in this review. A CACD system exploiting these nonlinear features can help clinicians to diagnose cardiovascular disease more accurately.
This paper presents a study on gene knockout strategies to identify candidate genes to be knocked out for improving the production of succinic acid in Escherichia coli. Succinic acid is widely used as a precursor for many chemicals, for example production of antibiotics, therapeutic proteins and food. However, the chemical syntheses of succinic acid using the traditional methods usually result in the production that is far below their theoretical maximums. In silico gene knockout strategies are commonly implemented to delete the gene in E. coli to overcome this problem. In this paper, a hybrid of Ant Colony Optimization (ACO) and Minimization of Metabolic Adjustment (MoMA) is proposed to identify gene knockout strategies to improve the production of succinic acid in E. coli. As a result, the hybrid algorithm generated a list of knockout genes, succinic acid production rate and growth rate for E. coli after gene knockout. The results of the hybrid algorithm were compared with the previous methods, OptKnock and MOMAKnock. It was found that the hybrid algorithm performed better than OptKnock and MOMAKnock in terms of the production rate. The information from the results produced from the hybrid algorithm can be used in wet laboratory experiments to increase the production of succinic acid in E. coli.
Many biological research areas such as drug design require gene regulatory networks to provide clear insight and understanding of the cellular process in living cells. This is because interactions among the genes and their products play an important role in many molecular processes. A gene regulatory network can act as a blueprint for the researchers to observe the relationships among genes. Due to its importance, several computational approaches have been proposed to infer gene regulatory networks from gene expression data. In this review, six inference approaches are discussed: Boolean network, probabilistic Boolean network, ordinary differential equation, neural network, Bayesian network, and dynamic Bayesian network. These approaches are discussed in terms of introduction, methodology and recent applications of these approaches in gene regulatory network construction. These approaches are also compared in the discussion section. Furthermore, the strengths and weaknesses of these computational approaches are described.
A computer-aided detection auto-probing (CADAP) system is presented for detecting breast lesions using dynamic contrast enhanced magnetic resonance imaging, through a spatial-based discrete Fourier transform. The stand-alone CADAP system reduces noise, refines region of interest (ROI) automatically, and detects the breast lesion with minimal false positive detection. The lesions are then classified and colourised according to their characteristics, whether benign, suspicious or malignant. To enhance the visualisation, the entire analysed ROI is constructed into a 3-D image, so that the user can diagnose based on multiple views on the ROI. The proposed method has been applied to 101 sets of digital images, and the results compared with the biopsy results done by radiologists. The proposed scheme is able to identify breast cancer regions accurately and efficiently.
Psoriasis is an incurable skin disorder affecting 2-3% of the world population. The scaliness of psoriasis is a key assessment parameter of the Psoriasis Area and Severity Index (PASI). Dermatologists typically use visual and tactile senses in PASI scaliness assessment. However, the assessment can be subjective resulting in inter- and intra-rater variability in the scores. This paper proposes an assessment method that incorporates 3D surface roughness with standard clustering techniques to objectively determine the PASI scaliness score for psoriasis lesions. A surface roughness algorithm using structured light projection has been applied to 1999 3D psoriasis lesion surfaces. The algorithm has been validated with an accuracy of 94.12%. Clustering algorithms were used to classify the surface roughness measured using the proposed assessment method for PASI scaliness scoring. The reliability of the developed PASI scaliness algorithm was high with kappa coefficients>0.84 (almost perfect agreement).
The structural comparison of proteins is a vital step in structural biology that is used to predict and analyse a new unknown protein function. Although a number of different techniques have been explored, the study to develop new alternative methods is still an active research area. The present paper introduces a text modelling-based technique for the structural comparison of proteins. The method models the secondary and tertiary structure of proteins in two linear sequences and then applies them to the comparison of two structures. The technique used for pairwise comparison of the sequences has been adopted from computational linguistics and its well-known techniques for analysing and quantifying textual sequences. To this end, an n-gram modelling technique is used to capture regularities between sequences, and then, the cross-entropy concept is employed to measure their similarities. Several experiments are conducted to evaluate the performance of the method and compare it with other commonly used programs. The assessments for information retrieval evaluation demonstrate that the technique has a high running speed, which is similar to other linear encoding methods, such as 3D-BLAST, SARST, and TS-AMIR, whereas its accuracy is comparable to CE and TM-align, which are high accuracy comparison tools. Accordingly, the results demonstrate that the algorithm has high efficiency compared with other state-of-the-art methods.
Diabetes mellitus (DM) affects considerable number of people in the world and the number of cases is increasing every year. Due to a strong link to the genetic basis of the disease, it is extremely difficult to cure. However, it can be controlled to prevent severe consequences, such as organ damage. Therefore, diabetes diagnosis and monitoring of its treatment is very important. In this paper, we have proposed a non-invasive diagnosis support system for DM. The system determines whether or not diabetes is present by determining the cardiac health of a patient using heart rate variability (HRV) analysis. This analysis was based on nine nonlinear features namely: Approximate Entropy (ApEn), largest Lyapunov exponet (LLE), detrended fluctuation analysis (DFA) and recurrence quantification analysis (RQA). Clinically significant measures were used as input to classification algorithms, namely AdaBoost, decision tree (DT), fuzzy Sugeno classifier (FSC), k-nearest neighbor algorithm (k-NN), probabilistic neural network (PNN) and support vector machine (SVM). Ten-fold stratified cross-validation was used to select the best classifier. AdaBoost, with least squares (LS) as weak learner, performed better than the other classifiers, yielding an average accuracy of 90%, sensitivity of 92.5% and specificity of 88.7%.
Wavelet packet transform decomposes a signal into a set of orthonormal bases (nodes) and provides opportunities to select an appropriate set of these bases for feature extraction. In this paper, multi-level basis selection (MLBS) is proposed to preserve the most informative bases of a wavelet packet decomposition tree through removing less informative bases by applying three exclusion criteria: frequency range, noise frequency, and energy threshold. MLBS achieved an accuracy of 97.56% for classifying normal heart sound, aortic stenosis, mitral regurgitation, and aortic regurgitation. MLBS is a promising basis selection to be suggested for signals with a small range of frequencies.
A drastic improvement in the analysis of gene expression has lead to new discoveries in bioinformatics research. In order to analyse the gene expression data, fuzzy clustering algorithms are widely used. However, the resulting analyses from these specific types of algorithms may lead to confusion in hypotheses with regard to the suggestion of dominant function for genes of interest. Besides that, the current fuzzy clustering algorithms do not conduct a thorough analysis of genes with low membership values. Therefore, we present a novel computational framework called the "multi-stage filtering-Clustering Functional Annotation" (msf-CluFA) for clustering gene expression data. The framework consists of four components: fuzzy c-means clustering (msf-CluFA-0), achieving dominant cluster (msf-CluFA-1), improving confidence level (msf-CluFA-2) and combination of msf-CluFA-0, msf-CluFA-1 and msf-CluFA-2 (msf-CluFA-3). By employing double filtering in msf-CluFA-1 and apriori algorithms in msf-CluFA-2, our new framework is capable of determining the dominant clusters and improving the confidence level of genes with lower membership values by means of which the unknown genes can be predicted.
The heart is a sophisticated functional organ that plays a crucial role in the blood circulatory system. Hemodynamics within the heart chamber can be indicative of exert cardiac health. Due to the limitations of current cardiac imaging modalities, computational fluid dynamics (CFD) have been widely used for the purposes of cardiac function assessment and heart disease diagnosis, as they provide detailed insights into the cardiac flow field. An understanding of ventricular hemodynamics and pathological severities can be gained through studies that employ the CFD method. In this research the hemodynamics of two common myocardial diseases, dilated cardiomyopathy (DCM) and myocardial infarction (MI) were investigated, during both the filling phase and the whole cardiac cycle, through a prescribed geometry and fluid structure interaction (FSI) approach. The results of the research indicated that early stage disease identification and the improvement of cardiac assisting devices and therapeutic procedures can be facilitated through the use of the CFD method.
Age-related Macular Degeneration (AMD) is an irreversible and chronic medical condition characterized by drusen, Choroidal Neovascularization (CNV) and Geographic Atrophy (GA). AMD is one of the major causes of visual loss among elderly people. It is caused by the degeneration of cells in the macula which is responsible for central vision. AMD can be dry or wet type, however dry AMD is most common. It is classified into early, intermediate and late AMD. The early detection and treatment may help one to stop the progression of the disease. Automated AMD diagnosis may reduce the screening time of the clinicians. In this work, we have introduced LCP to characterize normal and AMD classes using fundus images. Linear Configuration Coefficients (CC) and Pattern Occurrence (PO) features are extracted from fundus images. These extracted features are ranked using p-value of the t-test and fed to various supervised classifiers viz. Decision Tree (DT), Nearest Neighbour (k-NN), Naive Bayes (NB), Probabilistic Neural Network (PNN) and Support Vector Machine (SVM) to classify normal and AMD classes. The performance of the system is evaluated using both private (Kasturba Medical Hospital, Manipal, India) and public domain datasets viz. Automated Retinal Image Analysis (ARIA) and STructured Analysis of the Retina (STARE) using ten-fold cross validation. The proposed approach yielded best performance with a highest average accuracy of 97.78%, sensitivity of 98.00% and specificity of 97.50% for STARE dataset using 22 significant features. Hence, this system can be used as an aiding tool to the clinicians during mass eye screening programs to diagnose AMD.
Myocardial Infarction (MI) or acute MI (AMI) is one of the leading causes of death worldwide. Precise and timely identification of MI and extent of muscle damage helps in early treatment and reduction in the time taken for further tests. MI diagnosis using 2D echocardiography is prone to inter-/intra-observer variability in the assessment. Therefore, a computerised scheme based on image processing and artificial intelligent techniques can reduce the workload of clinicians and improve the diagnosis accuracy. A Computer-Aided Diagnosis (CAD) of infarcted and normal ultrasound images will be useful for clinicians. In this study, the performance of CAD approach using Discrete Wavelet Transform (DWT), second order statistics calculated from Gray-Level Co-Occurrence Matrix (GLCM) and Higher-Order Spectra (HOS) texture descriptors are compared. The proposed system is validated using 400 MI and 400 normal ultrasound images, obtained from 80 patients with MI and 80 normal subjects. The extracted features are ranked based on t-value and fed to the Support Vector Machine (SVM) classifier to obtain the best performance using minimum number of features. The features extracted from DWT coefficients obtained an accuracy of 99.5%, sensitivity of 99.75% and specificity of 99.25%; GLCM have achieved an accuracy of 85.75%, sensitivity of 90.25% and specificity of 81.25%; and HOS obtained an accuracy of 93.0%, sensitivity of 94.75% and specificity of 91.25%. Among the three techniques presented DWT yielded the highest classification accuracy. Thus, the proposed CAD approach may be used as a complementary tool to assist cardiologists in making a more accurate diagnosis for the presence of MI.
An estimated 6.5 million patients in the United States are affected by chronic wounds, with more than US$25 billion and countless hours spent annually for all aspects of chronic wound care. There is a need for an intelligent software tool to analyze wound images, characterize wound tissue composition, measure wound size, and monitor changes in wound in between visits. Performed manually, this process is very time-consuming and subject to intra- and inter-reader variability. In this work, our objective is to develop methods to segment, measure and characterize clinically presented chronic wounds from photographic images. The first step of our method is to generate a Red-Yellow-Black-White (RYKW) probability map, which then guides the segmentation process using either optimal thresholding or region growing. The red, yellow and black probability maps are designed to handle the granulation, slough and eschar tissues, respectively; while the white probability map is to detect the white label card for measurement calibration purposes. The innovative aspects of this work include defining a four-dimensional probability map specific to wound characteristics, a computationally efficient method to segment wound images utilizing the probability map, and auto-calibration of wound measurements using the content of the image. These methods were applied to 80 wound images, captured in a clinical setting at the Ohio State University Comprehensive Wound Center, with the ground truth independently generated by the consensus of at least two clinicians. While the mean inter-reader agreement between the readers varied between 67.4% and 84.3%, the computer achieved an average accuracy of 75.1%.
Age-related Macular Degeneration (AMD) affects the central vision of aged people. It can be diagnosed due to the presence of drusen, Geographic Atrophy (GA) and Choroidal Neovascularization (CNV) in the fundus images. It is labor intensive and time-consuming for the ophthalmologists to screen these images. An automated digital fundus photography based screening system can overcome these drawbacks. Such a safe, non-contact and cost-effective platform can be used as a screening system for dry AMD. In this paper, we are proposing a novel algorithm using Radon Transform (RT), Discrete Wavelet Transform (DWT) coupled with Locality Sensitive Discriminant Analysis (LSDA) for automated diagnosis of AMD. First the image is subjected to RT followed by DWT. The extracted features are subjected to dimension reduction using LSDA and ranked using t-test. The performance of various supervised classifiers namely Decision Tree (DT), Support Vector Machine (SVM), Probabilistic Neural Network (PNN) and k-Nearest Neighbor (k-NN) are compared to automatically discriminate to normal and AMD classes using ranked LSDA components. The proposed approach is evaluated using private and public datasets such as ARIA and STARE. The highest classification accuracy of 99.49%, 96.89% and 100% are reported for private, ARIA and STARE datasets. Also, AMD index is devised using two LSDA components to distinguish two classes accurately. Hence, this proposed system can be extended for mass AMD screening.
Age-related Macular Degeneration (AMD) is one of the major causes of vision loss and blindness in ageing population. Currently, there is no cure for AMD, however early detection and subsequent treatment may prevent the severe vision loss or slow the progression of the disease. AMD can be classified into two types: dry and wet AMDs. The people with macular degeneration are mostly affected by dry AMD. Early symptoms of AMD are formation of drusen and yellow pigmentation. These lesions are identified by manual inspection of fundus images by the ophthalmologists. It is a time consuming, tiresome process, and hence an automated diagnosis of AMD screening tool can aid clinicians in their diagnosis significantly. This study proposes an automated dry AMD detection system using various entropies (Shannon, Kapur, Renyi and Yager), Higher Order Spectra (HOS) bispectra features, Fractional Dimension (FD), and Gabor wavelet features extracted from greyscale fundus images. The features are ranked using t-test, Kullback-Lieber Divergence (KLD), Chernoff Bound and Bhattacharyya Distance (CBBD), Receiver Operating Characteristics (ROC) curve-based and Wilcoxon ranking methods in order to select optimum features and classified into normal and AMD classes using Naive Bayes (NB), k-Nearest Neighbour (k-NN), Probabilistic Neural Network (PNN), Decision Tree (DT) and Support Vector Machine (SVM) classifiers. The performance of the proposed system is evaluated using private (Kasturba Medical Hospital, Manipal, India), Automated Retinal Image Analysis (ARIA) and STructured Analysis of the Retina (STARE) datasets. The proposed system yielded the highest average classification accuracies of 90.19%, 95.07% and 95% with 42, 54 and 38 optimal ranked features using SVM classifier for private, ARIA and STARE datasets respectively. This automated AMD detection system can be used for mass fundus image screening and aid clinicians by making better use of their expertise on selected images that require further examination.
Early expansion of infarcted zone after Acute Myocardial Infarction (AMI) has serious short and long-term consequences and contributes to increased mortality. Thus, identification of moderate and severe phases of AMI before leading to other catastrophic post-MI medical condition is most important for aggressive treatment and management. Advanced image processing techniques together with robust classifier using two-dimensional (2D) echocardiograms may aid for automated classification of the extent of infarcted myocardium. Therefore, this paper proposes novel algorithms namely Curvelet Transform (CT) and Local Configuration Pattern (LCP) for an automated detection of normal, moderately infarcted and severely infarcted myocardium using 2D echocardiograms. The methodology extracts the LCP features from CT coefficients of echocardiograms. The obtained features are subjected to Marginal Fisher Analysis (MFA) dimensionality reduction technique followed by fuzzy entropy based ranking method. Different classifiers are used to differentiate ranked features into three classes normal, moderate and severely infarcted based on the extent of damage to myocardium. The developed algorithm has achieved an accuracy of 98.99%, sensitivity of 98.48% and specificity of 100% for Support Vector Machine (SVM) classifier using only six features. Furthermore, we have developed an integrated index called Myocardial Infarction Risk Index (MIRI) to detect the normal, moderately and severely infarcted myocardium using a single number. The proposed system may aid the clinicians in faster identification and quantification of the extent of infarcted myocardium using 2D echocardiogram. This system may also aid in identifying the person at risk of developing heart failure based on the extent of infarcted myocardium.
Cross-sectional view echocardiography is an efficient non-invasive diagnostic tool for characterizing Myocardial Infarction (MI) and stages of expansion leading to heart failure. An automated computer-aided technique of cross-sectional echocardiography feature assessment can aid clinicians in early and more reliable detection of MI patients before subsequent catastrophic post-MI medical conditions. Therefore, this paper proposes a novel Myocardial Infarction Index (MII) to discriminate infarcted and normal myocardium using features extracted from apical cross-sectional views of echocardiograms. The cross-sectional view of normal and MI echocardiography images are represented as textons using Maximum Responses (MR8) filter banks. Fractal Dimension (FD), Higher-Order Statistics (HOS), Hu's moments, Gabor Transform features, Fuzzy Entropy (FEnt), Energy, Local binary Pattern (LBP), Renyi's Entropy (REnt), Shannon's Entropy (ShEnt), and Kapur's Entropy (KEnt) features are extracted from textons. These features are ranked using t-test and fuzzy Max-Relevancy and Min-Redundancy (mRMR) ranking methods. Then, combinations of highly ranked features are used in the formulation and development of an integrated MII. This calculated novel MII is used to accurately and quickly detect infarcted myocardium by using one numerical value. Also, the highly ranked features are subjected to classification using different classifiers for the characterization of normal and MI LV ultrasound images using a minimum number of features. Our current technique is able to characterize MI with an average accuracy of 94.37%, sensitivity of 91.25% and specificity of 97.50% with 8 apical four chambers view features extracted from only single frame per patient making this a more reliable and accurate classification.
Cancer classification and gene selection in high-dimensional data have been popular research topics in genetics and molecular biology. Recently, adaptive regularized logistic regression using the elastic net regularization, which is called the adaptive elastic net, has been successfully applied in high-dimensional cancer classification to tackle both estimating the gene coefficients and performing gene selection simultaneously. The adaptive elastic net originally used elastic net estimates as the initial weight, however, using this weight may not be preferable for certain reasons: First, the elastic net estimator is biased in selecting genes. Second, it does not perform well when the pairwise correlations between variables are not high. Adjusted adaptive regularized logistic regression (AAElastic) is proposed to address these issues and encourage grouping effects simultaneously. The real data results indicate that AAElastic is significantly consistent in selecting genes compared to the other three competitor regularization methods. Additionally, the classification performance of AAElastic is comparable to the adaptive elastic net and better than other regularization methods. Thus, we can conclude that AAElastic is a reliable adaptive regularized logistic regression method in the field of high-dimensional cancer classification.
Diabetic Macular Edema (DME) is caused by accumulation of extracellular fluid from hyperpermeable capillaries within the macula. DME is one of the leading causes of blindness among Diabetes Mellitus (DM) patients. Early detection followed by laser photocoagulation can save the visual loss. This review discusses various imaging modalities viz. biomicroscopy, Fluorescein Angiography (FA), Optical Coherence Tomography (OCT) and colour fundus photographs used for diagnosis of DME. Various automated DME grading systems using retinal fundus images, associated retinal image processing techniques for fovea, exudate detection and segmentation are presented. We have also compared various imaging modalities and automated screening methods used for DME grading. The reviewed literature indicates that FA and OCT identify DME related changes accurately. FA is an invasive method, which uses fluorescein dye, and OCT is an expensive imaging method compared to fundus photographs. Moreover, using fundus images DME can be identified and automated. DME grading algorithms can be implemented for telescreening. Hence, fundus imaging based DME grading is more suitable and affordable method compared to biomicroscopy, FA, and OCT modalities.
Recombinant protein overexpression, an important biotechnological process, is ruled by complex biological rules which are mostly unknown, is in need of an intelligent algorithm so as to avoid resource-intensive lab-based trial and error experiments in order to determine the expression level of the recombinant protein. The purpose of this study is to propose a predictive model to estimate the level of recombinant protein overexpression for the first time in the literature using a machine learning approach based on the sequence, expression vector, and expression host. The expression host was confined to Escherichia coli which is the most popular bacterial host to overexpress recombinant proteins. To provide a handle to the problem, the overexpression level was categorized as low, medium and high. A set of features which were likely to affect the overexpression level was generated based on the known facts (e.g. gene length) and knowledge gathered from related literature. Then, a representative sub-set of features generated in the previous objective was determined using feature selection techniques. Finally a predictive model was developed using random forest classifier which was able to adequately classify the multi-class imbalanced small dataset constructed. The result showed that the predictive model provided a promising accuracy of 80% on average, in estimating the overexpression level of a recombinant protein.