Displaying publications 21 - 37 of 37 in total

Abstract:
Sort:
  1. Chan KL, Rosli R, Tatarinova TV, Hogan M, Firdaus-Raih M, Low EL
    BMC Bioinformatics, 2017 Jan 27;18(Suppl 1):1426.
    PMID: 28466793 DOI: 10.1186/s12859-016-1426-6
    BACKGROUND: Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion.

    RESULTS: We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure).

    CONCLUSIONS: Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.

    Matched MeSH terms: Markov Chains
  2. Juhan N, Zubairi YZ, Khalid ZM, Mahmood Zuhdi AS
    Iran J Public Health, 2020 Sep;49(9):1642-1649.
    PMID: 33643938 DOI: 10.18502/ijph.v49i9.4080
    Background: Identifying risk factors associated with mortality is important in providing better prognosis to patients. Consistent with that, Bayesian approach offers a great advantage where it rests on the assumption that all model parameters are random quantities and hence can incorporate prior knowledge. Therefore, we aimed to develop a reliable model to identify risk factors associated with mortality among ST-Elevation Myocardial Infarction (STEMI) male patients using Bayesian approach.

    Methods: A total of 7180 STEMI male patients from the National Cardiovascular Disease Database-Acute Coronary Syndrome (NCVD-ACS) registry for the years 2006-2013 were enrolled. In the development of univariate and multivariate logistic regression model for the STEMI patients, Bayesian Markov Chain Monte Carlo (MCMC) simulation approach was applied. The performance of the model was assessed through convergence diagnostics, overall model fit, model calibration and discrimination.

    Results: A set of six risk factors for cardiovascular death among STEMI male patients were identified from the Bayesian multivariate logistic model namely age, diabetes mellitus, family history of CVD, Killip class, chronic lung disease and renal disease respectively. Overall model fit, model calibration and discrimination were considered good for the proposed model.

    Conclusion: Bayesian risk prediction model for CVD male patients identified six risk factors associated with mortality. Among the highest risks were Killip class (OR=18.0), renal disease (2.46) and age group (OR=2.43) respectively.

    Matched MeSH terms: Markov Chains
  3. Zourmand A, Ting HN, Mirhassani SM
    J Voice, 2013 Mar;27(2):201-9.
    PMID: 23473455 DOI: 10.1016/j.jvoice.2012.12.006
    Speech is one of the prevalent communication mediums for humans. Identifying the gender of a child speaker based on his/her speech is crucial in telecommunication and speech therapy. This article investigates the use of fundamental and formant frequencies from sustained vowel phonation to distinguish the gender of Malay children aged between 7 and 12 years. The Euclidean minimum distance and multilayer perceptron were used to classify the gender of 360 Malay children based on different combinations of fundamental and formant frequencies (F0, F1, F2, and F3). The Euclidean minimum distance with normalized frequency data achieved a classification accuracy of 79.44%, which was higher than that of the nonnormalized frequency data. Age-dependent modeling was used to improve the accuracy of gender classification. The Euclidean distance method obtained 84.17% based on the optimal classification accuracy for all age groups. The accuracy was further increased to 99.81% using multilayer perceptron based on mel-frequency cepstral coefficients.
    Matched MeSH terms: Markov Chains
  4. Mustapha I, Mohd Ali B, Rasid MF, Sali A, Mohamad H
    Sensors (Basel), 2015;15(8):19783-818.
    PMID: 26287191 DOI: 10.3390/s150819783
    It is well-known that clustering partitions network into logical groups of nodes in order to achieve energy efficiency and to enhance dynamic channel access in cognitive radio through cooperative sensing. While the topic of energy efficiency has been well investigated in conventional wireless sensor networks, the latter has not been extensively explored. In this paper, we propose a reinforcement learning-based spectrum-aware clustering algorithm that allows a member node to learn the energy and cooperative sensing costs for neighboring clusters to achieve an optimal solution. Each member node selects an optimal cluster that satisfies pairwise constraints, minimizes network energy consumption and enhances channel sensing performance through an exploration technique. We first model the network energy consumption and then determine the optimal number of clusters for the network. The problem of selecting an optimal cluster is formulated as a Markov Decision Process (MDP) in the algorithm and the obtained simulation results show convergence, learning and adaptability of the algorithm to dynamic environment towards achieving an optimal solution. Performance comparisons of our algorithm with the Groupwise Spectrum Aware (GWSA)-based algorithm in terms of Sum of Square Error (SSE), complexity, network energy consumption and probability of detection indicate improved performance from the proposed approach. The results further reveal that an energy savings of 9% and a significant Primary User (PU) detection improvement can be achieved with the proposed approach.
    Matched MeSH terms: Markov Chains
  5. Aziz F, Arof H, Mokhtar N, Mubin M
    J Neural Eng, 2014 Oct;11(5):056018.
    PMID: 25188730 DOI: 10.1088/1741-2560/11/5/056018
    This paper presents a wheelchair navigation system based on a hidden Markov model (HMM), which we developed to assist those with restricted mobility. The semi-autonomous system is equipped with obstacle/collision avoidance sensors and it takes the electrooculography (EOG) signal traces from the user as commands to maneuver the wheelchair. The EOG traces originate from eyeball and eyelid movements and they are embedded in EEG signals collected from the scalp of the user at three different locations. Features extracted from the EOG traces are used to determine whether the eyes are open or closed, and whether the eyes are gazing to the right, center, or left. These features are utilized as inputs to a few support vector machine (SVM) classifiers, whose outputs are regarded as observations to an HMM. The HMM determines the state of the system and generates commands for navigating the wheelchair accordingly. The use of simple features and the implementation of a sliding window that captures important signatures in the EOG traces result in a fast execution time and high classification rates. The wheelchair is equipped with a proximity sensor and it can move forward and backward in three directions. The asynchronous system achieved an average classification rate of 98% when tested with online data while its average execution time was less than 1 s. It was also tested in a navigation experiment where all of the participants managed to complete the tasks successfully without collisions.
    Matched MeSH terms: Markov Chains*
  6. Khalid R, Nawawi MK, Kawsar LA, Ghani NA, Kamil AA, Mustafa A
    PLoS One, 2013;8(4):e58402.
    PMID: 23560037 DOI: 10.1371/journal.pone.0058402
    M/G/C/C state dependent queuing networks consider service rates as a function of the number of residing entities (e.g., pedestrians, vehicles, and products). However, modeling such dynamic rates is not supported in modern Discrete Simulation System (DES) software. We designed an approach to cater this limitation and used it to construct the M/G/C/C state-dependent queuing model in Arena software. Using the model, we have evaluated and analyzed the impacts of various arrival rates to the throughput, the blocking probability, the expected service time and the expected number of entities in a complex network topology. Results indicated that there is a range of arrival rates for each network where the simulation results fluctuate drastically across replications and this causes the simulation results and analytical results exhibit discrepancies. Detail results that show how tally the simulation results and the analytical results in both abstract and graphical forms and some scientific justifications for these have been documented and discussed.
    Matched MeSH terms: Markov Chains
  7. Alyousifi Y, Othman M, Husin A, Rathnayake U
    Ecotoxicol Environ Saf, 2021 Dec 20;227:112875.
    PMID: 34717219 DOI: 10.1016/j.ecoenv.2021.112875
    Fuzzy time series (FTS) forecasting models show a great performance in predicting time series, such as air pollution time series. However, they have caused major issues by utilizing random partitioning of the universe of discourse and ignoring repeated fuzzy sets. In this study, a novel hybrid forecasting model by integrating fuzzy time series to Markov chain and C-Means clustering techniques with an optimal number of clusters is presented. This hybridization contributes to generating effective lengths of intervals and thus, improving the model accuracy. The proposed model was verified and validated with real time series data sets, which are the benchmark data of actual trading of Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) and PM10 concentration data from Melaka, Malaysia. In addition, a comparison was made with some existing fuzzy time series models. Furthermore, the mean absolute percentage error, mean squared error and Theil's U statistic were calculated as evaluation criteria to illustrate the performance of the proposed model. The empirical analysis shows that the proposed model handles the time series data sets more efficiently and provides better overall forecasting results than existing FTS models. The results prove that the proposed model has greatly improved the prediction accuracy, for which it outperforms several fuzzy time series models. Therefore, it can be concluded that the proposed model is a better option for forecasting air pollution parameters and any kind of random parameters.
    Matched MeSH terms: Markov Chains
  8. Permsuwan U, Chaiyakunapruk N, Dilokthornsakul P, Thavorn K, Saokaew S
    Appl Health Econ Health Policy, 2016 Jun;14(3):281-92.
    PMID: 26961276 DOI: 10.1007/s40258-016-0228-3
    BACKGROUND: Even though Insulin glargine (IGlar) has been available and used in other countries for more than a decade, it has not been adopted into Thai national formulary. This study aimed to evaluate the long-term cost effectiveness of IGlar versus neutral protamine Hagedorn (NPH) insulin in type 2 diabetes from the perspective of Thai Health Care System.

    METHODS: A validated computer simulation model (the IMS CORE Diabetes Model) was used to estimate the long-term projection of costs and clinical outcomes. The model was populated with published characteristics of Thai patients with type 2 diabetes. Baseline risk factors were obtained from Thai cohort studies, while relative risk reduction was derived from a meta-analysis study conducted by the Canadian Agency for Drugs and Technology in Health. Only direct costs were taken into account. Costs of diabetes management and complications were obtained from hospital databases in Thailand. Both costs and outcomes were discounted at 3 % per annum and presented in US dollars in terms of 2014 dollar value. Incremental cost-effectiveness ratio (ICER) was calculated. One-way and probabilistic sensitivity analyses were also performed.

    RESULTS: IGlar is associated with a slight gain in quality-adjusted life years (0.488 QALYs), an additional life expectancy (0.677 life years), and an incremental cost of THB119,543 (US$3522.19) compared with NPH insulin. The ICERs were THB244,915/QALY (US$7216.12/QALY) and THB176,525/life-year gained (LYG) (US$5201.09/LYG). The ICER was sensitive to discount rates and IGlar cost. At the acceptable willingness to pay of THB160,000/QALY (US$4714.20/QALY), the probability that IGlar was cost effective was less than 20 %.

    CONCLUSIONS: Compared to treatment with NPH insulin, treatment with IGlar in type 2 diabetes patients who had uncontrolled blood glucose with oral anti-diabetic drugs did not represent good value for money at the acceptable threshold in Thailand.

    Matched MeSH terms: Markov Chains
  9. Suhana O, Nazni WA, Apandi Y, Farah H, Lee HL, Sofian-Azirun M
    Heliyon, 2019 Dec;5(12):e02682.
    PMID: 31867449 DOI: 10.1016/j.heliyon.2019.e02682
    Chikungunya virus (CHIKV) is maintained in the sylvatic cycle in West Africa and is transmitted by Aedes mosquito species to monkeys. In 2006, four verified CHIKV isolates were obtained during a survey of arboviruses in monkeys (Macaca fascicularis) in Pahang state, Peninsular Malaysia. RNA was extracted from the CHIKV isolates and used in reverse transcription polymerase chain reactions (RT-PCR) to amplify PCR fragments for sequencing. Nucleic acid primers were designed to generate overlapping PCR fragments that covered the whole viral sequence. A total of 11,238 base pairs (bp) corresponding to open reading frames (ORFs) from our isolates and 47 other registered isolates in the National Center for Biotechnology Information (NCBI) were used to elucidate sequences, amino acids, and phylogenetic relationships and to estimate divergence times by using MEGA 7.0 and the Bayesian Markov chain Monte Carlo method. Phylogenetic analysis revealed that all CHIKV isolates could be classified into the Asian genotype and clustered with Bagan Panchor clades, which are associated with the chikungunya outbreak reported in 2006, with sequence and amino acid similarities of 99.9% and 99.7%, respectively. Minor amino acid differences were found between human and non-human primate isolates. Amino acid analysis showed a unique amino acid at position 221 in the nsP1region, at which a glycine (G) was found only in monkey isolates, whereas arginine (R) was found at the same position only in human isolates. The time to the most recent common ancestor (MRCA) estimation indicated that CHIKV probably started to diverge from human to non-human primates in approximately 2004 in Malaysia. The results suggested that CHIKV in non-human primates probably resulted from the spillover of the virus from humans. The study will be helpful in understanding the movement and evolution of CHIKV in Malaysia and globally.
    Matched MeSH terms: Markov Chains
  10. Tay BA
    PMID: 25215723
    We study a series of N oscillators, each coupled to its nearest neighbors, and linearly to a phonon field through the oscillator's number operator. We show that the Hamiltonian of a pair of adjacent oscillators, or a dimer, within the series of oscillators can be transformed into a form in which they are collectively coupled to the phonon field as a composite unit. In the weak coupling and rotating-wave approximation, the system behaves effectively as the trilinear boson model in the one excitation subspace of the dimer subsystem. The reduced dynamics of the one excitation subspace of the dimer subsystem coupled weakly to a phonon bath is similar to that of a two-level system, with a metastable state against the vacuum. The decay constant of the subsystem is proportional to the dephasing rate of the individual oscillator in a phonon bath, attenuated by a factor that depends on site asymmetry, intersite coupling, and the resonance frequency between the transformed oscillator modes, or excitons. As a result of the collective effect, the excitation relaxation lifetime is prolonged over the dephasing lifetime of an individual oscillator coupled to the same bath.
    Matched MeSH terms: Markov Chains
  11. Cheong HT, Ng KT, Ong LY, Chook JB, Chan KG, Takebe Y, et al.
    PLoS One, 2014;9(10):e111236.
    PMID: 25340817 DOI: 10.1371/journal.pone.0111236
    A novel HIV-1 recombinant clade (CRF51_01B) was recently identified among men who have sex with men (MSM) in Singapore. As cases of sexually transmitted HIV-1 infection increase concurrently in two socioeconomically intimate countries such as Malaysia and Singapore, cross transmission of HIV-1 between said countries is highly probable. In order to investigate the timeline for the emergence of HIV-1 CRF51_01B in Singapore and its possible introduction into Malaysia, 595 HIV-positive subjects recruited in Kuala Lumpur from 2008 to 2012 were screened. Phylogenetic relationship of 485 amplified polymerase gene sequences was determined through neighbour-joining method. Next, near-full length sequences were amplified for genomic sequences inferred to be CRF51_01B and subjected to further analysis implemented through Bayesian Markov chain Monte Carlo (MCMC) sampling and maximum likelihood methods. Based on the near full length genomes, two isolates formed a phylogenetic cluster with CRF51_01B sequences of Singapore origin, sharing identical recombination structure. Spatial and temporal information from Bayesian MCMC coalescent and maximum likelihood analysis of the protease, gp120 and gp41 genes suggest that Singapore is probably the country of origin of CRF51_01B (as early as in the mid-1990s) and featured a Malaysian who acquired the infection through heterosexual contact as host for its ancestral lineages. CRF51_01B then spread rapidly among the MSM in Singapore and Malaysia. Although the importation of CRF51_01B from Singapore to Malaysia is supported by coalescence analysis, the narrow timeframe of the transmission event indicates a closely linked epidemic. Discrepancies in the estimated divergence times suggest that CRF51_01B may have arisen through multiple recombination events from more than one parental lineage. We report the cross transmission of a novel CRF51_01B lineage between countries that involved different sexual risk groups. Understanding the cross-border transmission of HIV-1 involving sexual networks is crucial for effective intervention strategies in the region.
    Matched MeSH terms: Markov Chains
  12. Teh Sin Yin, Ong Ker Hsin, Soh Keng Lin, Khoo Michael Boon Chong, Teoh Wei Li
    Sains Malaysiana, 2015;44:1067-1075.
    The existing optimal design of the fixed sampling interval S2-EWMA control chart to monitor the sample variance of a process is based on the average run length (ARL) criterion. Since the shape of the run length distribution changes with the magnitude of the shift in the variance, the median run length (MRL) gives a more meaningful explanation about the in-control and out-of-control performances of a control chart. This paper proposes the optimal design of the S2-EWMA chart, based on the MRL. The Markov chain technique is employed to compute the MRLs. The performances of the S2-EWMA chart, double sampling (DS) S2 chart and S chart are evaluated and compared. The MRL results indicated that the S2-EWMA chart gives better performance for detecting small and moderate variance shifts, while maintaining almost the same sensitivity as the DS S2 and S charts toward large variance shifts, especially when the sample size increases.
    Matched MeSH terms: Markov Chains
  13. Annazirin Eli, Mardhiyyah Shaffie, Wan Zawiah W
    Sains Malaysiana, 2012;41:1403-1410.
    Statistical modeling of extreme rainfall is essential since the results can often facilitate civil engineers and planners to estimate the ability of building structures to survive under the utmost extreme conditions. Data comprising of annual maximum series (AMS) of extreme rainfall in Alor Setar were fitted to Generalized Extreme Value (GEV) distribution using method of maximum likelihood (ML) and Bayesian Markov Chain Monte Carlo (MCMC) simulations. The weakness of ML method in handling small sample is hoped to be tackled by means of Bayesian MCMC simulations in this study. In order to obtain the posterior densities, non-informative and independent priors were employed. Performances of parameter estimations were verified by conducting several goodness-of-fit tests. The results showed that Bayesian MCMC method was slightly better than ML method in estimating GEV parameters.
    Matched MeSH terms: Markov Chains
  14. Soon SS, Chia WK, Chan ML, Ho GF, Jian X, Deng YH, et al.
    PLoS One, 2014;9(9):e107866.
    PMID: 25250815 DOI: 10.1371/journal.pone.0107866
    Recent observational studies showed that post-operative aspirin use reduces cancer relapse and death in the earliest stages of colorectal cancer. We sought to evaluate the cost-effectiveness of aspirin as an adjuvant therapy in Stage I and II colorectal cancer patients aged 65 years and older.
    Matched MeSH terms: Markov Chains*
  15. El-Sharnouby S, Fischer B, Magbanua JP, Umans B, Flower R, Choo SW, et al.
    PLoS One, 2017;12(3):e0172725.
    PMID: 28282436 DOI: 10.1371/journal.pone.0172725
    It is now well established that eukaryote genomes have a common architectural organization into topologically associated domains (TADs) and evidence is accumulating that this organization plays an important role in gene regulation. However, the mechanisms that partition the genome into TADs and the nature of domain boundaries are still poorly understood. We have investigated boundary regions in the Drosophila genome and find that they can be identified as domains of very low H3K27me3. The genome-wide H3K27me3 profile partitions into two states; very low H3K27me3 identifies Depleted (D) domains that contain housekeeping genes and their regulators such as the histone acetyltransferase-containing NSL complex, whereas domains containing moderate-to-high levels of H3K27me3 (Enriched or E domains) are associated with regulated genes, irrespective of whether they are active or inactive. The D domains correlate with the boundaries of TADs and are enriched in a subset of architectural proteins, particularly Chromator, BEAF-32, and Z4/Putzig. However, rather than being clustered at the borders of these domains, these proteins bind throughout the H3K27me3-depleted regions and are much more strongly associated with the transcription start sites of housekeeping genes than with the H3K27me3 domain boundaries. While we have not demonstrated causality, we suggest that the D domain chromatin state, characterised by very low or absent H3K27me3 and established by housekeeping gene regulators, acts to separate topological domains thereby setting up the domain architecture of the genome.
    Matched MeSH terms: Markov Chains
  16. Chong SY, Tiňo P, He J, Yao X
    Evol Comput, 2019;27(2):195-228.
    PMID: 29155606 DOI: 10.1162/evco_a_00218
    Studying coevolutionary systems in the context of simplified models (i.e., games with pairwise interactions between coevolving solutions modeled as self plays) remains an open challenge since the rich underlying structures associated with pairwise-comparison-based fitness measures are often not taken fully into account. Although cyclic dynamics have been demonstrated in several contexts (such as intransitivity in coevolutionary problems), there is no complete characterization of cycle structures and their effects on coevolutionary search. We develop a new framework to address this issue. At the core of our approach is the directed graph (digraph) representation of coevolutionary problems that fully captures structures in the relations between candidate solutions. Coevolutionary processes are modeled as a specific type of Markov chains-random walks on digraphs. Using this framework, we show that coevolutionary problems admit a qualitative characterization: a coevolutionary problem is either solvable (there is a subset of solutions that dominates the remaining candidate solutions) or not. This has an implication on coevolutionary search. We further develop our framework that provides the means to construct quantitative tools for analysis of coevolutionary processes and demonstrate their applications through case studies. We show that coevolution of solvable problems corresponds to an absorbing Markov chain for which we can compute the expected hitting time of the absorbing class. Otherwise, coevolution will cycle indefinitely and the quantity of interest will be the limiting invariant distribution of the Markov chain. We also provide an index for characterizing complexity in coevolutionary problems and show how they can be generated in a controlled manner.
    Matched MeSH terms: Markov Chains
  17. Alyousifi Y, Ibrahim K, Kang W, Zin WZW
    Environ Monit Assess, 2020 Oct 21;192(11):719.
    PMID: 33083907 DOI: 10.1007/s10661-020-08666-8
    An environmental problem which is of concern across the globe nowadays is air pollution. The extent of air pollution is often studied based on data on the observed level of air pollution. Although the analysis of air pollution data that is available in the literature is numerous, studies on the dynamics of air pollution with the allowance for spatial interaction effects through the use of the Markov chain model are very limited. Accordingly, this study aims to explore the potential impact of spatial dependence over time and space on the distribution of air pollution based on the spatial Markov chain (SMC) model using the longitudinal air pollution index (API) data. This SMC model is pertinent to be applied since the daily data of API from 2012 to 2014 that have been gathered from 37 different air quality stations in Peninsular Malaysia is found to exhibit the property of spatial autocorrelation. Based on the spatial transition probability matrices found from the SMC model, specific characteristics of air pollution are studied in the regional context. These characteristics are the long-run proportion and the mean first passage time for each state of air pollution. It is found that the probability for a particular station's state to remain good is 0.814 if its neighbors are in a good state of air pollution and 0.7082 if its neighbors are in a moderate state. For a particular station having neighbors in a good state of air pollution, the proportion of time for it to continue being in a good state is 0.6. This proportion reduces to 0.4, 0.01, and 0 for the cell of moderate, unhealthy, and very unhealthy states, respectively. In addition, there exists a significant spatial dependence of API, indicating that air pollution for a particular station is dependent on the states of the neighboring stations.
    Matched MeSH terms: Markov Chains
Related Terms
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links