Displaying publications 81 - 100 of 116 in total

Abstract:
Sort:
  1. Tsuchida N, Nakashima M, Kato M, Heyman E, Inui T, Haginoya K, et al.
    Clin Genet, 2018 03;93(3):577-587.
    PMID: 28940419 DOI: 10.1111/cge.13144
    Epilepsies are common neurological disorders and genetic factors contribute to their pathogenesis. Copy number variations (CNVs) are increasingly recognized as an important etiology of many human diseases including epilepsy. Whole-exome sequencing (WES) is becoming a standard tool for detecting pathogenic mutations and has recently been applied to detecting CNVs. Here, we analyzed 294 families with epilepsy using WES, and focused on 168 families with no causative single nucleotide variants in known epilepsy-associated genes to further validate CNVs using 2 different CNV detection tools using WES data. We confirmed 18 pathogenic CNVs, and 2 deletions and 2 duplications at chr15q11.2 of clinically unknown significance. Of note, we were able to identify small CNVs less than 10 kb in size, which might be difficult to detect by conventional microarray. We revealed 2 cases with pathogenic CNVs that one of the 2 CNV detection tools failed to find, suggesting that using different CNV tools is recommended to increase diagnostic yield. Considering a relatively high discovery rate of CNVs (18 out of 168 families, 10.7%) and successful detection of CNV with <10 kb in size, CNV detection by WES may be able to surrogate, or at least complement, conventional microarray analysis.
    Matched MeSH terms: Computational Biology/methods
  2. Kumar S, Fazil MHUT, Ahmad K, Tripathy M, Rajapakse JC, Verma NK
    Methods Mol Biol, 2019;1930:149-156.
    PMID: 30610609 DOI: 10.1007/978-1-4939-9036-8_18
    Analysis of protein-protein interactions is important for better understanding of molecular mechanisms involved in immune regulation and has potential for elaborating avenues for drug discovery targeting T-cell motility. Currently, only a small fraction of protein-protein interactions have been characterized in T-lymphocytes although there are several detection methods available. In this regard, computational approaches garner importance, with the continued explosion of genomic and proteomic data, for handling protein modeling and protein-protein interactions in large scale. Here, we describe a computational method to identify protein-protein interactions based on in silico protein design.
    Matched MeSH terms: Computational Biology/methods*
  3. Yan CZY, Austin CM, Ayub Q, Rahman S, Gan HM
    FEMS Microbiol Lett, 2019 09 01;366(17).
    PMID: 31589302 DOI: 10.1093/femsle/fnz211
    The Malaysian and global shrimp aquaculture production has been significantly impacted by acute hepatopancreatic necrosis disease (AHPND) typically caused by Vibrio parahaemolyticus harboring the pVA plasmid containing the pirAVp and pirBVp genes, which code for Photorhabdus insect-related (Pir) toxin. The limited genomic resource for V. parahaemolyticus strains from Malaysian aquaculture farms precludes an in-depth understanding of their diversity and evolutionary relationships. In this study, we isolated shrimp-associated and environmental (rearing water) V. parahaemolyticus from three aquaculture farms located in Northern and Central Malaysia followed by whole-genome sequencing of 40 randomly selected isolates on the Illumina MiSeq. Phylogenomic analysis and multilocus sequence typing (MLST) reveal distinct lineages of V. parahaemolyticus that harbor the pirABVp genes. The recovery of pVA plasmid backbone devoid of pirAVp or pirABVp in some V. parahaemolyticus isolates suggests that the toxin genes are prone to deletion. The new insight gained from phylogenomic analysis of Asian V. parahaemolyticus, in addition to the observed genomic instability of pVa plasmid, will have implications for improvements in aquaculture practices to diagnose, treat or limit the impacts of this disease.
    Matched MeSH terms: Computational Biology/methods
  4. Rengganaten V, Huang CJ, Tsai PH, Wang ML, Yang YP, Lan YT, et al.
    Int J Mol Sci, 2020 Oct 23;21(21).
    PMID: 33114016 DOI: 10.3390/ijms21217864
    Spheroidal cancer cell cultures have been used to enrich cancer stem cells (CSC), which are thought to contribute to important clinical features of tumors. This study aimed to map the regulatory networks driven by circular RNAs (circRNAs) in CSC-enriched colorectal cancer (CRC) spheroid cells. The spheroid cells established from two CRC cell lines acquired stemness properties in pluripotency gene expression and multi-lineage differentiation capacity. Genome-wide sequencing identified 1503 and 636 circRNAs specific to the CRC parental and spheroid cells, respectively. In the CRC spheroids, algorithmic analyses unveiled a core network of mRNAs involved in modulating stemness-associated signaling pathways, driven by a circRNA-microRNA (miRNA)-mRNA axis. The two major circRNAs, hsa_circ_0066631 and hsa_circ_0082096, in this network were significantly up-regulated in expression levels in the spheroid cells. The two circRNAs were predicted to target and were experimentally shown to down-regulate miR-140-3p, miR-224, miR-382, miR-548c-3p and miR-579, confirming circRNA sponging of the targeted miRNAs. Furthermore, the affected miRNAs were demonstrated to inhibit degradation of six mRNA targets, viz. ACVR1C/ALK7, FZD3, IL6ST/GP130, SKIL/SNON, SMAD2 and WNT5, in the CRC spheroid cells. These mRNAs encode proteins that are reported to variously regulate the GP130/Stat, Activin/Nodal, TGF-β/SMAD or Wnt/β-catenin signaling pathways in controlling various aspects of CSC stemness. Using the CRC spheroid cell model, the novel circRNA-miRNA-mRNA axis mapped in this work forms the foundation for the elucidation of the molecular mechanisms of the complex cellular and biochemical processes that determine CSC stemness properties of cancer cells, and possibly for designing therapeutic strategies for CRC treatment by targeting CSC.
    Matched MeSH terms: Computational Biology/methods
  5. Sillitoe I, Bordin N, Dawson N, Waman VP, Ashford P, Scholes HM, et al.
    Nucleic Acids Res, 2021 01 08;49(D1):D266-D273.
    PMID: 33237325 DOI: 10.1093/nar/gkaa1079
    CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt.
    Matched MeSH terms: Computational Biology/methods
  6. Hosseinpoor AR, Nambiar D, Schlotheuber A, Reidpath D, Ross Z
    BMC Med Res Methodol, 2016 10 19;16(1):141.
    PMID: 27760520
    BACKGROUND: It is widely recognised that the pursuit of sustainable development cannot be accomplished without addressing inequality, or observed differences between subgroups of a population. Monitoring health inequalities allows for the identification of health topics where major group differences exist, dimensions of inequality that must be prioritised to effect improvements in multiple health domains, and also population subgroups that are multiply disadvantaged. While availability of data to monitor health inequalities is gradually improving, there is a commensurate need to increase, within countries, the technical capacity for analysis of these data and interpretation of results for decision-making. Prior efforts to build capacity have yielded demand for a toolkit with the computational ability to display disaggregated data and summary measures of inequality in an interactive and customisable fashion that would facilitate interpretation and reporting of health inequality in a given country.

    METHODS: To answer this demand, the Health Equity Assessment Toolkit (HEAT), was developed between 2014 and 2016. The software, which contains the World Health Organization's Health Equity Monitor database, allows the assessment of inequalities within a country using over 30 reproductive, maternal, newborn and child health indicators and five dimensions of inequality (economic status, education, place of residence, subnational region and child's sex, where applicable).

    RESULTS/CONCLUSION: HEAT was beta-tested in 2015 as part of ongoing capacity building workshops on health inequality monitoring. This is the first and only application of its kind; further developments are proposed to introduce an upload data feature, translate it into different languages and increase interactivity of the software. This article will present the main features and functionalities of HEAT and discuss its relevance and use for health inequality monitoring.

    Matched MeSH terms: Computational Biology/methods*
  7. Rahman F, Hassan M, Rosli R, Almousally I, Hanano A, Murphy DJ
    PLoS One, 2018;13(5):e0196669.
    PMID: 29771926 DOI: 10.1371/journal.pone.0196669
    Bioinformatics analyses of caleosin/peroxygenases (CLO/PXG) demonstrated that these genes are present in the vast majority of Viridiplantae taxa for which sequence data are available. Functionally active CLO/PXG proteins with roles in abiotic stress tolerance and lipid droplet storage are present in some Trebouxiophycean and Chlorophycean green algae but are absent from the small number of sequenced Prasinophyceaen genomes. CLO/PXG-like genes are expressed during dehydration stress in Charophyte algae, a sister clade of the land plants (Embryophyta). CLO/PXG-like sequences are also present in all of the >300 sequenced Embryophyte genomes, where some species contain as many as 10-12 genes that have arisen via selective gene duplication. Angiosperm genomes harbour at least one copy each of two distinct CLO/PX isoforms, termed H (high) and L (low), where H-forms contain an additional C-terminal motif of about 30-50 residues that is absent from L-forms. In contrast, species in other Viridiplantae taxa, including green algae, non-vascular plants, ferns and gymnosperms, contain only one (or occasionally both) of these isoforms per genome. Transcriptome and biochemical data show that CLO/PXG-like genes have complex patterns of developmental and tissue-specific expression. CLO/PXG proteins can associate with cytosolic lipid droplets and/or bilayer membranes. Many of the analysed isoforms also have peroxygenase activity and are involved in oxylipin metabolism. The distribution of CLO/PXG-like genes is consistent with an origin >1 billion years ago in at least two of the earliest diverging groups of the Viridiplantae, namely the Chlorophyta and the Streptophyta, after the Viridiplantae had already diverged from other Archaeplastidal groups such as the Rhodophyta and Glaucophyta. While algal CLO/PXGs have roles in lipid packaging and stress responses, the Embryophyte proteins have a much wider spectrum of roles and may have been instrumental in the colonisation of terrestrial habitats and the subsequent diversification as the major land flora.
    Matched MeSH terms: Computational Biology/methods
  8. Hong KW, Asmah Hani AW, Nurul Aina Murni CA, Pusparani RR, Chong CK, Verasahib K, et al.
    Infect Genet Evol, 2017 Oct;54:263-270.
    PMID: 28711373 DOI: 10.1016/j.meegid.2017.07.015
    In this study, we report the comparative genomics and phylogenetic analysis of Corynebacterium diphtheriae strain B-D-16-78 that was isolated from a clinical specimen in 2016. The complete genome of C. diphtheriae strain B-D-16-78 was sequenced using PacBio Single Molecule, Real-Time sequencing technology and consists of a 2,474,151-bp circular chromosome with an average GC content of 53.56%. The core genome of C. diphtheriae was also deduced from a total of 74 strains with complete or draft genome sequences and the core genome-based phylogenetic analysis revealed close genetic relationship among strains that shared the same MLST allelic profile. In the context of CRISPR-Cas system, which confers adaptive immunity against re-invading DNA, 73 out of 86 spacer sequences were found to be unique to Malaysian strains which harboured only type-II-C and/or type-I-E-a systems. A total of 48 tox genes which code for the diphtheria toxin were retrieved from the 74 genomes and with the exception of one truncated gene, only nucleotide substitutions were detected when compared to the tox gene sequence of PW8. More than half were synonymous substitution and only two were nonsynonymous substitutions whereby H24Y was predicted to have a damaging effect on the protein function whilst T262V was predicted to be tolerated. Both toxigenic and non-toxigenic toxin-gene bearing strains have been isolated in Malaysia but the repeated isolation of toxigenic strains with the same MLST profile suggests the possibility of some of these strains may be circulating in the population. Hence, efforts to increase herd immunity should be continued and supported by an effective monitoring and surveillance system to track, manage and control outbreak of cases.
    Matched MeSH terms: Computational Biology/methods
  9. Wongrattanakamon P, Lee VS, Nimmanpipug P, Sirithunyalug B, Chansakaow S, Jiranusornkul S
    Toxicol. Mech. Methods, 2017 May;27(4):253-271.
    PMID: 27996361 DOI: 10.1080/15376516.2016.1273428
    In this work, molecular docking, pharmacophore modeling and molecular dynamics (MD) simulation were rendered for the mouse P-glycoprotein (P-gp) (code: 4Q9H) and bioflavonoids; amorphigenin, chrysin, epigallocatechin, formononetin and rotenone including a positive control; verapamil to identify protein-ligand interaction features including binding affinities, interaction characteristics, hot-spot amino acid residues and complex stabilities. These flavonoids occupied the same binding site with high binding affinities and shared the same key residues for their binding interactions and the binding region of the flavonoids was revealed that overlapped the ATP binding region with hydrophobic and hydrophilic interactions suggesting a competitive inhibition mechanism of the compounds. Root mean square deviations (RMSDs) analysis of MD trajectories of the protein-ligand complexes and NBD2 residues, and ligands pointed out these residues were stable throughout the duration of MD simulations. Thus, the applied preliminary structure-based molecular modeling approach of interactions between NBD2 and flavonoids may be gainful to realize the intimate inhibition mechanism of P-gp at NBD2 level and on the basis of the obtained data, it can be concluded that these bioflavonoids have the potential to cause herb-drug interactions or be used as lead molecules for the inhibition of P-gp (as anti-multidrug resistance agents) via the NBD2 blocking mechanism in future.
    Matched MeSH terms: Computational Biology/methods*
  10. Alballa M, Aplop F, Butler G
    PLoS One, 2020;15(1):e0227683.
    PMID: 31935244 DOI: 10.1371/journal.pone.0227683
    Transporters mediate the movement of compounds across the membranes that separate the cell from its environment and across the inner membranes surrounding cellular compartments. It is estimated that one third of a proteome consists of membrane proteins, and many of these are transport proteins. Given the increase in the number of genomes being sequenced, there is a need for computational tools that predict the substrates that are transported by the transmembrane transport proteins. In this paper, we present TranCEP, a predictor of the type of substrate transported by a transmembrane transport protein. TranCEP combines the traditional use of the amino acid composition of the protein, with evolutionary information captured in a multiple sequence alignment (MSA), and restriction to important positions of the alignment that play a role in determining the specificity of the protein. Our experimental results show that TranCEP significantly outperforms the state-of-the-art predictors. The results quantify the contribution made by each type of information used.
    Matched MeSH terms: Computational Biology/methods*
  11. Abdullah-Zawawi MR, Ahmad-Nizammuddin NF, Govender N, Harun S, Mohd-Assaad N, Mohamed-Hussein ZA
    Sci Rep, 2021 10 04;11(1):19678.
    PMID: 34608238 DOI: 10.1038/s41598-021-99206-y
    Transcription factors (TFs) form the major class of regulatory genes and play key roles in multiple plant stress responses. In most eukaryotic plants, transcription factor (TF) families (WRKY, MADS-box and MYB) activate unique cellular-level abiotic and biotic stress-responsive strategies, which are considered as key determinants for defense and developmental processes. Arabidopsis and rice are two important representative model systems for dicot and monocot plants, respectively. A comprehensive comparative study on 101 OsWRKY, 34 OsMADS box and 122 OsMYB genes (rice genome) and, 71 AtWRKY, 66 AtMADS box and 144 AtMYB genes (Arabidopsis genome) showed various relationships among TFs across species. The phylogenetic analysis clustered WRKY, MADS-box and MYB TF family members into 10, 7 and 14 clades, respectively. All clades in WRKY and MYB TF families and almost half of the total number of clades in the MADS-box TF family are shared between both species. Chromosomal and gene structure analysis showed that the Arabidopsis-rice orthologous TF gene pairs were unevenly localized within their chromosomes whilst the distribution of exon-intron gene structure and motif conservation indicated plausible functional similarity in both species. The abiotic and biotic stress-responsive cis-regulatory element type and distribution patterns in the promoter regions of Arabidopsis and rice WRKY, MADS-box and MYB orthologous gene pairs provide better knowledge on their role as conserved regulators in both species. Co-expression network analysis showed the correlation between WRKY, MADs-box and MYB genes in each independent rice and Arabidopsis network indicating their role in stress responsiveness and developmental processes.
    Matched MeSH terms: Computational Biology/methods
  12. Ramachandran H, Shafie NAH, Sudesh K, Azizan MN, Majid MIA, Amirul AA
    Antonie Van Leeuwenhoek, 2018 Mar;111(3):361-372.
    PMID: 29022146 DOI: 10.1007/s10482-017-0958-8
    Bacterial classification on the basis of a polyphasic approach was conducted on three poly(3 hydroxybutyrate-co-4-hydroxybutyrate) [P(3HB-co-4HB)] accumulating bacterial strains that were isolated from samples collected from Malaysian environments; Kulim Lake, Sg. Pinang river and Sg. Manik paddy field. The Gram-negative, rod-shaped, motile, non-sporulating and non-fermenting bacteria were shown to belong to the genus Cupriavidus of the Betaproteobacteria on the basis of their 16S rRNA gene sequence analyses. The sequence similarity value with their near phylogenetic neighbour, Cupriavidus pauculus LMG3413T, was 98.5%. However, the DNA-DNA hybridization values (8-58%) and ribotyping analysis both enabled these strains to be differentiated from related Cupriavidus species with validly published names. The RiboPrint patterns of the three strains also revealed that the strains were genetically related even though they displayed a clonal diversity. The major cellular fatty acids detected in these strains included C15:0 ISO 2OH/C16:1 ω7c, hexadecanoic (16:0) and cis-11-octadecenoic (C18:1 ω7c). Their G+C contents ranged from 68.0  to 68.6 mol%, and their major isoprenoid quinone was Ubiquinone Q-8. Of these three strains, only strain USMAHM13 (= DSM 25816 = KCTC 32390) was discovered to exhibit yellow pigmentation that is characteristic of the carotenoid family. Their assembled genomes also showed that the three strains were not identical in terms of their genome sizes that were 7.82, 7.95 and 8.70 Mb for strains USMAHM13, USMAA1020 and USMAA2-4, respectively, which are slightly larger than that of Cupriavidus necator H16 (7.42 Mb). The average nucleotide identity (ANI) results indicated that the strains were genetically related and the genome pairs belong to the same species. On the basis of the results obtained in this study, the three strains are considered to represent a novel species for which the name Cupriavidus malaysiensis sp. nov. is proposed. The type strain of the species is USMAA1020T (= DSM 19416T = KCTC 32390T).
    Matched MeSH terms: Computational Biology/methods
  13. Li YY, Liu H, Fu SH, Li XL, Guo XF, Li MH, et al.
    Infect Genet Evol, 2017 11;55:48-55.
    PMID: 28827175 DOI: 10.1016/j.meegid.2017.08.016
    Getah virus (GETV) was first isolated in Malaysia in 1955. Since then, epidemics in horses and pigs caused by GETV have resulted in huge economic losses. At present, GETV has spread across Eurasia and Southeast Asia, including mainland China, Korea, Japan, Mongolia, and Russia. Data show that the Most Recent Common Ancestor (MRCA) of GETV existed about 145years ago (95% HPD: 75-244) and gradually evolved into four distinct evolutionary populations: Groups I-IV. The MRCA of GETVs in Group III, which includes all GETVs isolated from mosquitoes, pigs, horses, and other animals since the 1960s (from latitude 19°N to 60°N), existed about 51years ago (95% HPD: 51-72). Group III is responsible for most viral epidemics among domestic animals. An analysis of the GETV E2 protein sequence and structure revealed seven common amino acid mutation sites. These sites are responsible for the structural and electrostatic differences detected between widespread Group III isolates and the prototype strain MM2021. These differences may account for the recent geographical radiation of the virus. Considering the economic significance of GETV infection in pigs and horses, we recommend the implementation of strict viral screening and monitoring programs.
    Matched MeSH terms: Computational Biology/methods
  14. Loh SY, Jahans-Price T, Greenwood MP, Greenwood M, Hoe SZ, Konopacka A, et al.
    eNeuro, 2017 12 21;4(6).
    PMID: 29279858 DOI: 10.1523/ENEURO.0243-17.2017
    The supraoptic nucleus (SON) is a group of neurons in the hypothalamus responsible for the synthesis and secretion of the peptide hormones vasopressin and oxytocin. Following physiological cues, such as dehydration, salt-loading and lactation, the SON undergoes a function related plasticity that we have previously described in the rat at the transcriptome level. Using the unsupervised graphical lasso (Glasso) algorithm, we reconstructed a putative network from 500 plastic SON genes in which genes are the nodes and the edges are the inferred interactions. The most active nodal gene identified within the network was Caprin2. Caprin2 encodes an RNA-binding protein that we have previously shown to be vital for the functioning of osmoregulatory neuroendocrine neurons in the SON of the rat hypothalamus. To test the validity of the Glasso network, we either overexpressed or knocked down Caprin2 transcripts in differentiated rat pheochromocytoma PC12 cells and showed that these manipulations had significant opposite effects on the levels of putative target mRNAs. These studies suggest that the predicative power of the Glasso algorithm within an in vivo system is accurate, and identifies biological targets that may be important to the functional plasticity of the SON.
    Matched MeSH terms: Computational Biology/methods*
  15. Charoenkwan P, Chotpatiwetchkul W, Lee VS, Nantasenamat C, Shoombuatong W
    Sci Rep, 2021 Dec 10;11(1):23782.
    PMID: 34893688 DOI: 10.1038/s41598-021-03293-w
    Owing to their ability to maintain a thermodynamically stable fold at extremely high temperatures, thermophilic proteins (TTPs) play a critical role in basic research and a variety of applications in the food industry. As a result, the development of computation models for rapidly and accurately identifying novel TTPs from a large number of uncharacterized protein sequences is desirable. In spite of existing computational models that have already been developed for characterizing thermophilic proteins, their performance and interpretability remain unsatisfactory. We present a novel sequence-based thermophilic protein predictor, termed SCMTPP, for improving model predictability and interpretability. First, an up-to-date and high-quality dataset consisting of 1853 TPPs and 3233 non-TPPs was compiled from published literature. Second, the SCMTPP predictor was created by combining the scoring card method (SCM) with estimated propensity scores of g-gap dipeptides. Benchmarking experiments revealed that SCMTPP had a cross-validation accuracy of 0.883, which was comparable to that of a support vector machine-based predictor (0.906-0.910) and 2-17% higher than that of commonly used machine learning models. Furthermore, SCMTPP outperformed the state-of-the-art approach (ThermoPred) on the independent test dataset, with accuracy and MCC of 0.865 and 0.731, respectively. Finally, the SCMTPP-derived propensity scores were used to elucidate the critical physicochemical properties for protein thermostability enhancement. In terms of interpretability and generalizability, comparative results showed that SCMTPP was effective for identifying and characterizing TPPs. We had implemented the proposed predictor as a user-friendly online web server at http://pmlabstack.pythonanywhere.com/SCMTPP in order to allow easy access to the model. SCMTPP is expected to be a powerful tool for facilitating community-wide efforts to identify TPPs on a large scale and guiding experimental characterization of TPPs.
    Matched MeSH terms: Computational Biology/methods*
  16. Gan HM, Grandjean F, Jenkins TL, Austin CM
    BMC Genomics, 2019 May 03;20(1):335.
    PMID: 31053062 DOI: 10.1186/s12864-019-5704-3
    BACKGROUND: The recently published complete mitogenome of the European lobster (Homarus gammarus) that was generated using long-range PCR exhibits unusual gene composition (missing nad2) and gene rearrangements among decapod crustaceans with strong implications in crustacean phylogenetics. Such atypical mitochondrial features will benefit greatly from validation with emerging long read sequencing technologies such as Oxford Nanopore that can more accurately identify structural variation.

    RESULTS: We re-sequenced the H. gammarus mitogenome on an Oxford Nanopore Minion flowcell and performed a long-read only assembly, generating a complete mitogenome assembly for H. gammarus. In contrast to previous reporting, we found an intact mitochondrial nad2 gene in the H. gammarus mitogenome and showed that its gene organization is broadly similar to that of the American lobster (H. americanus) except for the presence of a large tandemly duplicated region with evidence of pseudogenization in one of each duplicated protein-coding genes.

    CONCLUSIONS: Using the European lobster as an example, we demonstrate the value of Oxford Nanopore long read technology in resolving problematic mitogenome assemblies. The increasing accessibility of Oxford Nanopore technology will make it an attractive and useful tool for evolutionary biologists to verify new and existing unusual mitochondrial gene rearrangements recovered using first and second generation sequencing technologies, particularly those used to make phylogenetic inferences of evolutionary scenarios.

    Matched MeSH terms: Computational Biology/methods*
  17. Chong LC, Gandhi G, Lee JM, Yeo WWY, Choi SB
    Int J Mol Sci, 2021 Aug 20;22(16).
    PMID: 34445667 DOI: 10.3390/ijms22168962
    Spinal muscular atrophy (SMA), one of the leading inherited causes of child mortality, is a rare neuromuscular disease arising from loss-of-function mutations of the survival motor neuron 1 (SMN1) gene, which encodes the SMN protein. When lacking the SMN protein in neurons, patients suffer from muscle weakness and atrophy, and in the severe cases, respiratory failure and death. Several therapeutic approaches show promise with human testing and three medications have been approved by the U.S. Food and Drug Administration (FDA) to date. Despite the shown promise of these approved therapies, there are some crucial limitations, one of the most important being the cost. The FDA-approved drugs are high-priced and are shortlisted among the most expensive treatments in the world. The price is still far beyond affordable and may serve as a burden for patients. The blooming of the biomedical data and advancement of computational approaches have opened new possibilities for SMA therapeutic development. This article highlights the present status of computationally aided approaches, including in silico drug repurposing, network driven drug discovery as well as artificial intelligence (AI)-assisted drug discovery, and discusses the future prospects.
    Matched MeSH terms: Computational Biology/methods
  18. Horne HN, Chung CC, Zhang H, Yu K, Prokunina-Olsson L, Michailidou K, et al.
    PLoS One, 2016;11(8):e0160316.
    PMID: 27556229 DOI: 10.1371/journal.pone.0160316
    The Cancer Genetic Markers of Susceptibility genome-wide association study (GWAS) originally identified a single nucleotide polymorphism (SNP) rs11249433 at 1p11.2 associated with breast cancer risk. To fine-map this locus, we genotyped 92 SNPs in a 900kb region (120,505,799-121,481,132) flanking rs11249433 in 45,276 breast cancer cases and 48,998 controls of European, Asian and African ancestry from 50 studies in the Breast Cancer Association Consortium. Genotyping was done using iCOGS, a custom-built array. Due to the complicated nature of the region on chr1p11.2: 120,300,000-120,505,798, that lies near the centromere and contains seven duplicated genomic segments, we restricted analyses to 429 SNPs excluding the duplicated regions (42 genotyped and 387 imputed). Per-allelic associations with breast cancer risk were estimated using logistic regression models adjusting for study and ancestry-specific principal components. The strongest association observed was with the original identified index SNP rs11249433 (minor allele frequency (MAF) 0.402; per-allele odds ratio (OR) = 1.10, 95% confidence interval (CI) 1.08-1.13, P = 1.49 x 10-21). The association for rs11249433 was limited to ER-positive breast cancers (test for heterogeneity P≤8.41 x 10-5). Additional analyses by other tumor characteristics showed stronger associations with moderately/well differentiated tumors and tumors of lobular histology. Although no significant eQTL associations were observed, in silico analyses showed that rs11249433 was located in a region that is likely a weak enhancer/promoter. Fine-mapping analysis of the 1p11.2 breast cancer susceptibility locus confirms this region to be limited to risk to cancers that are ER-positive.
    Matched MeSH terms: Computational Biology/methods
  19. Kumar S
    BMC Res Notes, 2015;8:9.
    PMID: 25595103 DOI: 10.1186/s13104-015-0976-4
    Cytochrome P450s (CYPs) are important heme-containing proteins, well known for their monooxygenase reaction. The human cytochrome P450 4X1 (CYP4X1) is categorized as "orphan" CYP because of its unknown function. In recent studies it is found that this enzyme is expressed in neurovascular functions of the brain. Also, various studies have found the expression and activity of orphan human cytochrome P450 4X1 in cancer. It is found to be a potential drug target for cancer therapy. However, three-dimensional structure, the active site topology and substrate specificity of CYP4X1 remain unclear.
    Matched MeSH terms: Computational Biology/methods*
  20. Mirsafian H, Mat Ripen A, Merican AF, Bin Mohamad S
    ScientificWorldJournal, 2014;2014:482463.
    PMID: 25254246 DOI: 10.1155/2014/482463
    Beta-amyloid precursor protein cleavage enzyme 1 (BACE1) and beta-amyloid precursor protein cleavage enzyme 2 (BACE2), members of aspartyl protease family, are close homologues and have high similarity in their protein crystal structures. However, their enzymatic properties differ leading to disparate clinical consequences. In order to identify the residues that are responsible for such differences, we used evolutionary trace (ET) method to compare the amino acid conservation patterns of BACE1 and BACE2 in several mammalian species. We found that, in BACE1 and BACE2 structures, most of the ligand binding sites are conserved which indicate their enzymatic property of aspartyl protease family members. The other conserved residues are more or less randomly localized in other parts of the structures. Four group-specific residues were identified at the ligand binding site of BACE1 and BACE2. We postulated that these residues would be essential for selectivity of BACE1 and BACE2 biological functions and could be sites of interest for the design of selective inhibitors targeting either BACE1 or BACE2.
    Matched MeSH terms: Computational Biology/methods*
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links