MyMedR

Displaying publications 61 - 80 of 119 in total

Abstract:

Sort:

Fulltext Drug Discovery of Spinal Muscular Atrophy (SMA) from the Computational Perspective: A Comprehensive Review

Chong LC, Gandhi G, Lee JM, Yeo WWY, Choi SB

Int J Mol Sci, 2021 Aug 20;22(16).
PMID: 34445667 DOI: 10.3390/ijms22168962

Spinal muscular atrophy (SMA), one of the leading inherited causes of child mortality, is a rare neuromuscular disease arising from loss-of-function mutations of the survival motor neuron 1 (SMN1) gene, which encodes the SMN protein. When lacking the SMN protein in neurons, patients suffer from muscle weakness and atrophy, and in the severe cases, respiratory failure and death. Several therapeutic approaches show promise with human testing and three medications have been approved by the U.S. Food and Drug Administration (FDA) to date. Despite the shown promise of these approved therapies, there are some crucial limitations, one of the most important being the cost. The FDA-approved drugs are high-priced and are shortlisted among the most expensive treatments in the world. The price is still far beyond affordable and may serve as a burden for patients. The blooming of the biomedical data and advancement of computational approaches have opened new possibilities for SMA therapeutic development. This article highlights the present status of computationally aided approaches, including in silico drug repurposing, network driven drug discovery as well as artificial intelligence (AI)-assisted drug discovery, and discusses the future prospects.

Matched MeSH terms: Computational Biology/methods
Immunoinformatics-based potential multi-peptide vaccine designing against Jamestown Canyon Virus (JCV) capable of eliciting cellular and humoral immune responses

Shahab M, Aiman S, Alshammari A, Alasmari AF, Alharbi M, Khan A, et al.

Int J Biol Macromol, 2023 Dec 31;253(Pt 2):126678.
PMID: 37666399 DOI: 10.1016/j.ijbiomac.2023.126678

Jamestown Canyon virus (JCV) is a deadly viral infection transmitted by various mosquito species. This mosquito-borne virus belongs to Bunyaviridae family, posing a high public health threat in the in tropical regions of the United States causing encephalitis in humans. Common symptoms of JCV include fever, headache, stiff neck, photophobia, nausea, vomiting, and seizures. Despite the availability of resources, there is currently no vaccine or drug available to combat JCV. The purpose of this study was to develop an epitope-based vaccine using immunoinformatics approaches. The vaccine aimed to be secure, efficient, bio-compatible, and capable of stimulating both innate and adaptive immune responses. In this study, the protein sequence of JCV was obtained from the NCBI database. Various bioinformatics methods, including toxicity evaluation, antigenicity testing, conservancy analysis, and allergenicity assessment were utilized to identify the most promising epitopes. Suitable linkers and adjuvant sequences were used in the design of vaccine construct. 50s ribosomal protein sequence was used as an adjuvant at the N-terminus of the construct. A total of 5 CTL, 5 HTL, and 5 linear B cell epitopes were selected based on non-allergenicity, immunological potential, and antigenicity scores to design a highly immunogenic multi-peptide vaccine construct. Strong interactions between the proposed vaccine and human immune receptors, i.e., TLR-2 and TLR-4, were revealed in a docking study using ClusPro software, suggesting their possible relevance in the immunological response to the vaccine. Immunological and physicochemical properties assessment ensured that the proposed vaccine demonstrated high immunogenicity, solubility and thermostability. Molecular dynamics simulations confirmed the strong binding affinities, as well as dynamic and structural stability of the proposed vaccine. Immune simulation suggest that the vaccine has the potential to effectively stimulate cellular and humoral immune responses to combat JCV infection. Experimental and clinical assays are required to validate the results of this study.

Matched MeSH terms: Computational Biology/methods
Computational Analysis and In silico Predictive Modeling for Inhibitors of PhoP Regulon in S. typhi on High-Throughput Screening Bioassay Dataset

Kaur H, Ahmad M, Scaria V

Interdiscip Sci, 2016 Mar;8(1):95-101.
PMID: 26298582 DOI: 10.1007/s12539-015-0273-x

There is emergence of multidrug-resistant Salmonella enterica serotype typhi in pandemic proportions throughout the world, and therefore, there is a necessity to speed up the discovery of novel molecules having different modes of action and also less influenced by the resistance formation that would be used as drug for the treatment of salmonellosis particularly typhoid fever. The PhoP regulon is well studied and has now been shown to be a critical regulator of number of gene expressions which are required for intracellular survival of S. enterica and pathophysiology of disease like typhoid. The evident roles of two-component PhoP-/PhoQ-regulated products in salmonella virulence have motivated attempts to target them therapeutically. Although the discovery process of biologically active compounds for the treatment of typhoid relies on hit-finding procedure, using high-throughput screening technology alone is very expensive, as well as time consuming when performed on large scales. With the recent advancement in combinatorial chemistry and contemporary technique for compounds synthesis, there are more and more compounds available which give ample growth of diverse compound library, but the time and endeavor required to screen these unfocused massive and diverse library have been slightly reduced in the past years. Hence, there is demand to improve the high-quality hits and success rate for high-throughput screening that required focused and biased compound library toward the particular target. Therefore, we still need an advantageous and expedient method to prioritize the molecules that will be utilized for biological screens, which saves time and is also inexpensive. In this concept, in silico methods like machine learning are widely applicable technique used to build computational model for high-throughput virtual screens to prioritize molecules for advance study. Furthermore, in computational analysis, we extended our study to identify the common enriched structural entities among the biologically active compound toward finding out the privileged scaffold.

Matched MeSH terms: Computational Biology/methods*
Comparative genomic and phylogenetic analysis of a toxigenic clinical isolate of Corynebacterium diphtheriae strain B-D-16-78 from Malaysia

Hong KW, Asmah Hani AW, Nurul Aina Murni CA, Pusparani RR, Chong CK, Verasahib K, et al.

Infect Genet Evol, 2017 Oct;54:263-270.
PMID: 28711373 DOI: 10.1016/j.meegid.2017.07.015

In this study, we report the comparative genomics and phylogenetic analysis of Corynebacterium diphtheriae strain B-D-16-78 that was isolated from a clinical specimen in 2016. The complete genome of C. diphtheriae strain B-D-16-78 was sequenced using PacBio Single Molecule, Real-Time sequencing technology and consists of a 2,474,151-bp circular chromosome with an average GC content of 53.56%. The core genome of C. diphtheriae was also deduced from a total of 74 strains with complete or draft genome sequences and the core genome-based phylogenetic analysis revealed close genetic relationship among strains that shared the same MLST allelic profile. In the context of CRISPR-Cas system, which confers adaptive immunity against re-invading DNA, 73 out of 86 spacer sequences were found to be unique to Malaysian strains which harboured only type-II-C and/or type-I-E-a systems. A total of 48 tox genes which code for the diphtheria toxin were retrieved from the 74 genomes and with the exception of one truncated gene, only nucleotide substitutions were detected when compared to the tox gene sequence of PW8. More than half were synonymous substitution and only two were nonsynonymous substitutions whereby H24Y was predicted to have a damaging effect on the protein function whilst T262V was predicted to be tolerated. Both toxigenic and non-toxigenic toxin-gene bearing strains have been isolated in Malaysia but the repeated isolation of toxigenic strains with the same MLST profile suggests the possibility of some of these strains may be circulating in the population. Hence, efforts to increase herd immunity should be continued and supported by an effective monitoring and surveillance system to track, manage and control outbreak of cases.

Matched MeSH terms: Computational Biology/methods
From discovery to spread: The evolution and phylogeny of Getah virus

Li YY, Liu H, Fu SH, Li XL, Guo XF, Li MH, et al.

Infect Genet Evol, 2017 11;55:48-55.
PMID: 28827175 DOI: 10.1016/j.meegid.2017.08.016

Getah virus (GETV) was first isolated in Malaysia in 1955. Since then, epidemics in horses and pigs caused by GETV have resulted in huge economic losses. At present, GETV has spread across Eurasia and Southeast Asia, including mainland China, Korea, Japan, Mongolia, and Russia. Data show that the Most Recent Common Ancestor (MRCA) of GETV existed about 145years ago (95% HPD: 75-244) and gradually evolved into four distinct evolutionary populations: Groups I-IV. The MRCA of GETVs in Group III, which includes all GETVs isolated from mosquitoes, pigs, horses, and other animals since the 1960s (from latitude 19°N to 60°N), existed about 51years ago (95% HPD: 51-72). Group III is responsible for most viral epidemics among domestic animals. An analysis of the GETV E2 protein sequence and structure revealed seven common amino acid mutation sites. These sites are responsible for the structural and electrostatic differences detected between widespread Group III isolates and the prototype strain MM2021. These differences may account for the recent geographical radiation of the virus. Considering the economic significance of GETV infection in pigs and horses, we recommend the implementation of strict viral screening and monitoring programs.

Matched MeSH terms: Computational Biology/methods
In silico identification and characterization of a putative phosphatidylinositol 4-phosphate 5-kinase (PIP5K) gene in Eimeria tenella

Ling KH, Loo SS, Rosli R, Shamsudin MN, Mohamed R, Wan KL

In Silico Biol. (Gedrukt), 2007;7(1):115-21.
PMID: 17688436

Phosphatidylinositol 4-phosphate 5-kinases (PIP5Ks) play diverse roles in the cellular biology of many organisms, including signal transduction, secretion and vesicular trafficking, and regulation of cytoskeleton assembly. Discovery of the PIP5K gene in Eimeria tenella may shed light on its role in the biology of this avian protozoan, and afford further understanding of the cell-host interaction, particularly during the invasion process. In this study, we report the identification of the PIP5K coding region in the genome sequence of Eimeria tenella using in silico gene prediction approaches. Prediction of the PIP5K coding sequence was confirmed by mapping the full-length cDNA sequence, generated via the Rapid Amplification of cDNA Ends (RACE) method, to the genomic sequence. The putative PIP5K gene of Eimeria tenella is located on the complementary strand of the E1080B12.b1 contig, and comprises 12 exons. Further analysis showed that the coding region spans from exon 1 to exon 7, with all exons obeying the adopted 'gt...ag' splicing rule of intronic sequences. Consensus of the hexameric 5' donor-splice site was deduced as GTRDBB... and the consensus for the 3' acceptor-splice sites as ...BHDYAG. The gene encodes a 252-amino acid residue protein. Domain search and protein fold recognition analyses provide compelling evidences that the deduced protein is a PIP5K.

Matched MeSH terms: Computational Biology/methods*
Fulltext Large scale multifactorial likelihood quantitative analysis of BRCA1 and BRCA2 variants: An ENIGMA resource to support clinical variant classification

Parsons MT, Tudini E, Li H, Hahnen E, Wappenschmidt B, Feliubadaló L, et al.

Hum Mutat, 2019 Sep;40(9):1557-1578.
PMID: 31131967 DOI: 10.1002/humu.23818

The multifactorial likelihood analysis method has demonstrated utility for quantitative assessment of variant pathogenicity for multiple cancer syndrome genes. Independent data types currently incorporated in the model for assessing BRCA1 and BRCA2 variants include clinically calibrated prior probability of pathogenicity based on variant location and bioinformatic prediction of variant effect, co-segregation, family cancer history profile, co-occurrence with a pathogenic variant in the same gene, breast tumor pathology, and case-control information. Research and clinical data for multifactorial likelihood analysis were collated for 1,395 BRCA1/2 predominantly intronic and missense variants, enabling classification based on posterior probability of pathogenicity for 734 variants: 447 variants were classified as (likely) benign, and 94 as (likely) pathogenic; and 248 classifications were new or considerably altered relative to ClinVar submissions. Classifications were compared with information not yet included in the likelihood model, and evidence strengths aligned to those recommended for ACMG/AMP classification codes. Altered mRNA splicing or function relative to known nonpathogenic variant controls were moderately to strongly predictive of variant pathogenicity. Variant absence in population datasets provided supporting evidence for variant pathogenicity. These findings have direct relevance for BRCA1 and BRCA2 variant evaluation, and justify the need for gene-specific calibration of evidence types used for variant classification.

Matched MeSH terms: Computational Biology/methods*
Fulltext An expanded mammal mitogenome dataset from Southeast Asia

Mohd Salleh F, Ramos-Madrigal J, Peñaloza F, Liu S, Mikkel-Holger SS, Riddhi PP, et al.

Gigascience, 2017 08 01;6(8):1-8.
PMID: 28873965 DOI: 10.1093/gigascience/gix053

Southeast (SE) Asia is 1 of the most biodiverse regions in the world, and it holds approximately 20% of all mammal species. Despite this, the majority of SE Asia's genetic diversity is still poorly characterized. The growing interest in using environmental DNA to assess and monitor SE Asian species, in particular threatened mammals-has created the urgent need to expand the available reference database of mitochondrial barcode and complete mitogenome sequences. We have partially addressed this need by generating 72 new mitogenome sequences reconstructed from DNA isolated from a range of historical and modern tissue samples. Approximately 55 gigabases of raw sequence were generated. From this data, we assembled 72 complete mitogenome sequences, with an average depth of coverage of ×102.9 and ×55.2 for modern samples and historical samples, respectively. This dataset represents 52 species, of which 30 species had no previous mitogenome data available. The mitogenomes were geotagged to their sampling location, where known, to display a detailed geographical distribution of the species. Our new database of 52 taxa will strongly enhance the utility of environmental DNA approaches for monitoring mammals in SE Asia as it greatly increases the likelihoods that identification of metabarcoding sequencing reads can be assigned to reference sequences. This magnifies the confidence in species detections and thus allows more robust surveys and monitoring programmes of SE Asia's threatened mammal biodiversity. The extensive collections of historical samples from SE Asia in western and SE Asian museums should serve as additional valuable material to further enrich this reference database.

Matched MeSH terms: Computational Biology/methods
Fulltext An efficient and robust laboratory workflow and tetrapod database for larger scale environmental DNA studies

Axtner J, Crampton-Platt A, Hörig LA, Mohamed A, Xu CCY, Yu DW, et al.

Gigascience, 2019 Apr 01;8(4).
PMID: 30997489 DOI: 10.1093/gigascience/giz029

BACKGROUND: The use of environmental DNA for species detection via metabarcoding is growing rapidly. We present a co-designed lab workflow and bioinformatic pipeline to mitigate the 2 most important risks of environmental DNA use: sample contamination and taxonomic misassignment. These risks arise from the need for polymerase chain reaction (PCR) amplification to detect the trace amounts of DNA combined with the necessity of using short target regions due to DNA degradation.
FINDINGS: Our high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 PCR replicates and 2 extraction replicates; (ii) using multi-markers (12S,16S,CytB); (iii) a "twin-tagging," 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases. Because annotation errors in the reference sequences can result in taxonomic misassignment, we supply a protocol for curating sequence datasets. For some taxonomic groups and some markers, curation resulted in >50% of sequences being deleted from public reference databases, owing to (i) limited overlap between our target amplicon and reference sequences, (ii) mislabelling of reference sequences, and (iii) redundancy. Finally, we provide a bioinformatic pipeline to process amplicons and conduct PROTAX assignment and tested it on an invertebrate-derived DNA dataset from 1,532 leeches from Sabah, Malaysia. Twin-tagging allowed us to detect and exclude sequences with non-matching tags. The smallest DNA fragment (16S) amplified most frequently for all samples but was less powerful for discriminating at species rank. Using a stringent and lax acceptance criterion we found 162 (stringent) and 190 (lax) vertebrate detections of 95 (stringent) and 109 (lax) leech samples.
CONCLUSIONS: Our metabarcoding workflow should help research groups increase the robustness of their results and therefore facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods.

Matched MeSH terms: Computational Biology/methods
Fulltext A draft genome sequence of the elusive giant squid, Architeuthis dux

da Fonseca RR, Couto A, Machado AM, Brejova B, Albertin CB, Silva F, et al.

Gigascience, 2020 Jan 01;9(1).
PMID: 31942620 DOI: 10.1093/gigascience/giz152

BACKGROUND: The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked.
FINDINGS: We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
CONCLUSIONS: This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.

Matched MeSH terms: Computational Biology/methods
Fulltext Complete vertebrate mitogenomes reveal widespread repeats and gene duplications

Formenti G, Rhie A, Balacco J, Haase B, Mountcastle J, Fedrigo O, et al.

Genome Biol, 2021 04 29;22(1):120.
PMID: 33910595 DOI: 10.1186/s13059-021-02336-9

BACKGROUND: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly.
RESULTS: As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100-300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization.
CONCLUSIONS: Our results indicate that even in the "simple" case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.

Matched MeSH terms: Computational Biology/methods
Identification of hub glycogenes and their nsSNP analysis from mouse RNA-Seq data

Firoz A, Malik A, Singh SK, Jha V, Ali A

Gene, 2015 Dec 15;574(2):235-46.
PMID: 26260015 DOI: 10.1016/j.gene.2015.08.012

Glycogenes regulate a large number of biological processes such as cancer and development. In this work, we created an interaction network of 923 glycogenes to detect potential hubs from different mouse tissues using RNA-Seq data. DAVID functional cluster analysis revealed enrichment of immune response, glycoprotein and cholesterol metabolic processes. We also explored nsSNPs that may modify the expression and function of identified hubs using computational methods. We observe that the number of nsSNPs predicted by any two methods to affect protein function is 4, 7 and 2 for FLT1, NID2 and TNFRSF1B. Residues in the native and mutant proteins were analyzed for solvent accessibility and secondary structure change. Analysis of hubs can help in determining their degree of conservation and understanding their functions in biological processes. The nsSNPs proposed in this work may be further targeted through experimental methods for understanding structural and functional relationships of hub mutants.

Matched MeSH terms: Computational Biology/methods
In silico study of missense variants of FANCA, FANCC and FANCG genes reveals high risk deleterious alleles predisposing to Fanconi anemia pathogenesis

Shahid M, Azfaralariff A, Zubair M, Abdulkareem Najm A, Khalili N, Law D, et al.

Gene, 2022 Feb 20;812:146104.
PMID: 34864095 DOI: 10.1016/j.gene.2021.146104

Among the 22 Fanconi anemia (FA) reported genes, 90% of mutational spectra were found in three genes, namely FANCA (64%), FANCC (12%) and FANCG (8%). Therefore, this study aimed to identify the high-risk deleterious variants in three selected genes (FANCA, FANCC, and FANCG) through various computational approaches. The missense variant datasets retrieved from the UCSC genome browser were analyzed for their pathogenicity, stability, and phylogenetic conservancy. A total of 23 alterations, of which 16 in FANCA, 6 in FANCC and one variant in FANCG, were found to be highly deleterious. The native and mutant structures were generated, which demonstrated a profound impact on the respective proteins. Besides, their pathway analysis predicted many other pathways in addition to the Fanconi anemia pathway, homologous recombination, and mismatch repair pathways. Hence, this is the first comprehensive study that can be useful for understanding the genetic signatures in the development of FA.

Matched MeSH terms: Computational Biology/methods*
Fulltext Genome-Wide Transcription and Functional Analyses Reveal Heterogeneous Molecular Mechanisms Driving Pyrethroids Resistance in the Major Malaria Vector Anopheles funestus Across Africa

Riveron JM, Ibrahim SS, Mulamba C, Djouaka R, Irving H, Wondji MJ, et al.

G3 (Bethesda), 2017 06 07;7(6):1819-1832.
PMID: 28428243 DOI: 10.1534/g3.117.040147

Pyrethroid resistance in malaria vector, An. funestus is increasingly reported across Africa, threatening the sustainability of pyrethroid-based control interventions, including long lasting insecticidal nets (LLINs). Managing this problem requires understanding of the molecular basis of the resistance from different regions of the continent, to establish whether it is being driven by a single or independent selective events. Here, using a genome-wide transcription profiling of pyrethroid resistant populations from southern (Malawi), East (Uganda), and West Africa (Benin), we investigated the molecular basis of resistance, revealing strong differences between the different African regions. The duplicated cytochrome P450 genes (CYP6P9a and CYP6P9b) which were highly overexpressed in southern Africa are not the most upregulated in other regions, where other genes are more overexpressed, including GSTe2 in West (Benin) and CYP9K1 in East (Uganda). The lack of directional selection on both CYP6P9a and CYP6P9b in Uganda in contrast to southern Africa further supports the limited role of these genes outside southern Africa. However, other genes such as the P450 CYP9J11 are commonly overexpressed in all countries across Africa. Here, CYP9J11 is functionally characterized and shown to confer resistance to pyrethroids and moderate cross-resistance to carbamates (bendiocarb). The consistent overexpression of GSTe2 in Benin is coupled with a role of allelic variation at this gene as GAL4-UAS transgenic expression in Drosophila flies showed that the resistant 119F allele is highly efficient in conferring both DDT and permethrin resistance than the L119. The heterogeneity in the molecular basis of resistance and cross-resistance to insecticides in An. funestus populations throughout sub-Saharan African should be taken into account in designing resistance management strategies.

Matched MeSH terms: Computational Biology/methods
Fulltext Development of a Bioinformatics Framework for Identification and Validation of Genomic Biomarkers and Key Immunopathology Processes and Controllers in Infectious and Non-infectious Severe Inflammatory Response Syndrome

Tong DL, Kempsell KE, Szakmany T, Ball G

Front Immunol, 2020;11:380.
PMID: 32318053 DOI: 10.3389/fimmu.2020.00380

Sepsis is defined as dysregulated host response caused by systemic infection, leading to organ failure. It is a life-threatening condition, often requiring admission to an intensive care unit (ICU). The causative agents and processes involved are multifactorial but are characterized by an overarching inflammatory response, sharing elements in common with severe inflammatory response syndrome (SIRS) of non-infectious origin. Sepsis presents with a range of pathophysiological and genetic features which make clinical differentiation from SIRS very challenging. This may reflect a poor understanding of the key gene inter-activities and/or pathway associations underlying these disease processes. Improved understanding is critical for early differential recognition of sepsis and SIRS and to improve patient management and clinical outcomes. Judicious selection of gene biomarkers suitable for development of diagnostic tests/testing could make differentiation of sepsis and SIRS feasible. Here we describe a methodologic framework for the identification and validation of biomarkers in SIRS, sepsis and septic shock patients, using a 2-tier gene screening, artificial neural network (ANN) data mining technique, using previously published gene expression datasets. Eight key hub markers have been identified which may delineate distinct, core disease processes and which show potential for informing underlying immunological and pathological processes and thus patient stratification and treatment. These do not show sufficient fold change differences between the different disease states to be useful as primary diagnostic biomarkers, but are instrumental in identifying candidate pathways and other associated biomarkers for further exploration.

Matched MeSH terms: Computational Biology/methods*
Potential shared pathogenic mechanisms between endometriosis and inflammatory bowel disease indicate a strong initial effect of immune factors

Zhang H, Mo Y, Wang L, Zhang H, Wu S, Sandai D, et al.

Front Immunol, 2024;15:1339647.
PMID: 38660311 DOI: 10.3389/fimmu.2024.1339647

INTRODUCTION: Over the past decades, immune dysregulation has been consistently demonstrated being common charactoristics of endometriosis (EM) and Inflammatory Bowel Disease (IBD) in numerous studies. However, the underlying pathological mechanisms remain unknown. In this study, bioinformatics techniques were used to screen large-scale gene expression data for plausible correlations at the molecular level in order to identify common pathogenic pathways between EM and IBD.
METHODS: Based on the EM transcriptomic datasets GSE7305 and GSE23339, as well as the IBD transcriptomic datasets GSE87466 and GSE126124, differential gene analysis was performed using the limma package in the R environment. Co-expressed differentially expressed genes were identified, and a protein-protein interaction (PPI) network for the differentially expressed genes was constructed using the 11.5 version of the STRING database. The MCODE tool in Cytoscape facilitated filtering out protein interaction subnetworks. Key genes in the PPI network were identified through two topological analysis algorithms (MCC and Degree) from the CytoHubba plugin. Upset was used for visualization of these key genes. The diagnostic value of gene expression levels for these key genes was assessed using the Receiver Operating Characteristic (ROC) curve and Area Under the Curve (AUC) The CIBERSORT algorithm determined the infiltration status of 22 immune cell subtypes, exploring differences between EM and IBD patients in both control and disease groups. Finally, different gene expression trends shared by EM and IBD were input into CMap to identify small molecule compounds with potential therapeutic effects.
RESULTS: 113 differentially expressed genes (DEGs) that were co-expressed in EM and IBD have been identified, comprising 28 down-regulated genes and 86 up-regulated genes. The co-expression differential gene of EM and IBD in the functional enrichment analyses focused on immune response activation, circulating immunoglobulin-mediated humoral immune response and humoral immune response. Five hub genes (SERPING1、VCAM1、CLU、C3、CD55) were identified through the Protein-protein Interaction network and MCODE.High Area Under the Curve (AUC) values of Receiver Operating Characteristic (ROC) curves for 5hub genes indicate the predictive ability for disease occurrence.These hub genes could be used as potential biomarkers for the development of EM and IBD. Furthermore, the CMap database identified a total of 9 small molecule compounds (TTNPB、CAY-10577、PD-0325901 etc.) targeting therapeutic genes for EM and IBD.
DISCUSSION: Our research revealed common pathogenic mechanisms between EM and IBD, particularly emphasizing immune regulation and cell signalling, indicating the significance of immune factors in the occurence and progression of both diseases. By elucidating shared mechanisms, our study provides novel avenues for the prevention and treatment of EM and IBD.

Matched MeSH terms: Computational Biology/methods
Genomic characterization of Vibrio parahaemolyticus from Pacific white shrimp and rearing water in Malaysia reveals novel sequence types and structural variation in genomic regions containing the Photorhabdus insect-related (Pir) toxin-like genes

Yan CZY, Austin CM, Ayub Q, Rahman S, Gan HM

FEMS Microbiol Lett, 2019 09 01;366(17).
PMID: 31589302 DOI: 10.1093/femsle/fnz211

The Malaysian and global shrimp aquaculture production has been significantly impacted by acute hepatopancreatic necrosis disease (AHPND) typically caused by Vibrio parahaemolyticus harboring the pVA plasmid containing the pirAVp and pirBVp genes, which code for Photorhabdus insect-related (Pir) toxin. The limited genomic resource for V. parahaemolyticus strains from Malaysian aquaculture farms precludes an in-depth understanding of their diversity and evolutionary relationships. In this study, we isolated shrimp-associated and environmental (rearing water) V. parahaemolyticus from three aquaculture farms located in Northern and Central Malaysia followed by whole-genome sequencing of 40 randomly selected isolates on the Illumina MiSeq. Phylogenomic analysis and multilocus sequence typing (MLST) reveal distinct lineages of V. parahaemolyticus that harbor the pirABVp genes. The recovery of pVA plasmid backbone devoid of pirAVp or pirABVp in some V. parahaemolyticus isolates suggests that the toxin genes are prone to deletion. The new insight gained from phylogenomic analysis of Asian V. parahaemolyticus, in addition to the observed genomic instability of pVa plasmid, will have implications for improvements in aquaculture practices to diagnose, treat or limit the impacts of this disease.

Matched MeSH terms: Computational Biology/methods
Fulltext The challenges of designing a benchmark strategy for bioinformatics pipelines in the identification of antimicrobial resistance determinants using next generation sequencing technologies

Angers-Loustau A, Petrillo M, Bengtsson-Palme J, Berendonk T, Blais B, Chan KG, et al.

F1000Res, 2018;7.
PMID: 30026930 DOI: 10.12688/f1000research.14509.2

Next-Generation Sequencing (NGS) technologies are expected to play a crucial role in the surveillance of infectious diseases, with their unprecedented capabilities for the characterisation of genetic information underlying the virulence and antimicrobial resistance (AMR) properties of microorganisms. In the implementation of any novel technology for regulatory purposes, important considerations such as harmonisation, validation and quality assurance need to be addressed. NGS technologies pose unique challenges in these regards, in part due to their reliance on bioinformatics for the processing and proper interpretation of the data produced. Well-designed benchmark resources are thus needed to evaluate, validate and ensure continued quality control over the bioinformatics component of the process. This concept was explored as part of a workshop on "Next-generation sequencing technologies and antimicrobial resistance" held October 4-5 2017. Challenges involved in the development of such a benchmark resource, with a specific focus on identifying the molecular determinants of AMR, were identified. For each of the challenges, sets of unsolved questions that will need to be tackled for them to be properly addressed were compiled. These take into consideration the requirement for monitoring of AMR bacteria in humans, animals, food and the environment, which is aligned with the principles of a "One Health" approach.

Matched MeSH terms: Computational Biology/methods*
Allele Mining Strategies: Principles and Utilisation for Blast Resistance Genes in Rice (Oryza sativa L.)

Ashkani S, Yusop MR, Shabanimofrad M, Azady A, Ghasemzadeh A, Azizi P, et al.

Curr Issues Mol Biol, 2015;17:57-73.
PMID: 25706446

Allele mining is a promising way to dissect naturally occurring allelic variants of candidate genes with essential agronomic qualities. With the identification, isolation and characterisation of blast resistance genes in rice, it is now possible to dissect the actual allelic variants of these genes within an array of rice cultivars via allele mining. Multiple alleles from the complex locus serve as a reservoir of variation to generate functional genes. The routine sequence exchange is one of the main mechanisms of R gene evolution and development. Allele mining for resistance genes can be an important method to identify additional resistance alleles and new haplotypes along with the development of allele-specific markers for use in marker-assisted selection. Allele mining can be visualised as a vital link between effective utilisation of genetic and genomic resources in genomics-driven modern plant breeding. This review studies the actual concepts and potential of mining approaches for the discovery of alleles and their utilisation for blast resistance genes in rice. The details provided here will be important to provide the rice breeder with a worthwhile introduction to allele mining and its methodology for breakthrough discovery of fresh alleles hidden in hereditary diversity, which is vital for crop improvement.

Matched MeSH terms: Computational Biology/methods
A review on the computational approaches for gene regulatory network construction

Chai LE, Loh SK, Low ST, Mohamad MS, Deris S, Zakaria Z

Comput Biol Med, 2014 May;48:55-65.
PMID: 24637147 DOI: 10.1016/j.compbiomed.2014.02.011

Many biological research areas such as drug design require gene regulatory networks to provide clear insight and understanding of the cellular process in living cells. This is because interactions among the genes and their products play an important role in many molecular processes. A gene regulatory network can act as a blueprint for the researchers to observe the relationships among genes. Due to its importance, several computational approaches have been proposed to infer gene regulatory networks from gene expression data. In this review, six inference approaches are discussed: Boolean network, probabilistic Boolean network, ordinary differential equation, neural network, Bayesian network, and dynamic Bayesian network. These approaches are discussed in terms of introduction, methodology and recent applications of these approaches in gene regulatory network construction. These approaches are also compared in the discussion section. Furthermore, the strengths and weaknesses of these computational approaches are described.

Matched MeSH terms: Computational Biology/methods*

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links