MyMedR

Displaying publications 1 - 20 of 104 in total

Abstract:

Sort:

Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.)

Ho CL, Kwan YY, Choi MC, Tee SS, Ng WH, Lim KA, et al.

BMC Genomics, 2007;8:381.
PMID: 17953740

Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST) analysis on oil palm.

Matched MeSH terms: Databases, Genetic
Fulltext Genomic species identification of Acinetobacter of clinical isolates by 16S rDNA sequencing

Misbah S, Hassan H, Yusof MY, Hanifah YA, AbuBakar S

Singapore Med J, 2005 Sep;46(9):461-4.
PMID: 16123830

This study aims to identify Acinetobacter of clinical isolates from the University of Malaya Medical Centre (UMMC), Kuala Lumpur, to the species level by 16S rDNA sequencing.

Matched MeSH terms: Databases, Genetic
Immunoinformatics study on highly expressed Mycobacterium tuberculosis genes during infection

Nguyen Thi le T, Sarmiento ME, Calero R, Camacho F, Reyes F, Hossain MM, et al.

Tuberculosis (Edinb), 2014 Sep;94(5):475-81.
PMID: 25034135 DOI: 10.1016/j.tube.2014.06.004

The most important targets for vaccine development are the proteins that are highly expressed by the microorganisms during infection in-vivo. A number of Mycobacterium tuberculosis (Mtb) proteins are also reported to be expressed in-vivo at different phases of infection. In the present study, we analyzed multiple published databases of gene expression profiles of Mtb in-vivo at different phases of infection in animals and humans and selected 38 proteins that are highly expressed in the active, latent and reactivation phases. We predicted T- and B-cell epitopes from the selected proteins using HLAPred for T-cell epitope prediction and BCEPred combined with ABCPred for B-cell epitope prediction. For each selected proteins, regions containing both T- and B-cell epitopes were identified which might be considered as important candidates for vaccine design against tuberculosis.

Matched MeSH terms: Databases, Genetic
Analysis and construction of pathogenicity island regulatory pathways in Salmonella enterica serovar Typhi

Ong SY, Ng FL, Badai SS, Yuryev A, Alam M

J Integr Bioinform, 2010;7(1).
PMID: 20861532 DOI: 10.2390/biecoll-jib-2010-145

Signal transduction through protein-protein interactions and protein modifications are the main mechanisms controlling many biological processes. Here we described the implementation of MedScan information extraction technology and Pathway Studio software (Ariadne Genomics Inc.) to create a Salmonella specific molecular interaction database. Using the database, we have constructed several signal transduction pathways in Salmonella enterica serovar Typhi which causes Typhoid Fever, a major health threat especially in developing countries. S. Typhi has several pathogenicity islands that control rapid switching between different phenotypes including adhesion and colonization, invasion, intracellular survival, proliferation, and biofilm formation in response to environmental changes. Understanding of the detailed mechanism for S. Typhi survival in host cells is necessary for development of efficient detection and treatment of this pathogen. The constructed pathways were validated using publically available gene expression microarray data for Salmonella.

Matched MeSH terms: Databases, Genetic
Identification of hub glycogenes and their nsSNP analysis from mouse RNA-Seq data

Firoz A, Malik A, Singh SK, Jha V, Ali A

Gene, 2015 Dec 15;574(2):235-46.
PMID: 26260015 DOI: 10.1016/j.gene.2015.08.012

Glycogenes regulate a large number of biological processes such as cancer and development. In this work, we created an interaction network of 923 glycogenes to detect potential hubs from different mouse tissues using RNA-Seq data. DAVID functional cluster analysis revealed enrichment of immune response, glycoprotein and cholesterol metabolic processes. We also explored nsSNPs that may modify the expression and function of identified hubs using computational methods. We observe that the number of nsSNPs predicted by any two methods to affect protein function is 4, 7 and 2 for FLT1, NID2 and TNFRSF1B. Residues in the native and mutant proteins were analyzed for solvent accessibility and secondary structure change. Analysis of hubs can help in determining their degree of conservation and understanding their functions in biological processes. The nsSNPs proposed in this work may be further targeted through experimental methods for understanding structural and functional relationships of hub mutants.

Matched MeSH terms: Databases, Genetic
Fulltext Genome-wide meta-analysis identifies five new susceptibility loci for pancreatic cancer

Klein AP, Wolpin BM, Risch HA, Stolzenberg-Solomon RZ, Mocci E, Zhang M, et al.

Nat Commun, 2018 02 08;9(1):556.
PMID: 29422604 DOI: 10.1038/s41467-018-02942-5

In 2020, 146,063 deaths due to pancreatic cancer are estimated to occur in Europe and the United States combined. To identify common susceptibility alleles, we performed the largest pancreatic cancer GWAS to date, including 9040 patients and 12,496 controls of European ancestry from the Pancreatic Cancer Cohort Consortium (PanScan) and the Pancreatic Cancer Case-Control Consortium (PanC4). Here, we find significant evidence of a novel association at rs78417682 (7p12/TNS3, P = 4.35 × 10-8). Replication of 10 promising signals in up to 2737 patients and 4752 controls from the PANcreatic Disease ReseArch (PANDoRA) consortium yields new genome-wide significant loci: rs13303010 at 1p36.33 (NOC2L, P = 8.36 × 10-14), rs2941471 at 8q21.11 (HNF4G, P = 6.60 × 10-10), rs4795218 at 17q12 (HNF1B, P = 1.32 × 10-8), and rs1517037 at 18q21.32 (GRP, P = 3.28 × 10-8). rs78417682 is not statistically significantly associated with pancreatic cancer in PANDoRA. Expression quantitative trait locus analysis in three independent pancreatic data sets provides molecular support of NOC2L as a pancreatic cancer susceptibility gene.

Matched MeSH terms: Databases, Genetic
Fulltext A Transcriptome-Wide Association Study Identifies Novel Candidate Susceptibility Genes for Pancreatic Cancer

Zhong J, Jermusyk A, Wu L, Hoskins JW, Collins I, Mocci E, et al.

J Natl Cancer Inst, 2020 Oct 01;112(10):1003-1012.
PMID: 31917448 DOI: 10.1093/jnci/djz246

BACKGROUND: Although 20 pancreatic cancer susceptibility loci have been identified through genome-wide association studies in individuals of European ancestry, much of its heritability remains unexplained and the genes responsible largely unknown.
METHODS: To discover novel pancreatic cancer risk loci and possible causal genes, we performed a pancreatic cancer transcriptome-wide association study in Europeans using three approaches: FUSION, MetaXcan, and Summary-MulTiXcan. We integrated genome-wide association studies summary statistics from 9040 pancreatic cancer cases and 12 496 controls, with gene expression prediction models built using transcriptome data from histologically normal pancreatic tissue samples (NCI Laboratory of Translational Genomics [n = 95] and Genotype-Tissue Expression v7 [n = 174] datasets) and data from 48 different tissues (Genotype-Tissue Expression v7, n = 74-421 samples).
RESULTS: We identified 25 genes whose genetically predicted expression was statistically significantly associated with pancreatic cancer risk (false discovery rate < .05), including 14 candidate genes at 11 novel loci (1p36.12: CELA3B; 9q31.1: SMC2, SMC2-AS1; 10q23.31: RP11-80H5.9; 12q13.13: SMUG1; 14q32.33: BTBD6; 15q23: HEXA; 15q26.1: RCCD1; 17q12: PNMT, CDK12, PGAP3; 17q22: SUPT4H1; 18q11.22: RP11-888D10.3; and 19p13.11: PGPEP1) and 11 at six known risk loci (5p15.33: TERT, CLPTM1L, ZDHHC11B; 7p14.1: INHBA; 9q34.2: ABO; 13q12.2: PDX1; 13q22.1: KLF5; and 16q23.1: WDR59, CFDP1, BCAR1, TMEM170A). The association for 12 of these genes (CELA3B, SMC2, and PNMT at novel risk loci and TERT, CLPTM1L, INHBA, ABO, PDX1, KLF5, WDR59, CFDP1, and BCAR1 at known loci) remained statistically significant after Bonferroni correction.
CONCLUSIONS: By integrating gene expression and genotype data, we identified novel pancreatic cancer risk loci and candidate functional genes that warrant further investigation.

Matched MeSH terms: Databases, Genetic
Phylogeny of the most species-rich freshwater bivalve family (Bivalvia: Unionida: Unionidae): Defining modern subfamilies and tribes

Lopes-Lima M, Froufe E, Do VT, Ghamizi M, Mock KE, Kebapçı Ü, et al.

Mol Phylogenet Evol, 2017 01;106:174-191.
PMID: 27621130 DOI: 10.1016/j.ympev.2016.08.021

Freshwater mussels of the order Unionida are key elements of freshwater habitats and are responsible for important ecological functions and services. Unfortunately, these bivalves are among the most threatened freshwater taxa in the world. However, conservation planning and management are hindered by taxonomic problems and a lack of detailed ecological data. This highlights the urgent need for advances in the areas of systematics and evolutionary relationships within the Unionida. This study presents the most comprehensive phylogeny to date of the larger Unionida family, i.e., the Unionidae. The phylogeny is based on a combined dataset of 1032bp (COI+28S) of 70 species in 46 genera, with 7 of this genera being sequenced for the first time. The resulting phylogeny divided the Unionidae into 6 supported subfamilies and 18 tribes, three of which are here named for the first time (i.e., Chamberlainiini nomen novum, Cristariini nomen novum and Lanceolariini nomen novum). Molecular analyses were complemented by investigations of selected morphological, anatomical and behavioral characters used in traditional phylogenetic studies. No single morphological, anatomical or behavioral character was diagnostic at the subfamily level and few were useful at the tribe level. However, within subfamilies, many tribes can be recognized based on a subset of these characters. The geographical distribution of each of the subfamilies and tribes is also presented. The present study provides important advances in the systematics of these extraordinary taxa with implications for future ecological and conservation studies.

Matched MeSH terms: Databases, Genetic
Potential shared pathogenic mechanisms between endometriosis and inflammatory bowel disease indicate a strong initial effect of immune factors

Zhang H, Mo Y, Wang L, Zhang H, Wu S, Sandai D, et al.

Front Immunol, 2024;15:1339647.
PMID: 38660311 DOI: 10.3389/fimmu.2024.1339647

INTRODUCTION: Over the past decades, immune dysregulation has been consistently demonstrated being common charactoristics of endometriosis (EM) and Inflammatory Bowel Disease (IBD) in numerous studies. However, the underlying pathological mechanisms remain unknown. In this study, bioinformatics techniques were used to screen large-scale gene expression data for plausible correlations at the molecular level in order to identify common pathogenic pathways between EM and IBD.
METHODS: Based on the EM transcriptomic datasets GSE7305 and GSE23339, as well as the IBD transcriptomic datasets GSE87466 and GSE126124, differential gene analysis was performed using the limma package in the R environment. Co-expressed differentially expressed genes were identified, and a protein-protein interaction (PPI) network for the differentially expressed genes was constructed using the 11.5 version of the STRING database. The MCODE tool in Cytoscape facilitated filtering out protein interaction subnetworks. Key genes in the PPI network were identified through two topological analysis algorithms (MCC and Degree) from the CytoHubba plugin. Upset was used for visualization of these key genes. The diagnostic value of gene expression levels for these key genes was assessed using the Receiver Operating Characteristic (ROC) curve and Area Under the Curve (AUC) The CIBERSORT algorithm determined the infiltration status of 22 immune cell subtypes, exploring differences between EM and IBD patients in both control and disease groups. Finally, different gene expression trends shared by EM and IBD were input into CMap to identify small molecule compounds with potential therapeutic effects.
RESULTS: 113 differentially expressed genes (DEGs) that were co-expressed in EM and IBD have been identified, comprising 28 down-regulated genes and 86 up-regulated genes. The co-expression differential gene of EM and IBD in the functional enrichment analyses focused on immune response activation, circulating immunoglobulin-mediated humoral immune response and humoral immune response. Five hub genes (SERPING1、VCAM1、CLU、C3、CD55) were identified through the Protein-protein Interaction network and MCODE.High Area Under the Curve (AUC) values of Receiver Operating Characteristic (ROC) curves for 5hub genes indicate the predictive ability for disease occurrence.These hub genes could be used as potential biomarkers for the development of EM and IBD. Furthermore, the CMap database identified a total of 9 small molecule compounds (TTNPB、CAY-10577、PD-0325901 etc.) targeting therapeutic genes for EM and IBD.
DISCUSSION: Our research revealed common pathogenic mechanisms between EM and IBD, particularly emphasizing immune regulation and cell signalling, indicating the significance of immune factors in the occurence and progression of both diseases. By elucidating shared mechanisms, our study provides novel avenues for the prevention and treatment of EM and IBD.

Matched MeSH terms: Databases, Genetic
Fulltext DeSigN: connecting gene expression with therapeutics for drug repurposing and development

Lee BK, Tiong KH, Chang JK, Liew CS, Abdul Rahman ZA, Tan AC, et al.

BMC Genomics, 2017 01 25;18(Suppl 1):934.
PMID: 28198666 DOI: 10.1186/s12864-016-3260-7

BACKGROUND: The drug discovery and development pipeline is a long and arduous process that inevitably hampers rapid drug development. Therefore, strategies to improve the efficiency of drug development are urgently needed to enable effective drugs to enter the clinic. Precision medicine has demonstrated that genetic features of cancer cells can be used for predicting drug response, and emerging evidence suggest that gene-drug connections could be predicted more accurately by exploring the cumulative effects of many genes simultaneously.
RESULTS: We developed DeSigN, a web-based tool for predicting drug efficacy against cancer cell lines using gene expression patterns. The algorithm correlates phenotype-specific gene signatures derived from differentially expressed genes with pre-defined gene expression profiles associated with drug response data (IC50) from 140 drugs. DeSigN successfully predicted the right drug sensitivity outcome in four published GEO studies. Additionally, it predicted bosutinib, a Src/Abl kinase inhibitor, as a sensitive inhibitor for oral squamous cell carcinoma (OSCC) cell lines. In vitro validation of bosutinib in OSCC cell lines demonstrated that indeed, these cell lines were sensitive to bosutinib with IC50 of 0.8-1.2 μM. As further confirmation, we demonstrated experimentally that bosutinib has anti-proliferative activity in OSCC cell lines, demonstrating that DeSigN was able to robustly predict drug that could be beneficial for tumour control.
CONCLUSIONS: DeSigN is a robust method that is useful for the identification of candidate drugs using an input gene signature obtained from gene expression analysis. This user-friendly platform could be used to identify drugs with unanticipated efficacy against cancer cell lines of interest, and therefore could be used for the repurposing of drugs, thus improving the efficiency of drug development.

Matched MeSH terms: Databases, Genetic
GENIPAC: A Genomic Information Portal for Head and Neck Cancer Cell Systems

Lee BKB, Gan CP, Chang JK, Tan JL, Fadlullah MZ, Abdul Rahman ZA, et al.

J Dent Res, 2018 07;97(8):909-916.
PMID: 29512401 DOI: 10.1177/0022034518759038

Head and neck cancer (HNC)-derived cell lines represent fundamental models for studying the biological mechanisms underlying cancer development and precision therapies. However, mining the genomic information of HNC cells from available databases requires knowledge on bioinformatics and computational skill sets. Here, we developed a user-friendly web resource for exploring, visualizing, and analyzing genomics information of commonly used HNC cell lines. We populated the current version of GENIPAC with 44 HNC cell lines from 3 studies: ORL Series, OPC-22, and H Series. Specifically, the mRNA expressions for all the 3 studies were derived with RNA-seq. The copy number alterations analysis of ORL Series was performed on the Genome Wide Human Cytoscan HD array, while copy number alterations for OPC-22 were derived from whole exome sequencing. Mutations from ORL Series and H Series were derived from RNA-seq information, while OPC-22 was based on whole exome sequencing. All genomic information was preprocessed with customized scripts and underwent data validation and correction through data set validator tools provided by cBioPortal. The clinical and genomic information of 44 HNC cell lines are easily assessable in GENIPAC. The functional utility of GENIPAC was demonstrated with some of the genomic alterations that are commonly reported in HNC, such as TP53, EGFR, CCND1, and PIK3CA. We showed that these genomic alterations as reported in The Cancer Genome Atlas database were recapitulated in the HNC cell lines in GENIPAC. Importantly, genomic alterations within pathways could be simultaneously visualized. We developed GENIPAC to create access to genomic information on HNC cell lines. This cancer omics initiative will help the research community to accelerate better understanding of HNC and the development of new precision therapeutic options for HNC treatment. GENIPAC is freely available at http://genipac.cancerresearch.my/ .

Matched MeSH terms: Databases, Genetic*
Fulltext Singapore Genome Variation Project: a haplotype map of three Southeast Asian populations

Teo YY, Sim X, Ong RT, Tan AK, Chen J, Tantoso E, et al.

Genome Res, 2009 Nov;19(11):2154-62.
PMID: 19700652 DOI: 10.1101/gr.095000.109

The Singapore Genome Variation Project (SGVP) provides a publicly available resource of 1.6 million single nucleotide polymorphisms (SNPs) genotyped in 268 individuals from the Chinese, Malay, and Indian population groups in Southeast Asia. This online database catalogs information and summaries on genotype and phased haplotype data, including allele frequencies, assessment of linkage disequilibrium (LD), and recombination rates in a format similar to the International HapMap Project. Here, we introduce this resource and describe the analysis of human genomic variation upon agglomerating data from the HapMap and the Human Genome Diversity Project, providing useful insights into the population structure of the three major population groups in Asia. In addition, this resource also surveyed across the genome for variation in regional patterns of LD between the HapMap and SGVP populations, and for signatures of positive natural selection using two well-established metrics: iHS and XP-EHH. The raw and processed genetic data, together with all population genetic summaries, are publicly available for download and browsing through a web browser modeled with the Generic Genome Browser.

Matched MeSH terms: Databases, Genetic*
Fulltext FusoBase: an online Fusobacterium comparative genomic analysis platform

Ang MY, Heydari H, Jakubovics NS, Mahmud MI, Dutta A, Wee WY, et al.

Database (Oxford), 2014;2014.
PMID: 25149689 DOI: 10.1093/database/bau082

Fusobacterium are anaerobic gram-negative bacteria that have been associated with a wide spectrum of human infections and diseases. As the biology of Fusobacterium is still not well understood, comparative genomic analysis on members of this species will provide further insights on their taxonomy, phylogeny, pathogenicity and other information that may contribute to better management of infections and diseases. To facilitate the ongoing genomic research on Fusobacterium, a specialized database with easy-to-use analysis tools is necessary. Here we present FusoBase, an online database providing access to genome-wide annotated sequences of Fusobacterium strains as well as bioinformatics tools, to support the expanding scientific community. Using our custom-developed Pairwise Genome Comparison tool, we demonstrate how differences between two user-defined genomes and how insertion of putative prophages can be identified. In addition, Pathogenomics Profiling Tool is capable of clustering predicted genes across Fusobacterium strains and visualizing the results in the form of a heat map with dendrogram.

Matched MeSH terms: Databases, Genetic*
Fulltext StaphyloBase: a specialized genomic resource for the staphylococcal research community

Heydari H, Mutha NV, Mahmud MI, Siow CC, Wee WY, Wong GJ, et al.

Database (Oxford), 2014;2014:bau010.
PMID: 24578355 DOI: 10.1093/database/bau010

With the advent of high-throughput sequencing technologies, many staphylococcal genomes have been sequenced. Comparative analysis of these strains will provide better understanding of their biology, phylogeny, virulence and taxonomy, which may contribute to better management of diseases caused by staphylococcal pathogens. We developed StaphyloBase with the goal of having a one-stop genomic resource platform for the scientific community to access, retrieve, download, browse, search, visualize and analyse the staphylococcal genomic data and annotations. We anticipate this resource platform will facilitate the analysis of staphylococcal genomic data, particularly in comparative analyses. StaphyloBase currently has a collection of 754 032 protein-coding sequences (CDSs), 19 258 rRNAs and 15 965 tRNAs from 292 genomes of different staphylococcal species. Information about these features is also included, such as putative functions, subcellular localizations and gene/protein sequences. Our web implementation supports diverse query types and the exploration of CDS- and RNA-type information in detail using an AJAX-based real-time search system. JBrowse has also been incorporated to allow rapid and seamless browsing of staphylococcal genomes. The Pairwise Genome Comparison tool is designed for comparative genomic analysis, for example, to reveal the relationships between two user-defined staphylococcal genomes. A newly designed Pathogenomics Profiling Tool (PathoProT) is also included in this platform to facilitate comparative pathogenomics analysis of staphylococcal strains. In conclusion, StaphyloBase offers access to a range of staphylococcal genomic resources as well as analysis tools for comparative analyses. Database URL: http://staphylococcus.um.edu.my/.

Matched MeSH terms: Databases, Genetic*
Fulltext CoryneBase: Corynebacterium genomic resources and analysis tools at your fingertips

Heydari H, Siow CC, Tan MF, Jakubovics NS, Wee WY, Mutha NV, et al.

PLoS One, 2014;9(1):e86318.
PMID: 24466021 DOI: 10.1371/journal.pone.0086318

Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/.

Matched MeSH terms: Databases, Genetic*
Fulltext MabsBase: a Mycobacterium abscessus genome and annotation database

Heydari H, Wee WY, Lokanathan N, Hari R, Mohamed Yusoff A, Beh CY, et al.

PLoS One, 2013;8(4):e62443.
PMID: 23658631 DOI: 10.1371/journal.pone.0062443

Mycobacterium abscessus is a rapidly growing non-tuberculous mycobacterial species that has been associated with a wide spectrum of human infections. As the classification and biology of this organism is still not well understood, comparative genomic analysis on members of this species may provide further insights on their taxonomy, phylogeny, pathogenicity and other information that may contribute to better management of infections. The MabsBase described in this paper is a user-friendly database providing access to whole-genome sequences of newly discovered M. abscessus strains as well as resources for whole-genome annotations and computational predictions, to support the expanding scientific community interested in M. abscessus research. The MabsBase is freely available at http://mabscessus.um.edu.my.

Matched MeSH terms: Databases, Genetic*
Fulltext PGD: a pangolin genome hub for the research community

Tan TK, Tan KY, Hari R, Mohamed Yusoff A, Wong GJ, Siow CC, et al.

Database (Oxford), 2016;2016.
PMID: 27616775 DOI: 10.1093/database/baw063

Pangolins (order Pholidota) are the only mammals covered by scales. We have recently sequenced and analyzed the genomes of two critically endangered Asian pangolin species, namely the Malayan pangolin (Manis javanica) and the Chinese pangolin (Manis pentadactyla). These complete genome sequences will serve as reference sequences for future research to address issues of species conservation and to advance knowledge in mammalian biology and evolution. To further facilitate the global research effort in pangolin biology, we developed the Pangolin Genome Database (PGD), as a future hub for hosting pangolin genomic and transcriptomic data and annotations, and with useful analysis tools for the research community. Currently, the PGD provides the reference pangolin genome and transcriptome data, gene sequences and functional information, expressed transcripts, pseudogenes, genomic variations, organ-specific expression data and other useful annotations. We anticipate that the PGD will be an invaluable platform for researchers who are interested in pangolin and mammalian research. We will continue updating this hub by including more data, annotation and analysis tools particularly from our research consortium.Database URL: http://pangolin-genome.um.edu.my.

Matched MeSH terms: Databases, Genetic*
Fulltext YersiniaBase: a genomic resource and analysis platform for comparative analysis of Yersinia

Tan SY, Dutta A, Jakubovics NS, Ang MY, Siow CC, Mutha NV, et al.

BMC Bioinformatics, 2015;16:9.
PMID: 25591325 DOI: 10.1186/s12859-014-0422-y

Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity.

Matched MeSH terms: Databases, Genetic*
Initiating a Human Variome Project Country Node

AlAama J, Smith TD, Lo A, Howard H, Kline AA, Lange M, et al.

Hum Mutat, 2011 May;32(5):501-6.
PMID: 21305654 DOI: 10.1002/humu.21463

Genetic diseases are a pressing global health problem that requires comprehensive access to basic clinical and genetic data to counter. The creation of regional and international databases that can be easily accessed by clinicians and diagnostic labs will greatly improve our ability to accurately diagnose and treat patients with genetic disorders. The Human Variome Project is currently working in conjunction with human genetics societies to achieve this by establishing systems to collect every mutation reported by a diagnostic laboratory, clinic, or research laboratory in a country and store these within a national repository, or HVP Country Node. Nodes have already been initiated in Australia, Belgium, China, Egypt, Malaysia, and Kuwait. Each is examining how to systematically collect and share genetic, clinical, and biochemical information in a country-specific manner that is sensitive to local ethical and cultural issues. This article gathers cases of genetic data collection within countries and takes recommendations from the global community to develop a procedure for countries wishing to establish their own collection system as part of the Human Variome Project. We hope this may lead to standard practices to facilitate global collection of data and allow efficient use in clinical practice, research and therapy.

Matched MeSH terms: Databases, Genetic*
Missing-Values Imputation Algorithms for Microarray Gene Expression Data

Moorthy K, Jaber AN, Ismail MA, Ernawan F, Mohamad MS, Deris S

Methods Mol Biol, 2019;1986:255-266.
PMID: 31115893 DOI: 10.1007/978-1-4939-9442-7_12

In gene expression studies, missing values are a common problem with important consequences for the interpretation of the final data (Satija et al., Nat Biotechnol 33(5):495, 2015). Numerous bioinformatics examination tools are used for cancer prediction, including the data set matrix (Bailey et al., Cell 173(2):371-385, 2018); thus, it is necessary to resolve the problem of missing-values imputation. This chapter presents a review of the research on missing-values imputation approaches for gene expression data. By using local and global correlation of the data, we were able to focus mostly on the differences between the algorithms. We classified the algorithms as global, hybrid, local, or knowledge-based techniques. Additionally, this chapter presents suitable assessments of the different approaches. The purpose of this review is to focus on developments in the current techniques for scientists rather than applying different or newly developed algorithms with identical functional goals. The aim was to adapt the algorithms to the characteristics of the data.

Matched MeSH terms: Databases, Genetic*

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links