Displaying publications 1 - 20 of 104 in total

Abstract:
Sort:
  1. Zhong J, Jermusyk A, Wu L, Hoskins JW, Collins I, Mocci E, et al.
    J Natl Cancer Inst, 2020 Oct 01;112(10):1003-1012.
    PMID: 31917448 DOI: 10.1093/jnci/djz246
    BACKGROUND: Although 20 pancreatic cancer susceptibility loci have been identified through genome-wide association studies in individuals of European ancestry, much of its heritability remains unexplained and the genes responsible largely unknown.

    METHODS: To discover novel pancreatic cancer risk loci and possible causal genes, we performed a pancreatic cancer transcriptome-wide association study in Europeans using three approaches: FUSION, MetaXcan, and Summary-MulTiXcan. We integrated genome-wide association studies summary statistics from 9040 pancreatic cancer cases and 12 496 controls, with gene expression prediction models built using transcriptome data from histologically normal pancreatic tissue samples (NCI Laboratory of Translational Genomics [n = 95] and Genotype-Tissue Expression v7 [n = 174] datasets) and data from 48 different tissues (Genotype-Tissue Expression v7, n = 74-421 samples).

    RESULTS: We identified 25 genes whose genetically predicted expression was statistically significantly associated with pancreatic cancer risk (false discovery rate < .05), including 14 candidate genes at 11 novel loci (1p36.12: CELA3B; 9q31.1: SMC2, SMC2-AS1; 10q23.31: RP11-80H5.9; 12q13.13: SMUG1; 14q32.33: BTBD6; 15q23: HEXA; 15q26.1: RCCD1; 17q12: PNMT, CDK12, PGAP3; 17q22: SUPT4H1; 18q11.22: RP11-888D10.3; and 19p13.11: PGPEP1) and 11 at six known risk loci (5p15.33: TERT, CLPTM1L, ZDHHC11B; 7p14.1: INHBA; 9q34.2: ABO; 13q12.2: PDX1; 13q22.1: KLF5; and 16q23.1: WDR59, CFDP1, BCAR1, TMEM170A). The association for 12 of these genes (CELA3B, SMC2, and PNMT at novel risk loci and TERT, CLPTM1L, INHBA, ABO, PDX1, KLF5, WDR59, CFDP1, and BCAR1 at known loci) remained statistically significant after Bonferroni correction.

    CONCLUSIONS: By integrating gene expression and genotype data, we identified novel pancreatic cancer risk loci and candidate functional genes that warrant further investigation.

    Matched MeSH terms: Databases, Genetic
  2. Zhao H, Zhao S, International Network for Bamboo and Rattan, Fei B, Liu H, Yang H, et al.
    Gigascience, 2017 07 01;6(7):1-7.
    PMID: 28637269 DOI: 10.1093/gigascience/gix046
    Bamboo and rattan are widely grown for manufacturing, horticulture, and agroforestry. Bamboo and rattan production might help reduce poverty, boost economic growth, mitigate climate change, and protect the natural environment. Despite progress in research, sufficient molecular and genomic resources to study these species are lacking. We launched the Genome Atlas of Bamboo and Rattan (GABR) project, a comprehensive, coordinated international effort to accelerate understanding of bamboo and rattan genetics through genome analysis. GABR includes 2 core subprojects: Bamboo-T1K (Transcriptomes of 1000 Bamboos) and Rattan-G5 (Genomes of 5 Rattans), and several other subprojects. Here we describe the organization, directions, and status of GABR.
    Matched MeSH terms: Databases, Genetic*
  3. Zhang Y, Miao G, Fazhan H, Waiho K, Zheng H, Li S, et al.
    Physiol Genomics, 2018 05 01;50(5):393-405.
    PMID: 29570432 DOI: 10.1152/physiolgenomics.00016.2018
    The crucifix crab, Charybdis feriatus, which mainly inhabits Indo-Pacific region, is regarded as one of the most high-potential species for domestication and incorporation into the aquaculture sector. However, the regulatory mechanisms of sex determination and differentiation of this species remain unclear. To identify candidate genes involved in sex determination and differentiation, high throughput sequencing of transcriptome from the testis and ovary of C. feriatus was performed by the Illumina platform. After removing adaptor primers, low-quality sequences and very short (<50 nt) reads, we obtained 80.9 million and 66.2 million clean reads from testis and ovary, respectively. A total of 86,433 unigenes were assembled, and ~43% (37,500 unigenes) were successfully annotated to the NR, NT, Swiss-Prot, KEGG, COG, GO databases. By comparing the testis and ovary libraries, we obtained 27,636 differentially expressed genes. Some candidate genes involved in the sex determination and differentiation of C. feriatus were identified, such as vasa, pgds, vgr, hsp90, dsx-f, fem-1, and gpr. In addition, 88,608 simple sequence repeats were obtained, and 61,929 and 77,473 single nucleotide polymorphisms from testis and ovary were detected, respectively. The transcriptome profiling was validated by quantitative real-time PCR in 30 selected genes, which showed a good consistency. The present study is the first high-throughput transcriptome sequencing of C. feriatus. These findings will be useful for future functional analysis of sex-associated genes and molecular marker-assisted selections in C. feriatus.
    Matched MeSH terms: Databases, Genetic
  4. Zhang H, Mo Y, Wang L, Zhang H, Wu S, Sandai D, et al.
    Front Immunol, 2024;15:1339647.
    PMID: 38660311 DOI: 10.3389/fimmu.2024.1339647
    INTRODUCTION: Over the past decades, immune dysregulation has been consistently demonstrated being common charactoristics of endometriosis (EM) and Inflammatory Bowel Disease (IBD) in numerous studies. However, the underlying pathological mechanisms remain unknown. In this study, bioinformatics techniques were used to screen large-scale gene expression data for plausible correlations at the molecular level in order to identify common pathogenic pathways between EM and IBD.

    METHODS: Based on the EM transcriptomic datasets GSE7305 and GSE23339, as well as the IBD transcriptomic datasets GSE87466 and GSE126124, differential gene analysis was performed using the limma package in the R environment. Co-expressed differentially expressed genes were identified, and a protein-protein interaction (PPI) network for the differentially expressed genes was constructed using the 11.5 version of the STRING database. The MCODE tool in Cytoscape facilitated filtering out protein interaction subnetworks. Key genes in the PPI network were identified through two topological analysis algorithms (MCC and Degree) from the CytoHubba plugin. Upset was used for visualization of these key genes. The diagnostic value of gene expression levels for these key genes was assessed using the Receiver Operating Characteristic (ROC) curve and Area Under the Curve (AUC) The CIBERSORT algorithm determined the infiltration status of 22 immune cell subtypes, exploring differences between EM and IBD patients in both control and disease groups. Finally, different gene expression trends shared by EM and IBD were input into CMap to identify small molecule compounds with potential therapeutic effects.

    RESULTS: 113 differentially expressed genes (DEGs) that were co-expressed in EM and IBD have been identified, comprising 28 down-regulated genes and 86 up-regulated genes. The co-expression differential gene of EM and IBD in the functional enrichment analyses focused on immune response activation, circulating immunoglobulin-mediated humoral immune response and humoral immune response. Five hub genes (SERPING1、VCAM1、CLU、C3、CD55) were identified through the Protein-protein Interaction network and MCODE.High Area Under the Curve (AUC) values of Receiver Operating Characteristic (ROC) curves for 5hub genes indicate the predictive ability for disease occurrence.These hub genes could be used as potential biomarkers for the development of EM and IBD. Furthermore, the CMap database identified a total of 9 small molecule compounds (TTNPB、CAY-10577、PD-0325901 etc.) targeting therapeutic genes for EM and IBD.

    DISCUSSION: Our research revealed common pathogenic mechanisms between EM and IBD, particularly emphasizing immune regulation and cell signalling, indicating the significance of immune factors in the occurence and progression of both diseases. By elucidating shared mechanisms, our study provides novel avenues for the prevention and treatment of EM and IBD.

    Matched MeSH terms: Databases, Genetic
  5. Zhang C, Gao Y, Ning Z, Lu Y, Zhang X, Liu J, et al.
    Genome Biol, 2019 10 22;20(1):215.
    PMID: 31640808 DOI: 10.1186/s13059-019-1838-5
    Despite the tremendous growth of the DNA sequencing data in the last decade, our understanding of the human genome is still in its infancy. To understand the implications of genetic variants in the light of population genetics and molecular evolution, we developed a database, PGG.SNV ( https://www.pggsnv.org ), which gives much higher weight to previously under-investigated indigenous populations in Asia. PGG.SNV archives 265 million SNVs across 220,147 present-day genomes and 1018 ancient genomes, including 1009 newly sequenced genomes, representing 977 global populations. Moreover, estimation of population genetic diversity and evolutionary parameters is available in PGG.SNV, a unique feature compared with other databases.
    Matched MeSH terms: Databases, Genetic*
  6. Zeng C, Guo X, Long J, Kuchenbaecker KB, Droit A, Michailidou K, et al.
    Breast Cancer Res, 2016 06 21;18(1):64.
    PMID: 27459855 DOI: 10.1186/s13058-016-0718-0
    BACKGROUND: Multiple recent genome-wide association studies (GWAS) have identified a single nucleotide polymorphism (SNP), rs10771399, at 12p11 that is associated with breast cancer risk.

    METHOD: We performed a fine-scale mapping study of a 700 kb region including 441 genotyped and more than 1300 imputed genetic variants in 48,155 cases and 43,612 controls of European descent, 6269 cases and 6624 controls of East Asian descent and 1116 cases and 932 controls of African descent in the Breast Cancer Association Consortium (BCAC; http://bcac.ccge.medschl.cam.ac.uk/ ), and in 15,252 BRCA1 mutation carriers in the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA). Stepwise regression analyses were performed to identify independent association signals. Data from the Encyclopedia of DNA Elements project (ENCODE) and the Cancer Genome Atlas (TCGA) were used for functional annotation.

    RESULTS: Analysis of data from European descendants found evidence for four independent association signals at 12p11, represented by rs7297051 (odds ratio (OR) = 1.09, 95 % confidence interval (CI) = 1.06-1.12; P = 3 × 10(-9)), rs805510 (OR = 1.08, 95 % CI = 1.04-1.12, P = 2 × 10(-5)), and rs1871152 (OR = 1.04, 95 % CI = 1.02-1.06; P = 2 × 10(-4)) identified in the general populations, and rs113824616 (P = 7 × 10(-5)) identified in the meta-analysis of BCAC ER-negative cases and BRCA1 mutation carriers. SNPs rs7297051, rs805510 and rs113824616 were also associated with breast cancer risk at P 

    Matched MeSH terms: Databases, Genetic
  7. Yahya P, Sulong S, Harun A, Wangkumhang P, Wilantho A, Ngamphiw C, et al.
    Int J Legal Med, 2020 Jan;134(1):123-134.
    PMID: 31760471 DOI: 10.1007/s00414-019-02184-0
    Ancestry-informative markers (AIMs) can be used to infer the ancestry of an individual to minimize the inaccuracy of self-reported ethnicity in biomedical research. In this study, we describe three methods for selecting AIM SNPs for the Malay population (Malay AIM panel) using different approaches based on pairwise FST, informativeness for assignment (In), and PCA-correlated SNPs (PCAIMs). These Malay AIM panels were extracted from genotype data stored in SNP arrays hosted by the Malaysian node of the Human Variome Project (MyHVP) and the Singapore Genome Variation Project (SGVP). In particular, genotype data from a total of 165 Malay individuals were analyzed, comprising data on 117 individual genotypes from the Affymetrix SNP-6 SNP array platform and data on 48 individual genotypes from the OMNI 2.5 Illumina SNP array platform. The HapMap phase 3 database (1397 individuals from 11 populations) was used as a reference for comparison with the Malay genotype data. The accuracy of each resulting Malay AIM panel was evaluated using a machine learning "ancestry-predictive model" constructed by using WEKA, a comprehensive machine learning platform written in Java. A total of 1250 SNPs were finally selected, which successfully identified Malay individuals from other world populations with an accuracy of 90%, but the accuracy decreased to 80% using 157 SNPs according to the pairwise FST method, while a panel of 200 SNPs selected using In and PCAIMs could be used to identify Malay individuals with an accuracy of approximately 80%.
    Matched MeSH terms: Databases, Genetic*
  8. Wilson JJ, Sing KW, Sofian-Azirun M
    PLoS One, 2013;8(11):e79969.
    PMID: 24282514 DOI: 10.1371/journal.pone.0079969
    The objective of this study was to build a DNA barcode reference library for the true butterflies of Peninsula Malaysia and assess the value of attaching subspecies names to DNA barcode records. A new DNA barcode library was constructed with butterflies from the Museum of Zoology, University of Malaya collection. The library was analysed in conjunction with publicly available DNA barcodes from other Asia-Pacific localities to test the ability of the DNA barcodes to discriminate species and subspecies. Analyses confirmed the capacity of the new DNA barcode reference library to distinguish the vast majority of species (92%) and revealed that most subspecies possessed unique DNA barcodes (84%). In some cases conspecific subspecies exhibited genetic distances between their DNA barcodes that are typically seen between species, and these were often taxa that have previously been regarded as full species. Subspecies designations as shorthand for geographically and morphologically differentiated groups provide a useful heuristic for assessing how such groups correlate with clustering patterns of DNA barcodes, especially as the number of DNA barcodes per species in reference libraries increases. Our study demonstrates the value in attaching subspecies names to DNA barcode records as they can reveal a history of taxonomic concepts and expose important units of biodiversity.
    Matched MeSH terms: Databases, Genetic*
  9. Wei K, Sutherland H, Camilleri E, Haupt LM, Griffiths LR, Gan SH
    Mol Biol Rep, 2014 Dec;41(12):8285-92.
    PMID: 25213548 DOI: 10.1007/s11033-014-3729-x
    Computational epigenetics is a new area of research focused on exploring how DNA methylation patterns affect transcription factor binding that affect gene expression patterns. The aim of this study was to produce a new protocol for the detection of DNA methylation patterns using computational analysis which can be further confirmed by bisulfite PCR with serial pyrosequencing. The upstream regulatory element and pre-initiation complex relative to CpG islets within the methylenetetrahydrofolate reductase gene were determined via computational analysis and online databases. The 1,104 bp long CpG island located near to or at the alternative promoter site of methylenetetrahydrofolate reductase gene was identified. The CpG plot indicated that CpG islets A and B, within the island, contained 62 and 75 % GC content CpG ratios of 0.70 and 0.80-0.95, respectively. Further exploration of the CpG islets A and B indicates that the transcription start sites were GGC which were absent from the TATA boxes. In addition, although six PROSITE motifs were identified in CpG B, no motifs were detected in CpG A. A number of cis-regulatory elements were found in different regions within the CpGs A and B. Transcription factors were predicted to bind to CpGs A and B with varying affinities depending on the DNA methylation status. In addition, transcription factor binding may influence the expression patterns of the methylenetetrahydrofolate reductase gene by recruiting chromatin condensation inducing factors. These results have significant implications for the understanding of the architecture of transcription factor binding at CpG islets as well as DNA methylation patterns that affect chromatin structure.
    Matched MeSH terms: Databases, Genetic
  10. W. Wilonita, R. Nurliyana, D.D. Asma, M. Noorazizah, M.Y. Hirzun
    ASM Science Journal, 2013;7(2):105-112.
    MyJurnal
    Molecular markers have been intensively used in assisting breeding to reduce the time taken by conventional breeding as well as helping introgression of specific traits. Baseline analysis of known markers is crucial in developing a genetic database on disease and pest resistance for local rice germplasm which does not yet
    exist. In this study seven local rice varieties, including the popular MR219 and MRQ 74 and MRQ 76 (newly developed aromatic rice varieties), together with a foreign variety, Intani-2, were screened for genetic markers related to pest and disease resistance. One hundred and twenty-two type-related markers (SSR, STS, InDel and Allele-specific) for genes resistant to bacterial leaf blight, blast and brown planthopper were screened using PCR amplification and validated by sequencing. It was found that each variety had its own pattern of resistance. Using allele-specific markers namely pBPH9, pTA248 and Pisbdom were found to be the most efficient way to screen for the targeted genes. Of the seven varieties, MR219 and MR232 were found to have the highest distribution of markers for resistance genes against pest and diseases studied.
    Matched MeSH terms: Databases, Genetic
  11. Vockerodt M, Vrzalikova K, Ibrahim M, Nagy E, Margielewska S, Hollows R, et al.
    J Pathol, 2019 06;248(2):142-154.
    PMID: 30666658 DOI: 10.1002/path.5237
    The Epstein-Barr virus (EBV) is found almost exclusively in the activated B-cell (ABC) subtype of diffuse large B-cell lymphoma (DLBCL), yet its contribution to this tumour remains poorly understood. We have focused on the EBV-encoded latent membrane protein-1 (LMP1), a constitutively activated CD40 homologue expressed in almost all EBV-positive DLBCLs and which can disrupt germinal centre (GC) formation and drive lymphomagenesis in mice. Comparison of the transcriptional changes that follow LMP1 expression with those that follow transient CD40 signalling in human GC B cells enabled us to define pathogenic targets of LMP1 aberrantly expressed in ABC-DLBCL. These included the down-regulation of S1PR2, a sphingosine-1-phosphate (S1P) receptor that is transcriptionally down-regulated in ABC-DLBCL, and when genetically ablated leads to DLBCL in mice. Consistent with this, we found that LMP1-expressing primary ABC-DLBCLs were significantly more likely to lack S1PR2 expression than were LMP1-negative tumours. Furthermore, we showed that the down-regulation of S1PR2 by LMP1 drives a signalling loop leading to constitutive activation of the phosphatidylinositol-3-kinase (PI3-K) pathway. Finally, core LMP1-PI3-K targets were enriched for lymphoma-related transcription factors and genes associated with shorter overall survival in patients with ABC-DLBCL. Our data identify a novel function for LMP1 in aggressive DLBCL. Copyright © 2019 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.
    Matched MeSH terms: Databases, Genetic
  12. Ummu Atiqah Mohd Roslan
    MATEMATIKA, 2018;34(1):13-21.
    MyJurnal
    Markov map is one example of interval maps where it is a piecewise expanding
    map and obeys the Markov property. One well-known example of Markov map is the
    doubling map, a map which has two subintervals with equal partitions. In this paper, we
    are interested to investigate another type of Markov map, the so-called skewed doubling
    map. This map is a more generalized map than the doubling map. Thus, the aims of this
    paper are to find the fixed points as well as the periodic points for the skewed doubling
    map and to investigate the sensitive dependence on initial conditions of this map. The
    method considered here is the cobweb diagram. Numerical results suggest that there exist
    dense of periodic orbits for this map. The sensitivity of this map to initial conditions is
    also verified where small differences in initial conditions give different behaviour of the
    orbits in the map.
    Matched MeSH terms: Databases, Genetic
  13. Tätte K, Pagani L, Pathak AK, Kõks S, Ho Duy B, Ho XD, et al.
    Sci Rep, 2019 03 07;9(1):3818.
    PMID: 30846778 DOI: 10.1038/s41598-019-40399-8
    Surrounded by speakers of Indo-European, Dravidian and Tibeto-Burman languages, around 11 million Munda (a branch of Austroasiatic language family) speakers live in the densely populated and genetically diverse South Asia. Their genetic makeup holds components characteristic of South Asians as well as Southeast Asians. The admixture time between these components has been previously estimated on the basis of archaeology, linguistics and uniparental markers. Using genome-wide genotype data of 102 Munda speakers and contextual data from South and Southeast Asia, we retrieved admixture dates between 2000-3800 years ago for different populations of Munda. The best modern proxies for the source populations for the admixture with proportions 0.29/0.71 are Lao people from Laos and Dravidian speakers from Kerala in India. The South Asian population(s), with whom the incoming Southeast Asians intermixed, had a smaller proportion of West Eurasian genetic component than contemporary proxies. Somewhat surprisingly Malaysian Peninsular tribes rather than the geographically closer Austroasiatic languages speakers like Vietnamese and Cambodians show highest sharing of IBD segments with the Munda. In addition, we affirmed that the grouping of the Munda speakers into North and South Munda based on linguistics is in concordance with genome-wide data.
    Matched MeSH terms: Databases, Genetic
  14. Tungekar A, Mandarthi S, Mandaviya PR, Gadekar VP, Tantry A, Kotian S, et al.
    Sci Rep, 2018 08 24;8(1):12715.
    PMID: 30143675 DOI: 10.1038/s41598-018-30579-3
    Esophageal cancer (EC) is the eighth most aggressive malignancy and its treatment remains a challenge due to the lack of biomarkers that can facilitate early detection. EC is identified in two major histological forms namely - Adenocarcinoma (EAC) and Squamous cell carcinoma (ESCC), each showing differences in the incidence among populations that are geographically separated. Hence the detection of potential drug target and biomarkers demands a population-centric understanding of the molecular and cellular mechanisms of EC. To provide an adequate impetus to the biomarker discovery for ESCC, which is the most prevalent esophageal cancer worldwide, here we have developed ESCC ATLAS, a manually curated database that integrates genetic, epigenetic, transcriptomic, and proteomic ESCC-related genes from the published literature. It consists of 3475 genes associated to molecular signatures such as, altered transcription (2600), altered translation (560), contain copy number variation/structural variations (233), SNPs (102), altered DNA methylation (82), Histone modifications (16) and miRNA based regulation (261). We provide a user-friendly web interface ( http://www.esccatlas.org , freely accessible for academic, non-profit users) that facilitates the exploration and the analysis of genes among different populations. We anticipate it to be a valuable resource for the population specific investigation and biomarker discovery for ESCC.
    Matched MeSH terms: Databases, Genetic*
  15. Teo YY, Sim X, Ong RT, Tan AK, Chen J, Tantoso E, et al.
    Genome Res, 2009 Nov;19(11):2154-62.
    PMID: 19700652 DOI: 10.1101/gr.095000.109
    The Singapore Genome Variation Project (SGVP) provides a publicly available resource of 1.6 million single nucleotide polymorphisms (SNPs) genotyped in 268 individuals from the Chinese, Malay, and Indian population groups in Southeast Asia. This online database catalogs information and summaries on genotype and phased haplotype data, including allele frequencies, assessment of linkage disequilibrium (LD), and recombination rates in a format similar to the International HapMap Project. Here, we introduce this resource and describe the analysis of human genomic variation upon agglomerating data from the HapMap and the Human Genome Diversity Project, providing useful insights into the population structure of the three major population groups in Asia. In addition, this resource also surveyed across the genome for variation in regional patterns of LD between the HapMap and SGVP populations, and for signatures of positive natural selection using two well-established metrics: iHS and XP-EHH. The raw and processed genetic data, together with all population genetic summaries, are publicly available for download and browsing through a web browser modeled with the Generic Genome Browser.
    Matched MeSH terms: Databases, Genetic*
  16. Teh SL, Chan WS, Abdullah JO, Namasivayam P
    Mol Biol Rep, 2011 Aug;38(6):3903-9.
    PMID: 21116862 DOI: 10.1007/s11033-010-0506-3
    Vanda Mimi Palmer (VMP) is a highly sought as fragrant-orchid hybrid in Malaysia. It is economically important in cosmetic and beauty industries and also a famous potted ornamental plant. To date, no work on fragrance-related genes of vandaceous orchids has been reported from other research groups although the analysis of floral fragrance or volatiles have been extensively studied. An expressed sequence tag (EST) resource was developed for VMP principally to mine any potential fragrance-related expressed sequence tag-simple sequence repeat (EST-SSR) for future development as markers in the identification of fragrant vandaceous orchids endemic to Malaysia. Clustering, annotation and assembling of the ESTs identified 1,196 unigenes which defined 966 singletons and 230 contigs. The VMP dbEST was functionally classified by gene ontology (GO) into three groups: molecular functions (51.2%), cellular components (16.4%) and biological processes (24.6%) while the remaining 7.8% showed no hits with GO identifier. A total of 112 EST-SSR (9.4%) was mined on which at least five units of di-, tri-, tetra-, penta-, or hexa-nucleotide repeats were predicted. The di-nucleotide motif repeats appeared to be the most frequent repeats among the detected SSRs with the AT/TA types as the most abundant among the dimerics, while AAG/TTC, AGA/TCT-type were the most frequent trimerics. The mined EST-SSR is believed to be useful in the development of EST-SSR markers that is applicable in the screening and characterization of fragrance-related transcripts in closely related species.
    Matched MeSH terms: Databases, Genetic*
  17. Tan TK, Tan KY, Hari R, Mohamed Yusoff A, Wong GJ, Siow CC, et al.
    Database (Oxford), 2016;2016.
    PMID: 27616775 DOI: 10.1093/database/baw063
    Pangolins (order Pholidota) are the only mammals covered by scales. We have recently sequenced and analyzed the genomes of two critically endangered Asian pangolin species, namely the Malayan pangolin (Manis javanica) and the Chinese pangolin (Manis pentadactyla). These complete genome sequences will serve as reference sequences for future research to address issues of species conservation and to advance knowledge in mammalian biology and evolution. To further facilitate the global research effort in pangolin biology, we developed the Pangolin Genome Database (PGD), as a future hub for hosting pangolin genomic and transcriptomic data and annotations, and with useful analysis tools for the research community. Currently, the PGD provides the reference pangolin genome and transcriptome data, gene sequences and functional information, expressed transcripts, pseudogenes, genomic variations, organ-specific expression data and other useful annotations. We anticipate that the PGD will be an invaluable platform for researchers who are interested in pangolin and mammalian research. We will continue updating this hub by including more data, annotation and analysis tools particularly from our research consortium.Database URL: http://pangolin-genome.um.edu.my.
    Matched MeSH terms: Databases, Genetic*
  18. Tan SY, Dutta A, Jakubovics NS, Ang MY, Siow CC, Mutha NV, et al.
    BMC Bioinformatics, 2015;16:9.
    PMID: 25591325 DOI: 10.1186/s12859-014-0422-y
    Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity.
    Matched MeSH terms: Databases, Genetic*
  19. Tan SH, Normi YM, Leow AT, Salleh AB, Karjiban RA, Murad AM, et al.
    BMC Struct Biol, 2014 Mar 19;14:11.
    PMID: 24641837 DOI: 10.1186/1472-6807-14-11
    BACKGROUND: At least a quarter of any complete genome encodes for hypothetical proteins (HPs) which are largely non-similar to other known, well-characterized proteins. Predicting and solving their structures and functions is imperative to aid understanding of any given organism as a complete biological system. The present study highlights the primary effort to classify and cluster 1202 HPs of Bacillus lehensis G1 alkaliphile to serve as a platform to mine and select specific HP(s) to be studied further in greater detail.

    RESULTS: All HPs of B. lehensis G1 were grouped according to their predicted functions based on the presence of functional domains in their sequences. From the metal-binding group of HPs of the cluster, an HP termed Bleg1_2507 was discovered to contain a thioredoxin (Trx) domain and highly-conserved metal-binding ligands represented by Cys69, Cys73 and His159, similar to all prokaryotic and eukaryotic Sco proteins. The built 3D structure of Bleg1_2507 showed that it shared the βαβαββ core structure of Trx-like proteins as well as three flanking β-sheets, a 310 -helix at the N-terminus and a hairpin structure unique to Sco proteins. Docking simulations provided an interesting view of Bleg1_2507 in association with its putative cytochrome c oxidase subunit II (COXII) redox partner, Bleg1_2337, where the latter can be seen to hold its partner in an embrace, facilitated by hydrophobic and ionic interactions between the proteins. Although Bleg1_2507 shares relatively low sequence identity (47%) to BsSco, interestingly, the predicted metal-binding residues of Bleg1_2507 i.e. Cys-69, Cys-73 and His-159 were located at flexible active loops similar to other Sco proteins across biological taxa. This highlights structural conservation of Sco despite their various functions in prokaryotes and eukaryotes.

    CONCLUSIONS: We propose that HP Bleg1_2507 is a Sco protein which is able to interact with COXII, its redox partner and therefore, may possess metallochaperone and redox functions similar to other documented bacterial Sco proteins. It is hoped that this scientific effort will help to spur the search for other physiologically relevant proteins among the so-called "orphan" proteins of any given organism.

    Matched MeSH terms: Databases, Genetic
  20. Tan EC, Loh M, Chuon D, Lim YP
    Hum Mutat, 2006 Mar;27(3):232-5.
    PMID: 16429432
    There is a need for country/population-specific databases because the existence of population-specific mutations for single gene disorders is well documented, and there is also good evidence for ethnic differences in the frequencies of genetic variations involved in complex disorders. Thus the Singapore Human Mutation/Polymorphism Database (SHMPD) was created to provide clinicians and scientists access to a central genetic database for the Singapore population. The data catalogued in the database include mutations identified in Singapore for Mendelian diseases, and frequencies of polymorphisms that have been investigated in either healthy controls or samples associated with specific phenotypes. Data from journal articles identified by searches in PubMed and other online resources, and via personal communications with researchers were compiled and assembled into a single database. Genes are categorized alphabetically and are also searchable by name and disease. The information provided for each variant of the gene includes the protein encoded, phenotype association, gender, size, and ethnic origin of the sample, as well as the reported genotype and allele frequencies, and direct links to the corresponding abstracts on PubMed. Our database will facilitate molecular diagnosis of Mendelian disorders and improve study designs for complex traits. It will be useful not only for researchers in Singapore, but also for those in countries with similar ethnic backgrounds, such as China, Taiwan, Hong Kong, Indonesia, and Malaysia.
    Matched MeSH terms: Databases, Genetic*
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links