Displaying publications 1 - 20 of 233 in total

Abstract:
Sort:
  1. Salahshourifar I, Halim AS, Wan Sulaiman WA, Zilfalil BA
    J Hum Genet, 2011 Nov;56(11):755-8.
    PMID: 21866112 DOI: 10.1038/jhg.2011.95
    Oral clefts are clinically and genetically heterogeneous disorders that are influenced by both genetic and environmental factors. The present family-based association study investigated the role of the MSX1 and TGFB3 genes in the etiology of non-syndromic oral cleft in a Malay population. No transmission distortion was found in the transmission disequilibrium analysis for either MSX1-CA or TGFB3-CA intragenic markers, whereas TGFB3-CA exhibited a trend to excess maternal transmission. In sequencing the MSX1 coding regions in 124 patients with oral cleft, five variants were found, including three known variants (A34G, G110G and P147Q) and two novel variants (M37L and G267A). The P147Q and M37L variants were not observed in 200 control chromosomes, whereas G267A was found in one control sample, indicating a very rare polymorphic variant. Furthermore, the G110G variant displayed a significant association between patients with non-syndromic cleft lip, with or without cleft palate, and normal controls (P=0.001, odds ratio=2.241, 95% confidence interval, 1.357-3.700). Therefore, these genetic variants may contribute, along with other genetic and environmental factors, to this condition.
    Matched MeSH terms: Sequence Alignment
  2. Zhang T, Wu Q, Zhang Z
    Curr Biol, 2020 04 06;30(7):1346-1351.e2.
    PMID: 32197085 DOI: 10.1016/j.cub.2020.03.022
    An outbreak of coronavirus disease 2019 (COVID-19) caused by the 2019 novel coronavirus (SARS-CoV-2) began in the city of Wuhan in China and has widely spread worldwide. Currently, it is vital to explore potential intermediate hosts of SARS-CoV-2 to control COVID-19 spread. Therefore, we reinvestigated published data from pangolin lung samples from which SARS-CoV-like CoVs were detected by Liu et al. [1]. We found genomic and evolutionary evidence of the occurrence of a SARS-CoV-2-like CoV (named Pangolin-CoV) in dead Malayan pangolins. Pangolin-CoV is 91.02% and 90.55% identical to SARS-CoV-2 and BatCoV RaTG13, respectively, at the whole-genome level. Aside from RaTG13, Pangolin-CoV is the most closely related CoV to SARS-CoV-2. The S1 protein of Pangolin-CoV is much more closely related to SARS-CoV-2 than to RaTG13. Five key amino acid residues involved in the interaction with human ACE2 are completely consistent between Pangolin-CoV and SARS-CoV-2, but four amino acid mutations are present in RaTG13. Both Pangolin-CoV and RaTG13 lost the putative furin recognition sequence motif at S1/S2 cleavage site that can be observed in the SARS-CoV-2. Conclusively, this study suggests that pangolin species are a natural reservoir of SARS-CoV-2-like CoVs.
    Matched MeSH terms: Sequence Alignment
  3. Kalsum HU, Shah ZA, Othman RM, Hassan R, Rahim SM, Asmuni H, et al.
    Comput Biol Med, 2009 Nov;39(11):1013-9.
    PMID: 19720371 DOI: 10.1016/j.compbiomed.2009.08.002
    Protein domains contain information about the prediction of protein structure, function, evolution and design since the protein sequence may contain several domains with different or the same copies of the protein domain. In this study, we proposed an algorithm named SplitSSI-SVM that works with the following steps. First, the training and testing datasets are generated to test the SplitSSI-SVM. Second, the protein sequence is split into subsequence based on order and disorder regions. The protein sequence that is more than 600 residues is split into subsequences to investigate the effectiveness of the protein domain prediction based on subsequence. Third, multiple sequence alignment is performed to predict the secondary structure using bidirectional recurrent neural networks (BRNN) where BRNN considers the interaction between amino acids. The information of about protein secondary structure is used to increase the protein domain boundaries signal. Lastly, support vector machines (SVM) are used to classify the protein domain into single-domain, two-domain and multiple-domain. The SplitSSI-SVM is developed to reduce misleading signal, lower protein domain signal caused by primary structure of protein sequence and to provide accurate classification of the protein domain. The performance of SplitSSI-SVM is evaluated using sensitivity and specificity on single-domain, two-domain and multiple-domain. The evaluation shows that the SplitSSI-SVM achieved better results compared with other protein domain predictors such as DOMpro, GlobPlot, Dompred-DPS, Mateo, Biozon, Armadillo, KemaDom, SBASE, HMMPfam and HMMSMART especially in two-domain and multiple-domain.
    Matched MeSH terms: Sequence Alignment
  4. Ee SF, Mohamed-Hussein ZA, Othman R, Shaharuddin NA, Ismail I, Zainal Z
    ScientificWorldJournal, 2014;2014:840592.
    PMID: 24678279 DOI: 10.1155/2014/840592
    Polygonum minus is an aromatic plant, which contains high abundance of terpenoids, especially the sesquiterpenes C15H24. Sesquiterpenes were believed to contribute to the many useful biological properties in plants. This study aimed to functionally characterize a full length sesquiterpene synthase gene from P. minus. P. minus sesquiterpene synthase (PmSTS) has a complete open reading frame (ORF) of 1689 base pairs encoding a 562 amino acid protein. Similar to other sesquiterpene synthases, PmSTS has two large domains: the N-terminal domain and the C-terminal metal-binding domain. It also consists of three conserved motifs: the DDXXD, NSE/DTE, and RXR. A three-dimensional protein model for PmSTS built clearly distinguished the two main domains, where conserved motifs were highlighted. We also constructed a phylogenetic tree, which showed that PmSTS belongs to the angiosperm sesquiterpene synthase subfamily Tps-a. To examine the function of PmSTS, we expressed this gene in Arabidopsis thaliana. Two transgenic lines, designated as OE3 and OE7, were further characterized, both molecularly and functionally. The transgenic plants demonstrated smaller basal rosette leaves, shorter and fewer flowering stems, and fewer seeds compared to wild type plants. Gas chromatography-mass spectrometry analysis of the transgenic plants showed that PmSTS was responsible for the production of β -sesquiphellandrene.
    Matched MeSH terms: Sequence Alignment
  5. Kusumaningtyas E, Tan WS, Zamrod Z, Eshaghi M, Yusoff K
    Arch Virol, 2004 Sep;149(9):1859-65.
    PMID: 15593426
    Nucleotide sequence comparison of the L gene of the Malaysian neurotropic-viscerotropic velogenic NDV strain AF2240 with other NDV strains revealed a single nucleotide insertion at position 3870. This mutation is compensated by a nucleotide deletion downstream at position 3958 which results in two forms of the L proteins containing a 30-amino acid substitution in Domain V. This compensatory mutation does not correlate with the pathogenicity of the viral strains but it may affect the viral replication as Domain V is believed to play an important role in the replication of paramyxoviruses.
    Matched MeSH terms: Sequence Alignment
  6. Tai HF, Foo HL, Abdul Rahim R, Loh TC, Abdullah MP, Yoshinobu K
    Microb Cell Fact, 2015;14:89.
    PMID: 26077560 DOI: 10.1186/s12934-015-0280-y
    Bacteriocin-producing Lactic acid bacteria (LAB) have vast applications in human and animal health, as well as in food industry. The structural, immunity, regulatory, export and modification genes are required for effective bacteriocin biosynthesis. Variations in gene sequence, composition and organisation will affect the antimicrobial spectrum of bacteriocin greatly. Lactobacillus plantarum I-UL4 is a novel multiple bacteriocin producer that harbours both plw and plnEF structural genes simultaneous which has not been reported elsewhere. Therefore, molecular characterisation of bacteriocin genes that harboured in L. plantarum I-UL4 was conducted in this study.
    Matched MeSH terms: Sequence Alignment
  7. Lim PE, Tan J, Suana IW, Eamsobhana P, Yong HS
    PLoS One, 2012;7(5):e37276.
    PMID: 22615962 DOI: 10.1371/journal.pone.0037276
    The fruit fly Bactrocera caudata is a pest species of economic importance in Asia. Its larvae feed on the flowers of Cucurbitaceae such as Cucurbita moschata. To-date it is distinguished from related species based on morphological characters. Specimens of B. caudata from Peninsular Malaysia and Indonesia (Bali and Lombok) were analysed using the partial DNA sequences of cytochrome c oxidase subunit I (COI) and 16S rRNA genes. Both gene sequences revealed that B. caudata from Peninsular Malaysia was distinctly different from B. caudata of Bali and Lombok, without common haplotype between them. Phylogenetic analysis revealed two distinct clades, indicating distinct genetic lineage. The uncorrected 'p' distance for COI sequences between B. caudata of Malaysia-Thailand-China and B. caudata of Bali-Lombok was 5.65%, for 16S sequences from 2.76 to 2.99%, and for combined COI and 16S sequences 4.45 to 4.46%. The 'p' values are distinctly different from intraspecific 'p' distance (0-0.23%). Both the B. caudata lineages are distinctly separated from related species in the subgenus Zeugodacus - B. ascita, B. scutellata, B. ishigakiensis, B. diaphora, B. tau, B. cucurbitae, and B. depressa. Molecular phylogenetic analysis indicates that the B. caudata lineages are closely related to B. ascita sp. B, and form a clade with B. scutellata, B. ishigakiensis, B. diaphora and B. ascita sp. A. This study provides additional baseline for the phylogenetic relationships of Bactrocera fruit flies of the subgenus Zeugodacus. Both the COI and 16S genes could be useful markers for the molecular differentiation and phylogenetic analysis of tephritid fruit flies.
    Matched MeSH terms: Sequence Alignment
  8. Chow KS, Wan KL, Isa MN, Bahari A, Tan SH, Harikrishna K, et al.
    J Exp Bot, 2007;58(10):2429-40.
    PMID: 17545224
    Hevea brasiliensis is the most widely cultivated species for commercial production of natural rubber (cis-polyisoprene). In this study, 10,040 expressed sequence tags (ESTs) were generated from the latex of the rubber tree, which represents the cytoplasmic content of a single cell type, in order to analyse the latex transcription profile with emphasis on rubber biosynthesis-related genes. A total of 3,441 unique transcripts (UTs) were obtained after quality editing and assembly of EST sequences. Functional classification of UTs according to the Gene Ontology convention showed that 73.8% were related to genes of unknown function. Among highly expressed ESTs, a significant proportion encoded proteins related to rubber biosynthesis and stress or defence responses. Sequences encoding rubber particle membrane proteins (RPMPs) belonging to three protein families accounted for 12% of the ESTs. Characterization of these ESTs revealed nine RPMP variants (7.9-27 kDa) including the 14 kDa REF (rubber elongation factor) and 22 kDa SRPP (small rubber particle protein). The expression of multiple RPMP isoforms in latex was shown using antibodies against REF and SRPP. Both EST and quantitative reverse transcription-PCR (QRT-PCR) analyses demonstrated REF and SRPP to be the most abundant transcripts in latex. Besides rubber biosynthesis, comparative sequence analysis showed that the RPMPs are highly similar to sequences in the plant kingdom having stress-related functions. Implications of the RPMP function in cis-polyisoprene biosynthesis in the context of transcript abundance and differential gene expression are discussed.
    Matched MeSH terms: Sequence Alignment
  9. Meng SL, Yan JX, Xu GL, Nadin-Davis SA, Ming PG, Liu SY, et al.
    Virus Res, 2007 Mar;124(1-2):125-38.
    PMID: 17129631
    A group of 31 rabies viruses (RABVs), recovered primarily from dogs, one deer and one human case, were collected from various areas in China between 1989 and 2006. Complete G gene sequences determined for these isolates indicated identities of nucleotide and amino acid sequences of >or=87% and 93.8%, respectively. Phylogenetic analysis of these and some additional Chinese isolates clearly supported the placement of all Chinese viruses in Lyssavirus genotype 1 and divided all Chinese isolates between four distinct groups (I-IV). Several variants identified within the most commonly encountered group I were distributed according to their geographical origins. A comparison of representative Chinese viruses with other isolates retrieved world-wide indicated a close evolutionary relationship between China group I and II viruses and those of Indonesia while China group III viruses formed an outlying branch to variants from Malaysia and Thailand. China group IV viruses were closely related to several vaccine strains. The predicted glycoprotein sequences of these RABVs variants are presented and discussed with respect to the utility of the anti-rabies biologicals currently employed in China.
    Matched MeSH terms: Sequence Alignment
  10. Cho L, Kaur A, Cereb N, Lin PY, Yang KL
    HLA, 2020 08;96(2):217-218.
    PMID: 32227685 DOI: 10.1111/tan.13873
    One nucleotide substitution in codon 89 of HLA-B*38:02:01:01 results in a novel allele, HLA-B*38:64.
    Matched MeSH terms: Sequence Alignment
  11. Choo SW, Wee WY, Ngeow YF, Mitchell W, Tan JL, Wong GJ, et al.
    Sci Rep, 2014;4:4061.
    PMID: 24515248 DOI: 10.1038/srep04061
    Mycobacterium abscessus (Ma) is an emerging human pathogen that causes both soft tissue infections and systemic disease. We present the first comparative whole-genome study of Ma strains isolated from patients of wide geographical origin. We found a high proportion of accessory strain-specific genes indicating an open, non-conservative pan-genome structure, and clear evidence of rapid phage-mediated evolution. Although we found fewer virulence factors in Ma compared to M. tuberculosis, our data indicated that Ma evolves rapidly and therefore should be monitored closely for the acquisition of more pathogenic traits. This comparative study provides a better understanding of Ma and forms the basis for future functional work on this important pathogen.
    Matched MeSH terms: Sequence Alignment
  12. Xiao H
    Neural Netw, 2020 Nov;131:172-184.
    PMID: 32801109 DOI: 10.1016/j.neunet.2020.07.024
    Paraphrase identification serves as an important topic in natural language processing while sequence alignment and matching underlie the principle of this task. Traditional alignment methods take advantage of attention mechanism. Attention mechanism, i.e. weighting technique, could pick out the most similar/dissimilar parts, but is weak in modeling the aligned unmatched parts, which are the crucial evidence to identify paraphrases. In this paper, we empower neural architecture with Hungarian algorithm to extract the aligned unmatched parts. Specifically, first, our model applies BiLSTM/BERT to encode the input sentences into hidden representations. Then, Hungarian layer leverages the hidden representations to extract the aligned unmatched parts. Last, we apply cosine similarity to metric the aligned unmatched parts for a final discrimination. Extensive experiments show that our model outperforms other baselines, substantially and significantly.
    Matched MeSH terms: Sequence Alignment
  13. Tan JR, Tan KS, Yong FL, Armugam A, Wang CW, Jeyaseelan K, et al.
    PLoS One, 2017;12(2):e0172131.
    PMID: 28199366 DOI: 10.1371/journal.pone.0172131
    Ischemic stroke is a major cause of mortality and morbidity globally. Among the ischemic stroke subtypes, cardioembolic stroke is with poor functional outcome (Modified Rankin score ≥ 2). Early diagnosis of cardioembolic stroke will prove beneficial. This study examined the microRNAs targeting cluster of differentiation 46 (CD46), a potential biomarker for cardioembolic stroke. CD46 mRNA level was shown to be differentially expressed (p < 0.001) between cardioembolic stroke (median = 1.32) and non-cardioembolic stroke subtypes (large artery stroke median = 5.05; small vessel stroke median = 6.45). Bioinformatic search showed that miR-19a, -20a, -185 and -374b were found to target CD46 mRNA and further verified by luciferase reporter assay. The levels of miRNAs targeting CD46 were significantly reduced (p < 0.05) in non-cardioembolic stroke patients (large artery stroke median: miR-19a = 0.63, miR-20a = 0.42, miR-185 = 0.32, miR-374b = 0.27; small artery stroke median: miR-19a = 0.07, miR-20a = 0.06, miR-185 = 0.07, miR-374b = 0.05) as compared to cardioembolic stroke patients (median: miR-19a = 2.69, miR-20a = 1.36, miR-185 = 1.05, miR-374b = 1.23). ROC curve showed that the miRNAs could distinguish cardioembolic stroke from non-cardioembolic stroke with better AUC value as compared to CD46. Endogenous expression of CD46 in Human Umbilical Vein Endothelial Cells (HUVECs) were found to be regulated by miR-19a and miR-20a. Thus implicating that miR-19a and -20a may play a role in pathogenesis of cardioembolic stroke, possibly via the endothelial cells.
    Matched MeSH terms: Sequence Alignment
  14. Kwan YM, Meon S, Ho CL, Wong MY
    J Plant Physiol, 2015 Feb 01;174:131-6.
    PMID: 25462975 DOI: 10.1016/j.jplph.2014.10.003
    Nitric oxide associated 1 (NOA1) protein is implicated in plant disease resistance and nitric oxide (NO) biosynthesis. A full-length cDNA encoding of NOA1 protein from oil palm (Elaeis guineensis) was isolated and designated as EgNOA1. Sequence analysis suggested that EgNOA1 was a circular permutated GTPase with high similarity to the bacterial YqeH protein of the YawG/YlqF family. The gene expression of EgNOA1 and NO production in oil palm root tissues treated with Ganoderma boninense, the causal agent of basal stem rot (BSR) disease were profiled to investigate the involvement of EgNOA1 during fungal infection and association with NO biosynthesis. Real-time PCR (qPCR) analysis revealed that the transcript abundance of EgNOA1 in root tissues was increased by G. boninense treatment. NO burst in Ganoderma-treated root tissue was detected using Griess reagent, in advance of the up-regulation of the EgNOA1 transcript. This indicates that NO production was independent of EgNOA1. However, the induced expression of EgNOA1 in Ganoderma-treated root tissues implies that it might be involved in plant defense responses against pathogen infection.
    Matched MeSH terms: Sequence Alignment
  15. Chua EG, Debowski AW, Webberley KM, Peters F, Lamichhane B, Loke MF, et al.
    Gastroenterol Rep (Oxf), 2019 Feb;7(1):42-49.
    PMID: 30792865 DOI: 10.1093/gastro/goy048
    Background: Metronidazole is one of the first-line drugs of choice in the standard triple therapy used to eradicate Helicobacter pylori infection. Hence, the global emergence of metronidazole resistance in Hp poses a major challenge to health professionals. Inactivation of RdxA is known to be a major mechanism of conferring metronidazole resistance in H. pylori. However, metronidazole resistance can also arise in H. pylori strains expressing functional RdxA protein, suggesting that there are other mechanisms that may confer resistance to this drug.

    Methods: We performed whole-genome sequencing on 121 H. pylori clinical strains, among which 73 were metronidazole-resistant. Sequence-alignment analysis of core protein clusters derived from clinical strains containing full-length RdxA was performed. Variable sites in each alignment were statistically compared between the resistant and susceptible groups to determine candidate genes along with their respective amino-acid changes that may account for the development of metronidazole resistance in H. pylori.

    Results: Resistance due to RdxA truncation was identified in 34% of metronidazole-resistant strains. Analysis of core protein clusters derived from the remaining 48 metronidazole-resistant strains and 48 metronidazole-susceptible identified four variable sites significantly associated with metronidazole resistance. These sites included R16H/C in RdxA, D85N in the inner-membrane protein RclC (HP0565), V265I in a biotin carboxylase protein (HP0370) and A51V/T in a putative threonylcarbamoyl-AMP synthase (HP0918).

    Conclusions: Our approach identified new potential mechanisms for metronidazole resistance in H. pylori that merit further investigation.

    Matched MeSH terms: Sequence Alignment
  16. Pang SL, Ong SS, Lee HH, Zamri Z, Kandasamy KI, Choong CY, et al.
    Genet. Mol. Res., 2014;13(3):7217-38.
    PMID: 25222227 DOI: 10.4238/2014.September.5.7
    This study was directed at the understanding of the function of CCoAOMT isolated from Acacia auriculiformis x Acacia mangium. Full length cDNA of the Acacia hybrid CCoAOMT (AhCCoAOMT) was 1024-bp long, containing 750-bp coding regions, with one major open reading frame of 249 amino acids. On the other hand, full length genomic sequence of the CCoAOMT (AhgflCCoAOMT) was 2548 bp long, containing three introns and four exons with a 5' untranslated region (5'UTR) of 391 bp in length. The 5'UTR of the characterized CCoAOMT gene contains various regulatory elements. Southern analysis revealed that the Acacia hybrid has more than three copies of the CCoAOMT gene. Real-time PCR showed that this gene was expressed in root, inner bark, leaf, flower and seed pod of the Acacia hybrid. Downregulation of the homologous CCoAOMT gene in tobacco by antisense (AS) and intron-containing hairpin (IHP) constructs containing partial AhCCoAOMT led to reduction in lignin content. Expression of the CCoAOMT in AS line (pART-HAS78-03) and IHP line (pART-HIHP78-06) was reduced respectively by 37 and 75% compared to the control, resulting in a decrease in the estimated lignin content by 24 and 56%, respectively. AhCCoAOMT was found to have altered not only S and G units but also total lignin content, which is of economic value to the pulp industry. Subsequent polymorphism analysis of this gene across eight different genetic backgrounds each of A. mangium and A. auriculiformis revealed 47 single nucleotide polymorphisms (SNPs) in A. auriculiformis CCoAOMT and 30 SNPs in A. mangium CCoAOMT.
    Matched MeSH terms: Sequence Alignment
  17. French-Monar RD, Patton AF, Douglas JM, Abad JA, Schuster G, Wallace RW, et al.
    Plant Dis, 2010 Apr;94(4):481.
    PMID: 30754480 DOI: 10.1094/PDIS-94-4-0481A
    In August 2008, 30% of tomato (Solanum lycopersicum) plants in plots in Lubbock County, Texas showed yellowing, lateral stem dieback, upward leaf curling, enlargement of stems, adventitious roots, and swollen nodes. Yellowing in leaves was similar to that seen with zebra chip disease (ZC) of potato that was confirmed in a potato field 112 km away in July 2008 and was associated with a 'Candidatus Liberibacter' species (1), similar to findings earlier in 2008 in New Zealand and California (2,3). Tissue from four symptomatic plants of cv. Spitfire and two of cv. Celebrity were collected and DNA was extracted from midribs and petioles with a FastDNA Spin Kit (Qbiogene, Inc., Carlsbad, CA,). PCR amplification was done with 16S rRNA gene primers OA2 and OI2c, which are specific for "Ca. Liberibacter solanacearum" from potato and tomato and amplify a 1.1-kb fragment of the 16S rRNA gene of this new species (1,3). Amplicons of 1.1 kb were obtained from all samples and these were sequenced in both orientations (McLab, San Francisco, CA). Sequences of the 16S rRNA gene were identical for both Spitfire and Celebrity and were submitted to the NCBI as GenBank Accession Nos. FJ939136 and FJ939137, respectively. On the basis of a BLAST search, sequence alignments revealed 99.9% identity with a new species of 'Ca. Liberibacter' from potato (EU884128 and EU884129) in Texas (1); 99.7% identity with the new species "Ca. Liberibacter solanacearum" described from potato and tomato (3) in New Zealand (EU849020 and EU834130, respectively) and from the potato psyllid Bactericera cockerelli in California (2) (EU812559, EU812556); 97% identity with 'Ca L. asiaticus' from citrus in Malaysia (EU224393) and 94% identity with both 'Ca. L. africanus' and 'Ca. L. americanus' from citrus (EU921620 and AY742824, respectively). A neighbor-joining cladogram constructed using the 16S rRNA gene fragments delineated four clusters corresponding to each species, and these sequences clustered with "Ca. L. solanacearum". A second PCR analysis was conducted with the CL514F/CL514R primer pair, which amplifies a sequence from the rplJ and rplL ribosomal protein genes of "Ca. L. solanacearum". The resulting 669-bp products were 100% identical to a sequence reported from tomato in Mexico (FJ498807). This sequence was submitted to NCBI (GU169328). ZC, a disease causing losses to the potato industry, is associated with a 'Candidatus Liberibacter' species (1-3) and was reported in Central America and Mexico in the 1990s, in Texas in 2000, and more recently in other states in the United States (4). In 2008, a "Ca. Liberibacter solanacearum" was detected on Capsicum annuum, S. betaceum, and Physalis peruviana in New Zealand (3). Several studies have shown that the potato psyllid, B. cockerelli, is a potential vector for this pathogen (2,4). To our knowledge, this is the first report of "Ca. Liberibacter solanacearum" in field tomatoes showing ZC-like foliar disease symptoms in the United States. References: (1). J. A. Abad et al. Plant Dis. 93:108, 2009 (2) A. K. Hansen et al. Appl. Environ. Microbiol. 74:5862, 2008. (3) L. W. Liefting et al. Plant Dis. 93:208, 2009. (4) G. A. Secor et al. Plant Dis. 93:574, 2009.
    Matched MeSH terms: Sequence Alignment
  18. Amiruddin N, Lee XW, Blake DP, Suzuki Y, Tay YL, Lim LS, et al.
    BMC Genomics, 2012 Jan 13;13:21.
    PMID: 22244352 DOI: 10.1186/1471-2164-13-21
    BACKGROUND: Eimeria tenella is an apicomplexan parasite that causes coccidiosis in the domestic fowl. Infection with this parasite is diagnosed frequently in intensively reared poultry and its control is usually accorded a high priority, especially in chickens raised for meat. Prophylactic chemotherapy has been the primary method used for the control of coccidiosis. However, drug efficacy can be compromised by drug-resistant parasites and the lack of new drugs highlights demands for alternative control strategies including vaccination. In the long term, sustainable control of coccidiosis will most likely be achieved through integrated drug and vaccination programmes. Characterisation of the E. tenella transcriptome may provide a better understanding of the biology of the parasite and aid in the development of a more effective control for coccidiosis.

    RESULTS: More than 15,000 partial sequences were generated from the 5' and 3' ends of clones randomly selected from an E. tenella second generation merozoite full-length cDNA library. Clustering of these sequences produced 1,529 unique transcripts (UTs). Based on the transcript assembly and subsequently primer walking, 433 full-length cDNA sequences were successfully generated. These sequences varied in length, ranging from 441 bp to 3,083 bp, with an average size of 1,647 bp. Simple sequence repeat (SSR) analysis identified CAG as the most abundant trinucleotide motif, while codon usage analysis revealed that the ten most infrequently used codons in E. tenella are UAU, UGU, GUA, CAU, AUA, CGA, UUA, CUA, CGU and AGU. Subsequent analysis of the E. tenella complete coding sequences identified 25 putative secretory and 60 putative surface proteins, all of which are now rational candidates for development as recombinant vaccines or drug targets in the effort to control avian coccidiosis.

    CONCLUSIONS: This paper describes the generation and characterisation of full-length cDNA sequences from E. tenella second generation merozoites and provides new insights into the E. tenella transcriptome. The data generated will be useful for the development and validation of diagnostic and control strategies for coccidiosis and will be of value in annotation of the E. tenella genome sequence.

    Matched MeSH terms: Sequence Alignment
  19. Tan SL, Mohd-Adnan A, Mohd-Yusof NY, Forstner MR, Wan KL
    Gene, 2008 Mar 31;411(1-2):77-86.
    PMID: 18280674 DOI: 10.1016/j.gene.2008.01.008
    Using a novel library of 5637 expressed sequence tags (ESTs) from the brain tissue of the Asian seabass (Lates calcarifer), we first characterized the brain transcriptome for this economically important species. The ESTs generated from the brain of L. calcarifer yielded 2410 unique transcripts (UTs) which comprise of 982 consensi and 1428 singletons. Based on database similarity, 1005 UTs (41.7%) can be assigned putative functions and were grouped into 12 functional categories related to the brain function. Amongst others, we have identified genes that are putatively involved in energy metabolism, ion pumps and channels, synapse related genes, neurotransmitter and its receptors, stress induced genes and hormone related genes. Subsequently we selected a putative preprocGnRH-II precursor for further characterization. The complete cDNA sequence of the gene obtained was found to code for an 85-amino acid polypeptide that significantly matched preprocGnRH-II precursor sequences from other vertebrates, and possesses structural characteristics that are similar to that of other species, consisting of a signal peptide (23 residues), a GnRH decapeptide (10 residues), an amidation/proteolytic-processing signal (glycine-lysine-argine) and a GnRH associated peptide (GAP) (49 residues). Phylogenetic analysis showed that this putative L. calcarifer preprocGnRH-II sequence is a member of the subcohort Euteleostei and divergent from the sequences of the subcohort Otocephalan. These findings provide compelling evidence that the putative L. calcarifer preprocGnRH-II precursor obtained in this study is orthologous to that of other vertebrates. The functional prediction of this preprocGnRH-II precursor sequence through in silico analyses emphasizes the effectiveness of the EST approach in gene identification in L. calcarifer.
    Matched MeSH terms: Sequence Alignment
  20. Choi SB, Normi YM, Wahab HA
    Protein J, 2009 Dec;28(9-10):415-27.
    PMID: 19859792 DOI: 10.1007/s10930-009-9209-9
    Twenty percent of genes that encode for hypothetical proteins from Klebsiella pneumoniae MGH78578 were identified, leading to KPN00728 and KPN00729 after bioinformatics analysis. Both open reading frames showed high sequence homology to Succinate dehydrogenase Chain C (SdhC) and D (SdhD) from Escherichia coli. Recently, KPN00729 was assigned as SdhD. KPN00728 thus remains of particular interest as no annotated genes from the complete genome sequence encode for SdhC. We discovered KPN00728 has a missing region with conserved residues important for ubiquinone (UQ) and heme group binding. Structure and function prediction of KPN00728 coupled with secondary structure analysis and transmembrane topology showed KPN00728 adopts SDH-(subunit C)-like structure. To further probe its functionality, UQ was docked on the built model (consisting KPN00728 and KPN00729) and formation of hydrogen bonds between UQ and Ser27, Arg31 (KPN00728) and Tyr84 (KPN00729) further reinforces and supports that KPN00728 is indeed SDH. This is the first report on the structural and function prediction of KPN00728 of K. pneumoniae MGH78578 as SdhC.
    Matched MeSH terms: Sequence Alignment
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links