MyMedR

Displaying publications 1 - 20 of 180 in total

Abstract:

Sort:

Short homologies efficiently generate detectable homologous recombination events

Osahor AN, Tan CY, Sim EU, Lee CW, Narayanan K

Anal Biochem, 2014 Oct 1;462:26-8.
PMID: 24929088 DOI: 10.1016/j.ab.2014.05.030

When recombineering bacterial artificial chromosomes (BACs), it is common practice to design the ends of the donor molecule with 50 bp of homology specifying its insertion site. We demonstrate that desired recombinants can be produced using intermolecular homologies as short as 15 bp. Although the use of shorter donor end regions decreases total recombinants by several fold, the frequency of recombinants with correctly inserted donor molecules was high enough for easy detection by simple polymerase chain reaction (PCR) screening. This observation may have important implications for the design of oligonucleotides for recombineering, including significant cost savings, especially for high-throughput projects that use large quantities of primers.

Matched MeSH terms: Sequence Homology, Nucleic Acid*
A pattern matching approach for the estimation of alignment between any two given DNA sequences

Basu K, Sriraam N, Richard RJ

J Med Syst, 2007 Aug;31(4):247-53.
PMID: 17685148

For a given DNA sequence, it is well known that pair wise alignment schemes are used to determine the similarity with the DNA sequences available in the databanks. The efficiency of the alignment decides the type of amino acids and its corresponding proteins. In order to evaluate the given DNA sequence for its proteomic identity, a pattern matching approach is proposed in this paper. A block based semi-global alignment scheme is introduced to determine the similarity between the DNA sequences (known and given). The two DNA sequences are divided into blocks of equal length and alignment is performed which minimizes the computational complexity. The efficiency of the alignment scheme is evaluated using the parameter, percentage of similarity (POS). Four essential DNA version of the amino acids that emphasize the importance of proteomic functionalities are chosen as patterns and matching is performed with the known and given DNA sequences to determine the similarity between them. The ratio of amino acid counts between the two sequences is estimated and the results are compared with that of the POS value. It is found from the experimental results that higher the POS value and the pattern matching higher are the similarity between the two DNA sequences. The optimal block is also identified based on the POS value and amino acids count.

Matched MeSH terms: Sequence Homology, Nucleic Acid*
Fulltext Isolation and Characterisation of PRSV-P Resistance Genes in Carica and Vasconcellea

Razean Haireen MR, Drew RA

Int J Genomics, 2014;2014:145403.
PMID: 25184131 DOI: 10.1155/2014/145403

Papaya (Carica papaya L.) is one of the major tropical fruit crops worldwide, but it is limited throughout its range by papaya ringspot virus type P (PRSV-P). Previous genetic studies identified a functional PRSV-P resistance marker in a mapping population of F2 plants of Vasconcellea pubescens (resistant to PRSV-P) × Vasconcellea parviflora (susceptible to PRSV-P) and showed that the marker exhibited homology to a serine threonine protein kinase (STK) gene. Full length cDNAs of putative PRSV-P resistance genes designated CP_STK from C. papaya and VP_STK1 and VP_STK2 from V. pubescens were cloned by rapid amplification of cDNA ends (RACE). Due to a frame-shift mutation, the two homologous sequences are transcribed and edited differently such that the gene product in V. pubescens is two separate transcripts, whereas in C. papaya they are fused into a single message. A peroxisomal targeting signal (PTS2) present in VP_STK2 but absent in the other transcripts may be the functional source of PRSV resistance in V. pubescens. The STK gene from V. pubescens may have been derived from an alternative splicing to confer resistance. The putative resistance gene, VP_STK2, that was identified in this study is a potential new source of PRSV-P resistance for papaya genotypes.

Matched MeSH terms: Sequence Homology
Fulltext StaphyloBase: a specialized genomic resource for the staphylococcal research community

Heydari H, Mutha NV, Mahmud MI, Siow CC, Wee WY, Wong GJ, et al.

Database (Oxford), 2014;2014:bau010.
PMID: 24578355 DOI: 10.1093/database/bau010

With the advent of high-throughput sequencing technologies, many staphylococcal genomes have been sequenced. Comparative analysis of these strains will provide better understanding of their biology, phylogeny, virulence and taxonomy, which may contribute to better management of diseases caused by staphylococcal pathogens. We developed StaphyloBase with the goal of having a one-stop genomic resource platform for the scientific community to access, retrieve, download, browse, search, visualize and analyse the staphylococcal genomic data and annotations. We anticipate this resource platform will facilitate the analysis of staphylococcal genomic data, particularly in comparative analyses. StaphyloBase currently has a collection of 754 032 protein-coding sequences (CDSs), 19 258 rRNAs and 15 965 tRNAs from 292 genomes of different staphylococcal species. Information about these features is also included, such as putative functions, subcellular localizations and gene/protein sequences. Our web implementation supports diverse query types and the exploration of CDS- and RNA-type information in detail using an AJAX-based real-time search system. JBrowse has also been incorporated to allow rapid and seamless browsing of staphylococcal genomes. The Pairwise Genome Comparison tool is designed for comparative genomic analysis, for example, to reveal the relationships between two user-defined staphylococcal genomes. A newly designed Pathogenomics Profiling Tool (PathoProT) is also included in this platform to facilitate comparative pathogenomics analysis of staphylococcal strains. In conclusion, StaphyloBase offers access to a range of staphylococcal genomic resources as well as analysis tools for comparative analyses. Database URL: http://staphylococcus.um.edu.my/.

Matched MeSH terms: Sequence Homology, Nucleic Acid
Fulltext Distinct Biological Potential of Streptococcus gordonii and Streptococcus sanguinis Revealed by Comparative Genome Analysis

Zheng W, Tan MF, Old LA, Paterson IC, Jakubovics NS, Choo SW

Sci Rep, 2017 06 07;7(1):2949.
PMID: 28592797 DOI: 10.1038/s41598-017-02399-4

Streptococcus gordonii and Streptococcus sanguinis are pioneer colonizers of dental plaque and important agents of bacterial infective endocarditis (IE). To gain a greater understanding of these two closely related species, we performed comparative analyses on 14 new S. gordonii and 5 S. sanguinis strains using various bioinformatics approaches. We revealed S. gordonii and S. sanguinis harbor open pan-genomes and share generally high sequence homology and number of core genes including virulence genes. However, we observed subtle differences in genomic islands and prophages between the species. Comparative pathogenomics analysis identified S. sanguinis strains have genes encoding IgA proteases, mitogenic factor deoxyribonucleases, nickel/cobalt uptake and cobalamin biosynthesis. On the contrary, genomic islands of S. gordonii strains contain additional copies of comCDE quorum-sensing system components involved in genetic competence. Two distinct polysaccharide locus architectures were identified, one of which was exclusively present in S. gordonii strains. The first evidence of genes encoding the CylA and CylB system by the α-haemolytic S. gordonii is presented. This study provides new insights into the genetic distinctions between S. gordonii and S. sanguinis, which yields understanding of tooth surfaces colonization and contributions to dental plaque formation, as well as their potential roles in the pathogenesis of IE.

Matched MeSH terms: Sequence Homology
Unraveling the nuclear and chloroplast genomes of an agar producing red macroalga, Gracilaria changii (Rhodophyta, Gracilariales)

Ho CL, Lee WK, Lim EL

Genomics, 2018 03;110(2):124-133.
PMID: 28890206 DOI: 10.1016/j.ygeno.2017.09.003

Agar and agarose have wide applications in food and pharmaceutical industries. Knowledge on the genome of red seaweeds that produce them is still lacking. To fill the gap in genome analyses of these red algae, we have sequenced the nuclear and organellar genomes of an agarophyte, Gracilaria changii. The partial nuclear genome sequence of G. changii has a total length of 35.8Mb with 10,912 predicted protein coding sequences. Only 39.4% predicted proteins were found to have significant matches to protein sequences in SwissProt. The chloroplast genome of G. changii is 183,855bp with a total of 201 open reading frames (ORFs), 29 tRNAs and 3 rRNAs predicted. Five genes: ssrA, leuC and leuD CP76_p173 (orf139) and pbsA were absent in the chloroplast genome of G. changii. The genome information is valuable in accelerating functional studies of individual genes and resolving evolutionary relationship of red seaweeds.

Matched MeSH terms: Sequence Homology
Fulltext Detection of Naegleria species in environmental samples from Peninsular Malaysia

Ithoi I, Ahmad AF, Nissapatorn V, Lau YL, Mahmud R, Mak JW

PLoS One, 2011;6(9):e24327.
PMID: 21915311 DOI: 10.1371/journal.pone.0024327

BACKGROUND: In Malaysia, researchers and medical practitioners are unfamiliar with Naegleria infections. Thus little is known about the existence of pathogenic Naegleria fowleri, and the resultant primary amoebic meningoencephalitis (PAM) is seldom included in the differential diagnosis of central nervous system infections. This study was conducted to detect the presence of Naegleria species in various environmental samples.
METHODS/FINDINGS: A total of 41 Naegleria-like isolates were isolated from water and dust samples. All these isolates were subjected to PCR using two primer sets designed from the ITS1-ITS2 regions. The N. fowleri species-specific primer set failed to produce the expected amplicon. The Naegleria genus-specific primers produced amplicons of 408 bp (35), 450 bp (2), 457 bp (2) or 381 bp (2) from all 41 isolates isolated from aquatic (33) and dust (8) samples. Analysis of the sequences from 10 representative isolates revealed that amplicons with fragments 408, 450 and 457 bp showed homology with non-pathogenic Naegleria species, and 381 bp showed homology with Vahlkampfia species. These results concurred with the morphological observation that all 39 isolates which exhibited flagella were Naegleria, while 2 isolates (AC7, JN034055 and AC8, JN034056) that did not exhibit flagella were Vahlkampfia species.
CONCLUSION: To date, pathogenic species of N. fowleri have not been isolated from Malaysia. All 39 isolates that produced amplicons (408, 450 and 457 bp) from the genus-specific primers were identified as being similar to nonpathogenic Naegleria. Amplicon 408 bp from 5 representative isolates showed 100% and 99.7% identity to Naegleria philippinensis isolate RJTM (AM167890) and is thus believed to be the most common species in our environment. Amplicons 450 bp and 457 bp were respectively believed to be from 2 new species of Naegleria, since representative isolates showed lower homology and had a longer base pair length when compared to the reference species in the Genbank, Naegleria schusteri (AJ566626) and Naegleria laresi (AJ566630), respectively.

Matched MeSH terms: Sequence Homology, Nucleic Acid
Molecular characterization of two distinct begomoviruses from Papaya in China

Wang X, Xie Y, Zhou X

Virus Genes, 2004 Dec;29(3):303-9.
PMID: 15550769

Six papaya samples showing downward leaf curling were collected in Guangdong and Guangxi provinces, China. The result of TAS-ELISA showed they were all infected by geminiviruses. Comparison of partial DNA-A sequences reveals that these virus isolates can be classified into two groups. Group I includes isolates G2, G4, G5, G28 and G29 from Guangxi province, while isolate GD2 from Guangdong province belongs to Group II. The complete DNA-A sequence of G2 and GD2 were characterized. Sequence comparisons showed that the DNA-A of G2 and GD2 were most closely related to that of Ageratum yellow vein China virus- [Hn2] and Ageratum yellow vein virus , respectively, with 83.4 and 75.2% nucleotide sequence identity, while DNA-A sequence between G2 and GD2 had only 73.4% sequence identity. The molecular data suggests that G2 and GD2 are two distinct begomoviruses, for which the name Papaya leaf curl China virus (PaLCuCNV) for G2 and Papaya leaf curl Guangdong virus (PaLCuGDV) for GD2 are proposed. Comparison of individual encoded proteins showed the coat protein of G2 and GD2 shared highest amino acid sequence identity (97.7 and 94.2%, respectively) with that of Pepper leaf curl virus -[Malaysia] (PepLCV-[MY]), suggesting the CP of these viruses may have identical ancestor.

Matched MeSH terms: Sequence Homology, Nucleic Acid; Sequence Homology, Amino Acid
Genetic study of Japanese encephalitis viruses isolated in Malaysia

Tsuchie H, Oda K, Vythilingam I, Thayan R, Vijayamalar B, Sinniah M, et al.

Jpn. J. Med. Sci. Biol., 1994 Apr;47(2):101-7.
PMID: 7853748

Two hundred and forty nucleotides from the pre-M gene region of 10 Japanese encephalitis (JE) virus strains isolated in Malaysia in 1992 were sequenced and compared with the other JE virus strains from different geographic areas in Asia. Our JE virus strains belong to the largest genotypic group that includes strains isolated in temperate regions such as Japan, China, and Taiwan. Our Malaysian JE virus strains differed in 32 nucleotides (13.3%) from WTP/70/22 strain isolated from Malaysia in 1970, which belonged to another distinct genotypic group.

Matched MeSH terms: Sequence Homology, Nucleic Acid; Sequence Homology, Amino Acid
Fulltext The Plasmodium falciparum male gametocyte protein P230p, a paralog of P230, is vital for ookinete formation and mosquito transmission

Marin-Mogollon C, van de Vegte-Bolmer M, van Gemert GJ, van Pul FJA, Ramesar J, Othman AS, et al.

Sci Rep, 2018 10 08;8(1):14902.
PMID: 30297725 DOI: 10.1038/s41598-018-33236-x

Two members of 6-cysteine (6-cys) protein family, P48/45 and P230, are important for gamete fertility in rodent and human malaria parasites and are leading transmission blocking vaccine antigens. Rodent and human parasites encode a paralog of P230, called P230p. While P230 is expressed in male and female parasites, P230p is expressed only in male gametocytes and gametes. In rodent malaria parasites this protein is dispensable throughout the complete life-cycle; however, its function in P. falciparum is unknown. Using CRISPR/Cas9 methodology we disrupted the gene encoding Pfp230p resulting in P. falciparum mutants (PfΔp230p) lacking P230p expression. The PfΔp230p mutants produced normal numbers of male and female gametocytes, which retained expression of P48/45 and P230. Upon activation male PfΔp230p gametocytes undergo exflagellation and form male gametes. However, male gametes are unable to attach to red blood cells resulting in the absence of characteristic exflagellation centres in vitro. In the absence of P230p, zygote formation as well as oocyst and sporozoite development were strongly reduced (>98%) in mosquitoes. These observations demonstrate that P230p, like P230 and P48/45, has a vital role in P. falciparum male fertility and zygote formation and warrants further investigation as a potential transmission blocking vaccine candidate.

Matched MeSH terms: Sequence Homology, Amino Acid*
Phylogeny and characterization of three nifH-homologous genes from Paenibacillus azotofixans

Choo QC, Samian MR, Najimudin N

Appl Environ Microbiol, 2003 Jun;69(6):3658-62.
PMID: 12788777

In this paper, we report the cloning and characterization of three Paenibacillus azotofixans DNA regions containing genes involved in nitrogen fixation. Sequencing analysis revealed the presence of nifB1H1D1K1 gene organization in the 4,607-bp SacI DNA fragment. This is the first report of linkage of a nifB open reading frame upstream of the structural nif genes. The second (nifB2H2) and third (nifH3) nif homologues are confined within the 6,350-bp HindIII and 2,840-bp EcoRI DNA fragments, respectively. Phylogenetic analysis demonstrated that NifH1 and NifH2 form a monophyletic group among cyanobacterial NifH proteins. NifH3, on the other hand, clusters among NifH proteins of the highly divergent methanogenic archaea.

Matched MeSH terms: Sequence Homology, Amino Acid*
Molecular epidemiology and evolution of enterovirus 71 strains isolated from 1970 to 1998

Brown BA, Oberste MS, Alexander JP, Kennett ML, Pallansch MA

J Virol, 1999 Dec;73(12):9969-75.
PMID: 10559310

Enterovirus 71 (EV71) (genus Enterovirus, family Picornaviridae), a common cause of hand, foot, and mouth disease (HFMD), may also cause severe neurological diseases, such as encephalitis and poliomyelitis-like paralysis. To examine the genetic diversity and rate of evolution of EV71, we have determined and analyzed complete VP1 sequences (891 nucleotides) for 113 EV71 strains isolated in the United States and five other countries from 1970 to 1998. Nucleotide sequence comparisons demonstrated three distinct EV71 genotypes, designated A, B, and C. The genetic variation within genotypes (12% or fewer nucleotide differences) was less than the variation between genotypes (16.5 to 19.7%). Strains of all three genotypes were at least 94% identical to one another in deduced amino acid sequence. The EV71 prototype strain, BrCr-CA-70, isolated in California in 1970, is the sole member of genotype A. Strains isolated in the United States and Australia during the period from 1972 to 1988, a 1994 Colombian isolate, and isolates from a large HFMD outbreak in Malaysia in 1997 are all members of genotype B. Although strains of genotype B continue to circulate in other parts of the world, none have been isolated in the United States since 1988. Genotype C contains strains isolated in 1985 or later in the United States, Canada, Australia, and the Republic of China. The annual rate of evolution within both the B and C genotypes was estimated to be approximately 1.35 x 10(-2) substitutions per nucleotide and is similar to the rate observed for poliovirus. The results indicate that EV71 is a genetically diverse, rapidly evolving virus. Its worldwide circulation and potential to cause severe disease underscore the need for additional surveillance and improved methods to identify EV71 in human disease.

Matched MeSH terms: Sequence Homology, Nucleic Acid; Sequence Homology, Amino Acid
Type-3 dengue viruses responsible for the dengue epidemic in Malaysia during 1993-1994

Kobayashi N, Thayan R, Sugimoto C, Oda K, Saat Z, Vijayamalar B, et al.

Am J Trop Med Hyg, 1999 Jun;60(6):904-9.
PMID: 10403318

To characterize the dengue epidemic that recently occurred in Malaysia, we sequenced cDNAs from nine 1993-1994 dengue virus type-3 (DEN-3) isolates in Malaysia (DEN-3 was the most common type in Malaysia during this period). Nucleic acid sequences (720 nucleotides in length) from the nine isolates, encompassing the precursor of membrane protein (preM) and membrane (M) protein genes and part of the envelope (E) protein gene were aligned with various reference DEN-3 sequences to generate a neighbor-joining phylogenetic tree. According to the constructed tree, the nine Malaysian isolates were grouped into subtype II, which comprises Thai isolates from 1962 to 1987. Five earlier DEN-3 virus Malaysian isolates from 1974 to 1981 belonged to subtype I. The present data indicate that the recent dengue epidemic in Malaysia was due to the introduction of DEN-3 viruses previously endemic to Thailand.

Matched MeSH terms: Sequence Homology, Nucleic Acid; Sequence Homology, Amino Acid
Six isoforms of cardiotoxin in malayan spitting cobra (Naja naja sputatrix) venom: cloning and characterization of cDNAs

Jeyaseelan K, Armugam A, Lachumanan R, Tan CH, Tan NH

Biochim. Biophys. Acta, 1998 Apr 10;1380(2):209-22.
PMID: 9565688

Cardiotoxins are the most abundant toxin components of cobra venom. Although many cardiotoxins have been purified and characterized by amino acid sequencing and other pharmacological and biochemical studies, to date only five cardiotoxin cDNAs from Taiwan cobra (Naja naja atra), three cDNAs from Chinese cobra (Naja atra) and two more of uncertain origin (either Chinese or Taiwan cobra) have been reported. In this paper we show the existence of four isoforms of cardiotoxin by protein analysis and nine cDNA sequences encoding six isoforms of cardiotoxins (CTX 1-3, 4a, 4b and 5) from N. n. sputatrix by cDNA cloning. This forms the first report on the cloning and characterization of several cardiotoxin genes from a single species of a spitting cobra. The cDNAs encoding these isoforms, obtained by reverse transcription-polymerase chain reaction (RT-PCR), were subsequently expressed in Escherichia coli. The native and recombinant cardiotoxins were first characterized by Western blotting and N-terminal protein sequencing. These proteins were also found to have different levels of cytolytic activity on cultured baby hamster kidney cells. Four of the isoforms (CTX 1, 2, 4 and 5) are unique to N. n. sputatrix, with CTX 2 being the most abundant species constituting about 50% of the total cardiotoxins. The isoform CTX 3 (20% constitution) is highly homologous to the cardiotoxins of N. n. atra and N. n. naja, indicating that it may be universally present in all Naja naja subspecies. Our studies suggest that the most hydrophilic isoform (CTX 5) could have evolved first followed by the hydrophobic isoforms (CTX 1, 2, 3 and 4). We also speculate that Asiatic cobras could be the modern descendants of the African and Egyptian counterparts.

Matched MeSH terms: Sequence Homology, Nucleic Acid; Sequence Homology, Amino Acid
A molecular epidemiological study targeting the glycoprotein gene of rabies virus isolates from China

Meng SL, Yan JX, Xu GL, Nadin-Davis SA, Ming PG, Liu SY, et al.

Virus Res, 2007 Mar;124(1-2):125-38.
PMID: 17129631

A group of 31 rabies viruses (RABVs), recovered primarily from dogs, one deer and one human case, were collected from various areas in China between 1989 and 2006. Complete G gene sequences determined for these isolates indicated identities of nucleotide and amino acid sequences of >or=87% and 93.8%, respectively. Phylogenetic analysis of these and some additional Chinese isolates clearly supported the placement of all Chinese viruses in Lyssavirus genotype 1 and divided all Chinese isolates between four distinct groups (I-IV). Several variants identified within the most commonly encountered group I were distributed according to their geographical origins. A comparison of representative Chinese viruses with other isolates retrieved world-wide indicated a close evolutionary relationship between China group I and II viruses and those of Indonesia while China group III viruses formed an outlying branch to variants from Malaysia and Thailand. China group IV viruses were closely related to several vaccine strains. The predicted glycoprotein sequences of these RABVs variants are presented and discussed with respect to the utility of the anti-rabies biologicals currently employed in China.

Matched MeSH terms: Sequence Homology, Nucleic Acid; Sequence Homology, Amino Acid
Identifying SARS-CoV-2-related coronaviruses in Malayan pangolins

Lam TT, Jia N, Zhang YW, Shum MH, Jiang JF, Zhu HC, et al.

Nature, 2020 07;583(7815):282-285.
PMID: 32218527 DOI: 10.1038/s41586-020-2169-0

The ongoing outbreak of viral pneumonia in China and across the world is associated with a new coronavirus, SARS-CoV-21. This outbreak has been tentatively associated with a seafood market in Wuhan, China, where the sale of wild animals may be the source of zoonotic infection2. Although bats are probable reservoir hosts for SARS-CoV-2, the identity of any intermediate host that may have facilitated transfer to humans is unknown. Here we report the identification of SARS-CoV-2-related coronaviruses in Malayan pangolins (Manis javanica) seized in anti-smuggling operations in southern China. Metagenomic sequencing identified pangolin-associated coronaviruses that belong to two sub-lineages of SARS-CoV-2-related coronaviruses, including one that exhibits strong similarity in the receptor-binding domain to SARS-CoV-2. The discovery of multiple lineages of pangolin coronavirus and their similarity to SARS-CoV-2 suggests that pangolins should be considered as possible hosts in the emergence of new coronaviruses and should be removed from wet markets to prevent zoonotic transmission.

Matched MeSH terms: Sequence Homology, Nucleic Acid*
Fulltext Genetic assemblage of Sarcocystis spp. in Malaysian snakes

Lau YL, Chang PY, Subramaniam V, Ng YH, Mahmud R, Ahmad AF, et al.

Parasit Vectors, 2013 Sep 09;6(1):257.
PMID: 24010903 DOI: 10.1186/1756-3305-6-257

BACKGROUND: Sarcocystis species are protozoan parasites with a wide host range including snakes. Although there were several reports of Sarcocytis species in snakes, their distribution and prevalence are still not fully explored.
METHODS: In this study, fecal specimens of several snake species in Malaysia were examined for the presence of Sarcocystis by PCR of 18S rDNA sequence. Microscopy examination of the fecal specimens for sporocysts was not carried as it was difficult to determine the species of the infecting Sarcocystis.
RESULTS: Of the 28 snake fecal specimens, 7 were positive by PCR. BLASTn and phylogenetic analyses of the amplified 18S rDNA sequences revealed the snakes were infected with either S. nesbitti, S. singaporensis, S. zuoi or undefined Sarcocystis species.
CONCLUSION: This study is the first to report Sarcocystis infection in a cobra, and S. nesbitti in a reticulated python.

Matched MeSH terms: Sequence Homology
Novel complex re-arrangement of ARG1 commonly shared by unrelated patients with hyperargininemia

Mohseni J, Boon Hock C, Abdul Razak C, Othman SN, Hayati F, Peitee WO, et al.

Gene, 2014 Jan 1;533(1):240-5.
PMID: 24103480 DOI: 10.1016/j.gene.2013.09.081

Hyperargininemia is a very rare progressive neurometabolic disorder caused by deficiency of hepatic cytosolic arginase I, resulting from mutations in the ARG1 gene. Until now, some mutations were reported worldwide and none of them were of Southeast Asian origins. Furthermore, most reported mutations were point mutations and a few others deletions or insertions.

Matched MeSH terms: Sequence Homology, Nucleic Acid
[Analysis of genetic features of influenza B virus in Hunan province from 2007 to 2010]

Liu YZ, Zhao X, Huang YW, Chen Z, Li FC, Gao LD, et al.

Zhonghua Yu Fang Yi Xue Za Zhi, 2012 Mar;46(3):258-63.
PMID: 22800599

To investigate the gene variations of influenza B virus isolated in Hunan province from 2007 to 2010.

Matched MeSH terms: Sequence Homology
Remote protein homology detection and fold recognition using two-layer support vector machine classifiers

Muda HM, Saad P, Othman RM

Comput Biol Med, 2011 Aug;41(8):687-99.
PMID: 21704312 DOI: 10.1016/j.compbiomed.2011.06.004

Remote protein homology detection and fold recognition refer to detection of structural homology in proteins where there are small or no similarities in the sequence. To detect protein structural classes from protein primary sequence information, homology-based methods have been developed, which can be divided to three types: discriminative classifiers, generative models for protein families and pairwise sequence comparisons. Support Vector Machines (SVM) and Neural Networks (NN) are two popular discriminative methods. Recent studies have shown that SVM has fast speed during training, more accurate and efficient compared to NN. We present a comprehensive method based on two-layer classifiers. The 1st layer is used to detect up to superfamily and family in SCOP hierarchy using optimized binary SVM classification rules. It used the kernel function known as the Bio-kernel, which incorporates the biological information in the classification process. The 2nd layer uses discriminative SVM algorithm with string kernel that will detect up to protein fold level in SCOP hierarchy. The results obtained were evaluated using mean ROC and mean MRFP and the significance of the result produced with pairwise t-test was tested. Experimental results show that our approaches significantly improve the performance of remote protein homology detection and fold recognition for all three different version SCOP datasets (1.53, 1.67 and 1.73). We achieved 4.19% improvements in term of mean ROC in SCOP 1.53, 4.75% in SCOP 1.67 and 4.03% in SCOP 1.73 datasets when compared to the result produced by well-known methods. The combination of first layer and second layer of BioSVM-2L performs well in remote homology detection and fold recognition even in three different versions of datasets.

Matched MeSH terms: Sequence Homology, Amino Acid

Filters

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links