Displaying publications 1 - 20 of 1099 in total

Abstract:
Sort:
  1. Haigh AL, Gibernau M, Maurin O, Bailey P, Carlsen MM, Hay A, et al.
    Am J Bot, 2023 Feb;110(2):e16117.
    PMID: 36480380 DOI: 10.1002/ajb2.16117
    PREMISE: Recent phylogenetic studies of the Araceae have confirmed the position of the duckweeds nested within the aroids, and the monophyly of a clade containing all the unisexual flowered aroids plus the bisexual-flowered Calla palustris. The main objective of the present study was to better resolve the deep phylogenetic relationships among the main lineages within the family, particularly the relationships between the eight currently recognized subfamilies. We also aimed to confirm the phylogenetic position of the enigmatic genus Calla in relation to the long-debated evolutionary transition between bisexual and unisexual flowers in the family.

    METHODS: Nuclear DNA sequence data were generated for 128 species across 111 genera (78%) of Araceae using target sequence capture and the Angiosperms 353 universal probe set.

    RESULTS: The phylogenomic data confirmed the monophyly of the eight Araceae subfamilies, but the phylogenetic position of subfamily Lasioideae remains uncertain. The genus Calla is included in subfamily Aroideae, which has also been expanded to include Zamioculcadoideae. The tribe Aglaonemateae is newly defined to include the genera Aglaonema and Boycea.

    CONCLUSIONS: Our results strongly suggest that new research on African genera (Callopsis, Nephthytis, and Anubias) and Calla will be important for understanding the early evolution of the Aroideae. Also of particular interest are the phylogenetic positions of the isolated genera Montrichardia, Zantedeschia, and Anchomanes, which remain only moderately supported here.

    Matched MeSH terms: Sequence Analysis, DNA
  2. Li H, Khang TF
    PeerJ, 2023;11:e16126.
    PMID: 37790621 DOI: 10.7717/peerj.16126
    BACKGROUND: Pathological conditions may result in certain genes having expression variance that differs markedly from that of the control. Finding such genes from gene expression data can provide invaluable candidates for therapeutic intervention. Under the dominant paradigm for modeling RNA-Seq gene counts using the negative binomial model, tests of differential variability are challenging to develop, owing to dependence of the variance on the mean.

    METHODS: Here, we describe clrDV, a statistical method for detecting genes that show differential variability between two populations. We present the skew-normal distribution for modeling gene-wise null distribution of centered log-ratio transformation of compositional RNA-seq data.

    RESULTS: Simulation results show that clrDV has false discovery rate and probability of Type II error that are on par with or superior to existing methodologies. In addition, its run time is faster than its closest competitors, and remains relatively constant for increasing sample size per group. Analysis of a large neurodegenerative disease RNA-Seq dataset using clrDV successfully recovers multiple gene candidates that have been reported to be associated with Alzheimer's disease.

    Matched MeSH terms: Sequence Analysis, RNA/methods
  3. Choi JR, Tang R, Wang S, Wan Abas WA, Pingguan-Murphy B, Xu F
    Biosens Bioelectron, 2015 Dec 15;74:427-39.
    PMID: 26164488 DOI: 10.1016/j.bios.2015.06.065
    Nucleic acid testing (NAT), as a molecular diagnostic technique, including nucleic acid extraction, amplification and detection, plays a fundamental role in medical diagnosis for timely medical treatment. However, current NAT technologies require relatively high-end instrumentation, skilled personnel, and are time-consuming. These drawbacks mean conventional NAT becomes impractical in many resource-limited disease-endemic settings, leading to an urgent need to develop a fast and portable NAT diagnostic tool. Paper-based devices are typically robust, cost-effective and user-friendly, holding a great potential for NAT at the point of care. In view of the escalating demand for the low cost diagnostic devices, we highlight the beneficial use of paper as a platform for NAT, the current state of its development, and the existing challenges preventing its widespread use. We suggest a strategy involving integrating all three steps of NAT into one single paper-based sample-to-answer diagnostic device for rapid medical diagnostics in the near future.
    Matched MeSH terms: Sequence Analysis, DNA/instrumentation*; Sequence Analysis, RNA/instrumentation*; Oligonucleotide Array Sequence Analysis/instrumentation*
  4. Shardiwal RK, Sohrab SS
    Int J Bioinform Res Appl, 2010;6(3):223-9.
    PMID: 20615831
    Relative Synonymous Codon Usage (RSCU) and Relative Adaptiveness of a Codon (RAC) table bias importance in gene expression are well documented in the literature. However, to improve the gene expression we need to figure out which codons are optimal for the expression in order to synthesise an appropriate DNA sequence. An alternative to the manual approach, which is obviously a tedious task, is to set up software on your computer to perform this. Though such kinds of programs are available on the internet, none of them are open-source libraries. Here, one can use our Perl program to do his or her task more easily and efficiently. It is free for everyone.
    Matched MeSH terms: Sequence Analysis, DNA/methods
  5. Kurazono H, Yamasaki S, Ratchtrachenchai O, Nair GB, Takeda Y
    Microbiol. Immunol., 1996;40(4):303-5.
    PMID: 8709866
    Vibrio cholerae O139 isolated from different countries, as well as from different locations within a country, were examined using macrorestriction DNA analysis to determine the clonality of the O139 strains. NotI digests of genomic DNA of representative strains from Nepal, India, Bangladesh, China, Thailand, and Malaysia revealed very similar but not identical patterns. Examinations of the banding patterns generated by pulsed-field gel electrophoresis of strains isolated within countries revealed complete homogeneity. These results further reiterate the spread of an identical clone of V. cholerae O139 although it appears that genetic polymorphism among the O139 strains is becoming apparent.
    Matched MeSH terms: Sequence Analysis/methods
  6. Abramov AV, Bannikova AA, Lebedev VS, Rozhnov VV
    Zootaxa, 2017 Feb 15;4232(2):zootaxa.4232.2.5.
    PMID: 28264392 DOI: 10.11646/zootaxa.4232.2.5
    We analyzed the complete mitochondrial cytochrome b (cytb) gene and fragments of four nuclear loci: ApoB, RAG2, IRBP1 and BRCA1. These data allowed us to provide new insights into the diversity of the Asiatic water shrews of Indochina. A new, highly divergent genetic lineage of Chimarrogale was found in southern Vietnam, and this lineage included specimens from the provinces of Kon Tum, Dak Lak, and Lam Dong. Such finding represents the newest and southernmost records of Chimarrogale in Indochina. Morphological analysis classified the specimens from southern Vietnam as C. varennei proper, which is restricted to that region, whereas the polymorphic C. himalayica, which contained at least four cytochrome b haplogroups, occurred in central and northern Vietnam and southern China. This distinct C. varennei lineage closely related to the C. platycephalus + C. leander clade suggests the existence of an unknown glacial refuge in Tay Nguyen Plateau, southern Vietnam. Because the Bornean C. phaeura (i) was sister-group of the rest of Chimarrogale sensu lato and (ii) had a high genetic divergence (~15% for cytochrome b) and geographical isolation, we suggest that C. phaeura be placed into a separate genus, Crossogale Thomas, 1921. This genus should also include C. sumatrana (Sumatra) and C. hantu (Peninsular Malaysia). On those grounds, we propose a new classification system for Asiatic water shrews.
    Matched MeSH terms: Sequence Analysis, DNA*
  7. Huang Z, Wang J, Lu X, Mohd Zain A, Yu G
    Brief Bioinform, 2023 Mar 19;24(2).
    PMID: 36733262 DOI: 10.1093/bib/bbad040
    Single-cell RNA sequencing (scRNA-seq) data are typically with a large number of missing values, which often results in the loss of critical gene signaling information and seriously limit the downstream analysis. Deep learning-based imputation methods often can better handle scRNA-seq data than shallow ones, but most of them do not consider the inherent relations between genes, and the expression of a gene is often regulated by other genes. Therefore, it is essential to impute scRNA-seq data by considering the regional gene-to-gene relations. We propose a novel model (named scGGAN) to impute scRNA-seq data that learns the gene-to-gene relations by Graph Convolutional Networks (GCN) and global scRNA-seq data distribution by Generative Adversarial Networks (GAN). scGGAN first leverages single-cell and bulk genomics data to explore inherent relations between genes and builds a more compact gene relation network to jointly capture the homogeneous and heterogeneous information. Then, it constructs a GCN-based GAN model to integrate the scRNA-seq, gene sequencing data and gene relation network for generating scRNA-seq data, and trains the model through adversarial learning. Finally, it utilizes data generated by the trained GCN-based GAN model to impute scRNA-seq data. Experiments on simulated and real scRNA-seq datasets show that scGGAN can effectively identify dropout events, recover the biologically meaningful expressions, determine subcellular states and types, improve the differential expression analysis and temporal dynamics analysis. Ablation experiments confirm that both the gene relation network and gene sequence data help the imputation of scRNA-seq data.
    Matched MeSH terms: Sequence Analysis, RNA/methods
  8. Najam M, Rasool RU, Ahmad HF, Ashraf U, Malik AW
    Biomed Res Int, 2019;2019:7074387.
    PMID: 31111064 DOI: 10.1155/2019/7074387
    Storing and processing of large DNA sequences has always been a major problem due to increasing volume of DNA sequence data. However, a number of solutions have been proposed but they require significant computation and memory. Therefore, an efficient storage and pattern matching solution is required for DNA sequencing data. Bloom filters (BFs) represent an efficient data structure, which is mostly used in the domain of bioinformatics for classification of DNA sequences. In this paper, we explore more dimensions where BFs can be used other than classification. A proposed solution is based on Multiple Bloom Filters (MBFs) that finds all the locations and number of repetitions of the specified pattern inside a DNA sequence. Both of these factors are extremely important in determining the type and intensity of any disease. This paper serves as a first effort towards optimizing the search for location and frequency of substrings in DNA sequences using MBFs. We expect that further optimizations in the proposed solution can bring remarkable results as this paper presents a proof of concept implementation for a given set of data using proposed MBFs technique. Performance evaluation shows improved accuracy and time efficiency of the proposed approach.
    Matched MeSH terms: Sequence Analysis, DNA/methods*
  9. Sharko F, Rbbani G, Siriyappagouder P, Raeymaekers JAM, Galindo-Villegas J, Nedoluzhko A, et al.
    BMC Bioinformatics, 2023 May 19;24(1):205.
    PMID: 37208611 DOI: 10.1186/s12859-023-05331-y
    BACKGROUND: Circular RNAs (circRNAs) are covalently closed-loop RNAs with critical regulatory roles in cells. Tens of thousands of circRNAs have been unveiled due to the recent advances in high throughput RNA sequencing technologies and bioinformatic tools development. At the same time, polymerase chain reaction (PCR) cross-validation for circRNAs predicted by bioinformatic tools remains an essential part of any circRNA study before publication.

    RESULTS: Here, we present the CircPrime web-based platform, providing a user-friendly solution for DNA primer design and thermocycling conditions for circRNA identification with routine PCR methods.

    CONCLUSIONS: User-friendly CircPrime web platform ( http://circprime.elgene.net/ ) works with outputs of the most popular bioinformatic predictors of circRNAs to design specific circular RNA primers. CircPrime works with circRNA coordinates and any reference genome from the National Center for Biotechnology Information database).

    Matched MeSH terms: Sequence Analysis, RNA/methods
  10. Mohamed Yusoff A, Tan TK, Hari R, Koepfli KP, Wee WY, Antunes A, et al.
    Sci Rep, 2016 09 13;6:28199.
    PMID: 27618997 DOI: 10.1038/srep28199
    Pangolins are scale-covered mammals, containing eight endangered species. Maintaining pangolins in captivity is a significant challenge, in part because little is known about their genetics. Here we provide the first large-scale sequencing of the critically endangered Manis javanica transcriptomes from eight different organs using Illumina HiSeq technology, yielding ~75 Giga bases and 89,754 unigenes. We found some unigenes involved in the insect hormone biosynthesis pathway and also 747 lipids metabolism-related unigenes that may be insightful to understand the lipid metabolism system in pangolins. Comparative analysis between M. javanica and other mammals revealed many pangolin-specific genes significantly over-represented in stress-related processes, cell proliferation and external stimulus, probably reflecting the traits and adaptations of the analyzed pregnant female M. javanica. Our study provides an invaluable resource for future functional works that may be highly relevant for the conservation of pangolins.
    Matched MeSH terms: Sequence Analysis, DNA/methods*; Sequence Analysis, RNA/methods*
  11. MOHAMAD, O., HO, W. S.
    MyJurnal
    Sanger sequencing has been the major method in directly sequencing DNA, and has dominated the DNA sequencing market for nearly past 30 years (Varshney et al., 2009). Along with PCR, we cannot underestimate how important this technology has been to research in various elds of molecular biology. It has revolutionized genetics by allowing us to gain unprecedented insights into the
    workings of different organisms.
    Matched MeSH terms: Sequence Analysis, DNA
  12. Tan KY, Deng S, Tan TK, Hari R, Sitam FT, Othman RY, et al.
    PeerJ, 2023;11:e16002.
    PMID: 37810781 DOI: 10.7717/peerj.16002
    BACKGROUND: The Malayan pangolin (Manis javanica) is a placental mammal and is listed as Critically Endangered on the IUCN Red List of Threatened Species. Most previous attempts to breed pangolins in captivity have met with little success because of dietary issues, infections, and other complications, although a previous study reported breeding pangolins in captivity to the third generation. In our previous pangolin genome sequencing data analysis, we obtained a considerable amount of bacterial DNA from a pregnant female Malayan pangolin (named "UM3"), which was likely infected by Paraburkholderia fungorum-an agent of biodegradation and bioremediation in agriculture.

    METHODOLOGY: Here, we further confirmed and characterized this bacterial species using PCR, histological staining, whole-genome sequencing, and bioinformatics approaches. PCR assays with in-house designed primer sets and 16S universal primers showed clear positive bands in the cerebrum, cerebellum, lung, and blood of UM3 suggesting that UM3 might have developed septicaemia. Histological staining showed the presence of Gram-negative rod-shaped bacteria in the pangolin brain and lungs, indicating the colonization of the bacteria in these two organs. In addition, PCR screening of UM3's fetal tissues revealed the presence of P. fungorum in the gastrocnemius muscle, but not in other tissues that we examined. We also sequenced and reconstructed the genome of pangolin P. fungorum, which has a genome size of 7.7 Mbps.

    CONCLUSION: Our study is the first to present detailed evidence of the presence of P. fungorum in a pangolin and her fetus (although preliminary results were presented in our previous article). Here, we raise the concern that P. fungorum may potentially infect humans, especially YOPI (young, old, pregnant, and immunocompromised) people. Therefore, caution should be exercised when using this bacterial species as biodegradation or bioremediation agents in agriculture.

    Matched MeSH terms: Sequence Analysis
  13. Esmaeilpour M, Naderifar V, Shukur Z
    PLoS One, 2014;9(9):e106313.
    PMID: 25243670 DOI: 10.1371/journal.pone.0106313
    Over the last decade, design patterns have been used extensively to generate reusable solutions to frequently encountered problems in software engineering and object oriented programming. A design pattern is a repeatable software design solution that provides a template for solving various instances of a general problem.
    Matched MeSH terms: Sequence Analysis, DNA/methods*
  14. Tan MH, Gan HM, Lee YP, Austin CM
    PMID: 25103440 DOI: 10.3109/19401736.2014.945554
    The mitochondrial genome sequence of the Morton Bay bug, Thenus orientalis, is documented, which makes it the second mitogenome for species of the family Scyllaridae and the ninth for members of the superfamily Palinuroidae. Thenus orientalis has a mitogenome of 16,826 base pairs consisting of 13 protein-coding genes, 2 ribosomal subunit genes, 23 transfer RNAs, and a non-coding AT-rich region. The base composition of the T. orientalis mitogenome is 31.31% for T, 23.77% for C, 31.05% for A, and 13.87% for G, with an AT bias of 62.36%. In addition to a duplicated trnS1 and several other tRNA gene rearrangements, the mitogenome gene order has novel protein coding gene order with the nad6 and cob genes translocated as a block to a location downstream of the nad3 gene.
    Matched MeSH terms: Sequence Analysis, DNA/methods*
  15. Tan MH, Gan HM, Lee YP, Austin CM
    PMID: 25090400 DOI: 10.3109/19401736.2014.945553
    The complete mitochondrial genome of the swimming crab Thalamita crenata was obtained from a partial genome scan using the MiSeq sequencing system. The Thalamita crenata mitogenome has 15,787 base pairs (70% A+T content) made up of 13 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a putative 897 bp non-coding AT-rich region. This Thalamita mitogenome sequence is the first for the genus and the eighth for the family Portunidae.
    Matched MeSH terms: Sequence Analysis, DNA/methods*
  16. Jaber KM, Abdullah R, Rashid NA
    Int J Bioinform Res Appl, 2014;10(3):321-40.
    PMID: 24794073 DOI: 10.1504/IJBRA.2014.060765
    In recent times, the size of biological databases has increased significantly, with the continuous growth in the number of users and rate of queries; such that some databases have reached the terabyte size. There is therefore, the increasing need to access databases at the fastest rates possible. In this paper, the decision tree indexing model (PDTIM) was parallelised, using a hybrid of distributed and shared memory on resident database; with horizontal and vertical growth through Message Passing Interface (MPI) and POSIX Thread (PThread), to accelerate the index building time. The PDTIM was implemented using 1, 2, 4 and 5 processors on 1, 2, 3 and 4 threads respectively. The results show that the hybrid technique improved the speedup, compared to a sequential version. It could be concluded from results that the proposed PDTIM is appropriate for large data sets, in terms of index building time.
    Matched MeSH terms: Sequence Analysis/methods*
  17. Zhao H, Kong X, Zhou C
    Mitochondrial DNA, 2014 Oct;25(5):342-4.
    PMID: 23795847 DOI: 10.3109/19401736.2013.800492
    The Pangasius sutchi is an important ornamental and economic fish in Southeast Asia e.g. Thailand, Malaysia and China. The complete mitochondrial genome sequence of P. sutchi has been sequenced, which contains 22 tRNA genes, 13 protein-coding genes, 2 rRNA genes and a non-coding control region with the total length of 16,522 bp. The gene order and composition are similar to most of other vertebrates. Just like most other vertebrates, the bias of G and C was found in different region/genes statistics results. Most of the genes are encoded on heavy strand, except for eight tRNA and ND6 genes. The mitogenome sequence of P. sutchi would contribute to better understand population genetics, evolution of this lineage.
    Matched MeSH terms: Sequence Analysis, DNA/methods*
  18. Singham GV, Vargo EL, Booth W, Othman AS, Lee CY
    Environ Entomol, 2012 Apr;41(2):426-31.
    PMID: 22507019 DOI: 10.1603/EN11228
    The fungus-growing termite, Macrotermes gilvus (Hagen), an indigenous species from Southeast Asia distributed from Myanmar to Indonesia and the Philippines, offers great potential as an ecological model system to elucidate the effects of geography on gene flow within this region. We used next generation sequencing (Roche 454 pyrosequencing) to identify microsatellite markers from the genomic DNA of M. gilvus. A modest sequencing volume generated 34,122 reads, with 1,212 (3.6%) reads contains microsatellites with di-, tri-, tetra-, penta-, and hexa-nucleotide repeat motifs. Thirty-seven loci were selected for primer development and tested for polymorphism across 22 colonies of M. gilvus. Eleven loci were found to be polymorphic with 2-4 alleles per locus. Observed and expected heterozygosities ranged between 0.091-0.727 and 0.090-0.540, respectively. Cross taxa amplification was successful across a panel of four related termite species and four multiplex groups were designed for future population genetic studies. These markers will open new avenues for the study of phylogeography and population genetics of this fungus-growing termite. This study also has effectively demonstrated the use of 454 pyrosequencing for the rapid development of informative microsatellite markers from a termite genome.
    Matched MeSH terms: Sequence Analysis, DNA/methods
  19. Thanh T, Chi VT, Abdullah MP, Omar H, Noroozi M, Ky H, et al.
    Mol Biol Rep, 2011 Jan;38(1):177-82.
    PMID: 20354903 DOI: 10.1007/s11033-010-0092-4
    Green microalga Ankistrodesmus convolutus Corda is a fast growing alga which produces appreciable amount of carotenoids and polyunsaturated fatty acids. To our knowledge, this is the first report on the construction of cDNA library and preliminary analysis of ESTs for this species. The titers of the primary and amplified cDNA libraries were 1.1×10(6) and 6.0×10(9) pfu/ml respectively. The percentage of recombinants was 97% in the primary library and a total of 337 out of 415 original cDNA clones selected randomly contained inserts ranging from 600 to 1,500 bps. A total of 201 individual ESTs with sizes ranging from 390 to 1,038 bps were then analyzed and the BLASTX score revealed that 35.8% of the sequences were classified as strong match, 38.3% as nominal and 25.9% as weak match. Among the ESTs with known putative function, 21.4% of them were found to be related to gene expression, 14.4% ESTs to photosynthesis, 10.9% ESTs to metabolism, 5.5% ESTs to miscellaneous, 2.0% to stress response, and the remaining 45.8% were classified as novel genes. Analysis of ESTs described in this paper can be an effective approach to isolate and characterize new genes from A. convolutus and thus the sequences obtained represented a significant contribution to the extensive database of sequences from green microalgae.
    Matched MeSH terms: Sequence Analysis, DNA/methods*
  20. Tan MH, Austin CM, Hammer MP, Lee YP, Croft LJ, Gan HM
    Gigascience, 2018 03 01;7(3):1-6.
    PMID: 29342277 DOI: 10.1093/gigascience/gix137
    Background: Some of the most widely recognized coral reef fishes are clownfish or anemonefish, members of the family Pomacentridae (subfamily: Amphiprioninae). They are popular aquarium species due to their bright colours, adaptability to captivity, and fascinating behavior. Their breeding biology (sequential hermaphrodites) and symbiotic mutualism with sea anemones have attracted much scientific interest. Moreover, there are some curious geographic-based phenotypes that warrant investigation. Leveraging on the advancement in Nanopore long read technology, we report the first hybrid assembly of the clown anemonefish (Amphiprion ocellaris) genome utilizing Illumina and Nanopore reads, further demonstrating the substantial impact of modest long read sequencing data sets on improving genome assembly statistics.

    Results: We generated 43 Gb of short Illumina reads and 9 Gb of long Nanopore reads, representing approximate genome coverage of 54× and 11×, respectively, based on the range of estimated k-mer-predicted genome sizes of between 791 and 967 Mbp. The final assembled genome is contained in 6404 scaffolds with an accumulated length of 880 Mb (96.3% BUSCO-calculated genome completeness). Compared with the Illumina-only assembly, the hybrid approach generated 94% fewer scaffolds with an 18-fold increase in N50 length (401 kb) and increased the genome completeness by an additional 16%. A total of 27 240 high-quality protein-coding genes were predicted from the clown anemonefish, 26 211 (96%) of which were annotated functionally with information from either sequence homology or protein signature searches.

    Conclusions: We present the first genome of any anemonefish and demonstrate the value of low coverage (∼11×) long Nanopore read sequencing in improving both genome assembly contiguity and completeness. The near-complete assembly of the A. ocellaris genome will be an invaluable molecular resource for supporting a range of genetic, genomic, and phylogenetic studies specifically for clownfish and more generally for other related fish species of the family Pomacentridae.

    Matched MeSH terms: Sequence Analysis, DNA/methods*
Filters
Contact Us

Please provide feedback to Administrator (afdal@afpm.org.my)

External Links