Affiliations 

  • 1 Faculty of Science and Information Technology, Al-Zaytoonah University of Jordan, Amman, Jordan
  • 2 School of Computer Sciences, Universiti Sains Malaysia, Penang, Malaysia
Int J Bioinform Res Appl, 2014;10(3):321-40.
PMID: 24794073 DOI: 10.1504/IJBRA.2014.060765

Abstract

In recent times, the size of biological databases has increased significantly, with the continuous growth in the number of users and rate of queries; such that some databases have reached the terabyte size. There is therefore, the increasing need to access databases at the fastest rates possible. In this paper, the decision tree indexing model (PDTIM) was parallelised, using a hybrid of distributed and shared memory on resident database; with horizontal and vertical growth through Message Passing Interface (MPI) and POSIX Thread (PThread), to accelerate the index building time. The PDTIM was implemented using 1, 2, 4 and 5 processors on 1, 2, 3 and 4 threads respectively. The results show that the hybrid technique improved the speedup, compared to a sequential version. It could be concluded from results that the proposed PDTIM is appropriate for large data sets, in terms of index building time.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.