Affiliations 

  • 1 Faculty of Computing, Universiti Teknologi Malaysia, Malaysia. alsamet.faisal@gmail.com
  • 2 Faculty of Computing, Universiti Teknologi Malaysia, Malaysia
  • 3 Computer Science Department, Hodeidah University, Hodeidah, Yemen
Mol Inform, 2013 Jul;32(7):591-8.
PMID: 27481767 DOI: 10.1002/minf.201300004

Abstract

Many consensus clustering methods have been applied in different areas such as pattern recognition, machine learning, information theory and bioinformatics. However, few methods have been used for chemical compounds clustering. In this paper, an information theory and voting based algorithm (Adaptive Cumulative Voting-based Aggregation Algorithm A-CVAA) was examined for combining multiple clusterings of chemical structures. The effectiveness of clusterings was evaluated based on the ability of the clustering method to separate active from inactive molecules in each cluster, and the results were compared with Ward's method. The chemical dataset MDL Drug Data Report (MDDR) and the Maximum Unbiased Validation (MUV) dataset were used. Experiments suggest that the adaptive cumulative voting-based consensus method can improve the effectiveness of combining multiple clusterings of chemical structures.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.