Adapting Document Similarity Measures for Ligand-Based Virtual Screening

Himmat M; Salim N; Al-Dabbagh MM; Saeed F; Ahmed A

doi:10.3390/molecules21040476

Fulltext

Adapting Document Similarity Measures for Ligand-Based Virtual Screening

Himmat M ¹ , Salim N ² , Al-Dabbagh MM ³ , Saeed F ⁴ , Ahmed A ⁵

Affiliations

¹ Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia. barakamub@yahoo.com
² Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia. naomie@utm.my
³ Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia. mohamad.aldabbagh@protonmail.com
⁴ Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia. faisalsaeed@utm.my
⁵ Faculty of Computing, Universiti Teknologi Malaysia, Skudai, Johor 81310, Malaysia. alikarary@gmail.com

Molecules, 2016 Apr 13;21(4):476.

PMID: 27089312 DOI: 10.3390/molecules21040476

Abstract

Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening. In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure. It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP). For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR). The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%. The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations. The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.

MeSH terms

Similar publications