Computational methods coupled with experimental validation play a critical role in the identification of novel inhibitory peptides that interact with viral antigenic determinants. The interaction between the receptor binding domain (RBD) of SARS-CoV-2 spike protein and the helical peptide of human angiotensin-converting enzyme-2 (ACE2) is a necessity for the initiation of viral infection. Herein, natural orthologs of human ACE2 helical peptide were evaluated for competitive inhibitory binding to the viral RBD by use of a computational approach, which was experimentally validated. A total of 624 natural ACE2 orthologous 32-amino acid long peptides were identified through a similarity search. Molecular docking was used to virtually screen and rank the peptides based on binding affinity metrics, benchmarked against human ACE2 peptide docked to the RBD. Molecular dynamics (MD) simulations were done for the human reference and the Nipponia nippon peptide as it exhibited the highest binding affinity (Gibbs free energy; -14 kcal/mol) predicted from the docking results. The MD simulation confirmed the stability of the assessed peptide in the complex (-12.3 kcal/mol). The top three docked-peptides (from Chitinophaga sancti, Nipponia nippon, and Mus musculus) and the human reference were experimentally validated by use of surface plasmon resonance technology. The human reference exhibited the weakest binding affinity (Kd of 318-441 pM) among the peptides tested, in agreement with the docking prediction, while the peptide from Nipponia nippon was the best, with 267-538-fold higher affinity than the reference. The validated peptides merit further investigation. This work showcases that the approach herein can aid in the identification of inhibitory biosimilar peptides for other viruses.
Sequence diversity is one of the major challenges in the design of diagnostic, prophylactic, and therapeutic interventions against viruses. DiMA is a novel tool that is big data-ready and designed to facilitate the dissection of sequence diversity dynamics for viruses. DiMA stands out from other diversity analysis tools by offering various unique features. DiMA provides a quantitative overview of sequence (DNA/RNA/protein) diversity by use of Shannon's entropy corrected for size bias, applied via a user-defined k-mer sliding window to an input alignment file, and each k-mer position is dissected to various diversity motifs. The motifs are defined based on the probability of distinct sequences at a given k-mer alignment position, whereby an index is the predominant sequence, while all the others are (total) variants to the index. The total variants are sub-classified into the major (most common) variant, minor variants (occurring more than once and of incidence lower than the major), and the unique (singleton) variants. DiMA allows user-defined, sequence metadata enrichment for analyses of the motifs. The application of DiMA was demonstrated for the alignment data of the relatively conserved Spike protein (2,106,985 sequences) of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the relatively highly diverse pol gene (2637) of the human immunodeficiency virus-1 (HIV-1). The tool is publicly available as a web server (https://dima.bezmialem.edu.tr), as a Python library (via PyPi) and as a command line client (via GitHub).