Make DrugFinder Application Submit an application to DrugFinder

Joint collaboration between the NFCR and InhibOx Go to InhibOx website Go to NFCR website

free hit counters

Software DOx

DOx

Introduction

Protein-ligand docking methods attempt to identify optimal positions and orientations of a ligand or small molecule with respect to a given protein receptor or enzyme.  Docking programs typically are composed of two main components:

  1. A search component that explores position and orientation of the ligand with respect to the protein.
  2. A scoring component which evaluates each generated molecular configuration.

 

The Search Module

Search modules come in two main types, exhaustive and stochastic.
Exhaustive or systematic search methods move and rotate the ligand into every possible position and orientation within the search space using a given “granularity” of search. The success of such programs is often limited by efficiency considerations due to the complexity and scale associated with large proteins and receptors. Virtual high-throughput screening can therefore be hampered by large processing times. Such methods also encounter problems when presented with flexible molecules due to the exponential increase in the search space size.
A variety of stochastic and systematic search techniques are used by docking programs. For example, AutoDock [8] and GOLD [1] use variations of the Genetic Algorithm (GA) [4] method for this purpose. DOCK [1] uses an incremental construction and random conformation search-based method to search for optimal poses. FlexX [611] incrementally constructs the molecule to sample the conformation space and iteratively places it within the active site. Earlier versions of AutoDock used the simulated annealing [5] method to perform the search. All such methods attempt to achieve the correct balance between the efficiency and accuracy of the search.
In principle, a docking search is ideally suited for implementation within a GA as the search space can be represented using 6 or more real numbers which describe the position and orientation of the ligand with respect to the protein. In principle, a GA should be able identify the optimal configuration for a given optimisation problem. This is however limited by the design of the evaluation system and the parameters used during the running of the GA. Efficiency concerns normally do not allow for large execution times and small population sizes and limited execution times can force the algorithm to settle within local minima. Modern GAs usually operate by performing large variations (and therefore large configuration changes) at the early stages of execution, performing the final optimizations by incrementally changing the mutation rate, cross-over rate and the level of elitism. The use of an optimal GA configuration for the problem space is very important and was a main consideration for the design of the DOx search component.
DOx uses a GA-based search method with a gradient-based optimization module. AutoDock [8] also uses a similar implementation of a hybrid GA by incorporating Lamarckian rules to the operation of the algorithm.  DOx also uses a novel chromosome design which fragments the translational and rotational coordinates of a new configuration into several values of varying magnitude to allow the GA to perform its search using different step-sizes.

 

The Scoring Module

The scoring module is used to evaluate the favourability of a generated molecular configuration.  A variety of scoring functions have been developed over the past decade [13]. Several recent studies have also evaluated many collections of these scoring functions for accuracy. These studies [1314] have indicated the effectiveness of the XScore, DrugScore, PLP and G-Score scoring functions.  The PLP and XScore functions have been implemented in DOx.
However, many of the scoring functions available have been developed, tested and evaluated against distinct classes of proteins and may therefore return different results for generalized cases. The best scoring function to use for a particular class of target protein can be difficult to predict. Therefore, a more recent approach to the construction of the scoring module involves the use of consensus scoring. Consensus scoring simply involves the use of two or more scoring functions for the prediction of the binding affinity. The construction of the final score could be done in many ways. The simplest would involve the normalisation of two scores (e.g. PLP and XScore) and using the largest or smallest. A more refined approach could involve the scoring of the ligand against a collection of scoring functions and constructing the final score by the parameterized addition of the different scoring function scores. The values of each parameter could be dependent on the class of the protein being used.

References

 

[1]   Todd J. Ewing, Shingo Makino, Geoffrey A. Skillman, and Irwin D. Kuntz. Dock 4.0: Search strategies for automated molecular docking of flexible molecule databases. Journal of Computer-Aided Molecular Design, 15(5):411–428, May 2001.

[2]   Thierry O. Fischmann, Alan Hruza, José S. Duca, Lata Ramanathan, Todd Mayhood, William T. Windsor, Hung V. Le, Timothy J. Guzi, Michael P. Dwyer, Kamil Paruch, Ronald J. Doll, Emma Lees, David Parry, Wolfgang Seghezzi, and Vincent Madison. Structure-guided discovery of cyclin-dependent kinase inhibitors. Biopolymers, 89(5):372–379, 2008.

[3]   M. Gangloff, M. Ruff, S. Eiler, S. Duclaud, J.M. Wurtz, and D. Moras. Crystal structure of a mutant heralpha ligand-binding domain reveals key structural features for the mechanism of partial agonism. J. Biol. Chem., (276):15059–15065, 2001.

[4]   D.E. Goldberg. Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley Longman Publishing Co., Inc., 1989.

[5]   S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi. Optimization by simulated annealing. Science, Number 4598, 13 May 1983, 220, 4598:671–680, 1983.

[6]   B. Kramer, M. Rarey, and T. Lengauer. Evaluation of the flexx incremental construction algorithm for protein-ligand docking. Proteins, 37(2):228–241, November 1999.

[7]   Soss M. Rotamer exploration and prediction. CCG Internal Report, page http://www.chemcomp.com/journal/rotexpl.htm, 2002.

[8]   Garrett M. Morris, David S. Goodsell, Robert S. Halliday, Ruth Huey, William E. Hart, Richard K. Belew, and Arthur J. Olson. Automated docking using a lamarckian genetic algorithm and an empirical binding free energy function. Journal of Computational Chemistry, 19(14):1639–1662, January 1999.

[9]   Huang N., Shoichet B.K., and Irwin J.J. Benchmarking sets for molecular docking. J. Med. Chem., 49(23):6789–6801, October 2006.

[10]   Karine Pereira de Jesus-Tran, Pierre-Luc Cote, Line Cantin, Jonathan Blanchet, Fernand Labrie, and Rock Breton. Comparison of crystal structures of human androgen receptor ligand-binding domain complexed with various agonists reveals molecular determinants responsible for binding affinity. Protein Sci, 15(5):987–999, 2006.

[11]   I. Schellhammer and M. Rarey. Flexx-scan: fast, structure-based virtual screening. Proteins, 57(3):504–517, November 2004.

[12]   Schulz-Gasch T. and Stahl M. Binding site characteristics in structure-based virtual screening: evaluation of current docking tools. J. Mol. Mod, 9(1).

[13]   R. Wang, Y. Lu, X. Fang, and S. Wang. An extensive test of 14 scoring functions using the pdbbind refined set of 800 protein-ligand complexes. J Chem Inf Comput Sci, 44(6):2114–2125, 2004.

[14]   Renxiao Wang, Yipin Lu, and Shaomeng Wang. Comparative evaluation of 11 scoring functions for molecular docking. J. Med. Chem., 46(12):pp 2287 – 2303, 2003.

[15]   S. P. Williams and P. B. Sigler. Atomic structure of progesterone complexed with its receptor. Nature, 393(6683):392–6, 1998.