Abstract
Abstract
Conformational changes frequently occur when proteins interact with other proteins. How to detect such changes in silico is a major problem. Existing methods for docking with conformational changes remain time-consuming, and they solve only a small portion of protein complexes accurately. This work presents a more accurate method (FlexDoBi) for docking with conformational changes. FlexDoBi generates the possible conformational changes of the interface residues that transform the proteins from their unbound states to bound states. Based on the generated conformational changes, multidimensional scaling is performed to construct candidates for the bound structure. We develop a new energy item for determining the orientation of docking subunits and selecting of plausible conformational changes. Experimental results illustrate that FlexDoBi achieves better results. On 20 complexes, we obtained an average iRMSD of 1.55Å, which compares favorably with the average iRMSD of 1.94Å for FiberDock. Compared to ZDOCK, our results are of 0.27Å less in average iRMSD of the medium difficulty group.
1. Introduction
Protein docking is the task of calculating the three-dimensional structure of a complex starting from the individual structures (subunits) of proteins. There are many techniques for predicting protein–protein docking configurations. Broadly, they can be grouped into two categories. The first we call rigid molecule docking methods. They work by sampling the effective positions and orientations of a rigid-body protein around another one. Among these, some induced fit approaches based on fast Fourier transformation (Chen et al., 2003; Heifetz et al., 2002), geometric surface matching (Schneidman-Duhovny et al., 2005), as well as intermolecular energy (Fernández-Recio et al., 2004; Dominguez et al., 2003; Alcaro et al., 2007) have been proposed. In addition, other existing methods follow the population selection. They identify that the interface residues are based on analyzing the differences between interface residues and noninterface residues in known complexes, often through the use of statistical techniques (Neuvirth et al., 2004; Bradford and Westhead, 2005) and 3D structural algorithms (Shulman-Peleg et al., 2005; Konc and Janežič, 2010).
The second category of docking techniques is the flexible molecule docking methods. These methods work by changing the backbone and/or side-chain conformations to refine flexible structures of complexes. The flexible docking methods can be divided into three groups according to their treatment of structural flexibility. The first group, including FiberDock and RosettaDock, searches for energetically favored conformations in a wide conformational search space. FiberDock (Mashiach et al., 2009) combines a novel normal mode analysis (NMA)–based backbone refinement with side-chain optimization and rigid-body minimization. It minimizes the backbone conformation along a few degrees of freedom, which are carefully picked by NMA. The side-chain flexibility of interface residues is modeled by a rotamer library. After refining all docking solutions, the predicted structures are ranked according to an energy function. RosettaDock (Lyskov and Gray, 2008) is a Monte Carlo–based docking method. It optimizes both rigid-body orientation and side-chain conformation via rotamer packing. RosettaDock refines the flexible backbone by minimizing the energy functions via varying the backbone torsional angles. The second group deals with hinge-bending motions in the docked molecules, such as FlexDock (Schneidman-Duhovny et al., 2007). It first detects hinge regions, rigid parts, and motion directions in the flexible structure. Then, each rigid part of the flexible molecule is docked with the rigid molecule, and the directions generate more conformations of the flexible molecule. Finally, all the partial docking solutions are assembled with good shape complementarity, and they are selected according to scoring. The last one, HADDOCK (Dominguez et al., 2003), is an experimental data-driven method by using the biochemical and biophysical interaction data, such as chemical shift perturbation data resulting from NMR titration experiments, mutagenesis data, or bioinformatics predictions. This information is introduced as ambiguous interaction restraints (AIRs) to drive the docking process. An AIR is defined as an ambiguous distance between all residues shown to be involved in the interaction. The method uses simulated annealing in torsion angle space to refine the structure, allowing for both backbone and side-chain flexibility on the interface. The final structures are clustered and ranked according to their average interaction energies.
In this article, we present a more accurate method, FlexDoBi, for docking with conformational changes. We develop an approach to detect the conformational changes from unbound states to bound states. Our approach examines a set of scaled structures as candidates for the bound structure (possibly with conformational changes), and uses a new energy function to select the best solutions.
To obtain the set of scaled structures, we maintain a database of structures, from which raw candidates for the conformationally changed residues can be rapidly selected. These candidates are then refined through an efficient method based on multidimensional scaling. This allows accurate near-native structures to be constructed with a minimal number of sampling steps. One advantage in this approach is that, whereas the large search space of existing methods requires intensive computational power and produces a large portion of conformations different from the native complex, in our method the geometrical constraints—imposed by the distance between two residues respectively at both ends of an interface fragment—eliminate a substantial number of unlikely candidate structures. One caveat is that for our method to work, the regions far from the interface should be almost unchanged in the protein complex.
The energy function used in FlexDoBi for structural evaluation is carefully constructed, since the effectiveness of the function is a crucial factor in determining the resultant structure. In this work, we develop a new statistical energy item, which is combined linearly with four other energy items to rank the poses and to direct the search of the plausible conformations.
Experimental results show that FlexDoBi achieves better results than other methods for the same purpose. On 20 complexes, we obtained an average iRMSD of 1.55Å, which compares favorably with the average iRMSD of 1.94Å in the predictions from FiberDock. Compared with ZDOCK, our results are of 0.27Å less in average iRMSD on the medium difficulty group.
2. Method Overview
Our method for the flexible docking problem contains two steps. In the first step, we find the relative orientation and position between two subunits. That is, we determine where the two subunits bind. Each relative orientation and position combination is referred to as a configuration or pose. Once a pose is given, we can determine the interface region between two subunits and fix the orientation as well as position of the regions far from the interface. In the second step, we use an efficient way to compute the possibly changed conformation of the interface. Here our method examines only thousands of structure candidates for the bound conformation of the interface, which is significantly less than existing methods.
To perform the first step, we modify P-Binder (Guo et al., 2012), a tool we have developed recently. P-Binder utilizes an enumeration method to identify the docking configurations of two subunits. It first performs a large number of rigid transformations to enumerate the poses. For each configuration, the side-chain conformation on the interface is built for energy evaluation. The problem of modeling side-chain is well studied (Xu and Berger, 2006; Brown et al., 2006; Krivov et al., 2009), and we use SCWRL4 (Krivov et al., 2009) in this work. Side-chains are unchanged on the structures in this step and are repacked in the second step. The poses are evaluated through a linear combination of five energy items, one of which is newly developed in this article. The top-ranking poses are selected for the second step processing.
In the second step, we assume that only the interface region in a given configuration of the unbound structures will experience conformational changes. Hence, to obtain a near-native structure of a complex, one only needs to modify the residues in the interface region. Our strategy is to replace each fragment formed by the consecutive residues in the interface region with some similar fragments. A residue is to be replaced if any of its atoms are within 10Å to any atoms in the partner subunits. In each subunit, four or more consecutive residues to be replaced form a replaceable fragment. A database of known structure fragments is maintained to search for suitable replacement candidate structures. Referring to the pair of residues respectively at both ends of a fragment as stems, we use the following two measures in our selection of candidate structures: (1) the root mean square deviation (RMSD) of the heavy backbone atoms in the stems and (2) the sequence similarity between the replaceable fragment and the candidate.
Some processing is required in replacing the fragments, since selected fragment candidates may result in unreasonable bond lengths, bond angles, and even collisions in the protein structure. Hence, in our structural modification, we scale all fragment candidates to reduce these inconsistencies. This is formulated as a weighted multidimensional scaling (WMDS) (de Leeuw, 1977) problem and solved by using a heuristic method, which aims to reduce the unreasonable bond length on the interface as well as remove most of the clashes between pairs of subunits in a complex.
Each docking orientation and position is to be evaluated by a new energy function. This energy function is a combination of the following energy items: side-chain energy (Krivov et al., 2009), dDFIRE energy function (Yang and Zhou, 2008), atomic contact energy (Zhang et al., 1997; Zhang, 1998), secondary structure energy (our newly developed energy item), and the Gromacs force field (Lindahl et al., 2001). We use a trained SVM model to rank the docking solutions and report the best ones with the lowest energy values.
Throughout this article, a complex may contain several subunits and multiple binding interfaces. Each binding interface in a complex occurs in a pair of subunits. Two residues in a pair of subunits are called interface residues if any two atoms, one from each residue, interact. By interact, we mean the distance between two atoms is less than 6Å.
Figure 1 depicts an example of our result. In panel (A), we present a case where many fragment candidates are obtained for the replaceable fragments on the interface of each subunit. The value of Cα RMSD between the unbound and bound states of interface structure is 3.57Å; FlexDoBi gives a candidate of RMSD 2.57Å. In panel (B), multidimensional scaling refines dihedral angles and bond lengths, allowing for more accurate energy calculation. We select the highest ranked structure, which has the iRMSD (the RMSD between the Cα atoms of interface residues) of 2.21Å between the choosing structure and the bound complex.

The refinement of the case 2z0e(A:B).
3. Results
To evaluate our method, we conduct three groups of experiments. Recall that our method replaces the fragment, which is formed by the consecutive residues, in each interface region with candidate fragments in a database. To test the feasibility of the method, we show that for each native fragment of the bound subunit, there are some similar fragments in the database. The second group of experiments is to examine the performance of our method, that is, the ability of identifying the conformational changes from unbound state to bound state. We use native-bound complexes to fix the pose and compute the conformational changes of interface (see Section 3.2). In Section 3.3, we compare our method (FlexDoBi) with FiberDock (Mashiach et al., 2009), which also assumes that the poses are given. Finally, to test our program for unspecified poses, we compare our method with ZDOCK (Chen et al., 2003).
3.1. Similarity between native interface and selected candidates
Observations of protein complexes show that for many complexes, the major structural changes between the bound and unbound states occur on the interface regions. Our sample data set is extracted from the medium difficulty group (29 complexes) and the regular difficulty group (24 complexes) in protein–protein docking Benchmark 4.0 (Hwang et al., 2010). We calculate the Cα RMSD values on the whole structures and on the interface residues. The average Cα RMSD value between the complex structures and the unbound proteins in native-binding orientation is 1.92Å. However, the average RMSD between the interface residues of these two states is 3.78Å. These details are shown in Figure 2. Clearly, the interfaces are more flexible than the rest of the structures. This justifies our method for transforming an unbound structure into its bound state by substituting only the fragments on the interface.

The Cα RMSD between the complex structures and the unbound proteins in native binding orientation: interface RMSD (blue) and RMSD for the whole structure (red).
Suitable replacement fragment candidates are selected from a database. We use a database comprising 13,255 protein chains, selected by using PISCES (Wang and Dunbrack, 2003) with cutoff values being 90 percent identity, resolution 2.0Å, and R-value 0.25. Fragment candidates are selected from this database without the homologous proteins. We find that the homologous candidates appear in the fragment candidates for 42 complexes and filter out those fragments to make a fair assessment. Among 53 complexes, 326 replaceable fragments are extracted from the interfaces of bound states. We search the candidates for the bound state of the replaceable fragment. As shown in Figure 3, for all the fragments, the best candidates are found within 2.25Å.

The Cα RMSD between the interface fragments on bound conformations and unbound structures (blue) or best candidates selected by FlexDoBi (red).
3.2. Conformational changes of native poses
In this experiment, we verify that suitable fragment candidates can be identified from the database and reshaped properly for interface fragments. We assume that the native poses are given and two subunits are unbound. Now the task is to transform the unbound subunits onto bound states. To obtain the native pose, the unbound structure is superimposed on the native bound complex by the orientation of lowest Cα RMSD for the whole structure. The value of iRMSD is to denote the RMSD between the Cα atoms of interface in the predicted structure and in the native complex after superimposing the interfaces.
The medium difficulty group in Benchmark 4.0 is used for this study. Details are in Table 1. Among the 29 instances, our program obtains better conformations for 22; that is, FlexDoBi discovers better conformations than simply putting two unbound subunits together for 22 instances. The iRMSD values become worse for three instances and are similar in four instances; by similar, we mean the difference between the iRMSD of the prediction structures and that of the unbound ones is less than 0.05Å. The average Cα iRMSD value between the interface predicted by FlexDoBi and the corresponding portion of the native complex is 2.29Å. Yet, the average iRMSD value between the interface of unbound structures and that of the native complex is 2.51Å.
Unbound structure of receptor or ligand in the complex.
Unbound structure is superimposed on the bound conformation by the orientation of lowest Cα RMSD for the whole structure.
Cα RMSD between the interface of the predicted structure and the native complex.
Energy value of the predicted complex.
The best instances, predicted by FlexDoBi, are 1m10(A:B), 1r6q(A:C), and 2z0e(A:B), where the values of Cα iRMSD are reduced by 0.7Å, 0.6Å, and 1.3Å, respectively. Figure 4 displays the conformation discovered by FlexDoBi for 1r6q(A:C). FlexDoBi predicts the interface conformation with 1.70Å iRMSD; however, the value of iRMSD for the unbound structures on the native orientation is 2.32Å. The energy of the conformation predicted by FlexDoBi, −256.47, is lower than the initial energy of the unbound structure, −186.50. We should notice that lower energy does not always imply better conformation in terms of iRMSD.

The refinement of the case 1r6q(A:C): The unbound structure of interface is colored yellow and the bound structure is blue. The refined structure, created by FlexDoBi, is in red.
3.3. Comparison with FiberDock
In this subsection, we compare the results of FlexDoBi with FiberDock (Mashiach et al., 2009). FiberDock is a novel NMA-based backbone flexibility treatment, which refines the structure of complex from a given docking configuration. We evaluate the performance of two methods by using the unbound native pose. The data set is extracted from FiberDock's paper. We obtain much better results than that of FiberDock. The comparison result is detailed in Table 2. Among 20 instances, FlexDoBi produces better results for 14 instances. By better, we mean that the iRMSD value is at least 0.05Å smaller than the iRMSD of the FiberDock method. Only for four instances, FiberDock produces better results. The average values of Cα iRMSD between the predicted structures and the native complexes are 1.55Å (FlexDoBi) and 1.94Å (FiberDock), respectively.
Unbound structure of receptor or ligand in the complex.
Unbound structure is superimposed on the bound conformation by the orientation of lowest Cα RMSD for the whole structure.
Cα RMSD between the interface in the predicted structure and the native complex.
Cα RMSD between the interface in the predicted structure of receptor and in its bound conformation.
Rec-iRMSD is to denote the iRMSD value of the receptor, which is the subunit of more residues. The average values of Rec-iRMSD between the predictions and the bound conformations are 1.71Å (FlexDoBi) and 2.01Å (FiberDock), respectively. In case of 1got(A:B), FlexDoBi predicts new interface conformation in complex with 0.92Å Cα iRMSD, however, the value of iRMSD for the unbound structures on the native orientation is 3.62Å. Figure 5 displays the docking configuration discovered by FlexDoBi for 1got(A:B). The comparisons indicate that FlexDoBi produces better interface conformations while changing the unbound states into bound states.

The refinement of the case 1got(A:B): The unbound structure of interface is colored yellow and the bound structure is blue. The refined structure, created by FlexDoBi, is in red.
3.4. Evaluation on Benchmark v4.0
In this study, we assume that the native pose is unknown. We perform a search that finds both the poses and identifies the structural changes. For each complex, we adopt a similar procedure as in P-Binder to predict the poses. The top 100 poses according to our new energy function are chosen and are fed into our method for modeling conformational changes. The top ten results from the method according to energy value are reported. These are finally compared with the docking results from ZDOCK (Chen et al., 2003), and the flexible docking solutions from FiberDock, which refines the poses predicted by ZDOCK. In order to test the refining ability of our method, we use FlexDoBi to model the conformational changes of the poses predicted by ZDOCK. The docking results are shown in Table 3. In general, the iRMSD values decrease as the magnitude of conformational change increases.
Cα iRMSD between the predicted configuration by each method and the native complex.
Cα iRMSD between the changed conformation predicted by each method and the native complex, refining the best pose of ZDOCK.
We compare our method with ZDOCK on the rigid-body group in Benchmark v4.0. The values of Cα iRMSD between the unbound structures in the native poses and the native complexes range from 0.24Å to 2.02Å. The results are presented in Table 4. For 123 complexes in the rigid-body group, the average Cα iRMSD values between the predictions and the native complexes are 3.96Å (FlexDoBi) and 4.15Å (ZDOCK), respectively. FlexDoBi produces better results for 82 instances than ZDOCK.
Cα iRMSD between the predicted configuration of each method and the native complex.
The first complex is 1qfw(HL:AB), and the second complex is 1qfw(IM:AB).
The first complex is 1oyv(B:I), and the second complex is 1oyv(A:I).
We calculate the medium difficulty group and the regular difficulty group in Benchmark v4.0. The values of Cα iRMSD between the unbound structures in the native poses and the native complexes range from 1.48Å to 16.76Å. Several proteins in the regular difficulty group undergo significant conformational changes upon binding. The results are presented in Tables 5 and 6.
Cα iRMSD between the predicted configuration of each method and the native complex.
Cα iRMSD between the changed conformation predicted by each method and the native complex, refining the best pose of ZDOCK.
Cα iRMSD between the predicted configuration of each method and the native complex.
Cα iRMSD between the changed conformation predicted by each method and the native complex, refining the best pose of ZDOCK.
For 29 complexes in the medium difficulty group, the average Cα iRMSD values between the poses predicted by FlexDoBi and the native complexes is 4.61Å. ZDOCK predicts the best poses with 4.96Å iRMSD; FlexDoBi and FiberDock refine the best poses with 4.79Åand 4.88Å iRMSD, respectively. FlexDoBi computes better changed conformations for 16 instances than FiberDock, for refining the best poses predicted by ZDOCK. For 24 complexes in the regular difficulty group, the average Cα iRMSD values between the poses predicted by FlexDoBi and the native complexes is 7.78Å. ZDOCK predicts the best poses with 8.59Å iRMSD, FlexDoBi and FiberDock refine the best poses with 7.85Åand 7.84Å iRMSD, respectively. FlexDoBi computes better changed conformations than FiberDock for 12 instances.
In several unbound subunits, the coordinates of some backbone atoms are missing. We add the coordinates of the missing residues using MODELLER (Eswar et al., 2006). In two groups, the missing residues appear in the unbound structures of four complexes: residues 36–43 in 1fq1B, residues 206–215 in 1grnB, residues 72–94 in 1jmoA, and residues 46–58 in 3cphA. After the gaps are filled in, the accuracy of the predictions is improved. The docking configuration discovered by FlexDoBi for 3cph(A:G), after the gap is filled, is displayed in Figure 6. The complex predicted by FlexDoBi has an iRMSD of 3.27Å Cα, which is better than the iRMSD of 3.91Å from ZDOCK.

The refinement of case 3cph(A:G).
3.5. Assessment of the energy items
To assess the effectiveness of the energy items, we analyze the performance with Benchmark v4.0. For evaluating the effectiveness of energy items, we reoptimize the coefficients in each case with only four out of five items and reevaluate the predicted structures by leaving one energy item out. The iRMSD without side-chain energy is 4.72Å, without dDFIRE energy function is 4.77Å, without atomic contact energy is 4.66Å, without secondary structure energy is 4.78Å, and without the Gromacs force field is 4.71Å. The Cα iRMSD of all the above five experiments are worse than with all energy items, 4.63Å. It is clear that dDFIRE energy function and secondary structure energy have more impact.
4. Method Details
4.1. Selecting candidates from the database
We exam the known protein structures and identify suitable candidates to replace each fragment on the interface. We use a database of 13,255 protein chains, selected by using PISCES (Wang and Dunbrack, 2003), with cutoff values of 90 percent identity, 2.0Å resolution, and 0.25 R-value (Sept. 2012). Fragment candidates are selected from this database without the homologous proteins. We look for the fragment candidates whose stems are within 3Å to the stems of replaceable fragments. Once fragment candidates are obtained, we take the top 100 fragments according to the sequence similarity as the matching candidates. The sequence similarity is computed according to BLOSUM62 matrix. As the replaceable fragment and the candidates have the same number of residues, the sequence similarity is the sum of the respective residue pair values in the matrix.
4.2. Fitting candidates on replaceable fragment
We cannot replace the fragments by the candidates directly, as it will result in unrealistic atomic distances and clashes. We scale the candidates to resolve those issues.
We formulate this structure problem as an instance of weighted multidimensional scaling (WMDS). For a given d dimension and n points of data, we have a distance matrix D and a weighted matrix W, both symmetric n × n matrixes, and wish to find
For our problem instances, d = 3 and n is the total number of backbone atoms in all replaceable fragments and the stems on the interface of two subunits A and B. We define the distance matrix D as
where
The distances are unequally important, so we adopt a weighted version. First, the stem atoms are fixed, so the pairwise weights between each pair of stem atoms must be large. Moreover, neighboring atoms should be given larger weights, because they represent bond lengths. If two atoms are within a small distance, then they should remain close to this distance regardless of other changes, especially if they are adjacent backbone atoms. On the contrary, distant atoms are free to move around. The weighted matrix is defined as
where 0 ≤ i < j ≤ n, T is a 4 × 4 lookup table as defined in LoopWeaver (Holtby et al., 2012), r is the largest pairwise distance between any two atoms in the corresponding matching candidate, and Φ is the golden ratio conjugate. In addition,
We use the SMACOF algorithm (de Leeuw, 1977) for solving the WMDS problem. This algorithm works by minimizing the stress function and yielding a fast, deterministic heuristic. By performing the iterative generation, the quality of interface refinement often gets better, and the unrealistic atomic distances are eliminated in the candidate structure.
4.2.1. Searching best conformations
Given a pair of subunits, we extract a candidate set for each replaceable fragment. For each replaceable fragment, at most 100 candidates are chosen. Then we replace the fragments by the candidates from the respective candidate set randomly. If a better conformation according to the energy function is found, we keep it. Otherwise, we try to replace a fragment by other candidates. This process is repeated until convergence. We restart the above procedure to generate multiple structure candidates. We use SCWRL4 to build the side-chain conformation of these structure candidates and evaluate them by the dDFIRE energy function.
4.3. Energy items
Our method generates a large number of structure candidates. Here we develop a new energy function to select the best structures. Our energy function contains the following energy items:
(1) The side-chain atoms of interface residues are packed by SCWRL4 (Krivov et al., 2009), and the corresponding energy item is extracted. (2) The dDFIRE energy is an all-atom statistical function (Yang and Zhou, 2008) based on the atomic distance and three orientation angles involved in dipole–dipole interactions. (3) The item of atomic contact energy is produced by an atomic energy measure (Zhang et al., 1997; Zhang, 1998). The free energy for a pair of interacting atoms has been calculated on atom-pairing frequencies in known complexes. (4) We use DSSP (Kabsch and Sander, 1983) to determine the type of secondary structure for each residue and construct the item of secondary structure energy by using the statistical method in Guo et al. (2012). Here, we incorporate three types of secondary structures, 20 types of amino acids, and one solvent contacting the residues in protein surfaces. The secondary structure energy item takes 60 × 60 possible residue pairs, obtained from the statistical analysis of residue-pairing frequencies in a complex database. We select 6,323 complexes from PDB database, and these complexes are made up of two or more protein subunits. Their structures are determined by X-rays with cutoff values of resolution 2.2Å and sequence identity 30% (Sept. 2012). We calculate the free energy for all pairs of interacting residues in candidate structures. (5) The Gromacs force field is built up from two distinct subunits to describe the interaction between their atoms (Lindahl et al., 2001). Gromacs calculates electrostatic interactions in the standard coulomb potential as
where
We use a linear combination of these energy items to rank the poses and to direct the search for finding the plausible conformations. The coefficient of each item is optimized by using a training set. Details are described as following. For a pair of subunits, we generate 1,000 candidate poses. The top 200 poses according to the iRMSD values are selected. The energy values of these 200 poses are computed. Then we choose the top 10 poses according to the combined energy value. The objective here is to minimize the sum of the iRMSD values of the top 10 candidates for the training set. The grid search method is used (Al-Khayyal, 1990). We identify the best possible combination of coefficients from 0.1 to 10. For the values of range 0.1 to 1, we use a step size of 0.1. For the values ranging from 1 to 10, we use a step size of 1.0. After we obtain the best combination of coefficients, we refine them further by allowing higher resolution. We change the coefficient by ±0.5 with a step size of 0.1. The optimal combination of coefficients is used for prediction on the testing set.
Finally, we use a trained SVM regression model to rank the docking solutions and report the best ones with the lowest predicted values. For the training set, we use iRMSD as the response values for all configurations of each protein pair, and the above energy items can be regarded as five features for each one. The configurations with the lowest predicted response values can be reported as results on the testing set. To obtain the parameters, we use 36 unbound–unbound complexes from Dockground (Liu et al., 2008) as the training set, which are not included in the testing set.
5. Conclusion and Discussion
In this article, we present a new method for flexible refinement of docking solutions. We formulate the backbone flexibility problem on the interface as an instance of the weighted multidimensional scaling problem, which is able to model the local conformational changes. The results show that FlexDoBi models the backbone motions on the protein–protein interface. The backbone refinement procedure improves the accuracy of near-native docking solution candidates.
Our method can eliminate a larger number of inaccurate candidate structures due to the geometrical constraints imposed by the distance between two residues respectively at both ends of each interface fragment. However, we only deal with the cases in which the regions far from the interface should be almost unchanged in complex.
We notice that large conformation changes can occur and result in a whole structure of interacting proteins. On the regular difficulty group, the large changes appear in the unbound structures of three complexes: 1y64(A:B), 1f6m(A:C), and 1ira(Y:X). In the case of 1y64B, the conformational change occurs in the loop region (residues 1396–1416). First, we replace this loop region with all loop candidates in the protein database, regardless of the stem RMSD value. Then, we also refine the interface conformation of complex by using our flexible docking method in this article. The best discovered configuration is displayed in Figure 7. We predict a new configuration of complexes with 6.42Å Cα iRMSD, whereas the value of iRMSD for the predicted complex without the replaced loop is 11.77Å. Those issues will be our further investigations in the near future.

The refinement of case 1y64(A:B):
Footnotes
Acknowledgments
This work is supported by the grants from the Research Grants Council of the Hong Kong Special Administrative Region, China [Project No. CityU 124512], and the startup fund [Project No. 7200276] and grant [Project No. 9610025] from the City University of Hong Kong.
Author Disclosure Statement
No competing financial interests exist.
