Induced-Fit Posing (Confined) Tutorial

The primary purpose of the Induced-Fit Posing (Confined) Floe is to generate favorable ligand binding pose(s) in the context of limited (i.e., confined) local reorganization of the reference receptor (Fig. 1). Protein reorganization is focused on (but not necessarily limited to) sidechains.

Schematic image showing the reference receptor and a ligand hit feeding into an arrow representing the IFP method, which points to the output induced-fit pose.

Figure 1: Schematic depiction of the IFP Floe.

Note

A single instance of the Induced-Fit Posing (IFP) Floe generates predictions for a single ligand binding to a single receptor. To assess multiple ligands, the Floe must be run multiple times.

General progression of the IFP Floe

  1. Specify inputs. Minimally, the required inputs comprise:

    • the Ligand Input Dataset, which contains the ligand of interest, hereafter called the hit ligand. In this dataset, the hit ligand must occupy a primary molecule field and have reasonable 3D coordinates.

      Note

      the specific 3D coordinates of the input hit ligand do not affect the output of this floe.

    • the Reference Receptor Input Dataset, which is the protein to which hit ligand binding will be predicted. This protein must be fully prepared for simulation, for example by running a SPRUCE protein preparation Floe. Additionally, this input dataset must contain a bound ligand, which we refer to as the reference ligand. Although the chemical identity and bound pose of the reference ligand can be freely varied at the discretion of the user, these properties will affect the resulting IFP predictions.

    • the inputs listed above are sufficient to run this Floe. The remaining entries outline the progression of steps that the Floe executes.

  2. Define the collection of protein residues in the reference receptor that constitute its ligand binding pocket. This identification is accomplished using metrics based on the distance to the reference ligand and quantitative estimates of protein residue backbone solvent accessible surface area.

  3. Prune binding pocket residues in the reference receptor to alanine. This creates a larger pocket for subsequent docking.

  4. Dock the hit ligand into both pruned and unpruned binding pockets of the reference receptor. To increase the probability that the docked ensemble contains accurate poses, this docking can employ multiple methods: by default, it employs both ligand-guided (Hybrid) and ligand-free (FRED) approaches.

    Note

    IFP aims (without guarantee) to generate a list of highly-ranked poses that includes at least one accurate pose (target root mean square deviation (RMSD) < 2 Å). In some cases, the most accurate poses will originate from pruned docking, whereas other cases may favor unpruned docking. Likewise, either Hybrid or FRED may provide superior initial models for a given system of interest. The default IFP procedure leads to a top-12 ranked list of predicted binding modes because 2 docking methods * 2 treatments of binding pocket residues in the reference receptor * 3 final models per approach = 12 (which may then be moderately reduced due to conformational similarity).

  5. Revert the reference receptor back to its original sequence and run a series of energy minimizations to resolve any clashes that docking might have introduced.

  6. A modeled series of equilibrations to permit limited (i.e., confined) induced fit as the reference receptor relaxes around the hit ligand.

  7. Finally, pose/receptor conformations are scored and ranked. The IFP Floe report and success dataset provide quantification and 3-dimensional coordinates of up to 12 of the top-ranked pose/receptor conformations, respectively.

Worked example #1: cyclin-dependent kinase 2 (CDK2)

In this tutorial, we use two IFP Floes:

  1. For the induced-fit posing (step 1-5): Induced-Fit Posing (Confined) Floe

  2. For the quantitative comparison to the known binding pose (step 6): Compare IFP Results to Target Receptor Floe

The estimated cost of the IFP Floe run for the example #1 is around $53.

Step 1: Prepare IFP inputs

The IFP Floe requires correctly prepared proteins up to “MD ready” standards which require chain capping, full atomic definition (including hydrogens), the resolution (or capping) of missing protein loops, and sidechain protonation state selection. Cofactors and structured waters may be included. We strongly recommend using SPRUCE protein preparation Floes.

Two-dimensional image of the hit ligand used in worked example #1.

Figure 2: The hit ligand, PNU-230032

This tutorial component utilizes the CDK2 receptor.

  • As the reference receptor, we select PDB ID: 2C6I 1, which has a bound triazolopyrimidine inhibitor 2. As the hit ligand, we select the 2-aminothiazole inhibitor PNU-230032, 4-[(5-Isopropyl-1,3-thiazol-2-YL)amino]benzenesulfonamide 3 (Fig. 2). The SMILES of the ligand is CC(C)C1=CN=C(S1)NC2=CC=C(C=C2)S(=O)(=O)N. For convenience, we collect this ligand from its CDK2-bound pose in PDB ID: 2BTS 4, 5.

The Reference Receptor Input Dataset used in this tutorial is:

The input has been prepared by running SPRUCE - Protein Preparation Floe with the parameters: PDB code(s) to download: 2C6I, Ligand name(s): DT1. (The dataset has been aligned by 1JSV, with the parameter Reference Structure Inputs/Optional PDB Code for reference DU: 1JSV.)

Additional guidance on how to run the Spruce Floe is available in the SPRUCE tutorial.

Note

If you are planning a series of IFP runs using similar reference receptors, it can be convenient to provide the SPRUCE - Protein Preparation Floe with a PDB code for the Optional PDB Code for reference DU input that is consistent across SPRUCE runs. This will facilitate three-dimensional visualization by providing a common reference receptor alignment. Although we have set this optional input to 1JSV while generating the input datasets, this SPRUCE field can also be left blank in preparation for IFP.

The Ligand Input Dataset used in this tutorial is:

The ligand input has been taken from the Spruced 2BTS DU dataset. (The 2BTS DU dataset has been aligned by 1JSV, with the parameter Reference Structure Inputs/Optional PDB Code for reference DU: 1JSV.)

After you upload the downloaded files to Orion®, it will automatically convert them to datasets (Fig. 3). All future work in Orion will utilize these datasets rather than the uploaded files. Additional instructions for uploading files and managing directories in the Orion user interface (UI) can be found in the OpenEye documentation as text or video.

Image of datasets after automatic conversion from uploaded .oeb.gx files.

Figure 3: Orion automatically converts uploaded files into datasets.

Warning

All IFP input datasets must have a single record. That condition can be assessed directly in the appropriate Orion Data directory. In this example, both the datasets generated by Spruce protein preparation do have a single record (Fig. 4). Solutions for situations in which this is not the case are discussed at the end of this worked example (Step #7).

Image of spruce datasets clarifying that they have a single record each.

Figure 4: Input datasets for IFP must have a single record only.

Step 2: Launch the IFP Floe

Find and launch the Induced-Fit Posing (Confined) Floe in the Orion UI (Fig. 5). One can find the Floe by searching “IFP” or “Induced-Fit Posing”.

Image of the Orion user interface for launching the IFP Floe.

Figure 5: Launching the IFP Floe.

After selecting “launch Floe”, you will be presented with a job form on which inputs, outputs, and parameters may be defined (Fig. 6). Although many options can be specified or varied, only two are required:

  • Ligand Input Dataset: PNU-230032

  • Reference Receptor Input Dataset: spruce_dataset_2c6i_ref_1jsv

When you are satisfied with the IFP Floe setup, press the green ‘Start Job’ button at the lower right (Fig. 6).

Image of IFP options selection page in the Orion user interface.

Figure 6: Defining IFP Floe options.

Step 3: Verify that the IFP Floe completed successfully

If the Floe was configured to do so, you will receive an email when the job finishes. This email will indicate the status of the job. If the job was successful, you should receive an email that reads: Job “Induced-Fit Posing (Confined) [IFP]”, has completed successfully.

Warning

If the Floe failed, your email will instead read: Job “Induced-Fit Posing [Test] [IFP]”, has completed with errors: followed by a brief listing of the errors that were encountered. In this case, you will need to re-run the Floe after diagnosing the source(s) of the failure.

A successful IFP job will produce a number of output files, all of which can be found by selecting the Data icon on the vertical menu that occupies the left hand side of the Orion UI and navigating to the appropriate directory. An alternative approach to access the outputs associated with the Floe is to select the Floes icon on the vertical menu that occupies the left hand side of the Orion UI, then select Jobs in the top horizontal menu, then left click on the name of the job submitted as part of this tutorial. Ensuring that the JOBS tab is selected on the left side of the screen will lead it to be occupied by three sections. The Results section will contain links to most outputs, whereas the Reports section will contain a link to the Floe report. Key IFP outputs are listed below and shown in Fig. 7.

Image of two important IFP output files.

Figure 7: Key IFP output files.

  • The IFP Summary collection. This collection represents a Floe report that can be viewed by left clicking on the light blue image of a page that sits beside it in the data listing. Hovering over this light blue image of a page reveals the text Open Floe Report.

  • The Output for Induced-Fit Posing (Confined) [IFP] dataset. This is the Output Dataset of the Floe. Left clicking on the four blue squares that sit beside it in the data listing opens the output dataset in a tile view. Alternatively, this dataset can be added to active data using the associated icon of a white plus sign inside a light blue circle, followed by left clicking on either the Analyze or 3D icons on the vertical menu that occupies the left hand side of the Orion UI.

Step 4: Inspect the Floe report

Open the Floe report, either by navigating to the appropriate Data directory and left-clicking on the light blue image of a page that sits beside the IFP Summary collection in the data listing, or by using the appropriate link in the Job summary (both methods described in more detail above).

The Floe report will open as a pop-up window. It is often convenient to view the Floe report in full-screen mode by clicking on the open in a new tab icon near the IFP Summary index text at the top of the pop-up. This icon is a square with an arrow pointing out of its top right corner.

The top portion of the IFP Floe report for this CDK2 example is shown in Fig. 8. It displays a variety of information about ligand-receptor interactions for the 5 top-ranked induced-fit poses. The IFP score is shown in the ranking table on the left. The IFP score is a weighted sum of normalized MMPBSA, BintScore, iBintScore, and Chemgauss4 (terms described in more detail after Fig. 9). Larger IFP scores represent greater confidence that the predicted induced-fit conformation is accurate.

Image showing the top half of the IFP Floe report for this CDK2 example.

Figure 8: Top portion of the IFP Floe report for this CDK2 example.

Note

Additional guidance on the interpretation of the interactive images available in the IFP Floe report is available in the short trajectory molecular dynamics (STMD) tutorial , the associated Floe report shares many common features.

The bottom portion of the IFP Floe report for this CDK2 example is shown in Fig. 9. It comprises a table that quantifies the characteristics of the top-ranked 12 poses. (Note that the number of poses included in this table may be fewer than 12 if some results were sufficiently similar to cluster together).

Image showing the bottom half of the IFP Floe report for this CDK2 example.

Figure 9: Bottom portion of the IFP Floe report for this CDK2 example.

The table in Fig. 9 shows:

  • The IFP score and its component metrics:

    • Molecular mechanics Poisson-Boltzmann surface area (MMPBSA) energy (in kcal/mol)

    • BintScore, which is OpenEye’s knowledge-based scoring function for scoring protein-ligand interactions

    • iBintScore, which is the BintScore relative to the BintScore interactions of the initial pose

    • Chemgauss4 score

  • The final pose RMSD to the initial pose (in Å), thereby quantifying the conformational change in the ligand that occurred during the induced-fit components of the procedure

  • Active site sidechain RMSD to the input reference receptor (in Å)

  • Active site sidechain torsion fingerprint deviation (TFD) to the input reference receptor (in Å)

Note

The main IFP Floe report shown in Figs. 8 and 9 can be viewed directly as an interactive HTML file here. However, the per-ligand Floe reports usually linked in the lower table’s first column, titled “Rank (Link)”, will not work in this version as those files are not provided with this tutorial.

In addition to the main Floe report (Figs. 8 and 9), each pose listed in the main Floe report table (Fig. 9) is associated with its own pose-specific Floe report. These reports can be accessed by left-clicking on the first column in the IFP full summary table (Fig. 9). A sample pose-specific Floe report is depicted in Figs. 10-12.

Image showing the top section of a pose-specific IFP Floe report for this CDK2 example.

Figure 10: Top portion of a pose-specific IFP Floe report for this CDK2 example.

Image showing the middle section of a pose-specific IFP Floe report for this CDK2 example.

Figure 11: Middle portion of a pose-specific IFP Floe report for this CDK2 example.

Image showing the bottom section of a pose-specific IFP Floe report for this CDK2 example.

Figure 12: Bottom portion of a pose-specific IFP Floe report for this CDK2 example.

Note

The per-pose Floe report shown in Figs. 10-12 can be viewed directly as an interactive HTML file here.

Step 5: Inspect and visually assess / re-rank the poses

One of IFP’s design goals is to increase the probability that an accurate pose is included in the top-ranked 12 poses. In order to realize this goal, IFP uses a variety of docking methods despite the likelihood that some of these methods will return poses of poor accuracy for some protein-ligand combinations. Because the IFP score is imperfect, we recommend that users visually inspect and conduct a human-guided re-ranking of the top 5 ligands based on the IFP score. To some extent, this visual inspection can be conducted in the Floe report. However, in this tutorial, we will use the 3D viewer of Orion’s UI.

Image showing the Orion button to hide active datasets.

Figure 13: Orion’s dataset management button.

To simplify your 3D viewing experience, you may want to hide or deselect any other active datasets via the tool to add, hide, & view currently loaded datasets that is available at the top left of the Orion UI (Fig. 13).

Add the Output for Induced-Fit Posing (Confined) [IFP] dataset (the Output Dataset of the IFP Floe) to the active data by left clicking on its associated icon of a white plus sign inside a light blue circle. Before we proceed to the 3D viewer, we will also select the Reference Receptor Input Dataset you downloaded in Step 1. That selection is done analogously using the icon of a white plus sign inside a light blue circle to activate the data from spruce_dataset_2c6i_ref_1jsv.

Additionally, we will upload and activate the target receptor dataset:

The dataset has been prepared by running SPRUCE - Protein Preparation Floe with the parameters: PDB code(s) to download: 2BTS, Ligand name(s): U32. (The dataset has been aligned by 1JSV, with the parameter Reference Structure Inputs/Optional PDB Code for reference DU: 1JSV.)

Once these three datasets have been selected (and ideally your Orion Active Datasets menu [Fig. 13] indicates that 3 datasets are active), left click on the 3D icon on the vertical menu that occupies the left hand side of the Orion UI. This will take you to Orion’s 3D viewer, preloaded with the top 12 (or slightly fewer) IFP output poses, the input pose of the template ligand (from spruce_dataset_2c6i_ref_1jsv), and the crystallographic pose of the hit ligand (spruce_dataset_2bts_ref_1jsv).

Note

Additional guidance on using Orion’s 3D viewer is available in the 3D modeling documentation. Additional information is available via a set of videos.

IFP predictions of PNU-230032 binding to CDK2 are shown in Fig. 14. In this figure:

  • Part A depicts the reference receptor in the crystallographic conformation it adopts when bound to the reference ligand (the triazolopyrimidine inhibitor 4-{[5-(cyclohexylmethoxy)[1,2,4]triazolo[1,5-a]pyrimidin-7-yl]amino}benzenesulfonamide 6).

  • Part B depicts the target receptor bound to the hit ligand (PNU-230032). This represents the crystallographic pose that we take as the correct answer for PNU-230032 binding to CDK2. Note that the reference and target receptors are both CDK2; we use this terminology to distinguish conformations of the receptor and the identity and conformation of its bound ligand.

  • Parts C-E depict IFP output poses ranked #1-#3, respectively. Note the similarity between poses ranked #1 and #2, and their divergence from the pose ranked #3.

  • Part F overlays part B (the correct answer) and part C (the top-ranked IFP pose).

Three-dimensional conformations of IFP results for PNU-230032 binding to CDK2.

Figure 14: 3D IFP predictions of PNU-230032 binding to CDK2.

Warning

In this example, IFP-generated poses ranked #1 and #2 represent accurate predictions. However, the appearance of the most accurate poses at the top of the IFP-score ranking is, unfortunately, not always the outcome of an IFP Floe. Therefore, we encourage the user to visually inspect at least the top 5 poses, and to consider re-ranking them via expert human intuition or other computational tools such as additional STMD or conformation-specific nonequilibrium switching (NES). Please note that we are still in the process of quantifying the comparative value of such re-ranking approaches.

This CDK2 example provides insight into some of the benefits of IFP over rigid docking (Fig. 15). Fig. 15A shows the Reference receptor input dataset, in which protein residue K33 adopts a position near the bound ligand. Conversely, Fig. 15B shows that K33 rotates away from the ligand in the crystal structure with PNU-230032. The IFP result of the top-ranked pose is shown in Fig. 15C. Here, protein residue K33 has rearranged from its starting position (Fig. 15A) to permit PNU-230032 binding (Fig. 15C), though this rearrangement is less extensive than observed crystallographically (Fig. 15B). Figs. 15D-F are provided to add more perspective, but are based on the same coordinates used to generate Figs. 15A-C.

6-panel image showing the ligand dataset, target dataset, and IFP pose 1. Images highlight the relative position of of lysine 33 from two different viewpoints for each case.

Figure 15: Movement of lysine 33 permits PNU-230032 binding to CDK2.

Step 6: Quantitative comparison to the known binding pose (when available)

After running the Induced-Fit Posing (Confined) Floe, it is possible to regenerate the Floe report while augmenting it with quantitative comparisons to a known correct answer. For this purpose, we will use the Compare IFP Results to Target Receptor Floe.

Find and launch the Compare IFP Results to Target Receptor Floe in the Orion UI (Fig. 16).

Image of the Orion user interface for launching the Compare IFP Results to Target Receptor Floe.

Figure 16: Launching the Compare IFP Results to Target Receptor Floe.

After selecting “launch Floe”, you will be presented with a job form on which inputs, outputs, and parameters may be defined (Fig. 17). Although many options can be specified or varied, only two are required:

  • The IFP Output Dataset, which is the output of the Induced-Fit Posing (Confined) Floe we just ran. The default output dataset name is Output for Induced-Fit Posing (Confined) [IFP]

  • The Target Receptor Input Dataset for comparison, which describes the protein bound to the hit ligand and defines what will be interpreted to be the correct result. This will usually be a dataset generated by SPRUCE based on a crystal structure of the receptor-hit ligand complex. For this example, we will use the spruce_dataset_2bts_ref_1jsv dataset, which you uploaded in a previous step.

Warning

When making comparisons to a known correct answer via an IFP Floe, it is imperative that the order in which ligand atoms are defined is conserved between the Ligand Input Dataset and the dataset that will be used as the correct answer as defined by the Target Receptor Input Dataset for comparison. If care is not taken to ensure conserved atomic ordering between these two datasets, quantitative comparisons may be incorrect.

When you are satisfied with the Compare IFP Results to Target Receptor Floe setup, press the green ‘Start Job’ button at the lower right.

Image of Compare IFP Results to Target Receptor options selection page in the Orion user interface.

Figure 17: Defining Compare IFP Results to Target Receptor Floe options.

When completed, this Floe will generate another Floe Report. Unless you renamed it in the Floe inputs (non-promoted parameter Floe report title in the Generate Ligand Floe Report Cube), this collection will have the same default name as the original Floe report (IFP Summary), except that there will be a ” 1” added to the name by Orion to avoid duplication. This Floe report is broadly similar to the original Floe report (Figs. 8-12), except that the table at the lower portion of the main summary report will have 3 additional columns:

  • Pose RMSD to target pose, which quantifies the accuracy of the ligand pose, given the defined (possibly crystallographic) “correct answer”.

  • AS SC RMSD to target receptor, which quantifies the accuracy of the reorganization of the protein’s active site (AS) sidechains (SC) upon ligand binding, given the defined (possibly crystallographic) “correct answer”.

  • AS SC TFD to target receptor, which is similar to the previous metric, but uses a torsion angle fingerprint deviation (TFD) in place of a SC RMSD.

This table, which includes comparisons to the target pose and receptor, is depicted below in Fig. 18.

Image showing the bottom half of the IFP Floe report for this CDK2 example; generated after running the Compare IFP Results to Target Receptor Floe.

Figure 18: Bottom portion of the IFP Floe report for this CDK2 example; generated after running the Compare IFP Results to Target Receptor Floe.

Consistent with the analysis up to this point, the third-last column in Fig. 18 (‘Pose RMSD to target pose’) reveals that poses ranked #1 and #2 are highly consistent with the provided ‘correct answer’, whereas pose rank #3 is not. In this example, poses ranked #1 and #2 are accurate (Figs. 14 and 18) and poses ranked #3 through #12 are inaccurate (Fig. 18). However, the success of IFP for this pose/receptor pair across multiple testing runs is the reason it was selected as the first worked example for this tutorial and IFP does not always work this well.

Warning

It is possible/expected for some high-accuracy poses to receive lower IFP scores than some low-accuracy poses. It is also possible for the full top-12 pose list to lack any high-accuracy poses.

A more challenging application of IFP, in which the most accurate poses receive lower IFP scores than some low-accuracy poses is provided as worked example #2 below.

Note

In this tutorial, we used two separate Floes. First, we used the Induced-Fit Posing (Confined) Floe to generate poses, followed by the Compare IFP Results to Target Receptor Floe, which compared those poses to the ‘correct answer’ as defined by a crystal structure of the hit ligand bound to the CDK2 receptor. However, it is possible to combine these steps. If a Target Receptor Input Dataset for retrospective study is defined in the Induced-Fit Posing (Confined) Floe (i.e., by using the field shown in Fig. 19), then the table at the bottom portion of the initial Floe report will contain the additional 3 columns comparing each pose to the target receptor (as in Fig. 18).

Image showing optional inputs to the Induced-Fit Posing (Confined) Floe.

Figure 19: Optional inputs to the Induced-Fit Posing (Confined) Floe can define the Target Receptor Input Dataset for retrospective study.

Step 7: Check that your Floe inputs meet these important requirements

  • All IFP input datasets must have a single record. Any datasets with multiple records must be converted to single-record datasets prior to their use in an IFP Floe. To do this:

    • Open the dataset in a tile view by navigating to the dataset in the Orion data viewer and left-clicking on the four blue squares that sit beside it.

    • (Optional) If desired, sort the records. For example, if the input dataset is from a SPRUCE protein preparation Floe, click on the header “Columns” and select the “ird_single”, which is the Iridium score. Then click on the blue ‘Apply’ button. If it is available, this will label each record as either highly trustworthy (HT), mildly trustworthy (MT), not trustworthy (NT), or not applicable (NA). For more information on the Iridium score, see the OpenEye documentation or the associated publication 7

    • Left click on the record you want to use, then left click on the blue button reading ‘Save Selected Records’. Select ‘New Dataset’ as the destination, enter a name for the new dataset, and define its directory path.

    • Verify that the new dataset exists and that it has only one record.

    • The new dataset can now be used as an input to an IFP Floe.

  • The Reference Receptor Input Dataset and the Target Receptor Input Dataset for retrospective study must be aligned. If they are not aligned, then RMSD scores related to pose accuracy (e.g., in Fig. 18) will not be correct. One convenient way to ensure similar alignment is to provide the same reference DU dataset to all SPRUCE Floes. For example, when using the SPRUCE - Protein Preparation Floe, it may be convenient to specify the same Optional PDB Code for reference DU across all SPRUCE runs as you prepare input datasets for IFP.

  • The hit ligand and the target ligand (bound ligand of the target receptor) must have atoms defined in the same order. They must also represent the same tautomeric state.

Worked example #2: phosphodiesterase type 5 (PDE5)

Steps 1-7 below go through setup, Orion submission, and data analysis for a PDE5 system. This mirrors the general procedures outlined for CDK2 in example #1, above. The information provided below is substantially less verbose than the analogous descriptions in the first example.

The estimated cost of the IFP Floe run for the example #2 is around $52.

Step 1: Prepare IFP inputs

This example utilizes PDE5. As the reference receptor, we select PDB ID: 2CHM 8, 9, which has a bound N2 substituted pyrazolo pyrimidinone 10 (Fig. 20A). As the hit ligand, we select the PDE inhibitor Vardenafil (Fig. 20B). For convenience, we collect this ligand from its PDE5-bound pose in PDB ID: 1XP0 11, 12.

Two-dimensional image of the reference and hit ligands used in worked example #2.

Figure 20: (A) Reference ligand pyrazolo pyrimidinone and (B) hit ligand Vardenafil.

The Reference Receptor Input Dataset used in this tutorial is:

The input has been prepared by running SPRUCE - Protein Preparation Floe with the parameters: PDB code(s) to download: 2CHM, Ligand name(s): 3P4. (The dataset has been aligned by 1XP0, with the parameter Reference Structure Inputs/Optional PDB Code for reference DU: 1XP0.)

The Ligand Input Dataset used in this tutorial is:

The ligand input has been taken from the Spruced 1XP0 DU dataset. After automatic conversion by Orion, both datasets should already have only one record.

Warning

Proper protomer and tautomer states of the hit must be provided. The SMILES of the hit ligand, Vardenafil, as provided by the RCSB Protein Data Bank is CCCc1nc(c2n1NC(=NC2=O)c3cc(ccc3OCC)S(=O)(=O)N4CCN(CC4)CC)C 13. The protomer and tautomer state of the hit ligand was taken account during Spruce preparation of PDB ID:1XP0, and the SMILES of the prepared hit ligand for this tutorial is CCCc1nc(c2n1nc([nH]c2=O)c3cc(ccc3OCC)S(=O)(=O)N4CC[NH+](CC4)CC)C (Fig. 20B).

Step 2: Launch the IFP Floe

Find and launch the Induced-Fit Posing (Confined) Floe in the Orion UI (Fig. 5). After selecting “launch Floe”, you will be presented with a job form on which inputs, outputs, and parameters may be defined (Fig. 6). As before, we will use the datasets that Orion automatically generated upon uploading the .oedu and .oeb files in Step 1. Although many options can be specified or varied, only two are required:

  • Ligand Input Dataset: Vardenafil

  • Reference Receptor Input Dataset: spruce_dataset_2chm_ref_1xp0

When you are satisfied with the IFP Floe setup, press the green ‘Start Job’ button at the lower right.

Step 3: Verify that the IFP Floe completed successfully

If the Floe was configured to do so, you will receive an email when the job finishes. This email will indicate the status of the job. If the job was successful, you should receive an email that reads: Job “Induced-Fit Posing (Confined) [IFP]”, has completed successfully. Additional details on verifying that the IFP Floe completed successfully can be found in Example #1 Step 3.

Note

As you submit more jobs, you may find it easier to manage these jobs if you change the name of the Orion job via the Name field at the top of the job form depicted in Fig. 6.

Step 4: Inspect the Floe report

Open the Floe report, either by navigating to the appropriate Data directory and left-clicking on the light blue image of a page that sits beside the IFP Summary collection in the data listing, or by using the appropriate link in the Job summary.

Note

The main IFP Floe report for this PDE5 IFP run can be viewed directly as an interactive HTML file here. As in Example #1, the per-ligand Floe reports usually linked in the lower table’s first column, titled “Rank (Link)”, will not work in this version as those files are not provided with this tutorial.

The associated per-pose Floe report for pose rank #1 can be viewed directly as an interactive HTML file here.

Step 5: Inspect and visually assess / re-rank the poses

To facilitate analyses, we will upload the target receptor dataset:

The dataset has been prepared by running SPRUCE - Protein Preparation Floe with the parameters: PDB code(s) to download: 1XP0, Ligand name(s): VDN.

As before, to simplify your 3D viewing experience, you may want to hide or deselect any other active datasets via the tool to add, hide, & view currently loaded datasets that is available at the top left of the Orion UI (Fig. 13).

Follow the instructions outlined in Example #1 Step 5 to use the Orion 3D viewer to display the Output for Induced-Fit Posing (Confined) [IFP] dataset (the Output Dataset of the IFP Floe), the Reference Receptor Input Dataset you downloaded in Step 1, and the target receptor dataset that was downloaded at the outset of this step.

IFP predictions of Vardenafil binding to PDE5 are shown in Fig. 21. In this figure:

  • Part A depicts the reference receptor in the crystallographic conformation it adopts when bound to the reference ligand.

  • Part B depicts the target receptor bound to the hit ligand (Vardenafil). This represents the crystallographic pose that we take as the correct answer for Vardenafil binding to PDE5.

  • Parts C-F depict IFP output poses ranked #1-#4, respectively. Note the dissimilarity among these 4 top-ranked poses.

Three-dimensional conformations of IFP results for Vardenafil binding to PDE5.

Figure 21: 3D IFP predictions of Vardenafil binding to PDE5.

Fig. 21 parts C-F display the top 4 ranked poses from IFP. They are all dissimilar to one another and only the pose ranked #4 is accurate. Importantly, poses ranked #1-#3 all look reasonable and have seemingly favorable interactions with PDE5. Moreover, pose #1 (Fig. 21C) has a common substructure that is remarkably similar to the pose of the reference ligand (Fig. 21A). It is currently unclear how likely it would be for an expert to select pose #4 instead of a top-3 pose.

Note

The flipped binding mode of the reference ligand compared to other similar ligands, and the effect of this alternative binding mode on the sidechain conformation of a proximal leucine (residue 804), is discussed by Allerton et al. (2006). Moreover, ligand binding mode flipping for Vardenafil and its analog Sildenafil among PDE isoforms 5A and 4B is discussed in Card et al. (2004). These features may be relevant to the aforementioned differences in the binding modes of the common substructure that can be observed when comparing Fig. 21A to Figs. 21B and 21F.

Step 6: Quantitative comparison to the known binding pose (when available)

As in the first example, we will now regenerate the Floe report while augmenting it with quantitative comparisons to a known correct answer via the Compare IFP Results to Target Receptor Floe.

Find and launch the Compare IFP Results to Target Receptor Floe in the Orion UI (Fig. 16).

After selecting “launch Floe”, you will be presented with a job form on which inputs, outputs, and parameters may be defined (Fig. 17). Although many options can be specified or varied, only two are required:

  • The IFP Output Dataset, which is the output of the Induced-Fit Posing (Confined) Floe we just ran. The default output dataset name is Output for Induced-Fit Posing (Confined) [IFP]

  • The Target Receptor Input Dataset for comparison. For this example, we will use the spruce_dataset_1xp0_ref_1xp0 dataset, which you uploaded in a previous step.

When you are satisfied with the Compare IFP Results to Target Receptor Floe setup, press the green ‘Start Job’ button at the lower right.

The table at the bottom of the main Floe report for this comparison is shown below in Fig. 22.

Image showing the bottom half of the IFP Floe report for this PDE5 example; generated after running the Compare IFP Results to Target Receptor Floe.

Figure 22: Bottom portion of the IFP Floe report for this PDE5 example; generated after running the Compare IFP Results to Target Receptor Floe.

The third-last column in Fig. 22 (‘Pose RMSD to target pose’) reveals that poses ranked #1, #2, and #3 are inconsistent with the crystallographic pose of this ligand bound to PDE5. However, poses ranked #4, #5, and #8 are consistent with this ‘correct answer’, having RMSD’s < 2 Å. This type of outcome is not uncommon for this implementation of IFP. Accurate poses are frequently included in the top-ranked 12 poses (frequently even in the top-ranked 5 poses), but these accurate poses are not always assigned the highest IFP scores.

Note

An exercise left to the reader is to more deeply assess the poses via Orion’s 3D viewer and estimate whether you would be capable in this case of discriminating the accurate poses from the decoy poses (using Fig. 22 for reference).

Step 7: Check that your Floe inputs meet these important requirements

See Example #1 Step 7.

Footnotes

1

https://www.rcsb.org/structure/2c6i

2

Richardson et al. (2006) “Triazolo[1,5-a]pyrimidines as novel CDK2 inhibitors: Protein structure-guided design and SAR”, Bioorganic & Medicinal Chemistry Letters, 16(5): 1353-1357

3

https://www.rcsb.org/structure/2bts

4

Vulpetti et al. (2006) “Structure-Based Drug Design to the Discovery of New 2-Aminothiazole Cdk2 Inhibitors”, Journal of Molecular Graphics and Modelling 24(5): 341

5

https://www.rcsb.org/ligand/U32

6

https://www.rcsb.org/ligand/DT1

7

Warren et al. (2012) “Essential considerations for using protein–ligand structures in drug discovery”, Drug Discovery Today, 17: 1270–1281

8

https://www.rcsb.org/structure/2chm

9

Allerton et al. (2006) “A Novel Series of Potent and Selective PDE5 Inhibitors with Potential for High and Dose-Independent Oral Bioavailability”, Journal of Medicinal Chemistry, 49(12): 3581–3594

10

https://www.rcsb.org/ligand/3P4

11

https://www.rcsb.org/structure/1xp0

12

Card et al. (2004) “Structural Basis for the Activity of Drugs that Inhibit Phosphodiesterases”, Structure, 12(12): 2233-2247

13

https://www.rcsb.org/ligand/VDN