Schrödinger Notes—Ligand-based Pharmacophore Modeling

Declaration

This note is based on the article, “Ligand-Based Virtual Screening Using Phase”1, and created with the Schrödinger Software Release 2023-4.

This note contains only minimal annotations to the original text, along with corrections to formatting errors. It is intended for educational and communicative purposes only, and all rights remain with the original author.

Introduction

This tutorial demonstrates the creation of pharmacophore hypotheses from both congeneric and diverse ligands sets. You will learn how to create a pharmacophore hypothesis from a congeneric set of ligands with known experimental binding affinity. Additionally, you will learn how to create a Phase Database from a set of ligands, and use it to both prepare and filter a ligand library for future ligand screens or docking. Lastly, you will screen a Phase Database against a set of hypotheses, generate a pharmacophore hypothesis from a diverse ligand set, and visualize the binding modes.

1 Pharmacophore Modeling Prerequisites

Ligand files can be sourced from numerous places, such as vendors or databases, often in the form of 1D or 2D structures with unstandardized chemistry. LigPrep can convert ligand files to 3D structures, with the chemistry properly standardized and extrapolated, ready for use in virtual screening.

In this tutorial, the ligands have already been prepared in order to save time. Please see the Introduction to Structure Preparation and Visualization tutorial for instructions on using LigPrep.

2 Creating Projects and Importing Structures

At the start of the session, change the file path to your chosen Working Directory in Maestro to make file navigation easier. Each session in Maestro begins with a default Scratch Project, which is not saved. A Maestro project stores all your data and has a .prj extension. A project may contain numerous entries corresponding to imported structures, as well as the output of modeling-related tasks. Once a project is created, the project is automatically saved each time a change is made.

Structures can be imported from the PDB directly, or from your Working Directory using File > Import Structures, and are automatically added to the Entry List and Project Table. The Entry List is located to the left of the Workspace. The Project Table can be accessed by Ctrl+T (Cmd+T) or Window > Project Table to see an expanded view of your project data.

  1. Double-click the Maestro icon

Figure 2-1. Change Working Directory option.

  1. Go to File > Change Working Directory

  2. Find your directory, and click Choose

  3. Pre-generated input and results files are included for running jobs or examining output. Download the zip file here: https://www.schrodinger.com/sites/default/files/s3/release/current/Tutorials/zip/lbvs_phase_ligand.zip

  4. After downloading the zip file, unzip the contents in your Working Directorythe location that files are saved for ease of access throughout the tutorial

Please note that calculations may have been run with an earlier version of the software and results may not exactly match with those you produce in this tutorial.

Figure 2-2. Save Project panel.

  1. Go to File > Open Project > Phase_tutorial.prjzip > Open

  2. In Save Scratch Project, click OK

  3. Go to File > Save Project As

  4. Change the File name to Phase_ligand_tutorial

  5. Click Save

3 Developing Hypotheses from a Congeneric Ligand Series

In this section, we will generate a pharmacophore hypothesis from a congeneric (a series of ligands that are structurally related, such as sharing a common core) series of ligands with experimental binding affinities.

3.1 Creating hypotheses

Figure 3-1. Include at1 entries.

  1. Select and expand the group at1

  2. Right-click at1 and choose Include

  3. In the Warning panel, click Continue

  • Ligands are displayed in the Workspace

  • Ligands are not aligned

Ligand Alignment is very important for the generation of pharmacophores from a ligand series. While Phase can (and will as a default) automatically find the best alignment and common features, it is at times beneficial to manually pre-align your ligands using the Flexible Ligand Alignment tool before importing the ligands into Phase. That can be useful when trying to align to a known bioactive conformation.

Figure 3-2. Defining actives and inactives for multiple ligands (selected entries).

  1. Go to Tasks > Browse > Ligand-Based Virtual Screening > Develop Pharmacophore Hypotheses

  2. Under Create pharmacophore model using, choose Multiple ligands (selected entries)

  3. For Actives/Inactives Split (selected entries), click Define

Figure 3-3. Define Actives and Inactives panel.

  1. For Available properties, choose pIC50-Exp

Note: This property contains experimental binding affinities converted to a free energy scale by pIC50 = -log(IC50). IC50 ≤ 50 nM affinity corresponds to pIC50 ≥ 7.3. The threshold for recognizing an inactive will be 10 µM or pIC50 ≤ 5.0.

  1. Set Active if activity is to >= 7.30

  2. Set Inactive if activity is to <= 5.00

  3. Click Apply and OK

You can also specify ligands that must match a hypothesis for the hypothesis to be returned by checking the Required Match column for the desired ligand

Figure 3-4. Features tab in the Hypothesis Settings panel.

  1. In the Develop Pharmacophore Model panel, under Pharmacophore method, click Hypothesis Settings

  2. In the Features tab, set Number of features in the hypothesis to 5 to 6

  • Specify the range in the number of features in the hypothesis. A larger range produces more hypotheses. Providing a range allows the optimum number to be determined in the search by the scoring. A setting of 4 to 6 is usually appropriate.
  1. Check Preferred minimum number of features and set to 5

  2. In the Minimum column, set (D) Donor to 1 and (N) Negative to 1

  3. For Feature presets, choose Make acceptor and negative equivalent

  4. The remaining settings can be left at their default values

  • Hypothesis should match at least N of M actives text box: Specify the percentage of the actives that should match the hypothesis. The number of hypotheses found decreases as you increase the percentage. As it is possible that actives bind in different modes, a specific hypothesis might not cover all of the actives. For example, a receptor might have a donor with two hydrogens, and one active binds through one hydrogen, another binds through the other hydrogen.

  • Hypothesis difference criterion text box: Specify the cutoff for determining when two hypotheses are considered to be the same. To determine whether two hypotheses of the same composition (e.g. ADHRR) are the same, they are aligned in every possible configuration. The maximum distance between any two corresponding sites is determined for each configuration. If the smallest value of the maximum distance in any configuration is less than the value specified in this text box, the hypotheses are considered to be the same (redundant). The hypothesis with the best survival score is kept and the other is discarded. If they are not redundant, the new hypothesis is inserted in the ranked list of hypotheses.

The Exclude features from hypothesis option allows you to mask out any features that you do not want to see in the Phase hypothesis. This can be used to mask features in a conserved core to force the common pharmacophore perception to focus on more variable regions of a series.

Figure 3-5. Excluded Volumes tab in the Hypothesis Settings panel.

  1. Click the Excluded Volumes tab

  2. Check Create excluded volume shell

  3. Under Create shell from, choose Actives and Inactives

Note: See Phase Help for more information on the creation of excluded volumes from inactives and actives

  1. Click Save

Figure 3-6. Change Job name.

  1. In Develop Pharmacophore Model, change the Job name to at1_dcph

  2. Click Run

  3. Close the Develop Pharmacophore Model panel

Note: To save time, we will look at pregenerated results

Figure 3-7. Project Table of Hypothesis DHNRRR_1 Actives expanded.

  1. Expand the group at1_dcph

Note: Hypotheses are named according to their features, e.g. APRR has an acceptor, a positive ionic, and two ring features

  1. Expand group DHNRRR_1

  2. Double-click the DHNRRR_1 In circle to fix the hypothesis in the Workspace

  3. Select the Active group

Figure 3-8. Active group and DHNRRR_1 Hypothesis.

  1. Use shift-click to include all ligands in the Active group
  • Active ligands have good feature alignment

  • Active ligands avoid excluded volumes

  1. Right-click Active group and choose Exclude

Figure 3-9. Inactive ligands and hypothesis DHNRRR_1.

  1. Select the Inactive Partial Match group

  2. Right-click the Inactive Partial Match group and choose include

  • Inactive ligands have poor feature alignment (e.g. D5)

  • Inactive ligands clash with excluded volumes

  1. Right-click Inactive Partial Match group and choose Exclude

  2. Double-click the DHNRRR_1 In circle to un-fix the hypothesis

3.2 Measuring the distance between features

Figure 3-10. Measure distance between pharmacophore features

  1. Include the DHNRRR_2 hypothesis in the Workspace

  2. Go to Tasks > Browse > Workspace Operations > Measure

Note: Measure is added to the Favorites Toolbar by default and can be easily accessed from there as well

  1. For Measure, choose Distance

  2. Select N8 and D5 in the Workspace

  • A line appears connecting the two features with a label noting the distance between them
  1. Click Ok to close the Measurement banner

  2. Right-click the background and choose Clear Workspace from the menu

Measurements can be tracked by hovering over the measurement icon in the Workspace Configuration Toolbar and clicking the …

4 Creating a Phase Database

In this section, we will create a Phase database for a collection of ligands. Screening compounds from a prepared Phase database is ~2-3 times faster than screening compounds from files or the Entry List. By default, compounds in Phase databases include low-energy ionization/tautomeric states and an ensemble of low-energy conformers. These compounds are ready for use in docking with Glide, shape screening with Shape, and pharmacophore querying with Phase. Compounds can be screened against one or more pharmacophore hypotheses from one or more Phase databases in one calculation. 

Figure 4-1. Input tab in the Create Phase Database panel.

  1. In the Entry List, select DUD-E_act_3chp_trainc

  2. Navigate to Tasks > Browse > Ligand-Based Virtual Screening > Create Database

  • The Create Phase Database panel opens
  1. In the Input tab, next to Choose task, pick Create new database

  2. Specify the database location on the job host by writing a file path

  • Please note that the file path must begin with a backslash “ / ”
  1. For Use ligands from, choose Project Table (selected entries)

  2. Under Conformation Generation, check Generate ligand conformers

DUD-E is the ‘Database of Useful Decoys: Enhanced.’ For each active in the database, there are 50 decoys with similar 1-D physio-chemical properties, but different 2-D topology. DUD-E ligand set can thus be used in pharmacophore screening to avoid artificial enrichment

Figure 4-2. Ligand Filtering tab in the Create Phase Database panel.

  1. Click the Ligand Filtering tab

  2. Under Filter Prepared Ligands, check the first three options

  3. Change the Job name to tutorial_database

  4. Click Run

Optional: Go to Tasks > Browse > Ligand-Based Virtual Screening > Manage Database to view ligands generated in a Phase Database

Note:  No licenses are checked out during the creation or modification of a Phase database, enabling the calculation to be distributed over as many CPUs as are available without running out of licenses

5 Screening a Phase Database against Pharmacophore Hypotheses

In this section, we will cover options for screening ligands from a Phase Database against hypotheses. We will then visualize the results and examine the PhaseScreenScore to evaluate how well the ligands align to the pharmacophore features of the hypothesis it was screened against.

5.1 Screen ligands from a Phase Database

Figure 5-1. Entry List inclusion.

  1. In the Entry List, Ctrl-click (Cmd-click) to Include the top hypotheses from the at1_dcph group, ADHNRR_1, DHNRRR_2, and DHNRRR_1

Figure 5-2. Open Phase Database panel.

  1. Go to Tasks > Browse > Ligand-Based Virtual Screening > Ligand Screening

  2. For Ligands to screen, choose File

  3. Click Browse

  4. Select the file tutorial_database.maegz

Figure 5-3. Add hypotheses to Phase Ligand Screen.

  1. Click Add Hypothesis

  2. Choose Workspace

  3. Right click on the Workspace > Clear Workspace

  4. Click Screening Settings

Figure 5-4. Add Constraints to Phase Ligand Screen

  1. Go to the Constraints tab

  2. Check Pick Features  

  3. Include only DHNRRR_1  

  4. Select Features R10 and R11 in the Workspace using ctrl/cmd-click

  5. Click Add Constraint

Note: The constraint value and tolerance can be edited manually

  1. Repeat steps 12-14 for DHNRRR_2 and AHNRRR_1

  2. Click Save

Figure 5-5. Change Job name.

  1. Change the Job name to at1_dcph_screen

  2. Click Run

  3. Close the Phase Ligand Screening panel

Note: Although this job is fast, the results are already present in your project if you choose to use them instead of running the job yourself.

5.2 Visualize pharmacophore screening results 

Figure 5-6. Phase Database screening results visualized with the corresponding hypothesis.

  1. Expand the at1_dcph_screen group and the AHNRRR_1 group

  2. Double-click the In circle to fix the AHNRRR_1 hypothesis in the Workspace

  3. Select the AHNRRR_1 group by clicking on the header

Figure 5-7. Ligand screening results visualized with the corresponding hypothesis.

  1. Use right and left arrow keys to step through the ligands

Optional: Include the AHNRRR_1 group and double-click the Presets button to view the ligands with ball-and-stick styling  and unique coloring.

Figure 5-8. Add properties to the Entry List.

  1. At the top right corner of the Entry List, click the Settings button (cog)

  2. Choose Show Property

  3. Click Choose

  4. Type Phase

  5. Select PhaseScreenScore

  6. Click Choose

  7. Type Matched

  8. Select Matched Ligand Sites

  9. Click OK

Figure 5-9. Updated Entry List with Matched Ligand Sites and Phase Screen Score.

Note: The above properitis can be found at the Project Table panel by pressing Ctrl+T (⌘T).

Note: Hover over the Matched Ligand Sites circles to see what features are present

Note: Both the quantity and quality of feature matching is taken into account in the PhaseScreenScore. A ligand can thus both match fewer sites and have a higher PhaseScreenScore than another ligand in the screening set

6 Developing Pharmacophore Hypotheses from a Diverse Ligand Set

In this section, we will cover the process of developing a pharmacophore hypothesis. This will also show how to evaluate the quality of the generated hypotheses with PhaseHypoScore, and visualize binding modes.

6.1 Develop common pharmacophore hypotheses from a diverse ligand set

Figure 6-1. Group selection with the first entry included.

  1. Clear the Workspace

  2. Select the group DUD-E_act_3chp_trainc and drop down the list

  3. Include the first entry, CHEMBL349189

Note: Use Window > 2D Viewer to see a 2D tiled view of all selected ligands

Figure 6-2. Creating a pharmacophore model from Multiple ligands (selected entries).

  1. Go to Tasks> Ligand Based Virtual Screening> Develop Pharmacophore Hypothesis

  2. Under Create pharmacophore model using, choose Multiple ligands (selected entries)

Figure 6-3.  Find best alignment option.

  1. Under Pharmacophore method, choose Find best alignment and common features

  2. Click Hypothesis Settings

Figure 6-4. Hypothesis Settings Panel.

  1. In the Features tab, set Hypothesis should match at least 25% of actives
  • In this tutorial, we assume that ligands without IC50 values were obtained from experiments. While they are all active, they may bind to different targets. Therefore, we decreased the percentage to accommodate a wider range of targets.
  1. Set Number of features in the hypothesis to 4 to 7
  • Increase the range appropriately
  1. Check Preferred minimum number of features and set to 5
  • Set the value between 4 and 7
  1. Set Hypothesis difference criterion to 1.00
  • Increase the value to merge the similar hypotheses
  1. Click Save

Figure 6-5. Generate Conformer Options panel.

  1. Check Generate conformers

  2. Click Options

  3. Set Target number of conformers to 50

  4. Click Save

Figure 6-6. Change Job name.

  1. Change Job name to lkha4_common

  2. Click Run

Note: To save time, we will look at pregenerated results

6.2 View results from common pharmacophore hypotheses detection

Figure 6-7. Expand the lkha4_common group.

  1. Select lkha4_common and expand the group

  2. Double-click the blue H next to the entries to view each hypothesis

Figure 6-8. Show property option in the Entry List.

  1. At the top right corner of the Entry List, click the Settings button (cog)

  2. Choose Show Property

Figure 6-9. Show Property panel.

  1. Click Choose

  2. Type Hypo

  3. Select PhaseHypo Score

  4. Click OK

Note:  The PhaseHypoScore is a linear sum of how well active ligands match the hypothesis and the hypothesis performance in a small-scale virtual screen using the known actives and 1000 compound Glide decoy set.  Larger values of PhaseHypoScore indicate the hypothesis is more likely to perform well in virtual screening

Figure 6-10. Include the active ligands of top-ranked hypothesis (fixed) in the Workspace.

  1. Expand APRR_2

  2. Double-click the APRR_2 In circle to fix the hypothesis in the Workspace

  3. Select and expand the Active (36) group

  4. Include the first entry

  5. Use right and left arrow keys to step through the entries

6.3 Examine multiple binding modes in common pharmacophore results

Figure 6-11. Detect binding modes option for lkha4_common hypotheses.

  1. Collapse APRR_2 and select the lkha4_common group

  2. Next to lkha4_common, click the symbol

  3. Choose Analysis> Detect binding modes

  • The Detect Binding Modes panel opens

Figure 6-12. Detecting Binding Modes table with three proposed binding modes.

  1. Set Proposed number of binding modes to 3
  • Ligands and hypotheses are clustered into binding modes

7 Conclusions

In this tutorial, we learned how to create pharmacophore hypotheses from a set of congeneric ligands with known experimental binding affinities. Additionally, we created a new Phase database from a set of ligands. We then screened that Phase Database against three of the generated pharmacophore hypotheses, and examined their PhaseScreenScores and Matched Ligand Sites. Finally, we generated pharmacophore hypotheses from a diverse ligand set, evaluated them with PhaseHypoScore and visualized the binding modes.

References

[1] Ligand-Based Virtual Screening Using Phase (schrodinger.com)