HINT 2.30 Manual: Chapter 6S

LESSON 6: Using HINT with SYBYL 3D QSAR


This lesson demonstrates the use of HINT with the SYBYL QSAR module. It would be best for this lesson if there were no molecules or backgrounds from previous lessons currently active in SYBYL. If you are entering the HINT Tutorial at this point, follow the instructions in Step 1 of Lesson 1.

  1. Open a SYBYL Molecular Spreadsheet and Database

    From the File pulldown on the menubar select Molecular Spreadsheet and New.... The rows will represent Molecules. In the DATABASE_FILE dialog box, enter $TA_DEMO in the "Database containing molecules" text field and press Search Directory. This will allow us to select one of the already prepared molecular databases in the SYBYL Demo directory. Choose fisons.mdb for this lesson.

  2. Fill a Spreadsheet column with HINT LogP values

    After the spreadsheet is initialized and appears, select the AutoFill button on the speadsheet menubar. We are creating a new Column. From the list of New column types pick HINTLOGP. The Add Column (HintLogP) dialog box allows you to tailor the method for calculating LogP. For this set of small molecules, the Partition Method should be Calculate, the Hydrogen Treatment should be All, and the Polar Proximity should be Via Bond. Press OK and accept logP1 as the Column name. This operation will take a few minutes as Column 1 of the spreadsheet is AutoFilled with parameters.

    Molecular Spreadsheet filled with HINTLOGP values in column 1

  3. Fill a Spreadsheet column with the HINT hydropathic field

    Again from the spreadsheet menubar select the AutoFill button (and choose a new Column). This time select HINTCOMFA as the New column type. The Add Column (HintCoMFA) dialog box contains options to tailor the HINT field that will be entered into the QSAR table. For this first run, we will choose mostly the default settings: (Map Type = Molecule, Smoothing = None, Information = Hydrophobic/Polar, Partition Method = Calculate, Hydrogen Treatment = All, Polar Proximity = Via Bond, Distance Function Hydropathic Term = exp(-nr), Distance Function Steric Term = off, Inside Mol Cut Off = off, Van der Waals Limit = 1.0). The Region will be from Calculate Automatically... using the Calculate CoMFA Region Automatically dialog box, where all Spacings should be 2 Angstroms and all Margins should be 4 Angstroms. Use fisons.rgn as the CoMFA Region File name. Press OK to calculate the region and then press OK to the Add Column (HintCoMFA) dialog box and accept hint2 as the Column name. This AutoFill operation will take about 5-10 minutes.

  4. Run a PLS analysis on LogP as a function of the HINT field.

    Choose the columns for the PLS study: Use Select Cols, enter 1, 2 in the Expression text field, press Add and Done. From the QSAR pulldown, select Partial Least Squares... to call the Partial Least Squares Analysis dialog box. The Dependent Column is 1. Select Leave-1-Out Validation, 5 Components, CoMFA Std Scaling, and 0.01 kcal/mol Columns Filtering. This run will take about 10 minutes, so you may run it either interactively or in batch. If you run in batch, get a report on the results with QSAR, Report QSAR...; enter fisons1.lis as the File name to receive the QSAR report. To review the report, spawn or create a unix window and edit or list the report. If you run it interactively, be sure to choose Yes for Keep this analysis? In this run the optimum number of components is 2 and the cross-validated r^2 is 0.795.

  5. Review some of the HINT field optimization options

    The Add Column (HintCoMFA) dialog box provides a large number of options for optimizing the HINT field, much as the analogous CoMFA field dialog box does. Many of these options are only appropriate for certain data sets, e.g., it may be advisable to partition with the Dictionary method if the data set consists of peptides. If the HINT field is being combined with other fields, such as the CoMFA steric and/or electrostatic fields, Information = Hydrophobic only may yield better cross-validation statistics. Setting Smoothing to Box often improves a CoMFA model. Changing the grid spacing and other region definition parameters may improve a model, but usually at a significant cost in terms of speed. The other major form of field tuning in the Add Column (HintCoMFA) dialog box is associated with changing some of the field Cutoffs. The standard CoMFA practice is to set steric and electrostatic field values for grid points that are "inside" the molecular van der Waals surface to constant values. HINT simulates this technique with the Inside Mol Cut Off option and its associated parameters Hydrophobic and Polar. If the Inside Mol Cut Off is turned on and Polar is set at -2 and Hydrophobic is set at 1 for this data set, a model with a cross-validated r2 of 0.850 with 4 components can be derived. In order to repeat this result, however, it may be necessary to Save the Spreadsheet and restart SYBYL. There apparently is a SYBYL bug that prevents multiple External Field columns from being properly stored in a QSAR table in the same SYBYL session.

  6. Using the HINT field with the standard CoMFA fields

    The HINT field can be used in combination with the SYBYL steric and/or electrostatic fields for 2 or 3 field CoMFA studies. Note that the region must be the same, and that the SYBYL methodology for generating the region is much faster than the HINT algorithms because SYBYL does the calculation internally, while HINT must use an SPL script to collect the region information. Thus, add the SYBYL CoMFA column(s) to the table first, before the HINT column, and use as the HINT region definition the Preexisting region file generated by SYBYL for the CoMFA column(s).

    There is a graphical command in the HINT software to aid in graphing multifield CoMFA results. From the eslc pulldown on the main SYBYL menubar select the Hint, HintQSAR, Graph HintQSAR... command. This brings up the Retrieve HintQSAR dialog box that guides you through retrieving and graphing the CoMFA field contours. Choose which field types you wish to graph and their Columns. Important: This dialog does not work when there is only one CoMFA type column in the analysis.