Molecule Alignment

The following three code examples demonstrate how to align molecules either based on:

Molecule Alignment Based on MCS

The Figure: Example of depiction without alignment is generated by highlighting the maximum common substructure of two molecules (MCS), but keeping the original orientation of the compounds.

../_images/MCSNoAlignment.png

Example of depiction without alignment

The Listing 1 code example shows how to align these two molecules by their maximum common substructures (i.e. by the highlighted atoms and bonds in Figure: Example of depiction without alignment).

  • First, two molecules are initialized from SMILES string and prepared for depiction.

  • A OEMCSSearch object is initialized with the reference 2D molecule onto which the fit molecule will be aligned.

  • An OEImageGrid object is constructed that allows to render the two molecules in a grid next to each other.

  • The fit molecule is then aligned to the reference by calling the OEPrepareAlignedDepiction function that performs MCS search and then aligns the fit molecule to the reference based on the detected common substructure(s).

  • An OE2DMolDisplayOptions object is initialized that stores the properties determine how the molecules are rendered. In order to render the molecules in equal size the smallest depiction scaling factor is calculated.

  • If the molecule alignment was successful (i.e OEAlignmentResult.IsValid is true), then the correspondence between the two molecules stored in the OEAlignmentResult returned by the OEPrepareAlignedDepiction function. This OEAlignmentResult object can be used to highlight the matched common substructure by invoking the OEAddHighlighting function.

  • After both molecules are rendered into the separate cells of the grid layout, the image is written into a png file.

After initializing the two molecules and the OEMCSSearch object, the OEMCSSearch.Match method is called to perform the search and return an iterator over the identified maximum common substructures. The first match is then utilized to call the OEPrepareAlignedDepiction function that aligns one molecule to the other based on the given match. After the alignment, the common atoms and bonds are highlighted by invoking the OEAddHighlighting function. The image created by Listing 1 is shown in Figure: Example of depiction with alignment based on MCS.

Listing 1: Example of molecule alignment based on MCS

refmol = oechem.OEGraphMol()
oechem.OESmilesToMol(refmol, "c1cc(c2cc(cnc2c1)CCCO)C(=O)CCO")
oedepict.OEPrepareDepiction(refmol)

fitmol = oechem.OEGraphMol()
oechem.OESmilesToMol(fitmol, "c1cc2ccc(cc2c(c1)C(=O)O)CCO")
oedepict.OEPrepareDepiction(fitmol)

mcss = oechem.OEMCSSearch(oechem.OEMCSType_Approximate)
atomexpr = oechem.OEExprOpts_DefaultAtoms
bondexpr = oechem.OEExprOpts_DefaultBonds
mcss.Init(refmol, atomexpr, bondexpr)
mcss.SetMCSFunc(oechem.OEMCSMaxBondsCompleteCycles())

alignres = oedepict.OEPrepareAlignedDepiction(fitmol, mcss)

image = oedepict.OEImage(400, 200)

rows, cols = 1, 2
grid = oedepict.OEImageGrid(image, rows, cols)

opts = oedepict.OE2DMolDisplayOptions(grid.GetCellWidth(), grid.GetCellHeight(),
                                      oedepict.OEScale_AutoScale)
opts.SetTitleLocation(oedepict.OETitleLocation_Hidden)

refscale = oedepict.OEGetMoleculeScale(refmol, opts)
fitscale = oedepict.OEGetMoleculeScale(fitmol, opts)
opts.SetScale(min(refscale, fitscale))

refdisp = oedepict.OE2DMolDisplay(mcss.GetPattern(), opts)
fitdisp = oedepict.OE2DMolDisplay(fitmol, opts)

if alignres.IsValid():
    refabset = oechem.OEAtomBondSet(alignres.GetPatternAtoms(), alignres.GetPatternBonds())
    oedepict.OEAddHighlighting(refdisp, oechem.OEBlueTint,
                               oedepict.OEHighlightStyle_BallAndStick, refabset)

    fitabset = oechem.OEAtomBondSet(alignres.GetTargetAtoms(), alignres.GetTargetBonds())
    oedepict.OEAddHighlighting(fitdisp, oechem.OEBlueTint,
                               oedepict.OEHighlightStyle_BallAndStick, fitabset)

refcell = grid.GetCell(1, 1)
oedepict.OERenderMolecule(refcell, refdisp)

fitcell = grid.GetCell(1, 2)
oedepict.OERenderMolecule(fitcell, fitdisp)

oedepict.OEWriteImage("MCSAlign.png", image)
../_images/MCSAlign.png

Example of depiction with alignment based on maximum common substructure

See also

Molecule Alignment Based on Molecular Similarity

The Figure: Example of depiction without alignment is generated by keeping the original orientation of the compounds.

../_images/FPNoAlignment.png

Example of depiction without alignment

The Listing 3 code example shows how to align these molecules based on their molecular similarity.

  • First, two molecules are initialized from SMILES string and prepared for depiction.

  • An OEImageGrid object is constructed that allows to render the two molecules in a grid next to each other.

  • The fit molecule is then aligned to the reference. The OEGetFPOverlap function is utilized to return all common fragments found between two molecules based on a given fingerprint type. These common fragments reveal the similar parts of the two molecules being compared that are used by the OEPrepareMultiAlignedDepiction function to find the best alignment between the molecules. See more fingerprint types in the Fingerprint Types chapter of the GraphSim TK manual.

  • An OE2DMolDisplayOptions object is initialized that stores the properties determine how the molecules are rendered. In order to render the molecules in equal size the smallest depiction scaling factor is calculated.

  • Then both molecules are rendered into the separate cells of the grid layout, the image is written into a png file.

The image created by Listing 3 is shown in Figure: Example of depiction with alignment based on molecular similarity.

Note

GraphSim TK license in not required to run the Listing 3 example.

Listing 3: Example of molecule alignment based on molecular similarity

refmol = oechem.OEGraphMol()
oechem.OESmilesToMol(refmol, "C[C@H](C(=O)N1CCC[C@H]1C(=O)O)OC(=O)[C@@H](Cc2ccccc2)S")
oedepict.OEPrepareDepiction(refmol)

fitmol = oechem.OEGraphMol()
oechem.OESmilesToMol(fitmol, "C[C@H](C(=O)N1CCC[C@H]1C(=O)O)NC(=O)[C@@H](Cc2ccccn2)S")
oedepict.OEPrepareDepiction(fitmol)

image = oedepict.OEImage(500, 300)

rows, cols = 1, 2
grid = oedepict.OEImageGrid(image, rows, cols)

overlaps = oegraphsim.OEGetFPOverlap(refmol, fitmol,
                                     oegraphsim.OEGetFPType(oegraphsim.OEFPType_Tree))
oedepict.OEPrepareMultiAlignedDepiction(fitmol, refmol, overlaps)

opts = oedepict.OE2DMolDisplayOptions(grid.GetCellWidth(), grid.GetCellHeight(),
                                      oedepict.OEScale_AutoScale)
opts.SetTitleLocation(oedepict.OETitleLocation_Hidden)

refscale = oedepict.OEGetMoleculeScale(refmol, opts)
fitscale = oedepict.OEGetMoleculeScale(fitmol, opts)
opts.SetScale(min(refscale, fitscale))

refdisp = oedepict.OE2DMolDisplay(refmol, opts)
fitdisp = oedepict.OE2DMolDisplay(fitmol, opts)

refcell = grid.GetCell(1, 1)
oedepict.OERenderMolecule(refcell, refdisp)

fitcell = grid.GetCell(1, 2)
oedepict.OERenderMolecule(fitcell, fitdisp)

oedepict.OEWriteImage("FPAlign.png", image)
../_images/FPAlign.png

Example of depiction with alignment based on molecule similarity

See also

See also

../_images/OEPrepareMultiAlignedDepictionVisualize.png

The Python script that visualizes molecule similarity based on fingerprints can be downloaded from the OpenEye Python Cookbook