Gaussian QM Conformer Ensemble Tutorial¶
Running this Floe with provided input molecule and parameters specified in this tutorial will cost around $1.50. Navigate through the categories Product-based / Quantum Mechanics / Gaussian to find this Floe. Then, locate or search for “Gaussian QM Conformer Ensemble.”
This Floe generates conformers, performs a geometry optimization, and then a single point energy calculation. If the input molecule has coordinates, that conformer will be compared to the generated conformers.
The input for this Floe (Hydroxymethyl-benzoic-acid) can be found in the “QM Gaussian Floes” folder of the Tutorial Data on Orion.
Set the RMSD Threshold for Conformer Generation to 0.75 to make a less dense conformer ensemble for this tutorial. All other parameters will be left to the default. This means the conformers will be optimized at HF/6-31G with all torsions constrained. For these constrained optimizations a lower level of theory is sufficient. Then, the energy will be re-evaluated at B3LYP-GD3BJ/6-31G*. Finally, the conformers will be filtered based on the specified energy window (10 kcal/mol by default).
Changing the RMSD Threshold will signficantly affect the number of conformers generated (and therefore the number of QM calculations to be performed). The parameter Maximum Conformers for Geometry Optimization will stop the Floe if too many conformers are generated. A good way to test the RMSD is to set this limit to 1 and run the Floe to see how many conformers are generated. Any time the number of generated conformers is larger than this cutoff 1 optimization is still performed. This works as benchmark calculation for the Floe.
Navigate to the jobs tab and select this “Gaussian QM Conformer Ensemble” job. When the Floe is complete this should take you to a page like the one below:
In all Gaussian Floes there are many serial cubes. See this discussion in the FAQ for more details.
This Floe generates 3 output datasets and 2 Collections (Floe Reports):
gau_confs_gopt_spe: This is the final result and contains the conformer ensemble after energy filtering.
gau_all_conf_gopt: This is an intermediate result with all completed geometry optimization, but no single point energies or filtering performed.
gau_conf_ens_failures: This dataset saves any failed records.
Failure Report - Gaussian Conformer Ensemble: This Floe Report summarizes why a record failed (see below).
Gaussian Conformer Report This Floe report summarizes the conformer ensemble results.
Navigate to the Floe report tab and open the “Failure Report - Gaussian Conformer Ensemble” to see why each record failed
In conformer ensembles there are always some records in the failure dataset, it is not a cause for alarm. There are two main sources for failures: (1) conformers are deduplicated before and after optimization, the duplicates are sent to the failures. (2) conformers higher in energy than the specified cutoff will also be sent to the failures.
Next, navigate to the “Gaussian Conformer Report” to see a summary of the results.
This report summarizes the results of the conformer ensemble calculation. First, is a table showing the relative energy of the input conformer. In this case, the input conformer has a relative energy around 8 kcal/mol for both levels of theory (that used for the optimization and single point energy calculations).
Next, are histograms showing the number of conformers at each relative energy. These energy windows are binned automatically. In this example each bin is 2 kcal wide.
In this example, there was only one input molecule. Multiple molecules can be used as input to this Floe, but they will all be included in the same relative energy calculations. This means it is usually best to limit multiple molecule inputs to different tautomers or stereoisomers of the same molecule. When there are multiple inputs the histograms will be colored by molecule.
Histograms are not useful for small datasets, therefor if the conformer ensemble is very small (less than 10 conformers) a table of all energies will be shown instead.
The data from this Floe can be further analyzed using the analyze page or 3D page in Orion. Navigate to the Data tab and find the output from this Floe. The “Show in Project Data” button on the Jobs page is a nice shortcut.
Activate the “gau_confs_gopt_spe” output from this Floe by clicking on the plus sign next to it. Also activate the input dataset. Navigate to the Tutorial Data on Orion and select the plus sign next to the Hydroxymethyl-benzoic-acid dataset.
Then, navigate to the Analyze page. On the graph, for the x-axis select “RMSD” and on the y-axis select “Gaussian Strain Energy (kcal/mol).”
This plot can be helpful for seeing how the input conformer compares to the full ensemble. In this case the input conformer is fairly random, just taken from a conformer generation. However, in a real application the input conformer could be bioactive or have shown promise in other similulations. One interesting point in this plot is that there are conformers around 2 kcal/mol strain with very low RMSD. It might be interesting to understand why they are so much lower in energy than the input conformer which is at 8 kcal/mol.
To understand more about these results, select the “Analyze with 3D” layout (as shown above).
Selecting points on the plot will make that conformer appear in the 3D Viewer on the right. First consider the point at RMSD 0.11 and Gaussian Strain Energy of 8.0 kcal (the input molecule).
Then, try selecting the point around RMSD 0.098 with Gaussian Strain Energy of about 1.6 kcal. In this case, there is an intramolecular hydrogen bond which explains why a conformer which is very similar has such a low relative energy.
If you wanted to get single point energies at an even higher level of theory for all, or some, of these conformers this dataset could be sent to the Gaussian QM Single Point Energy Floe which has a tutorial available here. In that Floe, an implicit solvent could also be included in the calculaiton. It is possible the preferred conformers would change in solution.