WriteDict

Overview

WriteDict is a comprehensive refinement dictionary generator that properly handles covalent bond restraints between the ligand and the protein.

Example Commands

The basic usage of writedict is to take an input structure and generate the corresponding .pdb and .cif files for input into COOT or REFMAC5.

> writedict -in 1nhu.ism -out 1nhu_out

This creates the files 1nhu_out.cif and 1nhu_out.pdb. The pdb is always created since the atom names may be remapped from the input. Since the input contains no residue information, the ligand is placed in residue “UNL”.

To add your own residue names for unmapped residues:

> writedict -in 1nhu.ism -out 1nhu_out -residues 153

Note that if the given residue name (153) does not map to the known residue 153 the next residue name in the unmapped residue list will be used. Since none are specified it will go to the standard defaults: “UNL,UN1,UN2…”

Note

When doing whole complexes, renaming residues gets quite complicated since you may not know which residues will map before you run WRITEDICT.

> writedict -in 1nhu.ism -out 1nhu_out -residues LIG -nolookup

Force the residue to be named LIG. (It will also indicate an invalid dictionary)

Warning: LIG   1: Known Residue, but does not map to previous residue: LIG from: RCSB
Warning: LIG   1: previous smiles c1ccc2c(c1)CC3=C(NN=C23)c4ccncc4
Warning: LIG   1: current  smiles c1ccc(cc1)CC(C(=O)[O-])N(Cc2cccc(c2)C(F)(F)F)C(=O)c3ccc(cc3Cl)Cl
Warning: LIG: Keeping residue name (dictionary may be invalid)
Warning: LIG: Adding new residue dictionary
Warning: LIG: already in dictionary from: RCSB but with different structure.
Warning: LIG: c1ccc2c(c1)CC3=C(NN=C23)c4ccncc4
Warning: LIG: c1ccc(cc1)CC(C(=O)[O-])N(Cc2cccc(c2)C(F)(F)F)C(=O)c3ccc(cc3Cl)Cl
Warning: LIG: Overwriting existing residue (dictionary may be invalid)

To ensure planarAniline constraints consistent with the MMFF94s forcefield, use the -planarAniline flag.

> writedict -in 1nhu.ism -out 1nhu_out -planarAniline

Command Line Help

A description of the command line interface can be obtained by executing WRITEDICT with the –help option.

> writedict --help

will generate the following output:

Help functions:
  writedict --help simple      : Get a list of simple parameters
  writedict --help all         : Get a complete list of parameters
  writedict --help <parameter> : Get detailed help on a parameter
  writedict --help html        : Create an html help file for this program

Required Parameters

-in <filename>

File containing a molecule to be fit to density.

File type

Extension

SMILES

.smi .ism .smi.gz .ism.gz

OEBinary

.oeb .oeb.gz

SDF

.sdf .mol .sdf.gz .mol.gz

MOL2

.mol2 .mol2.gz

PDB

.pdb .ent .pdb.gz .ent.gz

MacroModel

.mmod .mmod.gz

-out <filename prefix>

This specifies a prefix for outputting the refinement dictionary .cif or .xplor and the pdb file. The pdb file ( and pdb.oeb) is always output and should be used for purposes of refinement since WRITEDICT sometimes corrects faulty hydrogen atom names and will include LINK records in the pdb if covalent bonds are detected.

The log file will also be written using this prefix as <filename prefix>.log.

-prefix <filename prefix>

Alias for -out

Optional Parameters

-assignCCP4

If true then CCP4 atom types will be assigned to the output pdb file.

[default=true]

-includeRotors

Set to true if the output should include rotor torsion terms. If this is set to false, refinement programs will not be able to modify any rotors.

[default=true]

-suppressH

If true, no hydrogens are output.

[default=false]

-type

set this to 0 for REFMAC5 refinement dictionaries or 1 for CNS/CNX (XPLOR). By default REFMAC5 dictionaries are generated.

[default=0]

-planarAniline

Use the MMFF94s variant of the MMFF94 forcefield. This forcefield emulates time-averages structures observed in crystallographic and other structure determination methods.

The MMFF94s forcefield is a variant of MMFF94 that emulates time-averages structures typically observed during crystallographic structure determination, mainly planar geometries at unstrained delocalized trigonal nitrogen centers. However, there are many theoretical studies that show puckering at nitrogen centers. (see MMFF VI. in JCICS)

That being said, due to the prevalence of crystallographic studies, many chemist erroneously consider the time-averaged structure to be correct, hence this variant is available for use.

[default = false]

-strict

Change the known residue matching mode.

Options available for residue matching is as follows:

Option

Meaning

exact

Known residue and input residue graphs must match exactly.

substructure

Input residue may be a substructure of the known residue.

Cannot be used with the exact flag.

fuzzybonds

Bond orders do not need to match

atomname

Atom names must match

Options are separated by colons. The default settings are set to limit the number of REFMAC5 issues. -strict exact:substructure will find more matches, but increases the risk of REFMAC5 rejecting the dictionary.

[default = exact:fuzzybonds]

-lookup

Attempts to lookup residues from the internal dictionary store using the current -strict status. This is useful to find out if a residue has already been deposited in the RCSB.

If -lookup is not set, mismatching residues will be given new names.

-forceExistingResidueNames

Force WRITEDICT to use the given residues, even if they do not match the deposited residues from the RCSB. Also aliased to -force

This flag used to be called -nolookup (which still works) but this was found to be confusing in conjunction with the -lookup flag which does attempts to find the correct residue when there is no match.

-writeFullDict

If true then all refinement dictionaries are created for all residues. If false only residues that are unknown or are in the set of replaceable residues or covalently bound to a ligand are output. Setting this to true can confuse REFMAC5.

{default=false]

-verbose

Output copious information of how writedict operates.