Optional Parameters¶

Input Options¶

-ligand_names <names>¶: If multiple ligands are in the structure but only a specific ones are desired, they can be specified here. Examples are either the three letter code “LIG”, or in case of a peptide “VAL-GLU-TYS-PHE-ALA”. Multiple different ligands, should be separated by commas, “LIG,GIL,INH”.

-map <filename>¶: Input electron density map from X-ray crystallography, to be used for Iridium score calculation.

-metadata <filename>¶: Metadata json file containing OEStructureMetadata. Using this can be a way to indicate desired ligands, tautomers to use for ligands, structure sequence etc., as well as structure title for output file names.

-ref <filename>¶: Reference OEDesignUnit indicating the proper biological unit and relevant binding site.

-site_residue <residue identifier>¶: Input option to specify a binding site using a single residue specification if apo (or holo). The format is “name:num:insert code:chainid”, e.g. “ASP:25: :A” indicating Aspartic acid 25 in chain A. Note: A blank/whitespace character is used for the insert code, which is a typical use case.

-add_receptors¶

Option to add OEReceptors onto the design units

[default = true]

Output Options¶

-out <out>¶: Write to a user specified file. Enforces *.oedu format. Can be used with -warts to write multiple files, one per design unit.

-warts¶: Option to add warts “_#” to the user supplied filename, when writing multiple design units.

-prefix <prefix>¶: Prefix used to name output files, default is blank

-log <logfile>¶: The argument for this flag specifies the name of the log file. This overrides any specified prefix. The default will be spruce_output.log, if no prefix is specified.

-settings <settingsfile>¶: The argument for this flag specifies the name of the settings file. This overrides any specified prefix. The default will be spruce_settings.param, if no prefix is specified.

-verbose : Triggers copious logging output¶

-write_biounits¶

Option to write intermediate Bio-DesignUnits, which can be used with enumsites to site to potential allosteric sites later.

[default = false]

Split Parameters¶

-altloc <method>¶

Parameter determining how alternate locations are handled. The primary option, collapses alternate locations, whereas the enumerate option attempts to set detected alternate locations, A, B, etc.

[default = enumerate]

-cofactor_codes <codes>¶: Mechanism to define 3 letter codes that should be recognized as co-factors if not done automatically, e.g. “ATP”, or “ATP,NAD”

-excipient_codes¶: Mechanism to define 3 letter codes that should be recognized as excipients if not done automatically, e.g. “GOL”, or “GOL,DMS”

-lipid_codes <codes>¶: Mechanism to define 3 letter codes that should be recognized as lipids if not done automatically, e.g. “PGR”, or “PGR,CHL”

-make_packing_residues¶

Option to generate packing residues, both for visualization, but also for Iridium classification

[default = true]

-min_lig_atoms¶

Parameter determining the min number of atoms a ligand molecule can have. A reason to lower this number would be for small fragments that need to be classified as ligands

[default = 8]

-max_lig_atoms¶

Parameter determining the max number of atoms a ligand molecule can have. A reasons to increase this number could be for peptidic ligands

[default = 100]

-max_lig_residues¶

Parameter determining the max number of residues a ligand molecule can have. A reasons to increase this number could be for peptidic ligands

[default = 5]

-max_sys_atoms¶

Parameter limiting the max number of atoms in the entire system that spruce will allow for processing. If this limit is reached e.g. due to a large systems or symmetry expansion resulting in a larger than expected system, spruce will stop processing and return false.

[default = 50,000]

-target¶

Parameter telling the system what the “target” is. This is particularly helpful for systems containing both protein and nucleic acids, where the automated system does not correctly identify the nucleic acid as the target of interest. Allowed values are protein or nucleic.

[default = protein]

Enumerate Sites Parameters¶

-add_interactions¶

Option to add OEInteractionHints to the design unit(s)

[default = true]

-add_style¶

Option to add visualization style to the design unit(s)

[default = true]

-collapse_nonsite_alts¶

Option to deduplicate structures with different alternate locations if those alternate locations are far from the binding site

[default = true]

-duplicate_removal¶

Option to deduplicate identical structures resulting from symmetry operations

[default = true]

-enum_cofactors_sites¶

Option to generate design units with sites based on components classified as co-factors

[default = false]

-restrict_to_refsite¶

Option to skip generating design units for sites identified, that do not match a provided reference design unit

[default = true]

-site_size: <value>¶

Distance from the ligand used to determine the size of the site

[default = 5.0 (angstroms)]

-superpose¶

Option to superpose generated design units, if multiple. If a reference is provided, the first generated design unit will be superposed onto the reference structure, and subsequent structures onto that one.

[default = true]

-superpose_method : <method>¶

The method to use for superposition

Method	Description
global	Global Sequence Alignment to identify CA pairs
site	Global Sequence Alignment to identity CA pairs - focusing on the subset in active site
ddm	Superposition using the Distance Difference Matrix method (DDM)
sse	Superposition using an overlap of Secondary Structure Elements (SSE)
sitehopper	Superposition based on binding site similarities (sitehopper)

Build Parameters¶

-build_cterm_caps¶

Option to cap broken c-termini in protein chains

[default = true]

-build_nterm_caps¶

Option to cap broken n-termini in protein chains

[default = true]

-build_loops¶

Option to build loops for gaps in the protein structure

[default = true]

-build_sidechains¶

Option to build missing or partial protein sidechains

[default = true]

-enum_pockets¶

Option to detect pockets and generate design units from them

[default = false]

Sidechain Build Parameters¶

-sc_delete_clashing_solvent¶

Option to allow build steps to remove clashing solvent

[default = true]

-rot_coverage <value>¶

Coverage of rotamer libraries to use, a lower number can be used to speed up side chain re-building skipping lower probability side-chain rotamers.

[default = 100.0]

-rot_library <value>¶

Rotamer library used for building sidechains and loops. Allowed values are ‘richardson2016’, ‘dunbrack’, ‘richardson’.

[default = richardson2016]

Loop Build Parameters¶

-build_with_crystalpacking¶

Include packing residues when building loops

[default = false]

-loop_db_filename <filename>¶: Database containing loop templates

-crop_length <value>¶

Anchor residues on the protein to crop back for a better fit, results in longer loops being built

[default = 1]

-transform_threshold <value>¶

Number of loops to allow through the sidechain clash checker. No matter this number, will process all with an identical sequence to target.

[default = 25]

-strict_proline_match¶

Fuzzy matches in the loop database have to have proline in exact locations of sequence.

[default = true]

-loop_distance_buffer <value>¶

Fuzzy matches in the loop database has to have distance between anchor atoms correct, +/- buffer distance

[default = 1.0]

-bb_clash_threshold <value>¶

Loops from the database where more than the threshold fraction of the backbone atoms clash, are rejected

[default = 0.25]

-loop_clash_threshold <value>¶

Loops from the database where more than the threshold fraction of the loops atoms in addition to the backbone clashing ones clash, are rejected

[default = 0.2]

-opt_shell <value>¶

Include atoms within this distance in the loop optimization, larger distance results in slower optimization

[default = 15.0]

-opt_tolerance <value>¶

Tolerance for the loop optimization, smaller numbers result in slower optimization

[default = 0.001]

-opt_stage1_iter_multiplier <value>¶

Number of steps per number of residues in the loop for the first stage optimizer

[default = 5]

-opt_stage2_iter_multiplier <value>¶

Number of steps per number of residues in the loop for the second stage optimizer

[default = 10]

-incl_solvation <value>¶

Include simple solvation model when building loops

[default = true]

-incl_SA_term <value>¶

Include solvent accessible surface area term when ranking the loops

[default = true]

-max_eval_loops <value>¶

Maximum number of loops to connect and minimize

[default = 5]

-build_disulfidebridges <value>¶

Allow the loop builder to build disulfide brides during loop modeling (if possible)

[default = true]

Cap Build Parameters¶

-cap_delete_clashing_solvent¶

Option to allow build steps to remove clashing solvent

[default = true]

-allow_truncate¶

Option to allow terminal residue to converted to cap, if cap will otherwise clash

[default = true]

Prep Parameters¶

-charge_radii¶

Option to assign partial charge and radii

[default = true]

-protonate¶

Option add and optimize protons in the system

[default = true]

Protonation Parameters¶

-generate_tautomers¶

Option to generate and use tautomers in the hydrogen network optimization

[default = true]

-het_group_nbr_dist <value>¶

Distance between heterogens used to determine optimization clusters for protonation

[default = 3.5 (angstroms)]

-opt_expt_protons¶

Option to optimize hydrogens assigned in the experiment.

[default = false]

-flip_bias_scale¶

Bias scale for flippable groups (e.g. HIS/ASN/GLN) during hydrogen placement

[default = 1.0]

Biological Unit Extraction Parameters¶

-bu_superpose¶

Option to superpose the biological units

[default = false]

-max_bu_atoms¶

Option to limit the size of BUs processed based on number of atoms

[default = 50,000]

-max_bu_parts¶

Option to limit the size of BUs processed based on number of parts

[default = 24]

-min_align_score¶

Option to specify minimum sequence alignment score

[default = 200]

-pref_author_record¶

Option where the author BIOMT record is preferred over the software generated one

[default = true]

Enumerate Pocket Parameters¶

-pocketMask¶

Option to set the subset of design unit components to be used for pocket finding. Multiple components can be combined as a comma separated string to create the input.

[default = true]

-min_pocket_surf_area¶

Option to set minimum surface area of detected pocket.

[default = false]

-max_pocket_surf_area¶

Option to set maximum surface area of detected pocket.

[default = false]

-pocket_burial_factor¶

Option to set the factor used to distinguish pockets from the general surface.

[default = 1.4]

OEReceptor Parameters¶

-targetMask¶

Subset of design unit components to be used as target for the receptor. Multiple components can be combined as a comma separated string to create the input

[default = protein,nucleic,cofactors,metals,lipids,other_proteins,other_nucleics,other_ligands,other_cofactors]

Please see more details at SPRUCE and ReceptorInDU.

Special Parameters¶

-no_prep¶

Option to skip all preparation steps. This will supersede any other optional flags set.

[default = false]

Optional Parameters¶

Input Options¶

Output Options¶

Split Parameters¶

Enumerate Sites Parameters¶

Build Parameters¶

Sidechain Build Parameters¶

Loop Build Parameters¶

Cap Build Parameters¶

Prep Parameters¶

Protonation Parameters¶

Biological Unit Extraction Parameters¶

Enumerate Pocket Parameters¶

OEReceptor Parameters¶

Special Parameters¶

Filter Options Parameters¶