Optional Parameters
Input Options
- -ligand_names <names>
If multiple ligands are in the structure but only a specific ones are desired, they can be specified here. Examples are either the three letter code “LIG”, or in case of a peptide “VAL-GLU-TYS-PHE-ALA”. Multiple different ligands, should be separated by commas, “LIG,GIL,INH”.
- -map <filename>
Input electron density map from X-ray crystallography, to be used for Iridium score calculation.
- -metadata <filename>
Metadata json file containing OEStructureMetadata. Using this can be a way to indicate desired ligands, tautomers to use for ligands, structure sequence etc., as well as structure title for output file names.
- -ref <filename>
Reference OEDesignUnit indicating the proper biological unit and relevant binding site.
- -site_residue <residue identifier>
Input option to specify a binding site using a single residue specification if apo (or holo). The format is “name:num:insert code:chainid”, e.g. “ASP:25: :A” indicating Aspartic acid 25 in chain A. Note: A blank/whitespace character is used for the insert code, which is a typical use case.
- -add_receptors
Option to add OEReceptors onto the design units
[default = true]
Output Options
- -out <out>
Write to a user specified file. Enforces *.oedu format. Can be used with
-wartsto write multiple files, one per design unit.
- -warts
Option to add warts “_#” to the user supplied filename, when writing multiple design units.
[default = false]
- -prefix <prefix>
Prefix used to name output files, default is blank
- -log <logfile>
The argument for this flag specifies the name of the log file. This overrides any specified prefix. The default will be spruce_output.log, if no prefix is specified.
- -settings <settingsfile>
The argument for this flag specifies the name of the settings file. This overrides any specified prefix. The default will be spruce_settings.param, if no prefix is specified.
- -verbose
Triggers copious logging output.
[default = false]
- -write_biounits
Option to write intermediate Bio-DesignUnits, which can be used with enumsites to site to potential allosteric sites later.
[default = false]
Split Parameters
- -altloc <method>
Parameter determining how alternate locations are handled. The primary option, collapses alternate locations, whereas the enumerate option attempts to set detected alternate locations, A, B, etc.
[default = enumerate]
- -cofactor_codes <codes>
Mechanism to define 3 letter codes that should be recognized as co-factors if not done automatically, e.g. “ATP”, or “ATP,NAD”
- -excipient_codes
Mechanism to define 3 letter codes that should be recognized as excipients if not done automatically, e.g. “GOL”, or “GOL,DMS”
- -lipid_codes <codes>
Mechanism to define 3 letter codes that should be recognized as lipids if not done automatically, e.g. “PGR”, or “PGR,CHL”
- -make_packing_residues
Option to generate packing residues, both for visualization, but also for Iridium classification
[default = true]
- -min_lig_atoms
Parameter determining the min number of atoms a ligand molecule can have. A reason to lower this number would be for small fragments that need to be classified as ligands
[default = 8]
- -max_lig_atoms
Parameter determining the max number of atoms a ligand molecule can have. A reasons to increase this number could be for peptidic ligands
[default = 100]
- -max_lig_residues
Parameter determining the max number of residues a ligand molecule can have. A reasons to increase this number could be for peptidic ligands
[default = 5]
- -max_sys_atoms
Parameter limiting the max number of atoms in the entire system that spruce will allow for processing. If this limit is reached e.g. due to a large systems or symmetry expansion resulting in a larger than expected system, spruce will stop processing and return false.
[default = 50,000]
- -target
Parameter telling the system what the “target” is. This is particularly helpful for systems containing both protein and nucleic acids, where the automated system does not correctly identify the nucleic acid as the target of interest. Allowed values are protein or nucleic.
[default = protein]
Enumerate Sites Parameters
- -add_interactions
Option to add OEInteractionHints to the design unit(s)
[default = true]
- -add_style
Option to add visualization style to the design unit(s)
[default = true]
- -collapse_nonsite_alts
Option to deduplicate structures with different alternate locations if those alternate locations are far from the binding site
[default = true]
- -duplicate_removal
Option to deduplicate identical structures resulting from symmetry operations
[default = true]
- -enum_cofactors_sites
Option to generate design units with sites based on components classified as co-factors
[default = false]
- -restrict_to_refsite
Option to skip generating design units for sites identified, that do not match a provided reference design unit
[default = true]
- -site_size: <value>
Distance from the ligand used to determine the size of the site
[default = 5.0 (angstroms)]
- -superpose
Option to superpose generated design units, if multiple. If a reference is provided, the first generated design unit will be superposed onto the reference structure, and subsequent structures onto that one.
[default = true]
- -superpose_method : <method>
The method to use for superposition
Method
Description
global
Global Sequence Alignment to identify CA pairs
site
Global Sequence Alignment to identity CA pairs - focusing on the subset in active site
ddm
Superposition using the Distance Difference Matrix method (DDM)
sse
Superposition using an overlap of Secondary Structure Elements (SSE)
sitehopper
Superposition based on binding site similarities (sitehopper)
Build Parameters
- -build_cterm_caps
Option to cap broken c-termini in protein chains
[default = true]
- -build_nterm_caps
Option to cap broken n-termini in protein chains
[default = true]
- -build_loops
Option to build loops for gaps in the protein structure
[default = true]
- -build_sidechains
Option to build missing or partial protein sidechains
[default = true]
- -enum_pockets
Option to detect pockets and generate design units from them
[default = false]
Sidechain Build Parameters
- -sc_delete_clashing_solvent
Option to allow build steps to remove clashing solvent
[default = true]
- -rot_coverage <value>
Coverage of rotamer libraries to use, a lower number can be used to speed up side chain re-building skipping lower probability side-chain rotamers.
[default = 100.0]
- -rot_library <value>
Rotamer library used for building sidechains and loops. Allowed values are ‘richardson2016’, ‘dunbrack’, ‘richardson’.
[default = richardson2016]
Loop Build Parameters
- -build_with_crystalpacking
Include packing residues when building loops
[default = false]
- -loop_db_filename <filename>
Database containing loop templates
- -crop_length <value>
Anchor residues on the protein to crop back for a better fit, results in longer loops being built
[default = 1]
- -transform_threshold <value>
Number of loops to allow through the sidechain clash checker. No matter this number, will process all with an identical sequence to target.
[default = 25]
- -strict_proline_match
Fuzzy matches in the loop database have to have proline in exact locations of sequence.
[default = true]
- -loop_distance_buffer <value>
Fuzzy matches in the loop database has to have distance between anchor atoms correct, +/- buffer distance
[default = 1.0]
- -bb_clash_threshold <value>
Loops from the database where more than the threshold fraction of the backbone atoms clash, are rejected
[default = 0.25]
- -loop_clash_threshold <value>
Loops from the database where more than the threshold fraction of the loops atoms in addition to the backbone clashing ones clash, are rejected
[default = 0.2]
- -opt_shell <value>
Include atoms within this distance in the loop optimization, larger distance results in slower optimization
[default = 15.0]
- -opt_tolerance <value>
Tolerance for the loop optimization, smaller numbers result in slower optimization
[default = 0.001]
- -opt_stage1_iter_multiplier <value>
Number of steps per number of residues in the loop for the first stage optimizer
[default = 5]
- -opt_stage2_iter_multiplier <value>
Number of steps per number of residues in the loop for the second stage optimizer
[default = 10]
- -incl_solvation <value>
Include simple solvation model when building loops
[default = true]
- -incl_SA_term <value>
Include solvent accessible surface area term when ranking the loops
[default = true]
- -max_eval_loops <value>
Maximum number of loops to connect and minimize
[default = 5]
- -build_disulfidebridges <value>
Allow the loop builder to build disulfide brides during loop modeling (if possible)
[default = true]
Cap Build Parameters
- -cap_delete_clashing_solvent
Option to allow build steps to remove clashing solvent
[default = true]
- -allow_truncate
Option to allow terminal residue to converted to cap, if cap will otherwise clash
[default = true]
Prep Parameters
- -charge_radii
Option to assign partial charge and radii
[default = true]
- -protonate
Option add and optimize protons in the system
[default = true]
Protonation Parameters
- -generate_tautomers
Option to generate and use tautomers in the hydrogen network optimization
[default = true]
- -het_group_nbr_dist <value>
Distance between heterogens used to determine optimization clusters for protonation
[default = 3.5 (angstroms)]
- -opt_expt_protons
Option to optimize hydrogens assigned in the experiment.
[default = false]
- -flip_bias_scale
Bias scale for flippable groups (e.g. HIS/ASN/GLN) during hydrogen placement
[default = 1.0]
Biological Unit Extraction Parameters
- -bu_superpose
Option to superpose the biological units
[default = false]
- -max_bu_atoms
Option to limit the size of BUs processed based on number of atoms
[default = 50,000]
- -max_bu_parts
Option to limit the size of BUs processed based on number of parts
[default = 24]
- -min_align_score
Option to specify minimum sequence alignment score
[default = 200]
- -pref_author_record
Option where the author BIOMT record is preferred over the software generated one
[default = true]
Enumerate Pocket Parameters
- -pocketMask
Option to set the subset of design unit components to be used for pocket finding. Multiple components can be combined as a comma separated string to create the input.
[default = true]
- -min_pocket_surf_area
Option to set minimum surface area of detected pocket.
[default = false]
- -max_pocket_surf_area
Option to set maximum surface area of detected pocket.
[default = false]
- -pocket_burial_factor
Option to set the factor used to distinguish pockets from the general surface.
[default = 1.4]
OEReceptor Parameters
- -targetMask
Subset of design unit components to be used as target for the receptor. Multiple components can be combined as a comma separated string to create the input
[default = protein,nucleic,cofactors,metals,lipids,other_proteins,other_nucleics,other_ligands,other_cofactors]
Please see more details at SPRUCE and ReceptorInDU.
Special Parameters
- -no_prep
Option to skip all preparation steps. This will supersede any other optional flags set.
[default = false]
Filter Options Parameters
- -allow_filter_error
Option to allow running spruce prep even when structure fails spruce filter.
[default = false]
- -fix_res_names
Option to fix incorrect residue names.
[default = true]
- -fix_res_states
Option to fix incorrect residue states.
[default = true]
- -fix_bonds_to_metals
Option to fix incorrect covalent bonds to metals.
[default = true]
- -fix_bb_atoms
BRIEF Option to fix protein backbone atom states.
[default = true]
- -fix_float_res
BRIEF Option to delete floating residues
[default = true]
- -fix_chainID
BRIEF Option to fix missing chain ID
[default = true]
- -fix_covalent_bond
BRIEF Option to fix incorrect covalent bonds to halogens
[default = true]
- -allow_validation_error
Option to allow writing OEDesignUnits even if they fail validation
[default = true]