ROCS for EON

This cube performs overlay optimization between a query molecule and a database (fit) molecule, using ROCS.

The query molecule is read from the init port, and the database (fit) molecules are read from the intake port.

Multiple top hits of the overlaid query molecule and fit molecule conformers are added to the intake port record along with a series of calculated 3D similarity measures and emitted through the success port. The output fit molecule conformers are transformed to their corresponding best overlaid positions.

Calculation Parameters

  • Color type (color_type) type: string: Type of function to be used for color score evaluation
    Default: Exact
    Choices: Grid, Analytic, Exact
  • CPUs (cpu_count) type: integer: The number of CPUs to run this cube with
    Default: 1 , Min: 1, Max: 128
  • Cube Metrics (cube_metrics) type: string: Set of metrics to be collected

    Choices: cpu, disk, memory, network
  • Temporary Disk Space (MiB) (disk_space) type: decimal: The minimum amount of disk space in MiB (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.
    Default: 5120.0 , Min: 128.0, Max: 8589934592
  • GPUs (gpu_count) type: integer: The number of GPUs to run this cube with
    Default: 0 , Max: 16
  • Instance Tags (instance_tags) type: string: Only run on machines with matching tags (comma separated)
    Default: “”
  • Instance Type (instance_type) type: string: The type of instance that this cube needs to be run on
  • Max Backlog Wait (max_backlog_wait) type: integer: The max time (in seconds) that a cube will be backlogged on a group before being re-evaluated
    Default: 600 , Min: 300
  • Max Out Confs (max_out_confs) type: integer: Maximum number of conformers to return
    Default: 5 , Min: 1, Max: 100
  • Max Rotors (max_rotors) type: integer: Cutoff of rotatable bonds. The cube will skip molecules with rotors more than the cutoff.
    Default: 40 , Min: 1, Max: 9999
  • Memory (MiB) (memory_mb) type: decimal: The minimum amount of memory in MiBs (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.
    Default: 1800 , Min: 256.0, Max: 8589934592
  • Metric Period (metric_period) type: decimal: How often to sample metrics, in seconds
    Default: 60
    Choices: 1, 5, 10, 30, 60, 120, 180, 240, 300, Min: 1, Max: 300
  • Output All Results (output_all_res) type: boolean: Output all possible overlay results
    Default: True
  • Thread limit per CPU (pids_per_cpu_limit) type: integer: The number of threads per CPU
    Default: 32
  • Shape type (shape_type) type: string: Type of function to be used for shape overlap evaluation
    Default: Grid
    Choices: Exact, Grid, Analytic
  • Shared Memory (MiB) (shared_memory_mb) type: decimal: The amount of shared memory to allow a container to address
    Default: 64
  • Sorter type (sorter_type) type: string: Type of predicate for sorting scores
    Default: HighestTanimotoCombo
    Choices: HighestTanimotoCombo, HighestFitTverskyCombo, HighestRefTverskyCombo
  • Spot policy (spot_policy) type: string: Control cube placement on spot market instances
    Default: Prohibited
    Choices: Allowed, Preferred, NotPreferred, Prohibited, Required
  • start type (start_type) type: string: Type of starts to be used
    Default: Inertial
    Choices: Inertial, Rocs, AsIs, AtAtom, Subrocs, Random

Field parameters

  • Color Score field (color_score_field) type: Field Type: Float: Color Score between the query and the fit molecule
    Default: Color Score
  • Color Tanimoto field (color_tan_field) type: Field Type: Float: Color Tanimoto between the query and the fit molecule
    Default: Color Tanimoto
  • Extended Log Field (ext_log_field) type: Field Type: StringVec: Message extended log field
    Default: Extended Log Field
  • Fit Self Color field (fit_color_field) type: Field Type: Float: Self Color score of the fit molecule
    Default: Fit Self Color
  • Fit Color Tversky field (fit_color_tvr_field) type: Field Type: Float: Color Tversky calculated at beta of 0.95
    Default: Fit Color Tversky
  • Fit Tversky Combo field (fit_combo_tvr_field) type: Field Type: Float: Sum of shape and color Tversky calculated at beta of 0.95
    Default: Fit Tversky Combo
  • Fit Self Overlap field (fit_shape_field) type: Field Type: Float: Self overlap of the fit molecule
    Default: Fit Self Overlap
  • Fit Shape Tversky field (fit_shape_tvr_field) type: Field Type: Float: Shape Tversky calculated at beta of 0.95
    Default: Fit Shape Tversky
  • None (in_mol_field) type: Field Type: Chem.Mol:
  • None (init_mol_field) type: Field Type: Chem.Mol:
  • Log Field (log_field) type: Field Type: String: The field to store messages to floe report
    Default: Log Field
  • None (out_mol_field) type: Field Type: Chem.Mol:
  • Overlap field (overlap_field) type: Field Type: Float: Shape Overlap between the query and the fit molecule
    Default: Overlap
  • Query Conformer field (query_field) type: Field Type: Chem.Mol: Query Conformer corresponding to the reported scores
    Default: Query Conformer
  • Query Conformer index field (query_id_field) type: Field Type: Int: Query Conformer index from the original query molecule
    Default: Query Conformer Index
  • Ref Self Color field (ref_color_field) type: Field Type: Float: Self Color score of the query molecule
    Default: Ref Self Color
  • Ref Color Tversky field (ref_color_tvr_field) type: Field Type: Float: Color Tversky calculated at alpha of 0.95
    Default: Ref Color Tversky
  • Ref Tversky Combo field (ref_combo_tvr_field) type: Field Type: Float: Sum of shape and color Tversky calculated at alpha of 0.95
    Default: Ref Tversky Combo
  • Ref Self Overlap field (ref_shape_field) type: Field Type: Float: Self overlap of the query molecule
    Default: Ref Self Overlap
  • Ref Shape Tversky field (ref_shape_tvr_field) type: Field Type: Float: Shape Tversky calculated at alpha of 0.95
    Default: Ref Shape Tversky
  • Shape Tanimoto field (shape_tan_field) type: Field Type: Float: Shape Tanimoto between the query and the fit molecule
    Default: Shape Tanimoto
  • Scores to sort field (sort_field) type: Field Type: Float: Scores to sort on for hit list generation
    Default: Scores to sort
  • Tanimoto Combo field (tan_combo_field) type: Field Type: Float: Sum of shape and color Tanimoto between query and fit molecule
    Default: Tanimoto Combo

Hardware Parameters

Machine hardware requirements
  • Memory (MiB) (memory_mb) type: decimal: The minimum amount of memory in MiBs (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.
    Default: 1800 , Min: 256.0, Max: 8589934592
  • Shared Memory (MiB) (shared_memory_mb) type: decimal: The amount of shared memory to allow a container to address
    Default: 64
  • Thread limit per CPU (pids_per_cpu_limit) type: integer: The number of threads per CPU
    Default: 32
  • Max Backlog Wait (max_backlog_wait) type: integer: The max time (in seconds) that a cube will be backlogged on a group before being re-evaluated
    Default: 600 , Min: 300
  • Temporary Disk Space (MiB) (disk_space) type: decimal: The minimum amount of disk space in MiB (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.
    Default: 5120.0 , Min: 128.0, Max: 8589934592
  • GPUs (gpu_count) type: integer: The number of GPUs to run this cube with
    Default: 0 , Max: 16
  • CPUs (cpu_count) type: integer: The number of CPUs to run this cube with
    Default: 1 , Min: 1, Max: 128
  • Instance Type (instance_type) type: string: The type of instance that this cube needs to be run on
  • Spot policy (spot_policy) type: string: Control cube placement on spot market instances
    Default: Prohibited
    Choices: Allowed, Preferred, NotPreferred, Prohibited, Required
  • Instance Tags (instance_tags) type: string: Only run on machines with matching tags (comma separated)
    Default: “”

Metrics Parameters

Cube Metric Parameters
  • Metric Period (None) type: decimal: How often to sample metrics, in seconds
    Default: 60
    Choices: 1, 5, 10, 30, 60, 120, 180, 240, 300, Min: 1, Max: 300
  • Cube Metrics (None) type: string: Set of metrics to be collected

    Choices: cpu, disk, memory, network

Parallel ROCS for EON

The parallel version adds these extra parameters.

  • Number of messages to distribute at a time (item_count) type: integer: The maximum number of messages to bundle together for a parallel cube.
    Default: 1 , Min: 1, Max: 65535
  • Maximum Failures (max_failures) type: integer: The maximum number of times to attempt processing a work item
    Default: 10 , Min: 1, Max: 100
  • Autoscale this Cube (autoscale) type: boolean: If True, let Orion manage the parallelism of this Cube
    Default: True
  • Maximum number of Cubes (max_parallel) type: integer: The maximum number of concurrently running copies of this Cube
    Default: 1000 , Min: 1
  • Minimum number of Cubes (min_parallel) type: integer: The minimum number of concurrently running copies of this Cube
    Default: 0