Fingerprint Set Similarity Calculation
Calculation Parameters
CPUs (cpu_count) type: integer: The number of CPUs to run this cube withDefault: 1 , Min: 1, Max: 192
Cube Metrics (cube_metrics) type: string: Set of metrics to be collectedChoices: cpu, disk, memory, network
Temporary Disk Space (MiB) (disk_space) type: decimal: The minimum amount of disk space in MiB (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 5120.0 , Min: 128.0, Max: 8589934592
GPUs (gpu_count) type: decimal: The number of GPUs to run this cube withDefault: 0 , Max: 16
Instance Tags (instance_tags) type: string: Only run on machines with matching tags (comma separated)Default: “”
Instance Type (instance_type) type: string: The type of instance that this cube needs to be run on
Max Backlog Wait (max_backlog_wait) type: integer: The max time (in seconds) that a cube will be backlogged on a group before being re-evaluatedDefault: 600 , Min: 300
Memory (MiB) (memory_mb) type: decimal: The minimum amount of memory in MiBs (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 1800 , Min: 256.0, Max: 8589934592
Metric Period (metric_period) type: decimal: How often to sample metrics, in secondsDefault: 60Choices: 1, 5, 10, 30, 60, 120, 180, 240, 300, Min: 1, Max: 300
Thread limit per CPU (pids_per_cpu_limit) type: integer: The number of threads per CPUDefault: 32
Shared Memory (MiB) (shared_memory_mb) type: decimal: The amount of shared memory to allow a container to addressDefault: 64
Maximum Similarity Score Cutoff (similarity_max_cutoff) type: decimal: The cutoff score for similarity calculation.
Similarity Measure (similarity_measure_type) type: string: The similarity measure used to 2D similarity calculation.Default: TanimotoChoices: Cosine, Dice, Euclid, Manhattan, Tanimoto, Tversky
Minimum Similarity Score Cutoff (similarity_min_cutoff) type: decimal: The cutoff score for similarity calculation.
Spot policy (spot_policy) type: string: Control cube placement on spot market instancesDefault: ProhibitedChoices: Allowed, Preferred, NotPreferred, Prohibited, Required
Field parameters
Histogram Counts (None) type: Field Type: FloatVec: The field to store histogram counts of similarity calculation.Default: Histogram Counts
Histogram Bin Centers (None) type: Field Type: FloatVec: The field to store histogram bin centers of similarity calculation.and molecules.Default: Histogram Bin Centers
Fingerprint Field (fingerprint_field) type: Field Type: Chem.FingerPrint: Tag name for the field that stores fingerprints.
Fingerprint Set (fingerprint_set_field) type: Field Type: RecordVec: Fingerprint record sets
Histogram Bin Centers (histogram_bin_centers_field) type: Field Type: FloatVec: The field to store histogram bin centers of similarity calculation.and molecules.Default: Histogram Bin Centers
Histogram Counts (histogram_counts_field) type: Field Type: FloatVec: The field to store histogram counts of similarity calculation.Default: Histogram Counts
Similarity Score Field (score_field) type: Field Type: Float: Name for the field that stores fingerprint similarity scores.
UUID (uuid_field) type: Field Type: String: The field to store unique identifiers for fingerprints and molecules.
2D Similarity Parameters
- The parameters of the 2D fingerprint similarity calculation.
- Similarity Measure (None) type: string: The similarity measure used to 2D similarity calculation.Default: TanimotoChoices: Cosine, Dice, Euclid, Manhattan, Tanimoto, Tversky
- Minimum Similarity Score Cutoff (None) type: decimal: The cutoff score for similarity calculation.
- Maximum Similarity Score Cutoff (None) type: decimal: The cutoff score for similarity calculation.
 
Hardware Parameters
- Machine hardware requirements
- Memory (MiB) (memory_mb) type: decimal: The minimum amount of memory in MiBs (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 1800 , Min: 256.0, Max: 8589934592
- Shared Memory (MiB) (shared_memory_mb) type: decimal: The amount of shared memory to allow a container to addressDefault: 64
- Thread limit per CPU (pids_per_cpu_limit) type: integer: The number of threads per CPUDefault: 32
- Max Backlog Wait (max_backlog_wait) type: integer: The max time (in seconds) that a cube will be backlogged on a group before being re-evaluatedDefault: 600 , Min: 300
- Temporary Disk Space (MiB) (disk_space) type: decimal: The minimum amount of disk space in MiB (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 5120.0 , Min: 128.0, Max: 8589934592
- GPUs (gpu_count) type: decimal: The number of GPUs to run this cube withDefault: 0 , Max: 16
- CPUs (cpu_count) type: integer: The number of CPUs to run this cube withDefault: 1 , Min: 1, Max: 192
- Instance Type (instance_type) type: string: The type of instance that this cube needs to be run on
- Spot policy (spot_policy) type: string: Control cube placement on spot market instancesDefault: ProhibitedChoices: Allowed, Preferred, NotPreferred, Prohibited, Required
- Instance Tags (instance_tags) type: string: Only run on machines with matching tags (comma separated)Default: “”
 
Metrics Parameters
- Cube Metric Parameters
- Metric Period (None) type: decimal: How often to sample metrics, in secondsDefault: 60Choices: 1, 5, 10, 30, 60, 120, 180, 240, 300, Min: 1, Max: 300
- Cube Metrics (None) type: string: Set of metrics to be collectedChoices: cpu, disk, memory, network
 
Parallel Fingerprint Set Similarity Calculation
The parallel version adds these extra parameters.
Number of messages to distribute at a time (item_count) type: integer: The maximum number of messages to bundle together for a parallel cube.Default: 1 , Min: 1, Max: 65535
Maximum Failures (max_failures) type: integer: The maximum number of times to attempt processing a work itemDefault: 10 , Min: 1, Max: 100
Autoscale this Cube (autoscale) type: boolean: If True, let Orion manage the parallelism of this CubeDefault: True
Maximum number of Cubes (max_parallel) type: integer: The maximum number of concurrently running copies of this CubeDefault: 1000 , Min: 1
Minimum number of Cubes (min_parallel) type: integer: The minimum number of concurrently running copies of this CubeDefault: 0
Starting number of Cubes (scaleup_start) type: integer: The initial number of concurrently running copies of this CubeDefault: 2 , Min: 1