SMARTS Substructure Filter
This cube filters records based on SMARTS query matching.
Candidate molecules are read from the field specified by the Input Molecule Field parameter. The substructure is defined by the SMARTS Pattern parameter.
Upon successful substructure match, the record is sent to the success port. If no match is detected, the record is sent to the failure port.
Note
While this cube has to modify the input molecule (performing necessary perceptions and adding/suppressing hydrogens prior to performing the substructure search), these changes will not persist when the record is sent to the success port.
See also
- Substructure Search section in OEChem TK manual 
Calculation Parameters
CPUs (cpu_count) type: integer: The number of CPUs to run this cube withDefault: 1 , Min: 1, Max: 192
Cube Metrics (cube_metrics) type: string: Set of metrics to be collectedChoices: cpu, disk, memory, network
Temporary Disk Space (MiB) (disk_space) type: decimal: The minimum amount of disk space in MiB (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 5120.0 , Min: 128.0, Max: 8589934592
GPUs (gpu_count) type: decimal: The number of GPUs to run this cube withDefault: 0 , Max: 16
Hydrogen Handling (hydrogen_handling) type: string: This parameter determines whether to suppress or add explicit hydrogens to the target molecules prior to performing the substructure search.Choices: AddExplicitHydrogens, SuppressHydrogens
Instance Tags (instance_tags) type: string: Only run on machines with matching tags (comma separated)Default: “”
Instance Type (instance_type) type: string: The type of instance that this cube needs to be run on
Max Backlog Wait (max_backlog_wait) type: integer: The max time (in seconds) that a cube will be backlogged on a group before being re-evaluatedDefault: 600 , Min: 300
Memory (MiB) (memory_mb) type: decimal: The minimum amount of memory in MiBs (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 1800 , Min: 256.0, Max: 8589934592
Metric Period (metric_period) type: decimal: How often to sample metrics, in secondsDefault: 60Choices: 1, 5, 10, 30, 60, 120, 180, 240, 300, Min: 1, Max: 300
Thread limit per CPU (pids_per_cpu_limit) type: integer: The number of threads per CPUDefault: 32
Shared Memory (MiB) (shared_memory_mb) type: decimal: The amount of shared memory to allow a container to addressDefault: 64
SMARTS Pattern (smarts) type: string: The string that defines the substructure as a SMARTS.
Spot policy (spot_policy) type: string: Control cube placement on spot market instancesDefault: ProhibitedChoices: Allowed, Preferred, NotPreferred, Prohibited, Required
Field parameters
None (in_mol_field) type: Field Type: Chem.Mol:
None (out_mol_field) type: Field Type: Chem.Mol:
Substructure Search Parameters
- The parameters of substructure search.
- Hydrogen Handling (None) type: string: This parameter determines whether to suppress or add explicit hydrogens to the target molecules prior to performing the substructure search.Choices: AddExplicitHydrogens, SuppressHydrogens
- SMARTS Pattern (None) type: string: The string that defines the substructure as a SMARTS.
 
Hardware Parameters
- Machine hardware requirements
- Memory (MiB) (memory_mb) type: decimal: The minimum amount of memory in MiBs (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 1800 , Min: 256.0, Max: 8589934592
- Shared Memory (MiB) (shared_memory_mb) type: decimal: The amount of shared memory to allow a container to addressDefault: 64
- Thread limit per CPU (pids_per_cpu_limit) type: integer: The number of threads per CPUDefault: 32
- Max Backlog Wait (max_backlog_wait) type: integer: The max time (in seconds) that a cube will be backlogged on a group before being re-evaluatedDefault: 600 , Min: 300
- Temporary Disk Space (MiB) (disk_space) type: decimal: The minimum amount of disk space in MiB (1048576 B) this cube requires. Due to overhead, request a couple hundred MiB more than required.Default: 5120.0 , Min: 128.0, Max: 8589934592
- GPUs (gpu_count) type: decimal: The number of GPUs to run this cube withDefault: 0 , Max: 16
- CPUs (cpu_count) type: integer: The number of CPUs to run this cube withDefault: 1 , Min: 1, Max: 192
- Instance Type (instance_type) type: string: The type of instance that this cube needs to be run on
- Spot policy (spot_policy) type: string: Control cube placement on spot market instancesDefault: ProhibitedChoices: Allowed, Preferred, NotPreferred, Prohibited, Required
- Instance Tags (instance_tags) type: string: Only run on machines with matching tags (comma separated)Default: “”
 
Metrics Parameters
- Cube Metric Parameters
- Metric Period (None) type: decimal: How often to sample metrics, in secondsDefault: 60Choices: 1, 5, 10, 30, 60, 120, 180, 240, 300, Min: 1, Max: 300
- Cube Metrics (None) type: string: Set of metrics to be collectedChoices: cpu, disk, memory, network
 
Parallel SMARTS Substructure Filter
The parallel version adds these extra parameters.
Number of messages to distribute at a time (item_count) type: integer: The maximum number of messages to bundle together for a parallel cube.Default: 1 , Min: 1, Max: 65535
Maximum Failures (max_failures) type: integer: The maximum number of times to attempt processing a work itemDefault: 10 , Min: 1, Max: 100
Autoscale this Cube (autoscale) type: boolean: If True, let Orion manage the parallelism of this CubeDefault: True
Maximum number of Cubes (max_parallel) type: integer: The maximum number of concurrently running copies of this CubeDefault: 1000 , Min: 1
Minimum number of Cubes (min_parallel) type: integer: The minimum number of concurrently running copies of this CubeDefault: 0
Starting number of Cubes (scaleup_start) type: integer: The initial number of concurrently running copies of this CubeDefault: 2 , Min: 1