Launch Reaction EnumerationsΒΆ

This Floe launches individual reaction enumeration Floes for all the reaction ids selected for enumeration from the uploaded reagent collection.

WARNING For reaction libraries in the 100M size or larger, contact support@eyesopen.com for assistance.

General Usage

10^10 products have been enumerated successfully with this approach, but careful investigation using the specific reagent collection of interest is highly encouraged. It is recommended that the Reagent Archive Summary is run on the desired reagent collection to verify the number of potential products before embarking on a large enumeration activity. Preliminary runs with a subset of the potential reaction(s) should be performed to ensure the product dataset contains all the desired output fields.

Reaction ID - the reaction ids from the specified reagent collection (Reagent Collection ID) to be enumerated. All is an acceptable selection but should be used very cautiously as this can result in a large number of launched Floes and a very expensive overall cost for the enumeration.

Append Enumeration Products To Collection - name of the output collection for output of the product records. If the collection does not exist, it will be created, and if it exists, the product shards will be added to the collection. The append feature means that the specified output collection will never be closed to allow additional product enumerations to populate the collection. It is recommended that ocli be used to close the collection once all the enumerations have been completed. The collection may display with 0 size, but that is simply because the size is not computed until the collection is closed. Regardless of its open/closed state, the product collection can still be used directly in other Floes.

Product ID Style - the desired style of output product ids.

Product ID Delimiter - the delimiter to use for the Custom style of product ids.

Product Rec/Shard - desired records/shard for the final output collection shards.

Advanced Usage

Concurrent Enumeration Job Limit - Limits the number of running enumeration Floes to this limit (0: unlimited).

Enumeration Cost Budget (Dollars) - The launching Floe will not launch any more enumeration jobs once this (dollar)

cost limit (if available) is met from all the child jobs. The launching Floe will never kill running Floes, however.

Enumeration Time Budget (Minutes) - The launching Floe will not launch any more enumeration jobs once this time limit

(if available) is met from all the child jobs. The launching Floe will never kill running Floes, however.

Enumeration Failure Limit - Enumeration jobs are continued to be launched until this number of failures is reached (0:

no limit).

InFlight Job Priority Limit - This limit caps the number of running enumeration Floes to this product limit. For example, if there is a reaction with 1B products launched, this limit can be used to reduce stress on the stack by limiting the launch of additional enumeration jobs until this large reaction completes and required compute instances are freed for use. The limit is a total number of products for all enumeration Floes in flight, and as Floes complete, additional enumeration Floes will be launched so long as this in-flight limit is not exceeded.

Reaction Product Minimum, Reaction Product Maximum - Only enumerate reactions limited by this range of products

Skip Reaction Sizes - Ignore reactions that generate more than this number of products.

Reagent Enumeration Floe - designates the name or id of the child Floe that enumerates a single reaction id - this should be the single reaction enumeration Floe from this package.

Extra Required Parameters

  • Reagent Collection (collection_source) : Designate the input reagent collection by name or id
  • Reagent Enumeration Floe (string) : Designate the enumeration floe by title or id for product enumerations
    Default: Large Scale Reaction Enumeration v0.1.9
  • Product ID Style (string) : Which style of product id to generate
    Default: EnamineREALSpace
    Choices: None, EnamineREALSpace, Custom
  • Append Enumeration Products to Collection (string) : The name or id of the collection for appending enumeration products
  • Product rec/shard (integer) : Records per shard for the final emitted product shards. For post-processing with Omega directly, 2500 is optimum. For use in the Giga Docking Prep floe, 50K is optimum
    Default: 100000
  • Reaction IDs (string) : Either a comma-delimited list of reactions, or All to run all reactions from the input collection