Sample Collection

Category Paths

Follow one of these paths in the Orion user interface, to find the floe.

  • Product-based/FastROCS

  • Product-based/Gigadock

  • Role-based/Computational Chemist

  • Solution-based/Virtual-screening/DB Search

  • Task-based/Data Science/Sampling and Subsetting

Description

This floe randomly samples the contents of a FastROCS or GigaDocking collection and writes the sample to a dataset.

If the total size of the collection is less than the sample size, the entire collection will be written to the dataset.

Note: Some older externalized collections in Organization Data cannot be passed to this floe (an error will occur).

Promoted Parameters

Title in user interface (promoted name)

Inputs

Input Collection (input_collection): FastROCS or Giga Docking collection to sample. Requires exactly one collection.

  • Required

  • Type: collection_source

Outputs

Output Sample Dataset (output_sample_dataset): Output dataset to write to

  • Required

  • Type: dataset_out

  • Default: Sample Dataset

Options

Sample Size (sample_size): Number of records to sample from the collection. Max value 100,000.

  • Required

  • Type: integer

  • Default: 1000