Sample Collection
Description
Randomly samples the contents of a FastROCS or GigaDocking collections and writes the sample to a dataset.
If the total size of the collection is less than the sample size the entire collection will be written to the dataset
Note: Some older externalized collections in Organization Data cannot be passed this is floe (an error will occur).
See also
This floes is used in the Prepare Vendor Database for Giga Docking and FastROCS tutorial.
Details
Title : Sample CollectionTags : Munging Large Scale Floes UtilityPython Name : #10_sample_collection
Parameters
Inputs
Input Collection FastROCS or Giga Docking collection to sample. Requires exactly 1 collection.Type : collection_sourceRequired : TruePython Name : input_collection
Outputs
Output Sample Dataset Output dataset to write toType : dataset_outRequired : TrueDefault : Sample DatasetPython Name : output_sample_dataset
Options
Sample Size Number of records to sample from the collection. Max value 100,000.Type : integerRequired : TrueDefault : 1000Max Value : 1000000Python Name : sample_size