Dataset to Collection Export

Category Paths

Follow one of these paths in the Orion user interface, to find the floe.

  • Task-based/Data Science/Conversion

Description

This flow converts an Orion dataset to an Orion collection. This floe allows both the maximum total number of records to be processed and the number of records per shard to be specified. This conversion gracefully handles filetype conversion using orion-platform’s DRConvert capability. Any output files that do not contain molecular data can be downloaded, but not analyzed in the Orion UI.

Titles of required parameters (promoted names)

  • Number of records in a batch (batch_size) type: integer: Maximum number of records to read with this cube
    Default: 50000
  • Data to read from (data_in) type: data_source: The data to read from
  • Collection Name (collection_name) type: collection_sink: Name of the collection to create