Dataset to Collection ExportΒΆ

Transform a Dataset to a Collection

Extra Required Parameters

  • Collection Name (collection_sink) : Name of the collection to create
  • Number of records in a batch (integer) : Maximum number of records to read with this cube
    Default: 50000
  • Data to read from (data_source) : The data to read from
  • Output Shard Format (string) : The format of the data that shards will contain
    Default: oedb
    Choices: oeb.gz, oedb, oez, ism.gz, oeb
  • records_per_shard (integer) : The target number of records in a shard. 0 indicates to run up to the max_shard_bytes limit per shard