Replicate Transform
    • Dark
      Light

    Replicate Transform

    • Dark
      Light

    Article summary

    The replicate transform replicates a Pandas DataFrame. It will replicate it once for each asset in the asset_list. It will also add an ‘asset’ column that contains the name of the asset.

    Example:

    Say we have a DataFrame that looks like this:

    some_field,stream_type,timestamp
    3.14      ,'a_stream' ,2018-01-01 00:00:00.000000
    2.71828   ,'b_stream' ,2018-01-02 00:00:00.000000
    0.0       ,'my_stream',2018-01-03 00:00:00.000000

    And we want to replicate it for three different assets:

    asset   ,some_field,stream_type,timestamp
    'asset1',3.14      ,'a_stream' ,2018-01-01 00:00:00.000000
    'asset1',2.71828   ,'b_stream' ,2018-01-02 00:00:00.000000
    'asset1',0.0       ,'my_stream',2018-01-03 00:00:00.000000
    'asset2',3.14      ,'a_stream' ,2018-01-01 00:00:00.000000
    'asset2',2.71828   ,'b_stream' ,2018-01-02 00:00:00.000000
    'asset2',0.0       ,'my_stream',2018-01-03 00:00:00.000000
    'asset3',3.14      ,'a_stream' ,2018-01-01 00:00:00.000000
    'asset3',2.71828   ,'b_stream' ,2018-01-02 00:00:00.000000
    'asset3',0.0       ,'my_stream',2018-01-03 00:00:00.000000

    Our configuration will look something like this:

    {
        "transform_name": "Replicate",
        "transform_type": "replicate",
        "filter_stream": [
            "*"
        ],
        "asset_list": ["asset1", "asset2", "asset3"],
    }

    Configuration:

    Required and optional properties that can be configured for a replicate transform.

    • asset_list: The list of assets that will be added to the DataFrame
    • transform_name: Unique name for the transform.
    • transform_type: Type of transform to apply. Should be replicate.
    • filter_stream: List of data streams to transform. Each stream can either be * (all) or asset:stream.

    What's Next