Replicate Transform
- Print
- DarkLight
Replicate Transform
- Print
- DarkLight
Article summary
Did you find this summary helpful?
Thank you for your feedback
The replicate transform replicates a Pandas DataFrame. It will replicate it once for each asset in the asset_list. It will also add an ‘asset’ column that contains the name of the asset.
Example:
Say we have a DataFrame that looks like this:
some_field,stream_type,timestamp
3.14 ,'a_stream' ,2018-01-01 00:00:00.000000
2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000
0.0 ,'my_stream',2018-01-03 00:00:00.000000
And we want to replicate it for three different assets:
asset ,some_field,stream_type,timestamp
'asset1',3.14 ,'a_stream' ,2018-01-01 00:00:00.000000
'asset1',2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000
'asset1',0.0 ,'my_stream',2018-01-03 00:00:00.000000
'asset2',3.14 ,'a_stream' ,2018-01-01 00:00:00.000000
'asset2',2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000
'asset2',0.0 ,'my_stream',2018-01-03 00:00:00.000000
'asset3',3.14 ,'a_stream' ,2018-01-01 00:00:00.000000
'asset3',2.71828 ,'b_stream' ,2018-01-02 00:00:00.000000
'asset3',0.0 ,'my_stream',2018-01-03 00:00:00.000000
Our configuration will look something like this:
{
"transform_name": "Replicate",
"transform_type": "replicate",
"filter_stream": [
"*"
],
"asset_list": ["asset1", "asset2", "asset3"],
}
Configuration:
Required and optional properties that can be configured for a replicate transform.
- asset_list: The list of assets that will be added to the DataFrame
- transform_name: Unique name for the transform.
- transform_type: Type of transform to apply. Should be replicate.
- filter_stream: List of data streams to transform. Each stream can either be * (all) or asset:stream.