SupplyChain / Client / create_data_integration_flow
create_data_integration_flow#
- SupplyChain.Client.create_data_integration_flow(**kwargs)#
- Enables you to programmatically create a data pipeline to ingest data from source systems such as Amazon S3 buckets, to a predefined Amazon Web Services Supply Chain dataset (product, inbound_order) or a temporary dataset along with the data transformation query provided with the API. - See also: AWS API Documentation - Request Syntax- response = client.create_data_integration_flow( instanceId='string', name='string', sources=[ { 'sourceType': 'S3'|'DATASET', 'sourceName': 'string', 's3Source': { 'bucketName': 'string', 'prefix': 'string', 'options': { 'fileType': 'CSV'|'PARQUET'|'JSON' } }, 'datasetSource': { 'datasetIdentifier': 'string', 'options': { 'loadType': 'INCREMENTAL'|'REPLACE', 'dedupeRecords': True|False } } }, ], transformation={ 'transformationType': 'SQL'|'NONE', 'sqlTransformation': { 'query': 'string' } }, target={ 'targetType': 'S3'|'DATASET', 's3Target': { 'bucketName': 'string', 'prefix': 'string', 'options': { 'fileType': 'CSV'|'PARQUET'|'JSON' } }, 'datasetTarget': { 'datasetIdentifier': 'string', 'options': { 'loadType': 'INCREMENTAL'|'REPLACE', 'dedupeRecords': True|False } } }, tags={ 'string': 'string' } ) - Parameters:
- instanceId (string) – - [REQUIRED] - The Amazon Web Services Supply Chain instance identifier. 
- name (string) – - [REQUIRED] - Name of the DataIntegrationFlow. 
- sources (list) – - [REQUIRED] - The source configurations for DataIntegrationFlow. - (dict) – - The DataIntegrationFlow source parameters. - sourceType (string) – [REQUIRED] - The DataIntegrationFlow source type. 
- sourceName (string) – [REQUIRED] - The DataIntegrationFlow source name that can be used as table alias in SQL transformation query. 
- s3Source (dict) – - The S3 DataIntegrationFlow source. - bucketName (string) – [REQUIRED] - The bucketName of the S3 source objects. 
- prefix (string) – [REQUIRED] - The prefix of the S3 source objects. 
- options (dict) – - The other options of the S3 DataIntegrationFlow source. - fileType (string) – - The Amazon S3 file type in S3 options. 
 
 
- datasetSource (dict) – - The dataset DataIntegrationFlow source. - datasetIdentifier (string) – [REQUIRED] - The ARN of the dataset. 
- options (dict) – - The dataset DataIntegrationFlow source options. - loadType (string) – - The dataset data load type in dataset options. 
- dedupeRecords (boolean) – - The dataset load option to remove duplicates. 
 
 
 
 
- transformation (dict) – - [REQUIRED] - The transformation configurations for DataIntegrationFlow. - transformationType (string) – [REQUIRED] - The DataIntegrationFlow transformation type. 
- sqlTransformation (dict) – - The SQL DataIntegrationFlow transformation configuration. - query (string) – [REQUIRED] - The transformation SQL query body based on SparkSQL. 
 
 
- target (dict) – - [REQUIRED] - The target configurations for DataIntegrationFlow. - targetType (string) – [REQUIRED] - The DataIntegrationFlow target type. 
- s3Target (dict) – - The S3 DataIntegrationFlow target. - bucketName (string) – [REQUIRED] - The bucketName of the S3 target objects. 
- prefix (string) – [REQUIRED] - The prefix of the S3 target objects. 
- options (dict) – - The S3 DataIntegrationFlow target options. - fileType (string) – - The Amazon S3 file type in S3 options. 
 
 
- datasetTarget (dict) – - The dataset DataIntegrationFlow target. - datasetIdentifier (string) – [REQUIRED] - The dataset ARN. 
- options (dict) – - The dataset DataIntegrationFlow target options. - loadType (string) – - The dataset data load type in dataset options. 
- dedupeRecords (boolean) – - The dataset load option to remove duplicates. 
 
 
 
- tags (dict) – - The tags of the DataIntegrationFlow to be created - (string) – - (string) – 
 
 
 
- Return type:
- dict 
- Returns:
- Response Syntax- { 'instanceId': 'string', 'name': 'string' } - Response Structure- (dict) – - The response parameters for CreateDataIntegrationFlow. - instanceId (string) – - The Amazon Web Services Supply Chain instance identifier. 
- name (string) – - The name of the DataIntegrationFlow created. 
 
 
 - Exceptions- SupplyChain.Client.exceptions.ServiceQuotaExceededException
- SupplyChain.Client.exceptions.ThrottlingException
- SupplyChain.Client.exceptions.ResourceNotFoundException
- SupplyChain.Client.exceptions.AccessDeniedException
- SupplyChain.Client.exceptions.ValidationException
- SupplyChain.Client.exceptions.InternalServerException
- SupplyChain.Client.exceptions.ConflictException