EntityResolution / Client / update_matching_workflow

update_matching_workflow#

EntityResolution.Client.update_matching_workflow(**kwargs)#

Updates an existing MatchingWorkflow. This method is identical to CreateMatchingWorkflow, except it uses an HTTP PUT request instead of a POST request, and the MatchingWorkflow must already exist for the method to succeed.

See also: AWS API Documentation

Request Syntax

response = client.update_matching_workflow(
    description='string',
    incrementalRunConfig={
        'incrementalRunType': 'IMMEDIATE'
    },
    inputSourceConfig=[
        {
            'applyNormalization': True|False,
            'inputSourceARN': 'string',
            'schemaName': 'string'
        },
    ],
    outputSourceConfig=[
        {
            'KMSArn': 'string',
            'applyNormalization': True|False,
            'output': [
                {
                    'hashed': True|False,
                    'name': 'string'
                },
            ],
            'outputS3Path': 'string'
        },
    ],
    resolutionTechniques={
        'resolutionType': 'RULE_MATCHING'|'ML_MATCHING',
        'ruleBasedProperties': {
            'attributeMatchingModel': 'ONE_TO_ONE'|'MANY_TO_MANY',
            'rules': [
                {
                    'matchingKeys': [
                        'string',
                    ],
                    'ruleName': 'string'
                },
            ]
        }
    },
    roleArn='string',
    workflowName='string'
)
Parameters:
  • description (string) – A description of the workflow.

  • incrementalRunConfig (dict) –

    An object which defines an incremental run type and has only incrementalRunType as a field.

    • incrementalRunType (string) –

      The type of incremental run. It takes only one value: IMMEDIATE.

  • inputSourceConfig (list) –

    [REQUIRED]

    A list of InputSource objects, which have the fields InputSourceARN and SchemaName.

    • (dict) –

      An object containing InputSourceARN, SchemaName, and ApplyNormalization.

      • applyNormalization (boolean) –

        Normalizes the attributes defined in the schema in the input data. For example, if an attribute has an AttributeType of PHONE_NUMBER, and the data in the input table is in a format of 1234567890, Entity Resolution will normalize this field in the output to (123)-456-7890.

      • inputSourceARN (string) – [REQUIRED]

        An Glue table ARN for the input source table.

      • schemaName (string) – [REQUIRED]

        The name of the schema to be retrieved.

  • outputSourceConfig (list) –

    [REQUIRED]

    A list of OutputSource objects, each of which contains fields OutputS3Path, ApplyNormalization, and Output.

    • (dict) –

      A list of OutputAttribute objects, each of which have the fields Name and Hashed. Each of these objects selects a column to be included in the output table, and whether the values of the column should be hashed.

      • KMSArn (string) –

        Customer KMS ARN for encryption at rest. If not provided, system will use an Entity Resolution managed KMS key.

      • applyNormalization (boolean) –

        Normalizes the attributes defined in the schema in the input data. For example, if an attribute has an AttributeType of PHONE_NUMBER, and the data in the input table is in a format of 1234567890, Entity Resolution will normalize this field in the output to (123)-456-7890.

      • output (list) – [REQUIRED]

        A list of OutputAttribute objects, each of which have the fields Name and Hashed. Each of these objects selects a column to be included in the output table, and whether the values of the column should be hashed.

        • (dict) –

          A list of OutputAttribute objects, each of which have the fields Name and Hashed. Each of these objects selects a column to be included in the output table, and whether the values of the column should be hashed.

          • hashed (boolean) –

            Enables the ability to hash the column values in the output.

          • name (string) – [REQUIRED]

            A name of a column to be written to the output. This must be an InputField name in the schema mapping.

      • outputS3Path (string) – [REQUIRED]

        The S3 path to which Entity Resolution will write the output table.

  • resolutionTechniques (dict) –

    [REQUIRED]

    An object which defines the resolutionType and the ruleBasedProperties

    • resolutionType (string) –

      There are two types of matching, RULE_MATCHING and ML_MATCHING

    • ruleBasedProperties (dict) –

      An object which defines the list of matching rules to run and has a field Rules, which is a list of rule objects.

      • attributeMatchingModel (string) – [REQUIRED]

        You can either choose ONE_TO_ONE or MANY_TO_MANY as the AttributeMatchingModel. When choosing MANY_TO_MANY, the system can match attribute across the sub-types of an attribute type. For example, if the value of the Email field of Profile A and the value of BusinessEmail field of Profile B matches, the two profiles are matched on the Email type. When choosing ONE_TO_ONE the system can only match if the sub-types are exact matches. For example, only when the value of the Email field of Profile A and the value of the Email field of Profile B matches, the two profiles are matched on the Email type.

      • rules (list) – [REQUIRED]

        A list of Rule objects, each of which have fields RuleName and MatchingKeys.

        • (dict) –

          An object containing RuleName, and MatchingKeys.

          • matchingKeys (list) – [REQUIRED]

            A list of MatchingKeys. The MatchingKeys must have been defined in the SchemaMapping. Two records are considered to match according to this rule if all of the MatchingKeys match.

            • (string) –

          • ruleName (string) – [REQUIRED]

            A name for the matching rule.

  • roleArn (string) –

    [REQUIRED]

    The Amazon Resource Name (ARN) of the IAM role. AWS Entity Resolution assumes this role to create resources on your behalf as part of workflow execution.

  • workflowName (string) –

    [REQUIRED]

    The name of the workflow to be retrieved.

Return type:

dict

Returns:

Response Syntax

{
    'description': 'string',
    'incrementalRunConfig': {
        'incrementalRunType': 'IMMEDIATE'
    },
    'inputSourceConfig': [
        {
            'applyNormalization': True|False,
            'inputSourceARN': 'string',
            'schemaName': 'string'
        },
    ],
    'outputSourceConfig': [
        {
            'KMSArn': 'string',
            'applyNormalization': True|False,
            'output': [
                {
                    'hashed': True|False,
                    'name': 'string'
                },
            ],
            'outputS3Path': 'string'
        },
    ],
    'resolutionTechniques': {
        'resolutionType': 'RULE_MATCHING'|'ML_MATCHING',
        'ruleBasedProperties': {
            'attributeMatchingModel': 'ONE_TO_ONE'|'MANY_TO_MANY',
            'rules': [
                {
                    'matchingKeys': [
                        'string',
                    ],
                    'ruleName': 'string'
                },
            ]
        }
    },
    'roleArn': 'string',
    'workflowName': 'string'
}

Response Structure

  • (dict) –

    • description (string) –

      A description of the workflow.

    • incrementalRunConfig (dict) –

      An object which defines an incremental run type and has only incrementalRunType as a field.

      • incrementalRunType (string) –

        The type of incremental run. It takes only one value: IMMEDIATE.

    • inputSourceConfig (list) –

      A list of InputSource objects, which have the fields InputSourceARN and SchemaName.

      • (dict) –

        An object containing InputSourceARN, SchemaName, and ApplyNormalization.

        • applyNormalization (boolean) –

          Normalizes the attributes defined in the schema in the input data. For example, if an attribute has an AttributeType of PHONE_NUMBER, and the data in the input table is in a format of 1234567890, Entity Resolution will normalize this field in the output to (123)-456-7890.

        • inputSourceARN (string) –

          An Glue table ARN for the input source table.

        • schemaName (string) –

          The name of the schema to be retrieved.

    • outputSourceConfig (list) –

      A list of OutputSource objects, each of which contains fields OutputS3Path, ApplyNormalization, and Output.

      • (dict) –

        A list of OutputAttribute objects, each of which have the fields Name and Hashed. Each of these objects selects a column to be included in the output table, and whether the values of the column should be hashed.

        • KMSArn (string) –

          Customer KMS ARN for encryption at rest. If not provided, system will use an Entity Resolution managed KMS key.

        • applyNormalization (boolean) –

          Normalizes the attributes defined in the schema in the input data. For example, if an attribute has an AttributeType of PHONE_NUMBER, and the data in the input table is in a format of 1234567890, Entity Resolution will normalize this field in the output to (123)-456-7890.

        • output (list) –

          A list of OutputAttribute objects, each of which have the fields Name and Hashed. Each of these objects selects a column to be included in the output table, and whether the values of the column should be hashed.

          • (dict) –

            A list of OutputAttribute objects, each of which have the fields Name and Hashed. Each of these objects selects a column to be included in the output table, and whether the values of the column should be hashed.

            • hashed (boolean) –

              Enables the ability to hash the column values in the output.

            • name (string) –

              A name of a column to be written to the output. This must be an InputField name in the schema mapping.

        • outputS3Path (string) –

          The S3 path to which Entity Resolution will write the output table.

    • resolutionTechniques (dict) –

      An object which defines the resolutionType and the ruleBasedProperties

      • resolutionType (string) –

        There are two types of matching, RULE_MATCHING and ML_MATCHING

      • ruleBasedProperties (dict) –

        An object which defines the list of matching rules to run and has a field Rules, which is a list of rule objects.

        • attributeMatchingModel (string) –

          You can either choose ONE_TO_ONE or MANY_TO_MANY as the AttributeMatchingModel. When choosing MANY_TO_MANY, the system can match attribute across the sub-types of an attribute type. For example, if the value of the Email field of Profile A and the value of BusinessEmail field of Profile B matches, the two profiles are matched on the Email type. When choosing ONE_TO_ONE the system can only match if the sub-types are exact matches. For example, only when the value of the Email field of Profile A and the value of the Email field of Profile B matches, the two profiles are matched on the Email type.

        • rules (list) –

          A list of Rule objects, each of which have fields RuleName and MatchingKeys.

          • (dict) –

            An object containing RuleName, and MatchingKeys.

            • matchingKeys (list) –

              A list of MatchingKeys. The MatchingKeys must have been defined in the SchemaMapping. Two records are considered to match according to this rule if all of the MatchingKeys match.

              • (string) –

            • ruleName (string) –

              A name for the matching rule.

    • roleArn (string) –

      The Amazon Resource Name (ARN) of the IAM role. AWS Entity Resolution assumes this role to create resources on your behalf as part of workflow execution.

    • workflowName (string) –

      The name of the workflow.

Exceptions

  • EntityResolution.Client.exceptions.ThrottlingException

  • EntityResolution.Client.exceptions.InternalServerException

  • EntityResolution.Client.exceptions.ResourceNotFoundException

  • EntityResolution.Client.exceptions.AccessDeniedException

  • EntityResolution.Client.exceptions.ValidationException