Glue.Client.
get_data_quality_ruleset_evaluation_run
(**kwargs)¶Retrieves a specific run where a ruleset is evaluated against a data source.
See also: AWS API Documentation
Request Syntax
response = client.get_data_quality_ruleset_evaluation_run(
RunId='string'
)
[REQUIRED]
The unique run identifier associated with this run.
{
'RunId': 'string',
'DataSource': {
'GlueTable': {
'DatabaseName': 'string',
'TableName': 'string',
'CatalogId': 'string',
'ConnectionName': 'string',
'AdditionalOptions': {
'string': 'string'
}
}
},
'Role': 'string',
'NumberOfWorkers': 123,
'Timeout': 123,
'AdditionalRunOptions': {
'CloudWatchMetricsEnabled': True|False,
'ResultsS3Prefix': 'string'
},
'Status': 'STARTING'|'RUNNING'|'STOPPING'|'STOPPED'|'SUCCEEDED'|'FAILED'|'TIMEOUT',
'ErrorString': 'string',
'StartedOn': datetime(2015, 1, 1),
'LastModifiedOn': datetime(2015, 1, 1),
'CompletedOn': datetime(2015, 1, 1),
'ExecutionTime': 123,
'RulesetNames': [
'string',
],
'ResultIds': [
'string',
]
}
Response Structure
The unique run identifier associated with this run.
The data source (an Glue table) associated with this evaluation run.
An Glue table.
A database name in the Glue Data Catalog.
A table name in the Glue Data Catalog.
A unique identifier for the Glue Data Catalog.
The name of the connection to the Glue Data Catalog.
Additional options for the table. Currently there are two keys supported:
pushDownPredicate
: to filter on partitions without having to list and read all the files in your dataset.catalogPartitionPredicate
: to use server-side partition pruning using partition indexes in the Glue Data Catalog.An IAM role supplied to encrypt the results of the run.
The number of G.1X
workers to be used in the run. The default is 5.
The timeout for a run in minutes. This is the maximum time that a run can consume resources before it is terminated and enters TIMEOUT
status. The default is 2,880 minutes (48 hours).
Additional run options you can specify for an evaluation run.
Whether or not to enable CloudWatch metrics.
Prefix for Amazon S3 to store results.
The status for this run.
The error strings that are associated with the run.
The date and time when this run started.
A timestamp. The last point in time when this data quality rule recommendation run was modified.
The date and time when this run was completed.
The amount of time (in seconds) that the run consumed resources.
A list of ruleset names for the run.
A list of result IDs for the data quality results for the run.
Exceptions
Glue.Client.exceptions.EntityNotFoundException
Glue.Client.exceptions.InvalidInputException
Glue.Client.exceptions.OperationTimeoutException
Glue.Client.exceptions.InternalServiceException