describe_compilation_job
(**kwargs)¶Returns information about a model compilation job.
To create a model compilation job, use CreateCompilationJob. To get information about multiple model compilation jobs, use ListCompilationJobs.
See also: AWS API Documentation
Request Syntax
response = client.describe_compilation_job(
CompilationJobName='string'
)
[REQUIRED]
The name of the model compilation job that you want information about.
{
'CompilationJobName': 'string',
'CompilationJobArn': 'string',
'CompilationJobStatus': 'INPROGRESS'|'COMPLETED'|'FAILED'|'STARTING'|'STOPPING'|'STOPPED',
'CompilationStartTime': datetime(2015, 1, 1),
'CompilationEndTime': datetime(2015, 1, 1),
'StoppingCondition': {
'MaxRuntimeInSeconds': 123,
'MaxWaitTimeInSeconds': 123
},
'InferenceImage': 'string',
'ModelPackageVersionArn': 'string',
'CreationTime': datetime(2015, 1, 1),
'LastModifiedTime': datetime(2015, 1, 1),
'FailureReason': 'string',
'ModelArtifacts': {
'S3ModelArtifacts': 'string'
},
'ModelDigests': {
'ArtifactDigest': 'string'
},
'RoleArn': 'string',
'InputConfig': {
'S3Uri': 'string',
'DataInputConfig': 'string',
'Framework': 'TENSORFLOW'|'KERAS'|'MXNET'|'ONNX'|'PYTORCH'|'XGBOOST'|'TFLITE'|'DARKNET'|'SKLEARN',
'FrameworkVersion': 'string'
},
'OutputConfig': {
'S3OutputLocation': 'string',
'TargetDevice': 'lambda'|'ml_m4'|'ml_m5'|'ml_c4'|'ml_c5'|'ml_p2'|'ml_p3'|'ml_g4dn'|'ml_inf1'|'ml_eia2'|'jetson_tx1'|'jetson_tx2'|'jetson_nano'|'jetson_xavier'|'rasp3b'|'imx8qm'|'deeplens'|'rk3399'|'rk3288'|'aisage'|'sbe_c'|'qcs605'|'qcs603'|'sitara_am57x'|'amba_cv2'|'amba_cv22'|'amba_cv25'|'x86_win32'|'x86_win64'|'coreml'|'jacinto_tda4vm'|'imx8mplus',
'TargetPlatform': {
'Os': 'ANDROID'|'LINUX',
'Arch': 'X86_64'|'X86'|'ARM64'|'ARM_EABI'|'ARM_EABIHF',
'Accelerator': 'INTEL_GRAPHICS'|'MALI'|'NVIDIA'|'NNA'
},
'CompilerOptions': 'string',
'KmsKeyId': 'string'
},
'VpcConfig': {
'SecurityGroupIds': [
'string',
],
'Subnets': [
'string',
]
}
}
Response Structure
The name of the model compilation job.
The Amazon Resource Name (ARN) of the model compilation job.
The status of the model compilation job.
The time when the model compilation job started the CompilationJob
instances.
You are billed for the time between this timestamp and the timestamp in the DescribeCompilationJobResponse$CompilationEndTime field. In Amazon CloudWatch Logs, the start time might be later than this time. That's because it takes time to download the compilation job, which depends on the size of the compilation job container.
The time when the model compilation job on a compilation job instance ended. For a successful or stopped job, this is when the job's model artifacts have finished uploading. For a failed job, this is when Amazon SageMaker detected that the job failed.
Specifies a limit to how long a model compilation job can run. When the job reaches the time limit, Amazon SageMaker ends the compilation job. Use this API to cap model training costs.
The maximum length of time, in seconds, that a training or compilation job can run before it is stopped.
For compilation jobs, if the job does not complete during this time, a TimeOut
error is generated. We recommend starting with 900 seconds and increasing as necessary based on your model.
For all other jobs, if the job does not complete during this time, SageMaker ends the job. When RetryStrategy
is specified in the job request, MaxRuntimeInSeconds
specifies the maximum time for all of the attempts in total, not each individual attempt. The default value is 1 day. The maximum value is 28 days.
The maximum time that a TrainingJob
can run in total, including any time spent publishing metrics or archiving and uploading models after it has been stopped, is 30 days.
The maximum length of time, in seconds, that a managed Spot training job has to complete. It is the amount of time spent waiting for Spot capacity plus the amount of time the job can run. It must be equal to or greater than MaxRuntimeInSeconds
. If the job does not complete during this time, SageMaker ends the job.
When RetryStrategy
is specified in the job request, MaxWaitTimeInSeconds
specifies the maximum time for all of the attempts in total, not each individual attempt.
The inference image to use when compiling a model. Specify an image only if the target device is a cloud instance.
The Amazon Resource Name (ARN) of the versioned model package that was provided to SageMaker Neo when you initiated a compilation job.
The time that the model compilation job was created.
The time that the status of the model compilation job was last modified.
If a model compilation job failed, the reason it failed.
Information about the location in Amazon S3 that has been configured for storing the model artifacts used in the compilation job.
The path of the S3 object that contains the model artifacts. For example, s3://bucket-name/keynameprefix/model.tar.gz
.
Provides a BLAKE2 hash value that identifies the compiled model artifacts in Amazon S3.
Provides a hash value that uniquely identifies the stored model artifacts.
The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker assumes to perform the model compilation job.
Information about the location in Amazon S3 of the input model artifacts, the name and shape of the expected data inputs, and the framework in which the model was trained.
The S3 path where the model artifacts, which result from model training, are stored. This path must point to a single gzip compressed tar archive (.tar.gz suffix).
Specifies the name and shape of the expected data inputs for your trained model with a JSON dictionary form. The data inputs are InputConfig$Framework specific.
TensorFlow
: You must specify the name and shape (NHWC format) of the expected data inputs using a dictionary format for your trained model. The dictionary formats required for the console and CLI are different.{"input":[1,1024,1024,3]}
{\"input\":[1,1024,1024,3]}
{"data1": [1,28,28,1], "data2":[1,28,28,1]}
{\"data1\": [1,28,28,1], \"data2\":[1,28,28,1]}
KERAS
: You must specify the name and shape (NCHW format) of expected data inputs using a dictionary format for your trained model. Note that while Keras model artifacts should be uploaded in NHWC (channel-last) format, DataInputConfig
should be specified in NCHW (channel-first) format. The dictionary formats required for the console and CLI are different.{"input_1":[1,3,224,224]}
{\"input_1\":[1,3,224,224]}
{"input_1": [1,3,224,224], "input_2":[1,3,224,224]}
{\"input_1\": [1,3,224,224], \"input_2\":[1,3,224,224]}
MXNET/ONNX/DARKNET
: You must specify the name and shape (NCHW format) of the expected data inputs in order using a dictionary format for your trained model. The dictionary formats required for the console and CLI are different.{"data":[1,3,1024,1024]}
{\"data\":[1,3,1024,1024]}
{"var1": [1,1,28,28], "var2":[1,1,28,28]}
{\"var1\": [1,1,28,28], \"var2\":[1,1,28,28]}
PyTorch
: You can either specify the name and shape (NCHW format) of expected data inputs in order using a dictionary format for your trained model or you can specify the shape only using a list format. The dictionary formats required for the console and CLI are different. The list formats for the console and CLI are the same.{"input0":[1,3,224,224]}
{\"input0\":[1,3,224,224]}
[[1,3,224,224]]
{"input0":[1,3,224,224], "input1":[1,3,224,224]}
{\"input0\":[1,3,224,224], \"input1\":[1,3,224,224]}
[[1,3,224,224], [1,3,224,224]]
XGBOOST
: input data name and shape are not needed.DataInputConfig
supports the following parameters forCoreML
OutputConfig$TargetDevice (ML Model format):
shape
: Input shape, for example {"input_1": {"shape": [1,224,224,3]}}
. In addition to static input shapes, CoreML converter supports Flexible input shapes:{"input_1": {"shape": ["1..10", 224, 224, 3]}}
{"input_1": {"shape": [[1, 224, 224, 3], [1, 160, 160, 3]]}}
default_shape
: Default input shape. You can set a default shape during conversion for both Range Dimension and Enumerated Shapes. For example {"input_1": {"shape": ["1..10", 224, 224, 3], "default_shape": [1, 224, 224, 3]}}
type
: Input type. Allowed values: Image
and Tensor
. By default, the converter generates an ML Model with inputs of type Tensor (MultiArray). User can set input type to be Image. Image input type requires additional input parameters such as bias
and scale
.bias
: If the input type is an Image, you need to provide the bias vector.scale
: If the input type is an Image, you need to provide a scale factor.CoreML ClassifierConfig
parameters can be specified using OutputConfig$CompilerOptions. CoreML converter supports Tensorflow and PyTorch models. CoreML conversion examples:
"DataInputConfig": {"input_1": {"shape": [[1,224,224,3], [1,160,160,3]], "default_shape": [1,224,224,3]}}
"DataInputConfig": [{"shape": [[1,3,224,224], [1,3,160,160]], "default_shape": [1,3,224,224]}]
"DataInputConfig": {"input_1": {"shape": [[1,224,224,3], [1,160,160,3]], "default_shape": [1,224,224,3], "type": "Image", "bias": [-1,-1,-1], "scale": 0.007843137255}}
"CompilerOptions": {"class_labels": "imagenet_labels_1000.txt"}
"DataInputConfig": [{"shape": [[1,3,224,224], [1,3,160,160]], "default_shape": [1,3,224,224], "type": "Image", "bias": [-1,-1,-1], "scale": 0.007843137255}]
"CompilerOptions": {"class_labels": "imagenet_labels_1000.txt"}
Depending on the model format, DataInputConfig
requires the following parameters for ml_eia2
OutputConfig:TargetDevice.
signature_def_key
and the input model shapes for DataInputConfig
. Specify the signature_def_key
in OutputConfig:CompilerOptions if the model does not use TensorFlow's default signature def key. For example:"DataInputConfig": {"inputs": [1, 224, 224, 3]}
"CompilerOptions": {"signature_def_key": "serving_custom"}
DataInputConfig
and the output tensor names for output_names
in OutputConfig:CompilerOptions. For example:"DataInputConfig": {"input_tensor:0": [1, 224, 224, 3]}
"CompilerOptions": {"output_names": ["output_tensor:0"]}
Identifies the framework in which the model was trained. For example: TENSORFLOW.
Specifies the framework version to use. This API field is only supported for the MXNet, PyTorch, TensorFlow and TensorFlow Lite frameworks.
For information about framework versions supported for cloud targets and edge devices, see Cloud Supported Instance Types and Frameworks and Edge Supported Frameworks.
Information about the output location for the compiled model and the target device that the model runs on.
Identifies the S3 bucket where you want Amazon SageMaker to store the model artifacts. For example, s3://bucket-name/key-name-prefix
.
Identifies the target device or the machine learning instance that you want to run your model on after the compilation has completed. Alternatively, you can specify OS, architecture, and accelerator using TargetPlatform fields. It can be used instead of TargetPlatform
.
Contains information about a target platform that you want your model to run on, such as OS, architecture, and accelerators. It is an alternative of TargetDevice
.
The following examples show how to configure the TargetPlatform
and CompilerOptions
JSON strings for popular target platforms:
"TargetPlatform": {"Os": "LINUX", "Arch": "ARM_EABIHF"},
"CompilerOptions": {'mattr': ['+neon']}
"TargetPlatform": {"Os": "LINUX", "Arch": "ARM64", "Accelerator": "NVIDIA"},
"CompilerOptions": {'gpu-code': 'sm_62', 'trt-ver': '6.0.1', 'cuda-ver': '10.0'}
"TargetPlatform": {"Os": "LINUX", "Arch": "X86_64", "Accelerator": "NVIDIA"},
"CompilerOptions": {'mcpu': 'skylake-avx512'}
"TargetPlatform": {"Os": "LINUX", "Arch": "ARM64", "Accelerator": "MALI"}
"TargetPlatform": {"Os": "ANDROID", "Arch": "ARM_EABI"},
"CompilerOptions": {'ANDROID_PLATFORM': 25, 'mattr': ['+neon']}
"TargetPlatform": {"Os": "ANDROID", "Arch": "ARM64"},
"CompilerOptions": {'ANDROID_PLATFORM': 29}
Specifies a target platform OS.
LINUX
: Linux-based operating systems.ANDROID
: Android operating systems. Android API level can be specified using the ANDROID_PLATFORM
compiler option. For example, "CompilerOptions": {'ANDROID_PLATFORM': 28}
Specifies a target platform architecture.
X86_64
: 64-bit version of the x86 instruction set.X86
: 32-bit version of the x86 instruction set.ARM64
: ARMv8 64-bit CPU.ARM_EABIHF
: ARMv7 32-bit, Hard Float.ARM_EABI
: ARMv7 32-bit, Soft Float. Used by Android 32-bit ARM platform.Specifies a target platform accelerator (optional).
NVIDIA
: Nvidia graphics processing unit. It also requires gpu-code
, trt-ver
, cuda-ver
compiler optionsMALI
: ARM Mali graphics processorINTEL_GRAPHICS
: Integrated Intel graphicsSpecifies additional parameters for compiler options in JSON format. The compiler options are TargetPlatform
specific. It is required for NVIDIA accelerators and highly recommended for CPU compilations. For any other cases, it is optional to specify CompilerOptions.
DTYPE
: Specifies the data type for the input. When compiling for ml_*
(except for ml_inf
) instances using PyTorch framework, provide the data type (dtype) of the model's input. "float32"
is used if "DTYPE"
is not specified. Options for data type are:"float"
or "float32"
."int64"
or "long"
.For example, {"dtype" : "float32"}
.
CPU
: Compilation for CPU supports the following compiler options.mcpu
: CPU micro-architecture. For example, {'mcpu': 'skylake-avx512'}
mattr
: CPU flags. For example, {'mattr': ['+neon', '+vfpv4']}
ARM
: Details of ARM CPU compilations.NEON
: NEON is an implementation of the Advanced SIMD extension used in ARMv7 processors. For example, add {'mattr': ['+neon']}
to the compiler options if compiling for ARM 32-bit platform with the NEON support.NVIDIA
: Compilation for NVIDIA GPU supports the following compiler options.gpu_code
: Specifies the targeted architecture.trt-ver
: Specifies the TensorRT versions in x.y.z. format.cuda-ver
: Specifies the CUDA version in x.y format.For example, {'gpu-code': 'sm_72', 'trt-ver': '6.0.1', 'cuda-ver': '10.1'}
ANDROID
: Compilation for the Android OS supports the following compiler options:ANDROID_PLATFORM
: Specifies the Android API levels. Available levels range from 21 to 29. For example, {'ANDROID_PLATFORM': 28}
.mattr
: Add {'mattr': ['+neon']}
to compiler options if compiling for ARM 32-bit platform with NEON support.INFERENTIA
: Compilation for target ml_inf1 uses compiler options passed in as a JSON string. For example, "CompilerOptions": "\"--verbose 1 --num-neuroncores 2 -O2\""
. For information about supported compiler options, see Neuron Compiler CLI.CoreML
: Compilation for the CoreML OutputConfig$TargetDevice supports the following compiler options:class_labels
: Specifies the classification labels file name inside input tar.gz file. For example, {"class_labels": "imagenet_labels_1000.txt"}
. Labels inside the txt file should be separated by newlines.EIA
: Compilation for the Elastic Inference Accelerator supports the following compiler options:precision_mode
: Specifies the precision of compiled artifacts. Supported values are "FP16"
and "FP32"
. Default is "FP32"
.signature_def_key
: Specifies the signature to use for models in SavedModel format. Defaults is TensorFlow's default signature def key.output_names
: Specifies a list of output tensor names for models in FrozenGraph format. Set at most one API field, either: signature_def_key
or output_names
.For example: {"precision_mode": "FP32", "output_names": ["output:0"]}
The Amazon Web Services Key Management Service key (Amazon Web Services KMS) that Amazon SageMaker uses to encrypt your output models with Amazon S3 server-side encryption after compilation job. If you don't provide a KMS key ID, Amazon SageMaker uses the default KMS key for Amazon S3 for your role's account. For more information, see KMS-Managed Encryption Keys in the Amazon Simple Storage Service Developer Guide.
The KmsKeyId can be any of the following formats:
1234abcd-12ab-34cd-56ef-1234567890ab
arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
alias/ExampleAlias
arn:aws:kms:us-west-2:111122223333:alias/ExampleAlias
A VpcConfig object that specifies the VPC that you want your compilation job to connect to. Control access to your models by configuring the VPC. For more information, see Protect Compilation Jobs by Using an Amazon Virtual Private Cloud.
The VPC security group IDs. IDs have the form of sg-xxxxxxxx
. Specify the security groups for the VPC that is specified in the Subnets
field.
The ID of the subnets in the VPC that you want to connect the compilation job to for accessing the model in Amazon S3.
Exceptions
SageMaker.Client.exceptions.ResourceNotFound