SageMaker / Client / list_compute_quotas
list_compute_quotas#
- SageMaker.Client.list_compute_quotas(**kwargs)#
List the resource allocation definitions.
See also: AWS API Documentation
Request Syntax
response = client.list_compute_quotas( CreatedAfter=datetime(2015, 1, 1), CreatedBefore=datetime(2015, 1, 1), NameContains='string', Status='Creating'|'CreateFailed'|'CreateRollbackFailed'|'Created'|'Updating'|'UpdateFailed'|'UpdateRollbackFailed'|'Updated'|'Deleting'|'DeleteFailed'|'DeleteRollbackFailed'|'Deleted', ClusterArn='string', SortBy='Name'|'CreationTime'|'Status'|'ClusterArn', SortOrder='Ascending'|'Descending', NextToken='string', MaxResults=123 )
- Parameters:
CreatedAfter (datetime) – Filter for after this creation time. The input for this parameter is a Unix timestamp. To convert a date and time into a Unix timestamp, see EpochConverter.
CreatedBefore (datetime) – Filter for before this creation time. The input for this parameter is a Unix timestamp. To convert a date and time into a Unix timestamp, see EpochConverter.
NameContains (string) – Filter for name containing this string.
Status (string) – Filter for status.
ClusterArn (string) – Filter for ARN of the cluster.
SortBy (string) – Filter for sorting the list by a given value. For example, sort by name, creation time, or status.
SortOrder (string) – The order of the list. By default, listed in
Descending
order according to bySortBy
. To change the list order, you can specifySortOrder
to beAscending
.NextToken (string) – If the previous response was truncated, you will receive this token. Use it in your next request to receive the next set of results.
MaxResults (integer) – The maximum number of compute allocation definitions to list.
- Return type:
dict
- Returns:
Response Syntax
{ 'ComputeQuotaSummaries': [ { 'ComputeQuotaArn': 'string', 'ComputeQuotaId': 'string', 'Name': 'string', 'ComputeQuotaVersion': 123, 'Status': 'Creating'|'CreateFailed'|'CreateRollbackFailed'|'Created'|'Updating'|'UpdateFailed'|'UpdateRollbackFailed'|'Updated'|'Deleting'|'DeleteFailed'|'DeleteRollbackFailed'|'Deleted', 'ClusterArn': 'string', 'ComputeQuotaConfig': { 'ComputeQuotaResources': [ { 'InstanceType': 'ml.p4d.24xlarge'|'ml.p4de.24xlarge'|'ml.p5.48xlarge'|'ml.trn1.32xlarge'|'ml.trn1n.32xlarge'|'ml.g5.xlarge'|'ml.g5.2xlarge'|'ml.g5.4xlarge'|'ml.g5.8xlarge'|'ml.g5.12xlarge'|'ml.g5.16xlarge'|'ml.g5.24xlarge'|'ml.g5.48xlarge'|'ml.c5.large'|'ml.c5.xlarge'|'ml.c5.2xlarge'|'ml.c5.4xlarge'|'ml.c5.9xlarge'|'ml.c5.12xlarge'|'ml.c5.18xlarge'|'ml.c5.24xlarge'|'ml.c5n.large'|'ml.c5n.2xlarge'|'ml.c5n.4xlarge'|'ml.c5n.9xlarge'|'ml.c5n.18xlarge'|'ml.m5.large'|'ml.m5.xlarge'|'ml.m5.2xlarge'|'ml.m5.4xlarge'|'ml.m5.8xlarge'|'ml.m5.12xlarge'|'ml.m5.16xlarge'|'ml.m5.24xlarge'|'ml.t3.medium'|'ml.t3.large'|'ml.t3.xlarge'|'ml.t3.2xlarge'|'ml.g6.xlarge'|'ml.g6.2xlarge'|'ml.g6.4xlarge'|'ml.g6.8xlarge'|'ml.g6.16xlarge'|'ml.g6.12xlarge'|'ml.g6.24xlarge'|'ml.g6.48xlarge'|'ml.gr6.4xlarge'|'ml.gr6.8xlarge'|'ml.g6e.xlarge'|'ml.g6e.2xlarge'|'ml.g6e.4xlarge'|'ml.g6e.8xlarge'|'ml.g6e.16xlarge'|'ml.g6e.12xlarge'|'ml.g6e.24xlarge'|'ml.g6e.48xlarge'|'ml.p5e.48xlarge'|'ml.p5en.48xlarge'|'ml.trn2.48xlarge'|'ml.c6i.large'|'ml.c6i.xlarge'|'ml.c6i.2xlarge'|'ml.c6i.4xlarge'|'ml.c6i.8xlarge'|'ml.c6i.12xlarge'|'ml.c6i.16xlarge'|'ml.c6i.24xlarge'|'ml.c6i.32xlarge'|'ml.m6i.large'|'ml.m6i.xlarge'|'ml.m6i.2xlarge'|'ml.m6i.4xlarge'|'ml.m6i.8xlarge'|'ml.m6i.12xlarge'|'ml.m6i.16xlarge'|'ml.m6i.24xlarge'|'ml.m6i.32xlarge'|'ml.r6i.large'|'ml.r6i.xlarge'|'ml.r6i.2xlarge'|'ml.r6i.4xlarge'|'ml.r6i.8xlarge'|'ml.r6i.12xlarge'|'ml.r6i.16xlarge'|'ml.r6i.24xlarge'|'ml.r6i.32xlarge', 'Count': 123 }, ], 'ResourceSharingConfig': { 'Strategy': 'Lend'|'DontLend'|'LendAndBorrow', 'BorrowLimit': 123 }, 'PreemptTeamTasks': 'Never'|'LowerPriority' }, 'ComputeQuotaTarget': { 'TeamName': 'string', 'FairShareWeight': 123 }, 'ActivationState': 'Enabled'|'Disabled', 'CreationTime': datetime(2015, 1, 1), 'LastModifiedTime': datetime(2015, 1, 1) }, ], 'NextToken': 'string' }
Response Structure
(dict) –
ComputeQuotaSummaries (list) –
Summaries of the compute allocation definitions.
(dict) –
Summary of the compute allocation definition.
ComputeQuotaArn (string) –
ARN of the compute allocation definition.
ComputeQuotaId (string) –
ID of the compute allocation definition.
Name (string) –
Name of the compute allocation definition.
ComputeQuotaVersion (integer) –
Version of the compute allocation definition.
Status (string) –
Status of the compute allocation definition.
ClusterArn (string) –
ARN of the cluster.
ComputeQuotaConfig (dict) –
Configuration of the compute allocation definition. This includes the resource sharing option, and the setting to preempt low priority tasks.
ComputeQuotaResources (list) –
Allocate compute resources by instance types.
(dict) –
Configuration of the resources used for the compute allocation definition.
InstanceType (string) –
The instance type of the instance group for the cluster.
Count (integer) –
The number of instances to add to the instance group of a SageMaker HyperPod cluster.
ResourceSharingConfig (dict) –
Resource sharing configuration. This defines how an entity can lend and borrow idle compute with other entities within the cluster.
Strategy (string) –
The strategy of how idle compute is shared within the cluster. The following are the options of strategies.
DontLend
: entities do not lend idle compute.Lend
: entities can lend idle compute to entities that can borrow.LendandBorrow
: entities can lend idle compute and borrow idle compute from other entities.
Default is
LendandBorrow
.BorrowLimit (integer) –
The limit on how much idle compute can be borrowed.The values can be 1 - 500 percent of idle compute that the team is allowed to borrow.
Default is
50
.
PreemptTeamTasks (string) –
Allows workloads from within an entity to preempt same-team workloads. When set to
LowerPriority
, the entity’s lower priority tasks are preempted by their own higher priority tasks.Default is
LowerPriority
.
ComputeQuotaTarget (dict) –
The target entity to allocate compute resources to.
TeamName (string) –
Name of the team to allocate compute resources to.
FairShareWeight (integer) –
Assigned entity fair-share weight. Idle compute will be shared across entities based on these assigned weights. This weight is only used when
FairShare
is enabled.A weight of 0 is the lowest priority and 100 is the highest. Weight 0 is the default.
ActivationState (string) –
The state of the compute allocation being described. Use to enable or disable compute allocation.
Default is
Enabled
.CreationTime (datetime) –
Creation time of the compute allocation definition.
LastModifiedTime (datetime) –
Last modified time of the compute allocation definition.
NextToken (string) –
If the previous response was truncated, you will receive this token. Use it in your next request to receive the next set of results.