SageMaker / Client / list_compute_quotas

list_compute_quotas#

SageMaker.Client.list_compute_quotas(**kwargs)#

List the resource allocation definitions.

Request Syntax

response = client.list_compute_quotas(
    CreatedAfter=datetime(2015, 1, 1),
    CreatedBefore=datetime(2015, 1, 1),
    NameContains='string',
    Status='Creating'|'CreateFailed'|'CreateRollbackFailed'|'Created'|'Updating'|'UpdateFailed'|'UpdateRollbackFailed'|'Updated'|'Deleting'|'DeleteFailed'|'DeleteRollbackFailed'|'Deleted',
    ClusterArn='string',
    SortBy='Name'|'CreationTime'|'Status'|'ClusterArn',
    SortOrder='Ascending'|'Descending',
    NextToken='string',
    MaxResults=123
)

Parameters:

CreatedAfter (datetime) – Filter for after this creation time. The input for this parameter is a Unix timestamp. To convert a date and time into a Unix timestamp, see EpochConverter.
CreatedBefore (datetime) – Filter for before this creation time. The input for this parameter is a Unix timestamp. To convert a date and time into a Unix timestamp, see EpochConverter.
NameContains (string) – Filter for name containing this string.
Status (string) – Filter for status.
ClusterArn (string) – Filter for ARN of the cluster.
SortBy (string) – Filter for sorting the list by a given value. For example, sort by name, creation time, or status.
SortOrder (string) – The order of the list. By default, listed in Descending order according to by SortBy. To change the list order, you can specify SortOrder to be Ascending.
NextToken (string) – If the previous response was truncated, you will receive this token. Use it in your next request to receive the next set of results.
MaxResults (integer) – The maximum number of compute allocation definitions to list.

Return type:

dict

Returns:

Response Syntax

{
    'ComputeQuotaSummaries': [
        {
            'ComputeQuotaArn': 'string',
            'ComputeQuotaId': 'string',
            'Name': 'string',
            'ComputeQuotaVersion': 123,
            'Status': 'Creating'|'CreateFailed'|'CreateRollbackFailed'|'Created'|'Updating'|'UpdateFailed'|'UpdateRollbackFailed'|'Updated'|'Deleting'|'DeleteFailed'|'DeleteRollbackFailed'|'Deleted',
            'ClusterArn': 'string',
            'ComputeQuotaConfig': {
                'ComputeQuotaResources': [
                    {
                        'InstanceType': 'ml.p4d.24xlarge'|'ml.p4de.24xlarge'|'ml.p5.48xlarge'|'ml.trn1.32xlarge'|'ml.trn1n.32xlarge'|'ml.g5.xlarge'|'ml.g5.2xlarge'|'ml.g5.4xlarge'|'ml.g5.8xlarge'|'ml.g5.12xlarge'|'ml.g5.16xlarge'|'ml.g5.24xlarge'|'ml.g5.48xlarge'|'ml.c5.large'|'ml.c5.xlarge'|'ml.c5.2xlarge'|'ml.c5.4xlarge'|'ml.c5.9xlarge'|'ml.c5.12xlarge'|'ml.c5.18xlarge'|'ml.c5.24xlarge'|'ml.c5n.large'|'ml.c5n.2xlarge'|'ml.c5n.4xlarge'|'ml.c5n.9xlarge'|'ml.c5n.18xlarge'|'ml.m5.large'|'ml.m5.xlarge'|'ml.m5.2xlarge'|'ml.m5.4xlarge'|'ml.m5.8xlarge'|'ml.m5.12xlarge'|'ml.m5.16xlarge'|'ml.m5.24xlarge'|'ml.t3.medium'|'ml.t3.large'|'ml.t3.xlarge'|'ml.t3.2xlarge'|'ml.g6.xlarge'|'ml.g6.2xlarge'|'ml.g6.4xlarge'|'ml.g6.8xlarge'|'ml.g6.16xlarge'|'ml.g6.12xlarge'|'ml.g6.24xlarge'|'ml.g6.48xlarge'|'ml.gr6.4xlarge'|'ml.gr6.8xlarge'|'ml.g6e.xlarge'|'ml.g6e.2xlarge'|'ml.g6e.4xlarge'|'ml.g6e.8xlarge'|'ml.g6e.16xlarge'|'ml.g6e.12xlarge'|'ml.g6e.24xlarge'|'ml.g6e.48xlarge'|'ml.p5e.48xlarge'|'ml.p5en.48xlarge'|'ml.trn2.48xlarge'|'ml.c6i.large'|'ml.c6i.xlarge'|'ml.c6i.2xlarge'|'ml.c6i.4xlarge'|'ml.c6i.8xlarge'|'ml.c6i.12xlarge'|'ml.c6i.16xlarge'|'ml.c6i.24xlarge'|'ml.c6i.32xlarge'|'ml.m6i.large'|'ml.m6i.xlarge'|'ml.m6i.2xlarge'|'ml.m6i.4xlarge'|'ml.m6i.8xlarge'|'ml.m6i.12xlarge'|'ml.m6i.16xlarge'|'ml.m6i.24xlarge'|'ml.m6i.32xlarge'|'ml.r6i.large'|'ml.r6i.xlarge'|'ml.r6i.2xlarge'|'ml.r6i.4xlarge'|'ml.r6i.8xlarge'|'ml.r6i.12xlarge'|'ml.r6i.16xlarge'|'ml.r6i.24xlarge'|'ml.r6i.32xlarge',
                        'Count': 123
                    },
                ],
                'ResourceSharingConfig': {
                    'Strategy': 'Lend'|'DontLend'|'LendAndBorrow',
                    'BorrowLimit': 123
                },
                'PreemptTeamTasks': 'Never'|'LowerPriority'
            },
            'ComputeQuotaTarget': {
                'TeamName': 'string',
                'FairShareWeight': 123
            },
            'ActivationState': 'Enabled'|'Disabled',
            'CreationTime': datetime(2015, 1, 1),
            'LastModifiedTime': datetime(2015, 1, 1)
        },
    ],
    'NextToken': 'string'
}

Response Structure

(dict) –
- ComputeQuotaSummaries (list) –
  
  Summaries of the compute allocation definitions.
  - (dict) –
    
    Summary of the compute allocation definition.
    - ComputeQuotaArn (string) –
      
      ARN of the compute allocation definition.
    - ComputeQuotaId (string) –
      
      ID of the compute allocation definition.
    - Name (string) –
      
      Name of the compute allocation definition.
    - ComputeQuotaVersion (integer) –
      
      Version of the compute allocation definition.
    - Status (string) –
      
      Status of the compute allocation definition.
    - ClusterArn (string) –
      
      ARN of the cluster.
    - ComputeQuotaConfig (dict) –
      
      Configuration of the compute allocation definition. This includes the resource sharing option, and the setting to preempt low priority tasks.
      - ComputeQuotaResources (list) –
        
        Allocate compute resources by instance types.
        
        (dict) –
        
        Configuration of the resources used for the compute allocation definition.
        
        InstanceType (string) –
        
        The instance type of the instance group for the cluster.
        
        Count (integer) –
        
        The number of instances to add to the instance group of a SageMaker HyperPod cluster.
      - ResourceSharingConfig (dict) –
        
        Resource sharing configuration. This defines how an entity can lend and borrow idle compute with other entities within the cluster.
        
        Strategy (string) –
        
        The strategy of how idle compute is shared within the cluster. The following are the options of strategies.
        
        DontLend: entities do not lend idle compute.
        
        Lend: entities can lend idle compute to entities that can borrow.
        
        LendandBorrow: entities can lend idle compute and borrow idle compute from other entities.
        
        Default is LendandBorrow.
        
        BorrowLimit (integer) –
        
        The limit on how much idle compute can be borrowed.The values can be 1 - 500 percent of idle compute that the team is allowed to borrow.
        
        Default is 50.
      - PreemptTeamTasks (string) –
        
        Allows workloads from within an entity to preempt same-team workloads. When set to LowerPriority, the entity’s lower priority tasks are preempted by their own higher priority tasks.
        
        Default is LowerPriority.
    - ComputeQuotaTarget (dict) –
      
      The target entity to allocate compute resources to.
      - TeamName (string) –
        
        Name of the team to allocate compute resources to.
      - FairShareWeight (integer) –
        
        Assigned entity fair-share weight. Idle compute will be shared across entities based on these assigned weights. This weight is only used when FairShare is enabled.
        
        A weight of 0 is the lowest priority and 100 is the highest. Weight 0 is the default.
    - ActivationState (string) –
      
      The state of the compute allocation being described. Use to enable or disable compute allocation.
      
      Default is Enabled.
    - CreationTime (datetime) –
      
      Creation time of the compute allocation definition.
    - LastModifiedTime (datetime) –
      
      Last modified time of the compute allocation definition.
- NextToken (string) –
  
  If the previous response was truncated, you will receive this token. Use it in your next request to receive the next set of results.