Table / Action / scan
scan#
- DynamoDB.Table.scan(**kwargs)#
The
Scanoperation returns one or more items and item attributes by accessing every item in a table or a secondary index. To have DynamoDB return fewer items, you can provide aFilterExpressionoperation.If the total size of scanned items exceeds the maximum dataset size limit of 1 MB, the scan completes and results are returned to the user. The
LastEvaluatedKeyvalue is also returned and the requestor can use theLastEvaluatedKeyto continue the scan in a subsequent operation. Each scan response also includes number of items that were scanned (ScannedCount) as part of the request. If using aFilterExpression, a scan result can result in no items meeting the criteria and theCountwill result in zero. If you did not use aFilterExpressionin the scan request, thenCountis the same asScannedCount.Note
CountandScannedCountonly return the count of items specific to a single scan request and, unless the table is less than 1MB, do not represent the total number of items in the table.A single
Scanoperation first reads up to the maximum number of items set (if using theLimitparameter) or a maximum of 1 MB of data and then applies any filtering to the results if aFilterExpressionis provided. IfLastEvaluatedKeyis present in the response, pagination is required to complete the full table scan. For more information, see Paginating the Results in the Amazon DynamoDB Developer Guide.Scanoperations proceed sequentially; however, for faster performance on a large table or secondary index, applications can request a parallelScanoperation by providing theSegmentandTotalSegmentsparameters. For more information, see Parallel Scan in the Amazon DynamoDB Developer Guide.By default, a
Scanuses eventually consistent reads when accessing the items in a table. Therefore, the results from an eventually consistentScanmay not include the latest item changes at the time the scan iterates through each item in the table. If you require a strongly consistent read of each item as the scan iterates through the items in the table, you can set theConsistentReadparameter to true. Strong consistency only relates to the consistency of the read at the item level.Note
DynamoDB does not provide snapshot isolation for a scan operation when the
ConsistentReadparameter is set to true. Thus, a DynamoDB scan operation does not guarantee that all reads in a scan see a consistent snapshot of the table when the scan operation was requested.See also: AWS API Documentation
Request Syntax
response = table.scan( IndexName='string', AttributesToGet=[ 'string', ], Limit=123, Select='ALL_ATTRIBUTES'|'ALL_PROJECTED_ATTRIBUTES'|'SPECIFIC_ATTRIBUTES'|'COUNT', ScanFilter={ 'string': { 'AttributeValueList': [ 'string'|123|Binary(b'bytes')|True|None|set(['string'])|set([123])|set([Binary(b'bytes')])|[]|{}, ], 'ComparisonOperator': 'EQ'|'NE'|'IN'|'LE'|'LT'|'GE'|'GT'|'BETWEEN'|'NOT_NULL'|'NULL'|'CONTAINS'|'NOT_CONTAINS'|'BEGINS_WITH' } }, ConditionalOperator='AND'|'OR', ExclusiveStartKey={ 'string': 'string'|123|Binary(b'bytes')|True|None|set(['string'])|set([123])|set([Binary(b'bytes')])|[]|{} }, ReturnConsumedCapacity='INDEXES'|'TOTAL'|'NONE', TotalSegments=123, Segment=123, ProjectionExpression='string', FilterExpression=Attr('myattribute').eq('myvalue'), ExpressionAttributeNames={ 'string': 'string' }, ExpressionAttributeValues={ 'string': 'string'|123|Binary(b'bytes')|True|None|set(['string'])|set([123])|set([Binary(b'bytes')])|[]|{} }, ConsistentRead=True|False )
- Parameters:
IndexName (string) – The name of a secondary index to scan. This index can be any local secondary index or global secondary index. Note that if you use the
IndexNameparameter, you must also provideTableName.AttributesToGet (list) –
This is a legacy parameter. Use
ProjectionExpressioninstead. For more information, see AttributesToGet in the Amazon DynamoDB Developer Guide.(string) –
Limit (integer) – The maximum number of items to evaluate (not necessarily the number of matching items). If DynamoDB processes the number of items up to the limit while processing the results, it stops the operation and returns the matching values up to that point, and a key in
LastEvaluatedKeyto apply in a subsequent operation, so that you can pick up where you left off. Also, if the processed dataset size exceeds 1 MB before DynamoDB reaches this limit, it stops the operation and returns the matching values up to the limit, and a key inLastEvaluatedKeyto apply in a subsequent operation to continue the operation. For more information, see Working with Queries in the Amazon DynamoDB Developer Guide.Select (string) –
The attributes to be returned in the result. You can retrieve all item attributes, specific item attributes, the count of matching items, or in the case of an index, some or all of the attributes projected into the index.
ALL_ATTRIBUTES- Returns all of the item attributes from the specified table or index. If you query a local secondary index, then for each matching item in the index, DynamoDB fetches the entire item from the parent table. If the index is configured to project all item attributes, then all of the data can be obtained from the local secondary index, and no fetching is required.ALL_PROJECTED_ATTRIBUTES- Allowed only when querying an index. Retrieves all attributes that have been projected into the index. If the index is configured to project all attributes, this return value is equivalent to specifyingALL_ATTRIBUTES.COUNT- Returns the number of matching items, rather than the matching items themselves. Note that this uses the same quantity of read capacity units as getting the items, and is subject to the same item size calculations.SPECIFIC_ATTRIBUTES- Returns only the attributes listed inProjectionExpression. This return value is equivalent to specifyingProjectionExpressionwithout specifying any value forSelect. If you query or scan a local secondary index and request only attributes that are projected into that index, the operation reads only the index and not the table. If any of the requested attributes are not projected into the local secondary index, DynamoDB fetches each of these attributes from the parent table. This extra fetching incurs additional throughput cost and latency. If you query or scan a global secondary index, you can only request attributes that are projected into the index. Global secondary index queries cannot fetch attributes from the parent table.
If neither
SelectnorProjectionExpressionare specified, DynamoDB defaults toALL_ATTRIBUTESwhen accessing a table, andALL_PROJECTED_ATTRIBUTESwhen accessing an index. You cannot use bothSelectandProjectionExpressiontogether in a single request, unless the value forSelectisSPECIFIC_ATTRIBUTES. (This usage is equivalent to specifyingProjectionExpressionwithout any value forSelect.)Note
If you use the
ProjectionExpressionparameter, then the value forSelectcan only beSPECIFIC_ATTRIBUTES. Any other value forSelectwill return an error.ScanFilter (dict) –
This is a legacy parameter. Use
FilterExpressioninstead. For more information, see ScanFilter in the Amazon DynamoDB Developer Guide.(string) –
(dict) –
Represents the selection criteria for a
QueryorScanoperation:For a
Queryoperation,Conditionis used for specifying theKeyConditionsto use when querying a table or an index. ForKeyConditions, only the following comparison operators are supported:EQ | LE | LT | GE | GT | BEGINS_WITH | BETWEENConditionis also used in aQueryFilter, which evaluates the query results and returns only the desired values.For a
Scanoperation,Conditionis used in aScanFilter, which evaluates the scan results and returns only the desired values.
AttributeValueList (list) –
One or more values to evaluate against the supplied attribute. The number of values in the list depends on the
ComparisonOperatorbeing used.For type Number, value comparisons are numeric.
String value comparisons for greater than, equals, or less than are based on ASCII character code values. For example,
ais greater thanA, andais greater thanB. For a list of code values, see http://en.wikipedia.org/wiki/ASCII#ASCII_printable_characters.For Binary, DynamoDB treats each byte of the binary data as unsigned when it compares binary values.
*(valid DynamoDB type) – *- The value of the attribute. The valid value types are listed in the DynamoDB Reference Guide.
ComparisonOperator (string) – [REQUIRED]
A comparator for evaluating attributes. For example, equals, greater than, less than, etc.
The following comparison operators are available:
EQ | NE | LE | LT | GE | GT | NOT_NULL | NULL | CONTAINS | NOT_CONTAINS | BEGINS_WITH | IN | BETWEENThe following are descriptions of each comparison operator.
EQ: Equal.EQis supported for all data types, including lists and maps.AttributeValueListcan contain only oneAttributeValueelement of type String, Number, Binary, String Set, Number Set, or Binary Set. If an item contains anAttributeValueelement of a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not equal{"N":"6"}. Also,{"N":"6"}does not equal{"NS":["6", "2", "1"]}.NE: Not equal.NEis supported for all data types, including lists and maps.AttributeValueListcan contain only oneAttributeValueof type String, Number, Binary, String Set, Number Set, or Binary Set. If an item contains anAttributeValueof a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not equal{"N":"6"}. Also,{"N":"6"}does not equal{"NS":["6", "2", "1"]}.LE: Less than or equal.AttributeValueListcan contain only oneAttributeValueelement of type String, Number, or Binary (not a set type). If an item contains anAttributeValueelement of a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not equal{"N":"6"}. Also,{"N":"6"}does not compare to{"NS":["6", "2", "1"]}.LT: Less than.AttributeValueListcan contain only oneAttributeValueof type String, Number, or Binary (not a set type). If an item contains anAttributeValueelement of a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not equal{"N":"6"}. Also,{"N":"6"}does not compare to{"NS":["6", "2", "1"]}.GE: Greater than or equal.AttributeValueListcan contain only oneAttributeValueelement of type String, Number, or Binary (not a set type). If an item contains anAttributeValueelement of a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not equal{"N":"6"}. Also,{"N":"6"}does not compare to{"NS":["6", "2", "1"]}.GT: Greater than.AttributeValueListcan contain only oneAttributeValueelement of type String, Number, or Binary (not a set type). If an item contains anAttributeValueelement of a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not equal{"N":"6"}. Also,{"N":"6"}does not compare to{"NS":["6", "2", "1"]}.NOT_NULL: The attribute exists.NOT_NULLis supported for all data types, including lists and maps.
Note
This operator tests for the existence of an attribute, not its data type. If the data type of attribute “
a” is null, and you evaluate it usingNOT_NULL, the result is a Booleantrue. This result is because the attribute “a” exists; its data type is not relevant to theNOT_NULLcomparison operator.NULL: The attribute does not exist.NULLis supported for all data types, including lists and maps.
Note
This operator tests for the nonexistence of an attribute, not its data type. If the data type of attribute “
a” is null, and you evaluate it usingNULL, the result is a Booleanfalse. This is because the attribute “a” exists; its data type is not relevant to theNULLcomparison operator.CONTAINS: Checks for a subsequence, or value in a set.AttributeValueListcan contain only oneAttributeValueelement of type String, Number, or Binary (not a set type). If the target attribute of the comparison is of type String, then the operator checks for a substring match. If the target attribute of the comparison is of type Binary, then the operator looks for a subsequence of the target that matches the input. If the target attribute of the comparison is a set (”SS”, “NS”, or “BS”), then the operator evaluates to true if it finds an exact match with any member of the set. CONTAINS is supported for lists: When evaluating “a CONTAINS b”, “a” can be a list; however, “b” cannot be a set, a map, or a list.NOT_CONTAINS: Checks for absence of a subsequence, or absence of a value in a set.AttributeValueListcan contain only oneAttributeValueelement of type String, Number, or Binary (not a set type). If the target attribute of the comparison is a String, then the operator checks for the absence of a substring match. If the target attribute of the comparison is Binary, then the operator checks for the absence of a subsequence of the target that matches the input. If the target attribute of the comparison is a set (”SS”, “NS”, or “BS”), then the operator evaluates to true if it does not find an exact match with any member of the set. NOT_CONTAINS is supported for lists: When evaluating “a NOT CONTAINS b”, “a” can be a list; however, “b” cannot be a set, a map, or a list.BEGINS_WITH: Checks for a prefix.AttributeValueListcan contain only oneAttributeValueof type String or Binary (not a Number or a set type). The target attribute of the comparison must be of type String or Binary (not a Number or a set type).IN: Checks for matching elements in a list.AttributeValueListcan contain one or moreAttributeValueelements of type String, Number, or Binary. These attributes are compared against an existing attribute of an item. If any elements of the input are equal to the item attribute, the expression evaluates to true.BETWEEN: Greater than or equal to the first value, and less than or equal to the second value.AttributeValueListmust contain twoAttributeValueelements of the same type, either String, Number, or Binary (not a set type). A target attribute matches if the target value is greater than, or equal to, the first element and less than, or equal to, the second element. If an item contains anAttributeValueelement of a different type than the one provided in the request, the value does not match. For example,{"S":"6"}does not compare to{"N":"6"}. Also,{"N":"6"}does not compare to{"NS":["6", "2", "1"]}
For usage examples of
AttributeValueListandComparisonOperator, see Legacy Conditional Parameters in the Amazon DynamoDB Developer Guide.
ConditionalOperator (string) – This is a legacy parameter. Use
FilterExpressioninstead. For more information, see ConditionalOperator in the Amazon DynamoDB Developer Guide.ExclusiveStartKey (dict) –
The primary key of the first item that this operation will evaluate. Use the value that was returned for
LastEvaluatedKeyin the previous operation.The data type for
ExclusiveStartKeymust be String, Number or Binary. No set data types are allowed.In a parallel scan, a
Scanrequest that includesExclusiveStartKeymust specify the same segment whose previousScanreturned the corresponding value ofLastEvaluatedKey.(string) –
*(valid DynamoDB type) – *- The value of the attribute. The valid value types are listed in the DynamoDB Reference Guide.
ReturnConsumedCapacity (string) –
Determines the level of detail about either provisioned or on-demand throughput consumption that is returned in the response:
INDEXES- The response includes the aggregateConsumedCapacityfor the operation, together withConsumedCapacityfor each table and secondary index that was accessed. Note that some operations, such asGetItemandBatchGetItem, do not access any indexes at all. In these cases, specifyingINDEXESwill only returnConsumedCapacityinformation for table(s).TOTAL- The response includes only the aggregateConsumedCapacityfor the operation.NONE- NoConsumedCapacitydetails are included in the response.
TotalSegments (integer) –
For a parallel
Scanrequest,TotalSegmentsrepresents the total number of segments into which theScanoperation will be divided. The value ofTotalSegmentscorresponds to the number of application workers that will perform the parallel scan. For example, if you want to use four application threads to scan a table or an index, specify aTotalSegmentsvalue of 4.The value for
TotalSegmentsmust be greater than or equal to 1, and less than or equal to 1000000. If you specify aTotalSegmentsvalue of 1, theScanoperation will be sequential rather than parallel.If you specify
TotalSegments, you must also specifySegment.Segment (integer) –
For a parallel
Scanrequest,Segmentidentifies an individual segment to be scanned by an application worker.Segment IDs are zero-based, so the first segment is always 0. For example, if you want to use four application threads to scan a table or an index, then the first thread specifies a
Segmentvalue of 0, the second thread specifies 1, and so on.The value of
LastEvaluatedKeyreturned from a parallelScanrequest must be used asExclusiveStartKeywith the same segment ID in a subsequentScanoperation.The value for
Segmentmust be greater than or equal to 0, and less than the value provided forTotalSegments.If you provide
Segment, you must also provideTotalSegments.ProjectionExpression (string) –
A string that identifies one or more attributes to retrieve from the specified table or index. These attributes can include scalars, sets, or elements of a JSON document. The attributes in the expression must be separated by commas.
If no attribute names are specified, then all attributes will be returned. If any of the requested attributes are not found, they will not appear in the result.
For more information, see Specifying Item Attributes in the Amazon DynamoDB Developer Guide.
FilterExpression (condition from
boto3.dynamodb.conditions.Attrmethod) – The condition(s) an attribute(s) must meet. Valid conditions are listed in the DynamoDB Reference Guide.ExpressionAttributeNames (dict) –
One or more substitution tokens for attribute names in an expression. The following are some use cases for using
ExpressionAttributeNames:To access an attribute whose name conflicts with a DynamoDB reserved word.
To create a placeholder for repeating occurrences of an attribute name in an expression.
To prevent special characters in an attribute name from being misinterpreted in an expression.
Use the # character in an expression to dereference an attribute name. For example, consider the following attribute name:
Percentile
The name of this attribute conflicts with a reserved word, so it cannot be used directly in an expression. (For the complete list of reserved words, see Reserved Words in the Amazon DynamoDB Developer Guide). To work around this, you could specify the following for
ExpressionAttributeNames:{"#P":"Percentile"}
You could then use this substitution in an expression, as in this example:
#P = :val
Note
Tokens that begin with the : character are expression attribute values, which are placeholders for the actual value at runtime.
For more information on expression attribute names, see Specifying Item Attributes in the Amazon DynamoDB Developer Guide.
(string) –
(string) –
ExpressionAttributeValues (dict) –
One or more values that can be substituted in an expression.
Use the : (colon) character in an expression to dereference an attribute value. For example, suppose that you wanted to check whether the value of the
ProductStatusattribute was one of the following:Available | Backordered | DiscontinuedYou would first need to specify
ExpressionAttributeValuesas follows:{ ":avail":{"S":"Available"}, ":back":{"S":"Backordered"}, ":disc":{"S":"Discontinued"} }You could then use these values in an expression, such as this:
ProductStatus IN (:avail, :back, :disc)For more information on expression attribute values, see Condition Expressions in the Amazon DynamoDB Developer Guide.
(string) –
*(valid DynamoDB type) – *- The value of the attribute. The valid value types are listed in the DynamoDB Reference Guide.
ConsistentRead (boolean) –
A Boolean value that determines the read consistency model during the scan:
If
ConsistentReadisfalse, then the data returned fromScanmight not contain the results from other recently completed write operations (PutItem,UpdateItem, orDeleteItem).If
ConsistentReadistrue, then all of the write operations that completed before theScanbegan are guaranteed to be contained in theScanresponse.
The default setting for
ConsistentReadisfalse.The
ConsistentReadparameter is not supported on global secondary indexes. If you scan a global secondary index withConsistentReadset to true, you will receive aValidationException.
- Return type:
dict
- Returns:
Response Syntax
{ 'Items': [ { 'string': 'string'|123|Binary(b'bytes')|True|None|set(['string'])|set([123])|set([Binary(b'bytes')])|[]|{} }, ], 'Count': 123, 'ScannedCount': 123, 'LastEvaluatedKey': { 'string': 'string'|123|Binary(b'bytes')|True|None|set(['string'])|set([123])|set([Binary(b'bytes')])|[]|{} }, 'ConsumedCapacity': { 'TableName': 'string', 'CapacityUnits': 123.0, 'ReadCapacityUnits': 123.0, 'WriteCapacityUnits': 123.0, 'Table': { 'ReadCapacityUnits': 123.0, 'WriteCapacityUnits': 123.0, 'CapacityUnits': 123.0 }, 'LocalSecondaryIndexes': { 'string': { 'ReadCapacityUnits': 123.0, 'WriteCapacityUnits': 123.0, 'CapacityUnits': 123.0 } }, 'GlobalSecondaryIndexes': { 'string': { 'ReadCapacityUnits': 123.0, 'WriteCapacityUnits': 123.0, 'CapacityUnits': 123.0 } } } }
Response Structure
(dict) –
Represents the output of a
Scanoperation.Items (list) –
An array of item attributes that match the scan criteria. Each element in this array consists of an attribute name and the value for that attribute.
(dict) –
(string) –
*(valid DynamoDB type) – *- The value of the attribute. The valid value types are listed in the DynamoDB Reference Guide.
Count (integer) –
The number of items in the response.
If you set
ScanFilterin the request, thenCountis the number of items returned after the filter was applied, andScannedCountis the number of matching items before the filter was applied.If you did not use a filter in the request, then
Countis the same asScannedCount.ScannedCount (integer) –
The number of items evaluated, before any
ScanFilteris applied. A highScannedCountvalue with few, or no,Countresults indicates an inefficientScanoperation. For more information, see Count and ScannedCount in the Amazon DynamoDB Developer Guide.If you did not use a filter in the request, then
ScannedCountis the same asCount.LastEvaluatedKey (dict) –
The primary key of the item where the operation stopped, inclusive of the previous result set. Use this value to start a new operation, excluding this value in the new request.
If
LastEvaluatedKeyis empty, then the “last page” of results has been processed and there is no more data to be retrieved.If
LastEvaluatedKeyis not empty, it does not necessarily mean that there is more data in the result set. The only way to know when you have reached the end of the result set is whenLastEvaluatedKeyis empty.(string) –
*(valid DynamoDB type) – *- The value of the attribute. The valid value types are listed in the DynamoDB Reference Guide.
ConsumedCapacity (dict) –
The capacity units consumed by the
Scanoperation. The data returned includes the total provisioned throughput consumed, along with statistics for the table and any indexes involved in the operation.ConsumedCapacityis only returned if theReturnConsumedCapacityparameter was specified. For more information, see Capacity unit consumption for read operations in the Amazon DynamoDB Developer Guide.TableName (string) –
The name of the table that was affected by the operation. If you had specified the Amazon Resource Name (ARN) of a table in the input, you’ll see the table ARN in the response.
CapacityUnits (float) –
The total number of capacity units consumed by the operation.
ReadCapacityUnits (float) –
The total number of read capacity units consumed by the operation.
WriteCapacityUnits (float) –
The total number of write capacity units consumed by the operation.
Table (dict) –
The amount of throughput consumed on the table affected by the operation.
ReadCapacityUnits (float) –
The total number of read capacity units consumed on a table or an index.
WriteCapacityUnits (float) –
The total number of write capacity units consumed on a table or an index.
CapacityUnits (float) –
The total number of capacity units consumed on a table or an index.
LocalSecondaryIndexes (dict) –
The amount of throughput consumed on each local index affected by the operation.
(string) –
(dict) –
Represents the amount of provisioned throughput capacity consumed on a table or an index.
ReadCapacityUnits (float) –
The total number of read capacity units consumed on a table or an index.
WriteCapacityUnits (float) –
The total number of write capacity units consumed on a table or an index.
CapacityUnits (float) –
The total number of capacity units consumed on a table or an index.
GlobalSecondaryIndexes (dict) –
The amount of throughput consumed on each global index affected by the operation.
(string) –
(dict) –
Represents the amount of provisioned throughput capacity consumed on a table or an index.
ReadCapacityUnits (float) –
The total number of read capacity units consumed on a table or an index.
WriteCapacityUnits (float) –
The total number of write capacity units consumed on a table or an index.
CapacityUnits (float) –
The total number of capacity units consumed on a table or an index.