Table of Contents
A low-level client representing Amazon CloudSearch Domain:
import boto3
client = boto3.client('cloudsearchdomain')
These are the available methods:
Check if an operation can be paginated.
Generate a presigned url given a client, its method, and arguments
The presigned url
Create a paginator for an operation.
Returns an object that can wait for some condition.
Retrieves a list of documents that match the specified search criteria. How you specify the search criteria depends on which query parser you use. Amazon CloudSearch supports four query parsers:
For more information, see Searching Your Data in the Amazon CloudSearch Developer Guide .
The endpoint for submitting Search requests is domain-specific. You submit search requests to a domain's search endpoint. To get the search endpoint for your domain, use the Amazon CloudSearch configuration service DescribeDomains action. A domain's endpoints are also displayed on the domain dashboard in the Amazon CloudSearch console.
See also: AWS API Documentation
Request Syntax
response = client.search(
cursor='string',
expr='string',
facet='string',
filterQuery='string',
highlight='string',
partial=True|False,
query='string',
queryOptions='string',
queryParser='simple'|'structured'|'lucene'|'dismax',
returnFields='string',
size=123,
sort='string',
start=123,
stats='string'
)
Retrieves a cursor value you can use to page through large result sets. Use the size parameter to control the number of hits to include in each response. You can specify either the cursor or start parameter in a request; they are mutually exclusive. To get the first cursor, set the cursor value to initial . In subsequent requests, specify the cursor value returned in the hits section of the response.
For more information, see Paginating Results in the Amazon CloudSearch Developer Guide .
Defines one or more numeric expressions that can be used to sort results or specify search or filter criteria. You can also specify expressions as return fields.
You specify the expressions in JSON using the form {"EXPRESSIONNAME":"EXPRESSION"} . You can define and use multiple expressions in a search request. For example:
{"expression1":"_score*rating", "expression2":"(1/rank)*year"}
For information about the variables, operators, and functions you can use in expressions, see Writing Expressions in the Amazon CloudSearch Developer Guide .
Specifies one or more fields for which to get facet information, and options that control how the facet information is returned. Each specified field must be facet-enabled in the domain configuration. The fields and options are specified in JSON using the form {"FIELD":{"OPTION":VALUE,"OPTION:"STRING"},"FIELD":{"OPTION":VALUE,"OPTION":"STRING"}} .
You can specify the following faceting options:
If no facet options are specified, facet counts are computed for all field values, the facets are sorted by facet count, and the top 10 facets are returned in the results.
To count particular buckets of values, use the buckets option. For example, the following request uses the buckets option to calculate and return facet counts by decade.
{"year":{"buckets":["[1970,1979]","[1980,1989]","[1990,1999]","[2000,2009]","[2010,}"]}}
To sort facets by facet count, use the count option. For example, the following request sets the sort option to count to sort the facet values by facet count, with the facet values that have the most matching documents listed first. Setting the size option to 3 returns only the top three facet values.
{"year":{"sort":"count","size":3}}
To sort the facets by value, use the bucket option. For example, the following request sets the sort option to bucket to sort the facet values numerically by year, with earliest year listed first.
{"year":{"sort":"bucket"}}
For more information, see Getting and Using Facet Information in the Amazon CloudSearch Developer Guide .
Specifies a structured query that filters the results of a search without affecting how the results are scored and sorted. You use filterQuery in conjunction with the query parameter to filter the documents that match the constraints specified in the query parameter. Specifying a filter controls only which matching documents are included in the results, it has no effect on how they are scored and sorted. The filterQuery parameter supports the full structured query syntax.
For more information about using filters, see Filtering Matching Documents in the Amazon CloudSearch Developer Guide .
Retrieves highlights for matches in the specified text or text-array fields. Each specified field must be highlight enabled in the domain configuration. The fields and options are specified in JSON using the form {"FIELD":{"OPTION":VALUE,"OPTION:"STRING"},"FIELD":{"OPTION":VALUE,"OPTION":"STRING"}} .
You can specify the following highlight options:
If no highlight options are specified for a field, the returned field text is treated as HTML and the first match is highlighted with emphasis tags: <em>search-term</em> .
For example, the following request retrieves highlights for the actors and title fields.
{ "actors": {}, "title": {"format": "text","max_phrases": 2,"pre_tag": "**","post_tag": "** "} }
[REQUIRED]
Specifies the search criteria for the request. How you specify the search criteria depends on the query parser used for the request and the parser options specified in the queryOptions parameter. By default, the simple query parser is used to process requests. To use the structured , lucene , or dismax query parser, you must also specify the queryParser parameter.
For more information about specifying search criteria, see Searching Your Data in the Amazon CloudSearch Developer Guide .
Configures options for the query parser specified in the queryParser parameter. You specify the options in JSON using the following form {"OPTION1":"VALUE1","OPTION2":VALUE2"..."OPTIONN":"VALUEN"}.
The options you can configure vary according to which parser you use:
Specifies which query parser to use to process the request. If queryParser is not specified, Amazon CloudSearch uses the simple query parser.
Amazon CloudSearch supports four query parsers:
Specifies the fields or custom expressions to use to sort the search results. Multiple fields or expressions are specified as a comma-separated list. You must specify the sort direction (asc or desc ) for each field; for example, year desc,title asc . To use a field to sort results, the field must be sort-enabled in the domain configuration. Array type fields cannot be used for sorting. If no sort parameter is specified, results are sorted by their default relevance scores in descending order: _score desc . You can also sort by document ID (_id asc ) and version (_version desc ).
For more information, see Sorting Results in the Amazon CloudSearch Developer Guide .
Specifies the offset of the first search hit you want to return. Note that the result set is zero-based; the first result is at index 0. You can specify either the start or cursor parameter in a request, they are mutually exclusive.
For more information, see Paginating Results in the Amazon CloudSearch Developer Guide .
Specifies one or more fields for which to get statistics information. Each specified field must be facet-enabled in the domain configuration. The fields are specified in JSON using the form:
{"FIELD-A":{},"FIELD-B":{}}
There are currently no options supported for statistics.
dict
Response Syntax
{
'status': {
'timems': 123,
'rid': 'string'
},
'hits': {
'found': 123,
'start': 123,
'cursor': 'string',
'hit': [
{
'id': 'string',
'fields': {
'string': [
'string',
]
},
'exprs': {
'string': 'string'
},
'highlights': {
'string': 'string'
}
},
]
},
'facets': {
'string': {
'buckets': [
{
'value': 'string',
'count': 123
},
]
}
},
'stats': {
'string': {
'min': 'string',
'max': 'string',
'count': 123,
'missing': 123,
'sum': 123.0,
'sumOfSquares': 123.0,
'mean': 'string',
'stddev': 123.0
}
}
}
Response Structure
(dict) --
The result of a Search request. Contains the documents that match the specified search criteria and any requested fields, highlights, and facet information.
status (dict) --
The status information returned for the search request.
timems (integer) --
How long it took to process the request, in milliseconds.
rid (string) --
The encrypted resource ID for the request.
hits (dict) --
The documents that match the search criteria.
found (integer) --
The total number of documents that match the search request.
start (integer) --
The index of the first matching document.
cursor (string) --
A cursor that can be used to retrieve the next set of matching documents when you want to page through a large result set.
hit (list) --
A document that matches the search request.
(dict) --
Information about a document that matches the search request.
id (string) --
The document ID of a document that matches the search request.
fields (dict) --
The fields returned from a document that matches the search request.
exprs (dict) --
The expressions returned from a document that matches the search request.
highlights (dict) --
The highlights returned from a document that matches the search request.
facets (dict) --
The requested facet information.
(string) --
(dict) --
A container for the calculated facet values and counts.
buckets (list) --
A list of the calculated facet values and counts.
(dict) --
A container for facet information.
value (string) --
The facet value being counted.
count (integer) --
The number of hits that contain the facet value in the specified facet field.
stats (dict) --
The requested field statistics information.
(string) --
(dict) --
The statistics for a field calculated in the request.
min (string) --
The minimum value found in the specified field in the result set.
If the field is numeric (int , int-array , double , or double-array ), min is the string representation of a double-precision 64-bit floating point value. If the field is date or date-array , min is the string representation of a date with the format specified in IETF RFC3339 : yyyy-mm-ddTHH:mm:ss.SSSZ.
max (string) --
The maximum value found in the specified field in the result set.
If the field is numeric (int , int-array , double , or double-array ), max is the string representation of a double-precision 64-bit floating point value. If the field is date or date-array , max is the string representation of a date with the format specified in IETF RFC3339 : yyyy-mm-ddTHH:mm:ss.SSSZ.
count (integer) --
The number of documents that contain a value in the specified field in the result set.
missing (integer) --
The number of documents that do not contain a value in the specified field in the result set.
sum (float) --
The sum of the field values across the documents in the result set. null for date fields.
sumOfSquares (float) --
The sum of all field values in the result set squared.
mean (string) --
The average of the values found in the specified field in the result set.
If the field is numeric (int , int-array , double , or double-array ), mean is the string representation of a double-precision 64-bit floating point value. If the field is date or date-array , mean is the string representation of a date with the format specified in IETF RFC3339 : yyyy-mm-ddTHH:mm:ss.SSSZ.
stddev (float) --
The standard deviation of the values in the specified field in the result set.
Retrieves autocomplete suggestions for a partial query string. You can use suggestions enable you to display likely matches before users finish typing. In Amazon CloudSearch, suggestions are based on the contents of a particular text field. When you request suggestions, Amazon CloudSearch finds all of the documents whose values in the suggester field start with the specified query string. The beginning of the field must match the query string to be considered a match.
For more information about configuring suggesters and retrieving suggestions, see Getting Suggestions in the Amazon CloudSearch Developer Guide .
The endpoint for submitting Suggest requests is domain-specific. You submit suggest requests to a domain's search endpoint. To get the search endpoint for your domain, use the Amazon CloudSearch configuration service DescribeDomains action. A domain's endpoints are also displayed on the domain dashboard in the Amazon CloudSearch console.
See also: AWS API Documentation
Request Syntax
response = client.suggest(
query='string',
suggester='string',
size=123
)
[REQUIRED]
Specifies the string for which you want to get suggestions.
[REQUIRED]
Specifies the name of the suggester to use to find suggested matches.
dict
Response Syntax
{
'status': {
'timems': 123,
'rid': 'string'
},
'suggest': {
'query': 'string',
'found': 123,
'suggestions': [
{
'suggestion': 'string',
'score': 123,
'id': 'string'
},
]
}
}
Response Structure
(dict) --
Contains the response to a Suggest request.
status (dict) --
The status of a SuggestRequest . Contains the resource ID (rid ) and how long it took to process the request (timems ).
timems (integer) --
How long it took to process the request, in milliseconds.
rid (string) --
The encrypted resource ID for the request.
suggest (dict) --
Container for the matching search suggestion information.
query (string) --
The query string specified in the suggest request.
found (integer) --
The number of documents that were found to match the query string.
suggestions (list) --
The documents that match the query string.
(dict) --
An autocomplete suggestion that matches the query string specified in a SuggestRequest .
suggestion (string) --
The string that matches the query string specified in the SuggestRequest .
score (integer) --
The relevance score of a suggested match.
id (string) --
The document ID of the suggested document.
Posts a batch of documents to a search domain for indexing. A document batch is a collection of add and delete operations that represent the documents you want to add, update, or delete from your domain. Batches can be described in either JSON or XML. Each item that you want Amazon CloudSearch to return as a search result (such as a product) is represented as a document. Every document has a unique ID and one or more fields that contain the data that you want to search and return in results. Individual documents cannot contain more than 1 MB of data. The entire batch cannot exceed 5 MB. To get the best possible upload performance, group add and delete operations in batches that are close the 5 MB limit. Submitting a large volume of single-document batches can overload a domain's document service.
The endpoint for submitting UploadDocuments requests is domain-specific. To get the document endpoint for your domain, use the Amazon CloudSearch configuration service DescribeDomains action. A domain's endpoints are also displayed on the domain dashboard in the Amazon CloudSearch console.
For more information about formatting your data for Amazon CloudSearch, see Preparing Your Data in the Amazon CloudSearch Developer Guide . For more information about uploading data for indexing, see Uploading Data in the Amazon CloudSearch Developer Guide .
See also: AWS API Documentation
Request Syntax
response = client.upload_documents(
documents=b'bytes'|file,
contentType='application/json'|'application/xml'
)
[REQUIRED]
A batch of documents formatted in JSON or HTML.
[REQUIRED]
The format of the batch you are uploading. Amazon CloudSearch supports two document batch formats:
dict
Response Syntax
{
'status': 'string',
'adds': 123,
'deletes': 123,
'warnings': [
{
'message': 'string'
},
]
}
Response Structure
(dict) --
Contains the response to an UploadDocuments request.
status (string) --
The status of an UploadDocumentsRequest .
adds (integer) --
The number of documents that were added to the search domain.
deletes (integer) --
The number of documents that were deleted from the search domain.
warnings (list) --
Any warnings returned by the document service about the documents being uploaded.
(dict) --
A warning returned by the document service when an issue is discovered while processing an upload request.
message (string) --
The description for a warning returned by the document service.