kendra.Client.
query
(**kwargs)¶Searches an active index. Use this API to search your documents using query. The Query
API enables to do faceted search and to filter results based on document attributes.
It also enables you to provide user context that Amazon Kendra uses to enforce document access control in the search results.
Amazon Kendra searches your index for text content and question and answer (FAQ) content. By default the response contains three types of results.
You can specify that the query return only one type of result using the QueryResultTypeFilter
parameter.
Each query returns the 100 most relevant results.
See also: AWS API Documentation
Request Syntax
response = client.query(
IndexId='string',
QueryText='string',
AttributeFilter={
'AndAllFilters': [
{'... recursive ...'},
],
'OrAllFilters': [
{'... recursive ...'},
],
'NotFilter': {'... recursive ...'},
'EqualsTo': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
'ContainsAll': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
'ContainsAny': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
'GreaterThan': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
'GreaterThanOrEquals': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
'LessThan': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
'LessThanOrEquals': {
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
}
},
Facets=[
{
'DocumentAttributeKey': 'string',
'Facets': {'... recursive ...'},
'MaxResults': 123
},
],
RequestedDocumentAttributes=[
'string',
],
QueryResultTypeFilter='DOCUMENT'|'QUESTION_ANSWER'|'ANSWER',
DocumentRelevanceOverrideConfigurations=[
{
'Name': 'string',
'Relevance': {
'Freshness': True|False,
'Importance': 123,
'Duration': 'string',
'RankOrder': 'ASCENDING'|'DESCENDING',
'ValueImportanceMap': {
'string': 123
}
}
},
],
PageNumber=123,
PageSize=123,
SortingConfiguration={
'DocumentAttributeKey': 'string',
'SortOrder': 'DESC'|'ASC'
},
UserContext={
'Token': 'string',
'UserId': 'string',
'Groups': [
'string',
],
'DataSourceGroups': [
{
'GroupId': 'string',
'DataSourceId': 'string'
},
]
},
VisitorId='string',
SpellCorrectionConfiguration={
'IncludeQuerySpellCheckSuggestions': True|False
}
)
[REQUIRED]
The identifier of the index to search. The identifier is returned in the response from the CreateIndex
API.
Enables filtered searches based on document attributes. You can only provide one attribute filter; however, the AndAllFilters
, NotFilter
, and OrAllFilters
parameters contain a list of other filters.
The AttributeFilter
parameter enables you to create a set of filtering rules that a document must satisfy to be included in the query results.
Performs a logical AND
operation on all supplied filters.
Provides filtering the query results based on document attributes or metadata fields.
When you use the AndAllFilters
or OrAllFilters
, filters you can use 2 layers under the first attribute filter. For example, you can use:
<AndAllFilters>
<OrAllFilters>
<EqualsTo>
If you use more than 2 layers, you receive a ValidationException
exception with the message " AttributeFilter
cannot have a depth of more than 2."
If you use more than 10 attribute filters in a given list for AndAllFilters
or OrAllFilters
, you receive a ValidationException
with the message " AttributeFilter
cannot have a length of more than 10".
Performs a logical OR
operation on all supplied filters.
Provides filtering the query results based on document attributes or metadata fields.
When you use the AndAllFilters
or OrAllFilters
, filters you can use 2 layers under the first attribute filter. For example, you can use:
<AndAllFilters>
<OrAllFilters>
<EqualsTo>
If you use more than 2 layers, you receive a ValidationException
exception with the message " AttributeFilter
cannot have a depth of more than 2."
If you use more than 10 attribute filters in a given list for AndAllFilters
or OrAllFilters
, you receive a ValidationException
with the message " AttributeFilter
cannot have a length of more than 10".
Performs a logical NOT
operation on all supplied filters.
Performs an equals operation on two document attributes or metadata fields.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Returns true when a document contains all of the specified document attributes or metadata fields. This filter is only applicable to StringListValue
metadata.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Returns true when a document contains any of the specified document attributes or metadata fields. This filter is only applicable to StringListValue
metadata.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Performs a greater than operation on two document attributes or metadata fields. Use with a document attribute of type Date
or Long
.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Performs a greater or equals than operation on two document attributes or metadata fields. Use with a document attribute of type Date
or Long
.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Performs a less than operation on two document attributes or metadata fields. Use with a document attribute of type Date
or Long
.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Performs a less than or equals operation on two document attributes or metadata fields. Use with a document attribute of type Date
or Long
.
The identifier for the attribute.
The value of the attribute.
A string, such as "department".
A list of strings. The default maximum length or number of strings is 10.
A long integer value.
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
An array of documents attributes. Amazon Kendra returns a count for each attribute key specified. This helps your users narrow their search.
Information about a document attribute. You can use document attributes as facets.
For example, the document attribute or facet "Department" includes the values "HR", "Engineering", and "Accounting". You can display these values in the search results so that documents can be searched by department.
You can display up to 10 facet values per facet for a query. If you want to increase this limit, contact Support.
The unique key for the document attribute.
An array of document attributes that are nested facets within a facet.
For example, the document attribute or facet "Department" includes a value called "Engineering". In addition, the document attribute or facet "SubDepartment" includes the values "Frontend" and "Backend" for documents assigned to "Engineering". You can display nested facets in the search results so that documents can be searched not only by department but also by a sub department within a department. This helps your users further narrow their search.
You can only have one nested facet within a facet. If you want to increase this limit, contact Support.
Maximum number of facet values per facet. The default is 10. You can use this to limit the number of facet values to less than 10. If you want to increase the default, contact Support.
An array of document attributes to include in the response. You can limit the response to include certain document attributes. By default all document attributes are included in the response.
Overrides relevance tuning configurations of fields or attributes set at the index level.
If you use this API to override the relevance tuning configured at the index level, but there is no relevance tuning configured at the index level, then Amazon Kendra does not apply any relevance tuning.
If there is relevance tuning configured at the index level, but you do not use this API to override any relevance tuning in the index, then Amazon Kendra uses the relevance tuning that is configured at the index level.
If there is relevance tuning configured for fields at the index level, but you use this API to override only some of these fields, then for the fields you did not override, the importance is set to 1.
Overrides the document relevance properties of a custom index field.
The name of the index field.
Provides information for tuning the relevance of a field in a search. When a query includes terms that match the field, the results are given a boost in the response based on these tuning parameters.
Indicates that this field determines how "fresh" a document is. For example, if document 1 was created on November 5, and document 2 was created on October 31, document 1 is "fresher" than document 2. You can only set the Freshness
field on one DATE
type field. Only applies to DATE
fields.
The relative importance of the field in the search. Larger numbers provide more of a boost than smaller numbers.
Specifies the time period that the boost applies to. For example, to make the boost apply to documents with the field value within the last month, you would use "2628000s". Once the field value is beyond the specified range, the effect of the boost drops off. The higher the importance, the faster the effect drops off. If you don't specify a value, the default is 3 months. The value of the field is a numeric string followed by the character "s", for example "86400s" for one day, or "604800s" for one week.
Only applies to DATE
fields.
Determines how values should be interpreted.
When the RankOrder
field is ASCENDING
, higher numbers are better. For example, a document with a rating score of 10 is higher ranking than a document with a rating score of 1.
When the RankOrder
field is DESCENDING
, lower numbers are better. For example, in a task tracking application, a priority 1 task is more important than a priority 5 task.
Only applies to LONG
and DOUBLE
fields.
A list of values that should be given a different boost when they appear in the result list. For example, if you are boosting a field called "department," query terms that match the department field are boosted in the result. However, you can add entries from the department field to boost documents with those values higher.
For example, you can add entries to the map with names of departments. If you add "HR",5 and "Legal",3 those departments are given special attention when they appear in the metadata of a document. When those terms appear they are given the specified importance instead of the regular importance for the boost.
PageSize
parameter. By default, Amazon Kendra returns the first page of results. Use this parameter to get result pages after the first one.Provides information that determines how the results of the query are sorted. You can set the field that Amazon Kendra should sort the results on, and specify whether the results should be sorted in ascending or descending order. In the case of ties in sorting the results, the results are sorted by relevance.
If you don't provide sorting configuration, the results are sorted by the relevance that Amazon Kendra determines for the result.
The name of the document attribute used to sort the response. You can use any field that has the Sortable
flag set to true.
You can also sort by any of the following built-in attributes:
The order that the results should be returned in. In case of ties, the relevance assigned to the result by Amazon Kendra is used as the tie-breaker.
The user context token or user and group information.
The user context token for filtering search results for a user. It must be a JWT or a JSON token.
The identifier of the user you want to filter search results based on their access to documents.
The list of groups you want to filter search results based on the groups' access to documents.
The list of data source groups you want to filter search results based on groups' access to documents in that data source.
Data source information for user context filtering.
The identifier of the group you want to add to your list of groups. This is for filtering search results based on the groups' access to documents.
The identifier of the data source group you want to add to your list of data source groups. This is for filtering search results based on the groups' access to documents in that data source.
VisitorId
should be a unique identifier, such as a GUID. Don't use personally identifiable information, such as the user's email address, as the VisitorId
.Enables suggested spell corrections for queries.
TRUE
to suggest spell corrections for queries.
dict
Response Syntax
{
'QueryId': 'string',
'ResultItems': [
{
'Id': 'string',
'Type': 'DOCUMENT'|'QUESTION_ANSWER'|'ANSWER',
'Format': 'TABLE'|'TEXT',
'AdditionalAttributes': [
{
'Key': 'string',
'ValueType': 'TEXT_WITH_HIGHLIGHTS_VALUE',
'Value': {
'TextWithHighlightsValue': {
'Text': 'string',
'Highlights': [
{
'BeginOffset': 123,
'EndOffset': 123,
'TopAnswer': True|False,
'Type': 'STANDARD'|'THESAURUS_SYNONYM'
},
]
}
}
},
],
'DocumentId': 'string',
'DocumentTitle': {
'Text': 'string',
'Highlights': [
{
'BeginOffset': 123,
'EndOffset': 123,
'TopAnswer': True|False,
'Type': 'STANDARD'|'THESAURUS_SYNONYM'
},
]
},
'DocumentExcerpt': {
'Text': 'string',
'Highlights': [
{
'BeginOffset': 123,
'EndOffset': 123,
'TopAnswer': True|False,
'Type': 'STANDARD'|'THESAURUS_SYNONYM'
},
]
},
'DocumentURI': 'string',
'DocumentAttributes': [
{
'Key': 'string',
'Value': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
}
},
],
'ScoreAttributes': {
'ScoreConfidence': 'VERY_HIGH'|'HIGH'|'MEDIUM'|'LOW'|'NOT_AVAILABLE'
},
'FeedbackToken': 'string',
'TableExcerpt': {
'Rows': [
{
'Cells': [
{
'Value': 'string',
'TopAnswer': True|False,
'Highlighted': True|False,
'Header': True|False
},
]
},
],
'TotalNumberOfRows': 123
}
},
],
'FacetResults': [
{
'DocumentAttributeKey': 'string',
'DocumentAttributeValueType': 'STRING_VALUE'|'STRING_LIST_VALUE'|'LONG_VALUE'|'DATE_VALUE',
'DocumentAttributeValueCountPairs': [
{
'DocumentAttributeValue': {
'StringValue': 'string',
'StringListValue': [
'string',
],
'LongValue': 123,
'DateValue': datetime(2015, 1, 1)
},
'Count': 123,
'FacetResults': {'... recursive ...'}
},
]
},
],
'TotalNumberOfResults': 123,
'Warnings': [
{
'Message': 'string',
'Code': 'QUERY_LANGUAGE_INVALID_SYNTAX'
},
],
'SpellCorrectedQueries': [
{
'SuggestedQueryText': 'string',
'Corrections': [
{
'BeginOffset': 123,
'EndOffset': 123,
'Term': 'string',
'CorrectedTerm': 'string'
},
]
},
]
}
Response Structure
(dict) --
QueryId (string) --
The identifier for the search. You use QueryId
to identify the search when using the feedback API.
ResultItems (list) --
The results of the search.
(dict) --
A single query result.
A query result contains information about a document returned by the query. This includes the original location of the document, a list of attributes assigned to the document, and relevant text from the document that satisfies the query.
Id (string) --
The identifier for the query result.
Type (string) --
The type of document within the response. For example, a response could include a question-answer that's relevant to the query.
Format (string) --
If the Type
of document within the response is ANSWER
, then it is either a TABLE
answer or TEXT
answer. If it's a table answer, a table excerpt is returned in TableExcerpt
. If it's a text answer, a text excerpt is returned in DocumentExcerpt
.
AdditionalAttributes (list) --
One or more additional attributes associated with the query result.
(dict) --
An attribute returned from an index query.
Key (string) --
The key that identifies the attribute.
ValueType (string) --
The data type of the Value
property.
Value (dict) --
An object that contains the attribute value.
TextWithHighlightsValue (dict) --
The text associated with the attribute and information about the highlight to apply to the text.
Text (string) --
The text to display to the user.
Highlights (list) --
The beginning and end of the text that should be highlighted.
(dict) --
Provides information that you can use to highlight a search result so that your users can quickly identify terms in the response.
BeginOffset (integer) --
The zero-based location in the response string where the highlight starts.
EndOffset (integer) --
The zero-based location in the response string where the highlight ends.
TopAnswer (boolean) --
Indicates whether the response is the best response. True if this is the best response; otherwise, false.
Type (string) --
The highlight type.
DocumentId (string) --
The identifier for the document.
DocumentTitle (dict) --
The title of the document. Contains the text of the title and information for highlighting the relevant terms in the title.
Text (string) --
The text to display to the user.
Highlights (list) --
The beginning and end of the text that should be highlighted.
(dict) --
Provides information that you can use to highlight a search result so that your users can quickly identify terms in the response.
BeginOffset (integer) --
The zero-based location in the response string where the highlight starts.
EndOffset (integer) --
The zero-based location in the response string where the highlight ends.
TopAnswer (boolean) --
Indicates whether the response is the best response. True if this is the best response; otherwise, false.
Type (string) --
The highlight type.
DocumentExcerpt (dict) --
An extract of the text in the document. Contains information about highlighting the relevant terms in the excerpt.
Text (string) --
The text to display to the user.
Highlights (list) --
The beginning and end of the text that should be highlighted.
(dict) --
Provides information that you can use to highlight a search result so that your users can quickly identify terms in the response.
BeginOffset (integer) --
The zero-based location in the response string where the highlight starts.
EndOffset (integer) --
The zero-based location in the response string where the highlight ends.
TopAnswer (boolean) --
Indicates whether the response is the best response. True if this is the best response; otherwise, false.
Type (string) --
The highlight type.
DocumentURI (string) --
The URI of the original location of the document.
DocumentAttributes (list) --
An array of document attributes assigned to a document in the search results. For example, the document author ( _author
) or the source URI ( _source_uri
) of the document.
(dict) --
A document attribute or metadata field. To create custom document attributes, see Custom attributes.
Key (string) --
The identifier for the attribute.
Value (dict) --
The value of the attribute.
StringValue (string) --
A string, such as "department".
StringListValue (list) --
A list of strings. The default maximum length or number of strings is 10.
LongValue (integer) --
A long integer value.
DateValue (datetime) --
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
ScoreAttributes (dict) --
Indicates the confidence that Amazon Kendra has that a result matches the query that you provided. Each result is placed into a bin that indicates the confidence, VERY_HIGH
, HIGH
, MEDIUM
and LOW
. You can use the score to determine if a response meets the confidence needed for your application.
The field is only set to LOW
when the Type
field is set to DOCUMENT
and Amazon Kendra is not confident that the result matches the query.
ScoreConfidence (string) --
A relative ranking for how well the response matches the query.
FeedbackToken (string) --
A token that identifies a particular result from a particular query. Use this token to provide click-through feedback for the result. For more information, see Submitting feedback.
TableExcerpt (dict) --
An excerpt from a table within a document.
Rows (list) --
A list of rows in the table excerpt.
(dict) --
Information about a row in a table excerpt.
Cells (list) --
A list of table cells in a row.
(dict) --
Provides information about a table cell in a table excerpt.
Value (string) --
The actual value or content within a table cell. A table cell could contain a date value of a year, or a string value of text, for example.
TopAnswer (boolean) --
TRUE
if the response of the table cell is the top answer. This is the cell value or content with the highest confidence score or is the most relevant to the query.
Highlighted (boolean) --
TRUE
means that the table cell has a high enough confidence and is relevant to the query, so the value or content should be highlighted.
Header (boolean) --
TRUE
means that the table cell should be treated as a header.
TotalNumberOfRows (integer) --
A count of the number of rows in the original table within the document.
FacetResults (list) --
Contains the facet results. A FacetResult
contains the counts for each attribute key that was specified in the Facets
input parameter.
(dict) --
The facet values for the documents in the response.
DocumentAttributeKey (string) --
The key for the facet values. This is the same as the DocumentAttributeKey
provided in the query.
DocumentAttributeValueType (string) --
The data type of the facet value. This is the same as the type defined for the index field when it was created.
DocumentAttributeValueCountPairs (list) --
An array of key/value pairs, where the key is the value of the attribute and the count is the number of documents that share the key value.
(dict) --
Provides the count of documents that match a particular attribute when doing a faceted search.
DocumentAttributeValue (dict) --
The value of the attribute. For example, "HR".
StringValue (string) --
A string, such as "department".
StringListValue (list) --
A list of strings. The default maximum length or number of strings is 10.
LongValue (integer) --
A long integer value.
DateValue (datetime) --
A date expressed as an ISO 8601 string.
It is important for the time zone to be included in the ISO 8601 date-time format. For example, 2012-03-25T12:30:10+01:00 is the ISO 8601 date-time format for March 25th 2012 at 12:30PM (plus 10 seconds) in Central European Time.
Count (integer) --
The number of documents in the response that have the attribute value for the key.
FacetResults (list) --
Contains the results of a document attribute that is a nested facet. A FacetResult
contains the counts for each facet nested within a facet.
For example, the document attribute or facet "Department" includes a value called "Engineering". In addition, the document attribute or facet "SubDepartment" includes the values "Frontend" and "Backend" for documents assigned to "Engineering". You can display nested facets in the search results so that documents can be searched not only by department but also by a sub department within a department. The counts for documents that belong to "Frontend" and "Backend" within "Engineering" are returned for a query.
TotalNumberOfResults (integer) --
The total number of items found by the search; however, you can only retrieve up to 100 items. For example, if the search found 192 items, you can only retrieve the first 100 of the items.
Warnings (list) --
A list of warning codes and their messages on problems with your query.
Amazon Kendra currently only supports one type of warning, which is a warning on invalid syntax used in the query. For examples of invalid query syntax, see Searching with advanced query syntax.
(dict) --
The warning code and message that explains a problem with a query.
Message (string) --
The message that explains the problem with the query.
Code (string) --
The code used to show the type of warning for the query.
SpellCorrectedQueries (list) --
A list of information related to suggested spell corrections for a query.
(dict) --
A query with suggested spell corrections.
SuggestedQueryText (string) --
The query with the suggested spell corrections.
Corrections (list) --
The corrected misspelled word or words in a query.
(dict) --
A corrected misspelled word in a query.
BeginOffset (integer) --
The zero-based location in the response string or text where the corrected word starts.
EndOffset (integer) --
The zero-based location in the response string or text where the corrected word ends.
Term (string) --
The string or text of a misspelled word in a query.
CorrectedTerm (string) --
The string or text of a corrected misspelled word in a query.
Exceptions
kendra.Client.exceptions.ValidationException
kendra.Client.exceptions.ConflictException
kendra.Client.exceptions.ResourceNotFoundException
kendra.Client.exceptions.ThrottlingException
kendra.Client.exceptions.AccessDeniedException
kendra.Client.exceptions.ServiceQuotaExceededException
kendra.Client.exceptions.InternalServerException