LexRuntimeV2 / Client / recognize_utterance
recognize_utterance#
- LexRuntimeV2.Client.recognize_utterance(**kwargs)#
Sends user input to Amazon Lex V2. You can send text or speech. Clients use this API to send text and audio requests to Amazon Lex V2 at runtime. Amazon Lex V2 interprets the user input using the machine learning model built for the bot.
The following request fields must be compressed with gzip and then base64 encoded before you send them to Amazon Lex V2.
requestAttributes
sessionState
The following response fields are compressed using gzip and then base64 encoded by Amazon Lex V2. Before you can use these fields, you must decode and decompress them.
inputTranscript
interpretations
messages
requestAttributes
sessionState
The example contains a Java application that compresses and encodes a Java object to send to Amazon Lex V2, and a second that decodes and decompresses a response from Amazon Lex V2.
If the optional post-fulfillment response is specified, the messages are returned as follows. For more information, see PostFulfillmentStatusSpecification.
Success message - Returned if the Lambda function completes successfully and the intent state is fulfilled or ready fulfillment if the message is present.
Failed message - The failed message is returned if the Lambda function throws an exception or if the Lambda function returns a failed intent state without a message.
Timeout message - If you don’t configure a timeout message and a timeout, and the Lambda function doesn’t return within 30 seconds, the timeout message is returned. If you configure a timeout, the timeout message is returned when the period times out.
For more information, see Completion message.
See also: AWS API Documentation
Request Syntax
response = client.recognize_utterance( botId='string', botAliasId='string', localeId='string', sessionId='string', sessionState='string', requestAttributes='string', requestContentType='string', responseContentType='string', inputStream=b'bytes'|file )
- Parameters:
botId (string) –
[REQUIRED]
The identifier of the bot that should receive the request.
botAliasId (string) –
[REQUIRED]
The alias identifier in use for the bot that should receive the request.
localeId (string) –
[REQUIRED]
The locale where the session is in use.
sessionId (string) –
[REQUIRED]
The identifier of the session in use.
sessionState (string) –
Sets the state of the session with the user. You can use this to set the current intent, attributes, context, and dialog action. Use the dialog action to determine the next step that Amazon Lex V2 should use in the conversation with the user.
The
sessionState
field must be compressed using gzip and then base64 encoded before sending to Amazon Lex V2.requestAttributes (string) –
Request-specific information passed between the client application and Amazon Lex V2
The namespace
x-amz-lex:
is reserved for special attributes. Don’t create any request attributes for prefixx-amz-lex:
.The
requestAttributes
field must be compressed using gzip and then base64 encoded before sending to Amazon Lex V2.requestContentType (string) –
[REQUIRED]
Indicates the format for audio input or that the content is text. The header must start with one of the following prefixes:
PCM format, audio data must be in little-endian byte order.
audio/l16; rate=16000; channels=1
audio/x-l16; sample-rate=16000; channel-count=1
audio/lpcm; sample-rate=8000; sample-size-bits=16; channel-count=1; is-big-endian=false
Opus format
audio/x-cbr-opus-with-preamble;preamble-size=0;bit-rate=256000;frame-size-milliseconds=4
Text format
text/plain; charset=utf-8
responseContentType (string) –
The message that Amazon Lex V2 returns in the response can be either text or speech based on the
responseContentType
value.If the value is
text/plain;charset=utf-8
, Amazon Lex V2 returns text in the response.If the value begins with
audio/
, Amazon Lex V2 returns speech in the response. Amazon Lex V2 uses Amazon Polly to generate the speech using the configuration that you specified in theresponseContentType
parameter. For example, if you specifyaudio/mpeg
as the value, Amazon Lex V2 returns speech in the MPEG format.If the value is
audio/pcm
, the speech returned isaudio/pcm
at 16 KHz in 16-bit, little-endian format.The following are the accepted values:
audio/mpeg
audio/ogg
audio/pcm (16 KHz)
audio/* (defaults to mpeg)
text/plain; charset=utf-8
inputStream (bytes or seekable file-like object) – User input in PCM or Opus audio format or text format as described in the
requestContentType
parameter.
- Return type:
dict
- Returns:
Response Syntax
{ 'inputMode': 'string', 'contentType': 'string', 'messages': 'string', 'interpretations': 'string', 'sessionState': 'string', 'requestAttributes': 'string', 'sessionId': 'string', 'inputTranscript': 'string', 'audioStream': StreamingBody(), 'recognizedBotMember': 'string' }
Response Structure
(dict) –
inputMode (string) –
Indicates whether the input mode to the operation was text, speech, or from a touch-tone keypad.
contentType (string) –
Content type as specified in the
responseContentType
in the request.messages (string) –
A list of messages that were last sent to the user. The messages are ordered based on the order that you returned the messages from your Lambda function or the order that the messages are defined in the bot.
The
messages
field is compressed with gzip and then base64 encoded. Before you can use the contents of the field, you must decode and decompress the contents. See the example for a simple function to decode and decompress the contents.interpretations (string) –
A list of intents that Amazon Lex V2 determined might satisfy the user’s utterance.
Each interpretation includes the intent, a score that indicates how confident Amazon Lex V2 is that the interpretation is the correct one, and an optional sentiment response that indicates the sentiment expressed in the utterance.
The
interpretations
field is compressed with gzip and then base64 encoded. Before you can use the contents of the field, you must decode and decompress the contents. See the example for a simple function to decode and decompress the contents.sessionState (string) –
Represents the current state of the dialog between the user and the bot.
Use this to determine the progress of the conversation and what the next action might be.
The
sessionState
field is compressed with gzip and then base64 encoded. Before you can use the contents of the field, you must decode and decompress the contents. See the example for a simple function to decode and decompress the contents.requestAttributes (string) –
The attributes sent in the request.
The
requestAttributes
field is compressed with gzip and then base64 encoded. Before you can use the contents of the field, you must decode and decompress the contents.sessionId (string) –
The identifier of the session in use.
inputTranscript (string) –
The text used to process the request.
If the input was an audio stream, the
inputTranscript
field contains the text extracted from the audio stream. This is the text that is actually processed to recognize intents and slot values. You can use this information to determine if Amazon Lex V2 is correctly processing the audio that you send.The
inputTranscript
field is compressed with gzip and then base64 encoded. Before you can use the contents of the field, you must decode and decompress the contents. See the example for a simple function to decode and decompress the contents.audioStream (
StreamingBody
) –The prompt or statement to send to the user. This is based on the bot configuration and context. For example, if Amazon Lex V2 did not understand the user intent, it sends the
clarificationPrompt
configured for the bot. If the intent requires confirmation before taking the fulfillment action, it sends theconfirmationPrompt
. Another example: Suppose that the Lambda function successfully fulfilled the intent, and sent a message to convey to the user. Then Amazon Lex V2 sends that message in the response.recognizedBotMember (string) –
The bot member that recognized the utterance.
Exceptions