create_application

KinesisAnalytics.Client.create_application(**kwargs)

Note

This documentation is for version 1 of the Amazon Kinesis Data Analytics API, which only supports SQL applications. Version 2 of the API supports SQL and Java applications. For more information about version 2, see Amazon Kinesis Data Analytics API V2 Documentation.

Creates an Amazon Kinesis Analytics application. You can configure each application with one streaming source as input, application code to process the input, and up to three destinations where you want Amazon Kinesis Analytics to write the output data from your application. For an overview, see How it Works.

In the input configuration, you map the streaming source to an in-application stream, which you can think of as a constantly updating table. In the mapping, you must provide a schema for the in-application stream and map each data column in the in-application stream to a data element in the streaming source.

Your application code is one or more SQL statements that read input data, transform it, and generate output. Your application code can create one or more SQL artifacts like SQL streams or pumps.

In the output configuration, you can configure the application to write data from in-application streams created in your applications to up to three destinations.

To read data from your source stream or write data to destination streams, Amazon Kinesis Analytics needs your permissions. You grant these permissions by creating IAM roles. This operation requires permissions to perform the kinesisanalytics:CreateApplication action.

For introductory exercises to create an Amazon Kinesis Analytics application, see Getting Started.

See also: AWS API Documentation

Request Syntax

response = client.create_application(
    ApplicationName='string',
    ApplicationDescription='string',
    Inputs=[
        {
            'NamePrefix': 'string',
            'InputProcessingConfiguration': {
                'InputLambdaProcessor': {
                    'ResourceARN': 'string',
                    'RoleARN': 'string'
                }
            },
            'KinesisStreamsInput': {
                'ResourceARN': 'string',
                'RoleARN': 'string'
            },
            'KinesisFirehoseInput': {
                'ResourceARN': 'string',
                'RoleARN': 'string'
            },
            'InputParallelism': {
                'Count': 123
            },
            'InputSchema': {
                'RecordFormat': {
                    'RecordFormatType': 'JSON'|'CSV',
                    'MappingParameters': {
                        'JSONMappingParameters': {
                            'RecordRowPath': 'string'
                        },
                        'CSVMappingParameters': {
                            'RecordRowDelimiter': 'string',
                            'RecordColumnDelimiter': 'string'
                        }
                    }
                },
                'RecordEncoding': 'string',
                'RecordColumns': [
                    {
                        'Name': 'string',
                        'Mapping': 'string',
                        'SqlType': 'string'
                    },
                ]
            }
        },
    ],
    Outputs=[
        {
            'Name': 'string',
            'KinesisStreamsOutput': {
                'ResourceARN': 'string',
                'RoleARN': 'string'
            },
            'KinesisFirehoseOutput': {
                'ResourceARN': 'string',
                'RoleARN': 'string'
            },
            'LambdaOutput': {
                'ResourceARN': 'string',
                'RoleARN': 'string'
            },
            'DestinationSchema': {
                'RecordFormatType': 'JSON'|'CSV'
            }
        },
    ],
    CloudWatchLoggingOptions=[
        {
            'LogStreamARN': 'string',
            'RoleARN': 'string'
        },
    ],
    ApplicationCode='string',
    Tags=[
        {
            'Key': 'string',
            'Value': 'string'
        },
    ]
)
Parameters
  • ApplicationName (string) --

    [REQUIRED]

    Name of your Amazon Kinesis Analytics application (for example, sample-app ).

  • ApplicationDescription (string) -- Summary description of the application.
  • Inputs (list) --

    Use this parameter to configure the application input.

    You can configure your application to receive input from a single streaming source. In this configuration, you map this streaming source to an in-application stream that is created. Your application code can then query the in-application stream like a table (you can think of it as a constantly updating table).

    For the streaming source, you provide its Amazon Resource Name (ARN) and format of data on the stream (for example, JSON, CSV, etc.). You also must provide an IAM role that Amazon Kinesis Analytics can assume to read this stream on your behalf.

    To create the in-application stream, you need to specify a schema to transform your data into a schematized version used in SQL. In the schema, you provide the necessary mapping of the data elements in the streaming source to record columns in the in-app stream.

    • (dict) --

      When you configure the application input, you specify the streaming source, the in-application stream name that is created, and the mapping between the two. For more information, see Configuring Application Input.

      • NamePrefix (string) -- [REQUIRED]

        Name prefix to use when creating an in-application stream. Suppose that you specify a prefix "MyInApplicationStream." Amazon Kinesis Analytics then creates one or more (as per the InputParallelism count you specified) in-application streams with names "MyInApplicationStream_001," "MyInApplicationStream_002," and so on.

      • InputProcessingConfiguration (dict) --

        The InputProcessingConfiguration for the input. An input processor transforms records as they are received from the stream, before the application's SQL code executes. Currently, the only input processing configuration available is InputLambdaProcessor.

        • InputLambdaProcessor (dict) -- [REQUIRED]

          The InputLambdaProcessor that is used to preprocess the records in the stream before being processed by your application code.

          • ResourceARN (string) -- [REQUIRED]

            The ARN of the AWS Lambda function that operates on records in the stream.

            Note

            To specify an earlier version of the Lambda function than the latest, include the Lambda function version in the Lambda function ARN. For more information about Lambda ARNs, see Example ARNs: AWS Lambda

          • RoleARN (string) -- [REQUIRED]

            The ARN of the IAM role that is used to access the AWS Lambda function.

      • KinesisStreamsInput (dict) --

        If the streaming source is an Amazon Kinesis stream, identifies the stream's Amazon Resource Name (ARN) and an IAM role that enables Amazon Kinesis Analytics to access the stream on your behalf.

        Note: Either KinesisStreamsInput or KinesisFirehoseInput is required.

        • ResourceARN (string) -- [REQUIRED]

          ARN of the input Amazon Kinesis stream to read.

        • RoleARN (string) -- [REQUIRED]

          ARN of the IAM role that Amazon Kinesis Analytics can assume to access the stream on your behalf. You need to grant the necessary permissions to this role.

      • KinesisFirehoseInput (dict) --

        If the streaming source is an Amazon Kinesis Firehose delivery stream, identifies the delivery stream's ARN and an IAM role that enables Amazon Kinesis Analytics to access the stream on your behalf.

        Note: Either KinesisStreamsInput or KinesisFirehoseInput is required.

        • ResourceARN (string) -- [REQUIRED]

          ARN of the input delivery stream.

        • RoleARN (string) -- [REQUIRED]

          ARN of the IAM role that Amazon Kinesis Analytics can assume to access the stream on your behalf. You need to make sure that the role has the necessary permissions to access the stream.

      • InputParallelism (dict) --

        Describes the number of in-application streams to create.

        Data from your source is routed to these in-application input streams.

        (see Configuring Application Input.

        • Count (integer) --

          Number of in-application streams to create. For more information, see Limits.

      • InputSchema (dict) -- [REQUIRED]

        Describes the format of the data in the streaming source, and how each data element maps to corresponding columns in the in-application stream that is being created.

        Also used to describe the format of the reference data source.

        • RecordFormat (dict) -- [REQUIRED]

          Specifies the format of the records on the streaming source.

          • RecordFormatType (string) -- [REQUIRED]

            The type of record format.

          • MappingParameters (dict) --

            When configuring application input at the time of creating or updating an application, provides additional mapping information specific to the record format (such as JSON, CSV, or record fields delimited by some delimiter) on the streaming source.

            • JSONMappingParameters (dict) --

              Provides additional mapping information when JSON is the record format on the streaming source.

              • RecordRowPath (string) -- [REQUIRED]

                Path to the top-level parent that contains the records.

            • CSVMappingParameters (dict) --

              Provides additional mapping information when the record format uses delimiters (for example, CSV).

              • RecordRowDelimiter (string) -- [REQUIRED]

                Row delimiter. For example, in a CSV format, 'n' is the typical row delimiter.

              • RecordColumnDelimiter (string) -- [REQUIRED]

                Column delimiter. For example, in a CSV format, a comma (",") is the typical column delimiter.

        • RecordEncoding (string) --

          Specifies the encoding of the records in the streaming source. For example, UTF-8.

        • RecordColumns (list) -- [REQUIRED]

          A list of RecordColumn objects.

          • (dict) --

            Describes the mapping of each data element in the streaming source to the corresponding column in the in-application stream.

            Also used to describe the format of the reference data source.

            • Name (string) -- [REQUIRED]

              Name of the column created in the in-application input stream or reference table.

            • Mapping (string) --

              Reference to the data element in the streaming input or the reference data source. This element is required if the RecordFormatType is JSON .

            • SqlType (string) -- [REQUIRED]

              Type of column created in the in-application input stream or reference table.

  • Outputs (list) --

    You can configure application output to write data from any of the in-application streams to up to three destinations.

    These destinations can be Amazon Kinesis streams, Amazon Kinesis Firehose delivery streams, AWS Lambda destinations, or any combination of the three.

    In the configuration, you specify the in-application stream name, the destination stream or Lambda function Amazon Resource Name (ARN), and the format to use when writing data. You must also provide an IAM role that Amazon Kinesis Analytics can assume to write to the destination stream or Lambda function on your behalf.

    In the output configuration, you also provide the output stream or Lambda function ARN. For stream destinations, you provide the format of data in the stream (for example, JSON, CSV). You also must provide an IAM role that Amazon Kinesis Analytics can assume to write to the stream or Lambda function on your behalf.

    • (dict) --

      Describes application output configuration in which you identify an in-application stream and a destination where you want the in-application stream data to be written. The destination can be an Amazon Kinesis stream or an Amazon Kinesis Firehose delivery stream.

      For limits on how many destinations an application can write and other limitations, see Limits.

      • Name (string) -- [REQUIRED]

        Name of the in-application stream.

      • KinesisStreamsOutput (dict) --

        Identifies an Amazon Kinesis stream as the destination.

        • ResourceARN (string) -- [REQUIRED]

          ARN of the destination Amazon Kinesis stream to write to.

        • RoleARN (string) -- [REQUIRED]

          ARN of the IAM role that Amazon Kinesis Analytics can assume to write to the destination stream on your behalf. You need to grant the necessary permissions to this role.

      • KinesisFirehoseOutput (dict) --

        Identifies an Amazon Kinesis Firehose delivery stream as the destination.

        • ResourceARN (string) -- [REQUIRED]

          ARN of the destination Amazon Kinesis Firehose delivery stream to write to.

        • RoleARN (string) -- [REQUIRED]

          ARN of the IAM role that Amazon Kinesis Analytics can assume to write to the destination stream on your behalf. You need to grant the necessary permissions to this role.

      • LambdaOutput (dict) --

        Identifies an AWS Lambda function as the destination.

        • ResourceARN (string) -- [REQUIRED]

          Amazon Resource Name (ARN) of the destination Lambda function to write to.

          Note

          To specify an earlier version of the Lambda function than the latest, include the Lambda function version in the Lambda function ARN. For more information about Lambda ARNs, see Example ARNs: AWS Lambda

        • RoleARN (string) -- [REQUIRED]

          ARN of the IAM role that Amazon Kinesis Analytics can assume to write to the destination function on your behalf. You need to grant the necessary permissions to this role.

      • DestinationSchema (dict) -- [REQUIRED]

        Describes the data format when records are written to the destination. For more information, see Configuring Application Output.

        • RecordFormatType (string) -- [REQUIRED]

          Specifies the format of the records on the output stream.

  • CloudWatchLoggingOptions (list) --

    Use this parameter to configure a CloudWatch log stream to monitor application configuration errors. For more information, see Working with Amazon CloudWatch Logs.

    • (dict) --

      Provides a description of CloudWatch logging options, including the log stream Amazon Resource Name (ARN) and the role ARN.

      • LogStreamARN (string) -- [REQUIRED]

        ARN of the CloudWatch log to receive application messages.

      • RoleARN (string) -- [REQUIRED]

        IAM ARN of the role to use to send application messages. Note: To write application messages to CloudWatch, the IAM role that is used must have the PutLogEvents policy action enabled.

  • ApplicationCode (string) --

    One or more SQL statements that read input data, transform it, and generate output. For example, you can write a SQL statement that reads data from one in-application stream, generates a running average of the number of advertisement clicks by vendor, and insert resulting rows in another in-application stream using pumps. For more information about the typical pattern, see Application Code.

    You can provide such series of SQL statements, where output of one statement can be used as the input for the next statement. You store intermediate results by creating in-application streams and pumps.

    Note that the application code must create the streams with names specified in the Outputs . For example, if your Outputs defines output streams named ExampleOutputStream1 and ExampleOutputStream2 , then your application code must create these streams.

  • Tags (list) --

    A list of one or more tags to assign to the application. A tag is a key-value pair that identifies an application. Note that the maximum number of application tags includes system tags. The maximum number of user-defined application tags is 50. For more information, see Using Tagging.

    • (dict) --

      A key-value pair (the value is optional) that you can define and assign to AWS resources. If you specify a tag that already exists, the tag value is replaced with the value that you specify in the request. Note that the maximum number of application tags includes system tags. The maximum number of user-defined application tags is 50. For more information, see Using Tagging.

      • Key (string) -- [REQUIRED]

        The key of the key-value tag.

      • Value (string) --

        The value of the key-value tag. The value is optional.

Return type

dict

Returns

Response Syntax

{
    'ApplicationSummary': {
        'ApplicationName': 'string',
        'ApplicationARN': 'string',
        'ApplicationStatus': 'DELETING'|'STARTING'|'STOPPING'|'READY'|'RUNNING'|'UPDATING'
    }
}

Response Structure

  • (dict) --

    TBD

    • ApplicationSummary (dict) --

      In response to your CreateApplication request, Amazon Kinesis Analytics returns a response with a summary of the application it created, including the application Amazon Resource Name (ARN), name, and status.

      • ApplicationName (string) --

        Name of the application.

      • ApplicationARN (string) --

        ARN of the application.

      • ApplicationStatus (string) --

        Status of the application.

Exceptions

  • KinesisAnalytics.Client.exceptions.CodeValidationException
  • KinesisAnalytics.Client.exceptions.ResourceInUseException
  • KinesisAnalytics.Client.exceptions.LimitExceededException
  • KinesisAnalytics.Client.exceptions.InvalidArgumentException
  • KinesisAnalytics.Client.exceptions.TooManyTagsException
  • KinesisAnalytics.Client.exceptions.ConcurrentModificationException