Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
Java StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)
AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
Java StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
Java StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.
Class of the records in DStream
AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)
AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
Java StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.
Class of the records in DStream
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)
AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The given AWS credentials will get saved in DStream checkpoints if checkpointing is enabled. Make sure that your checkpoint directory is secure.
StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.
AWS AccessKeyId (if null, will use DefaultAWSCredentialsProviderChain)
AWS SecretKey (if null, will use DefaultAWSCredentialsProviderChain)
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note: The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain gets the AWS credentials.
StreamingContext object
Kinesis application name used by the Kinesis Client Library (KCL) to update DynamoDB
Kinesis stream name
Url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Name of region used by the Kinesis Client Library (KCL) to update DynamoDB (lease coordination and checkpointing) and CloudWatch (metrics)
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
Storage level to use for storing the received objects. StorageLevel.MEMORY_AND_DISK_2 is recommended.
A custom message handler that can generate a generic output from a
Kinesis Record
, which contains both message data, and metadata.
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note:
- The AWS credentials will be discovered using the DefaultAWSCredentialsProviderChain
on the workers. See AWS documentation to understand how DefaultAWSCredentialsProviderChain
gets AWS credentials.
- The region of the endpointUrl
will be used for DynamoDB and CloudWatch.
- The Kinesis application name used by the Kinesis Client Library (KCL) will be the app name in
org.apache.spark.SparkConf.
Java StreamingContext object
Kinesis stream name
Endpoint url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Storage level to use for storing the received objects StorageLevel.MEMORY_AND_DISK_2 is recommended.
(Since version 1.4.0) use other forms of createStream
Create an input stream that pulls messages from a Kinesis stream.
Create an input stream that pulls messages from a Kinesis stream. This uses the Kinesis Client Library (KCL) to pull messages from Kinesis.
Note:
endpointUrl
will be used for DynamoDB and CloudWatch.StreamingContext object
Kinesis stream name
Endpoint url of Kinesis service (e.g., https://kinesis.us-east-1.amazonaws.com)
Checkpoint interval for Kinesis checkpointing. See the Kinesis Spark Streaming documentation for more details on the different types of checkpoints.
In the absence of Kinesis checkpoint info, this is the worker's initial starting position in the stream. The values are either the beginning of the stream per Kinesis' limit of 24 hours (InitialPositionInStream.TRIM_HORIZON) or the tip of the stream (InitialPositionInStream.LATEST).
Storage level to use for storing the received objects StorageLevel.MEMORY_AND_DISK_2 is recommended.
(Since version 1.4.0) use other forms of createStream