You can create an instance of a Kinesis data set in Pega Platform
to connect to an instance of Amazon Kinesis Data Streams. Amazon Kinesis Data Streams ingests a
large amount of data in real time, durably stores it, and makes it available for lightweight
processing. For Pega Cloud applications, you can use a Pega-provided Kinesis data stream or
connect to your own Kinesis data stream.
Before you begin: Make sure that the Identity and Access Management (IAM) policies in Amazon Web Services
(AWS) are set to allow access to Kinesis data streams. For more information, see the Amazon
Web Services (AWS) documentation about IAM policies. To use your own Kinesis account with data
streams, change the value of the useExternalKinesisAccount Dynamic System
Setting to true
-
Create an instance of the Kinesis
data set rule.
-
In the Connection section, select a Kinesis configuration
instance and a region. For more information about the available regions, see the Amazon Web Services (AWS)
documentation.
Note: This step is not available if you are running Pega Platform in a
cloud environment (the onPegaCloud Dynamic System Setting is set to
true) and you are using a Pega-provided Kinesis data
stream.
-
In the Stream section, select a stream that is available in your
Kinesis configuration instance.
Important: If you use a Kinesis data stream with Pega Platform on premises or a Kinesis data stream with Pega Platform in the cloud that are in different regions, you might
experience performance issues during data set operations. For optimal performance, use a
Kinesis data stream with Pega Platform in the cloud that are in the
same region.
- Optional:
In the Partition key(s) section, define the data set
partitioning.
By configuring partitioning, you ensure that related records are sent to the same
partition. If you do not define partition keys, the Kinesis data set randomly assigns
records to partitions, which can hinder its performance.
-
Click Add key.
-
In the Key field, press the Down Arrow key to select the
property that you want the Kinesis data set to use as a partitioning key.
Note:
By default, the available properties to be used as keys correspond to the
properties of the Applies To class of the Kinesis data set.
-
Click Save.