Managing Cassandra as the source of decision management data
Pega Platform operates Apache Cassandra as the underlying storage system for the Decision Data Store (DDS). Cassandra is an open source, column-oriented, and fault-tolerant database that handles large data workloads across multiple nodes.The following chapter provides guidelines on how to manage, maintain, and run Cassandra nodes as part of Decision Strategy Manager. It also provides procedures for optimizing Cassandra operations and lists the tools that you can use to perform such optimizations in Pega Platform and outside.
- Cassandra overview
Apache Cassandra is the primary means of storage for all of the customer, historical, and analytical data that you use in decision management. The following sections provide an overview of the most important Cassandra features in terms of scalability, data distribution, consistency, and architecture.
- Configuring the Cassandra cluster
Pega Platform comes with an internal Cassandra cluster to which you can connect through a Decision Data Store data set. Before connecting to the cluster through Pega Platform, perform the following steps to achieve optimal performance and data consistency across the nodes in the cluster.
- Monitoring Cassandra cluster health
You can secure the good health of a Cassandra cluster by monitoring the node status in Pega Platform and by running regular repair operations.
- Troubleshooting Cassandra
Identify the root cause of degraded performance by completing corresponding monitoring activities. Learn about the most commonly encountered Cassandra issues and how to address them.