This article summarizes some of the key capabilities and strategies used across the different layers of the Pega Cloud offering to ensure optimal availability and resiliency. Most of these capabilities and strategies are deployed and managed by Pega Cloud and are available for all customers.
For optimum availability, Pega Cloud Production Services uses the Amazon Web Service (AWS) Infrastructure-as-a-Service (IaaS). The AWS infrastructure is composed of Regions and Availability Zones (AZs). AWS Regions are geographically distant, logically defined, and jurisdictionally distinct operating areas. An AWS Region consists of multiple Availability Zones, which are commonly managed, logically defined combinations of redundant and geographically separate physical locations. The use of Availability Zones protects against zone faults and localized service disruptions.
For more information about availability zones, see the Amazon Web Services website.
The following regions are currently available:
- Northern California
Customers select the region into which they want their virtual private cloud to be deployed, either to meet legal requirements or to be located closer to their customers. Pega Cloud Production Services provisions customer sandbox and production environments into that region/VPC, with components deployed across availability zones.
Load-balancing services are deployed across all Pega Cloud environments to manage traffic. In environments with multiple application servers spread across multiple availability zones, these load-balancing services drive traffic to healthy nodes, masking any application server failures that might occur.
Pega Cloud Database Replication and Data Backup services have robust Recovery Time Objectives (RTO) and Recovery Point Objective (RPO) policies for production and sandbox environments.
- The Pega Cloud Database Replication service protects production environments by deploying a real-time database mirror in a different availability zone. Failover to the production mirror is automatic, with a recovery point objective (RPO) of less than 1 minute. This is the maximum amount of time during which data might be lost from a customer database. The recovery time objective (RTO) is less than 4 minutes. This is the targeted time to restore the customer's cloud service.
- The Pega Cloud Data Backup service backs up customer data in all sandbox and production environments on an ongoing basis with a recovery point objective (RPO) of less than 10 minutes. Database backup recovery time objective (RTO) depends on the size of the system deployed.
The Pega Cloud virtual private network (VPN) service implements a warm-standby channel within the primary tunnel. If a failure occurs in the primary channel, the service automatically fails over to the secondary channel.
Availability monitoring in Pega Cloud
Pega Cloud Operations continually monitors the cloud environment, and generates internal alerts for any variances in thresholds. Some examples of the monitoring protocols implemented to ensure optimal availability include:
- Server responsiveness – Proactive, regular ping
- CPU utilization – Proactive monitoring with alert when utilization exceeds threshold
- Memory utilization (buffered and cached) – Proactive monitoring with alert when utilization exceeds threshold
- Disk utilization – Proactive monitoring with alert when utilization exceeds threshold
- Application server accessibility – WGET/Curl, alert if no response
- Application server – WGET/Curl and JMX, alert if no response
- Application server Heap utilization – WGET/Curl, alert when utilization exceeds threshold
- Database – Proactive, regular database query; alert if no response
- Database table space utilization – Proactive, regular database query; alert when utilization exceeds threshold
- Database errors – Proactive, regular database query; alert if error returned
Pega Cloud Production Services can alter these monitoring thresholds based on the customer environment and any Pegasystems application-specific parameters.
Support and maintenance
If customers encounter issues with the availability of their Pega Cloud service, they can open a service request in My Support Portal.
For questions about Pega Cloud maintenance policies, see Pega Cloud maintenance and types of system updates.