From the Connection tab, define all the connection details for the Hadoop host.
- In the Connection section, specify a master Hadoop host. This host must contain HDFS NameNode and HBase master node.
- Optional: To configure settings for HDFS connection, select the Use HDFS configuration check box.
- Optional: To configure settings for HBase connection, select the Use HBase configuration check box.
- Optional: Enable running external data flows on the Hadoop record. Configure the
following objects: You can configure Pega Platform to run predictive models directly on a Hadoop record with an external data flow. Through the Pega Platform, you can view the input for the data flow and its outcome.
The use of the Hadoop infrastructure lets you process large amounts of data directly on the Hadoop cluster and reduce the data transfer between the Hadoop cluster and the Pega Platform.