You are here: Reference > Data classes > Hadoop data instances > Connection tab

Connection tab

  1. About 
  2. Connection 
  3. History 

From the Connection tab, define all the connection details for the Hadoop host.

Note: Before you can connect to an Apache HBase or HDFS data store, upload the relevant client JAR files into the application container with the Pega 7 Platform. You need different JAR files for specific versions of the application (for example HBase 0.90, HDFS 2.7.1).

  1. In the Connection section, specify a master Hadoop host. This host must contain HDFS NameNode and HBase master node.
  2. Optional: To configure settings for HDFS connection, select the Use HDFS configuration check box.
  3. Optional: To configure settings for HBase connection, select the Use HBase configuration check box.
  4. Optional: Enable running external data flows on the Hadoop record. Configure the following objects:

    You can configure the Pega 7 Platform to run predictive models directly on a Hadoop record with an external data flow. Through the Pega 7 Platform, you can view the input for the data flow and its outcome.

    Note: The use of the Hadoop infrastructure lets you process large amounts of data directly on the Hadoop cluster and reduce the data transfer between the Hadoop cluster and the Pega 7 Platform.