You are here: Reference > Rule types > Data Set > Creating an HBase data set record

Creating an HBase data set record

You must configure each instance of the HBase data set rule before it can read data from and save it to an external HBase storage.

  1. Create an instance of the HBase data set rule.

  2. Connect to an instance of the Data-Admin-Hadoop configuration rule.

    1. In the Hadoop configuration instance field, reference the Data-Admin-Hadoop configuration rule that contains HBase storage configuration.

    2. Click Test connectivity.

  3. Configure mapping between the fields that are stored in an HBase table and properties in the Pega Platform.

    1. Optional: Click Refresh table names.

    2. In the HBase table name field, select a table that is available in the HBase storage to which you are connected.

    3. Optional: Click Preview table to see the first 100 row IDs and all column families defined in the table schema.

      • Select a row ID and a column family to view data in the selected table. When you preview the data, it helps you to define the property mappings.

    4. In the Row ID field, specify a property that contains a row ID.

      A row ID uniquely identifies a single row in an HBase table. The HBase dataset rule instance that you are configuring will perform all operations on a row identified by the row ID.

    5. Click Add mapping.

    6. In the HBase column field, specify a name of the field that is stored in the HBase table. Use the following format <column_family>:<column_name>, for example, total:expenses.

      You can specify just a column family name and map it to the page list property of Embed-NameValuePair type or page group property of SingleValue-Text type. In this case, all the column values are put into a list, using the pyName or pxSubscript property for the column name, and pyValue for the value.

    7. In the Property Name field, specify a property in the Pega Platform.

    8. Repeat steps e-g to map more properties.

  4. Click Save.