Creating an HBase data set record
You must configure each instance of the HBase data set rule before it can read data from and save it to an external HBase storage.
- Create an instance of the HBase data set rule.
-
Connect to an instance of the
Data-Admin-Hadoop
configuration rule
by performing the following actions:
- In the Hadoop configuration instance field, reference the Data-Admin-Hadoop configuration rule that contains HBase storage configuration.
- Click Test connectivity.
-
Configure mapping between the fields that are stored in an HBase table and properties
in the
Pega Platform
by performing the following actions:
- Optional: Click Refresh table names.
- In the HBase table name field, select a table that is available in the HBase storage to which you are connected.
-
Click
Preview table
to see the first 100 row IDs and all
column families defined in the table schema, and then select a row ID and a column
family to view data in the selected table.
When you preview the data, it helps you to define the property mappings.
-
In the Row ID field, specify a property that contains a row ID.
A row ID uniquely identifies a single row in an HBase table. The HBase dataset rule instance that you are configuring will perform all operations on a row identified by the row ID.
- Click Add mapping.
-
In the HBase column field, specify a name of the field that is stored in the HBase
table. Use the following format
<column_family>:<column_name>, for example,
total:expenses.
Note: You can specify just a column family name and map it to the page list property of Embed-NameValuePair type or page group property of SingleValue-Text type. In this case, all the column values are put into a list, using the pyName or pxSubscript property for the column name, and pyValue for the value.
- In the Property Name field, specify a property in the Pega Platform.
- Repeat steps e-g to map more properties.
- Click Save.