You are here: Decision Strategy Manager > Managing predictive and text analytics > Text analytics

Text analytics

You can use the Pega 7 Platform to analyze unstructured text that may come in through different channels: emails, social networks, chat channels, and so on.

Pega Platform provides the following methods of text analysis:

Sentiment analysis – Detects positive, neutral, or negative sentiment. Supports machine learning.
Text classification – Classifies text into categories. Supports machine learning and rule-based classification that is based on taxonomy keywords.
Intent detection – Assigns underlying intents to text input. Does not support machine learning.
Entity extraction – Extracts named entities from text. Does not support machine learning. Entity extraction can be configured by using Apache RUTA scripts.

You can build machine learning models for sentiment analysis and text classification and deploy those models using Text Analyzer rules.

To train a text analytics model, you must upload training data with sample texts and associated outcomes. For a sentiment analysis, these sample records must be associated with a positive, neutral or negative outcome. For text classification, the outcome must be one of the categories in the taxonomy. In the process of creating a model, the data is split into a training sample and a test sample. The training sample is used to train the model. The test sample is the hold-out sample that is used validate the model. When a model is built, you can validate its performance.

If no training data is available, a text classification model can also be created based on category keywords included in the taxonomy (this is rule-based classification analysis). You cannot create your own machine learning models for intent detection or entity extraction. Entity extraction models can be defined by using RUTA scripts.

Text analytics in the Pega Platform is run through a Text Analyzer rule. A text analyzer parses text, automatically recognizes the language, and processes the models. A text analyzer rule may refer to one or more models of the methods that are listed above.

Open topic with navigation