Configuring text extraction analysis
Configure text extraction analysis by specifying tags, keywords, entity extraction models, and pattern extraction rules. Use tags and keywords to mark specific terms and their synonyms that you want to identify in the analyzed text. Text and pattern extraction models help to identify various types of named entities.
- In the Records panel, click .
- Open the Text Analyzer rule that you want to edit.
-
Perform any of the following actions:
- To detect the most relevant words or phrases in a document to, for example, create
word clouds, or perform a faceted search, in the Text extraction
section, select the Enable auto-tag extraction check box and
perform one of the following actions:
- To detect all significant tags in the document, select the Detect all tags radio button.
- To detect a specific number of tags in the document, select the Detect top N tag(s) radio button and specify the number of tags that you want to detect, starting from the most significant one.
- To extract keywords, named entities of specific types, or entities whose names follow various patterns, in the Text extraction section, select the Enable text extraction check box.
- To detect the most relevant words or phrases in a document to, for example, create
word clouds, or perform a faceted search, in the Text extraction
section, select the Enable auto-tag extraction check box and
perform one of the following actions:
-
If you enabled the Text extraction setting, configure one of the
following text extraction options:
- To extract a set of specific keywords and their synonyms, for example,
George Washington, US President,
POTUS, and so on, perform the following actions:
- In the Keywords section, click Add keywords.
- In the Name field, press the Down Arrow key and select a keyword-based text extraction model.
For more information, see Creating keyword-based text extraction models.
- To extract named entities that belong to an open dictionary of terms, for example,
names of people, locations, and so on, perform the following actions:
- In the Entity extraction models section, click Add model.
- In the Entity model field, press the Down Arrow key to select an entity extraction model.
For more information, see Creating machine learning-based text extraction models.
- To extract account numbers, ZIP codes, case numbers, telephone numbers, and so on,
perform the following actions:
- In the Pattern extraction models section, click Add model.
- In the Entity rule field, press the Down Arrow key to select a pattern extraction model
For more information, see Creating pattern extraction models.
- To extract a set of specific keywords and their synonyms, for example,
George Washington, US President,
POTUS, and so on, perform the following actions:
- Click Save.