Table of Contents


Extract content from image-based files

With the new DocumentOcr component you convert an image-based document that contains text into text that you can manipulate in an automation. Use this component with images, such as a faxed document, or documents that contain both text and images. This component works with image files (such as .png, .jpeg, .tiff), PDF files, and Microsoft Word documents.

  • Use in automations to extract data from the supported file formats.

  • Use in automations for searching and copying data from unstructured documents.

For more information, see DocumentOcr Component.

Published September 20, 2018 — Updated October 3, 2018

28% found this useful

Related Content

Have a question? Get answers now.

Visit the Pega Support Community to ask questions, engage in discussions, and help others.