Extract content from image-based files
With the new DocumentOcr component you convert an image-based document that contains text into text that you can manipulate in an automation. Use this component with images, such as a faxed document, or documents that contain both text and images. This component works with image files (such as .png, .jpeg, .tiff), PDF files, and Microsoft Word documents.
-
Use in automations to extract data from the supported file formats.
-
Use in automations for searching and copying data from unstructured documents.
For more information, see DocumentOcr Component.