Skip to main content

Pega enVision

Unravel Multi Modal AI analytics of Google Gemini Pro Vision using Pega Gen AI features

By using or submitting listings via the Pega Marketplace, you agree to our Terms of Use.


Enhance your Pega application with image / video analysis capabilities of Google Gemini Pro Vision. Unlock a new level of automation by seamlessly integrating Gemini Pro Vision's object detection, image segmentation, and facial recognition features into your Pega processes. 1. Customer Experience Personalization: Analyse customer interactions, such as facial expressions and emotions, to provide personalized and empathetic experiences. 2. Fraud Detection and Prevention: Leverage object detection and image segmentation to identify suspicious patterns and anomalies in financial transactions and other sensitive processes. 3. Healthcare Process Optimization: Enhance medical diagnosis and treatment planning by automating image analysis tasks, such as medical image segmentation and disease detection. 4. Intelligent Document Processing: Automate the extraction of data from complex documents, such as invoices, contracts, and medical records, with precision.

Key Features

  • Use Pega “Connect-GenerativeAI” OOTB rule to invoke Google Gemini Pro Vision and unlock the world of Multi Modal AI.
  • Configure application settings in App Studio to leverage the power of Google Gemini to generate case type stages and steps, data models as well as auto fill attributes
  • Ability to send requests to Google AI both in text format and a combination of text, image, and videos.


=Maantic Inc.

Partner Name



Offering Type

Share this page Share via x Share via LinkedIn Copying...

Did you find this content helpful?

Want to help us improve this content?

We'd prefer it if you saw us at our best.

Pega Community has detected you are using a browser which may prevent you from experiencing the site as intended. To improve your experience, please update your browser.

Close Deprecation Notice
Contact us