100's of workflows

PDF Parsing with Multimodal Vision AI

Integrations
Edit Image
HTTP Request
If
Google Drive
Sticky Note
Manual Trigger
Basic LLM Chain
Structured Output Parser
Google Gemini Chat Model

This rantir workflow showcases the use of Multimodal LLMs for parsing and extracting information from PDF documents within rantir.

In this scenario, we’re reviewing a candidate’s CV/resume through an AI that filters out unqualified applications. However, the candidate has included a hidden prompt intended to bypass our filter! By leveraging AI Vision, we can effectively address this issue. Read on to learn how!

How it works

       
  • The candidate’s CV/resume, in PDF format, is downloaded from Google Drive for demonstration purposes.
  •    
  • The PDF is converted to a PNG image using Stirling PDF. Since the hidden prompt is in white font, it remains invisible in the converted image.
  •    
  • The image is processed by a Basic LLM node using a multimodal model, such as Google’s Gemini 1.5 Pro.
  •    
  • Within the Basic LLM node, a "User Message" with Binary type is configured, allowing us to send the image file directly in the request.
  •    
  • The LLM now successfully ignores the hidden prompt, yielding the expected response.

Requirements

       
  • Google Gemini API Key. Alternatively, GPT-4 can also handle this use case.
  •    
  • Stirling PDF or another service for converting PDFs to images. For privacy, consider self-hosting Stirling PDF to avoid using public APIs.

Customizing the workflow

       
  • Replace the manual trigger with a webhook or other trigger to integrate into your existing services.
  •    
  • This example validates qualifications; you can expand it to extract data points like years of experience, previous employers, etc.

Other Workflows like this one

Your connected stack awaits to automate AI workflows with 24-7 uptime performance and engagement

OpenAI Assistant workflow for chatting with a file or creating an Assistant.
Google Drive
OpenAI
Sticky Note
Manual Trigger
Chat Trigger
AI Agent to Analyze Stocks Workflow
Webhook
Google Drive
Respond to Webhook
Sticky Note
Manual Trigger
Prepare CSV files with GPT-4
OpenAI
Edit Fields (Set)
Loop Over Items (Split in Batches)
Spreadsheet File
Item Lists
Automate your Hubspot Chat using OpenAi and Airtable
Airtable
HTTP Request
Code
OpenAI
If
Multi-Agent PDF-to-Blog Content Generation
Merge
Ghost
Sticky Note
Code
AI Agent
Query Rantir Credentials with AI SQL Agent
Code
AI Agent
OpenAI Chat Model
Window Buffer Memory (easiest)
Code Tool

Compare features across plans

Computir Cloud Suite All Access

$99/m

Per team/per month, with 10 GB of data and storage
Everything in Free, and:
Icon
Host up to around 4-5 Applications
Icon
Advanced user roles
Icon
Unlimited AI applications & workflows
Icon
Custom onboarding & Customer management
Icon
Advanced integrations
Icon
International capabilities
Unlimited Team Plan & Custom Integration

$299/m

Per $1K Tokens or 1 TB added, custom integration (per month)
Everything in Professional, and:
Icon
Host up to around 20+ Applications
Icon
Tailored implementation services
Icon
Advanced ERP integration capabilities
Icon
Extra bandwidth and open-source AI models
Icon
Fine-tuning & data logic
Icon
SOX or integration customization
Icon
Dedicated premium support
Cloud Suite

$99/mo

Team Plan

$299

Computir Cloud

AI Application & Automation platform suite
Get access to generate dashboards, websites or content
Chat to Explore Data
Icon

Custom Develop  integrations

Chat to Transform Data
Icon
Direct or Enterprise application connections
Webflow, Wix or Wordpress
+ Acumatica, Microsoft, Netsuite & Sage
+ Oracle & Workday
Rules to automate AI
Basic
Advanced
Advanced

Custom Integrations

Build & Share Live Reports
Icon
Generated
Human-Led
Train Classification Models
Icon
Human-Led
Train Time Series Forecasts
Icon

"I highly recommend Computir, they are a great dev team with quick turn around on all projects and requests. We recently worked with them on updating our website and any changes, updates or modifications I needed were always taken care of quickly!"

Paige J, VP of Marketing, Heavy AI