GOOGLE VISION - OCR DOCUMENT

Google Vision - OCR Document

Published March 2021

    •    Updated February 2026

Plugin details

Improve your document understanding and save time by extracting text within a PDF, TIFF or GIF file that is provided as input, using Google Vision's powerful AI.

The following element is provided:
- GOOGLE VISION - OCR DOCUMENT (FRONT-END DESKTOP & NATIVE MOBILE)

The following action is provided:
- PERFORM OCR ON FILE (FRONT-END DESKTOP & NATIVE MOBILE)
- PERFORM OCR ON FILE (BACK-END)

⚠️ WARNING: This service provides OCR-specialised operations based on a document as input. If you intend to detect text in an image such as a scene, please refer to the "Google Vision - Detect Text" plugin.

A script is provided to automatically configure your Google Cloud settings.
 
The plugin returns a list of pages along with their full text annotations, dimensions, and confidence score.

Demo Link: https://gcpvisiondemotextocr.bubbleapps.io/version-test

Editor Link: https://app.bubble.io/page?name=index&id=gcpvisiondemotextocr-editor&tab=tabs-1

💡 𝗦𝘂𝗯𝘀𝗰𝗿𝗶𝗽𝘁𝗶𝗼𝗻𝘀 𝗮𝗿𝗲 𝗽𝗿𝗼𝗿𝗮𝘁𝗲𝗱. 𝗜𝗳 𝘆𝗼𝘂 𝗶𝗻𝘀𝘁𝗮𝗹𝗹 𝗮𝗻𝗱 𝘂𝗻𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗯𝗲 𝘁𝗵𝗶𝘀 𝗽𝗹𝘂𝗴𝗶𝗻 𝗶𝗻 𝗼𝗻𝗲 𝗱𝗮𝘆 𝘁𝗼 𝘁𝗲𝘀𝘁 𝗶𝘁 𝗼𝘂𝘁, 𝘆𝗼𝘂'𝗹𝗹 𝗼𝗻𝗹𝘆 𝗯𝗲 𝗰𝗵𝗮𝗿𝗴𝗲𝗱 𝟭/𝟯𝟬𝘁𝗵 𝗼𝗳 𝘁𝗵𝗲 𝗺𝗼𝗻𝘁𝗵𝗹𝘆 𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗽𝘁𝗶𝗼𝗻 𝗳𝗲𝗲.

📖 𝗦𝘁𝗲𝗽-𝗯𝘆-𝗦𝘁𝗲𝗽 𝗶𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀 𝗮𝗿𝗲 𝘁𝗵𝗲 "𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀" 𝘀𝗲𝗰𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝗗𝗲𝗺𝗼 𝗘𝗱𝗶𝘁𝗼𝗿 𝗶𝘀 𝗶𝗻 𝘁𝗵𝗲 "𝗟𝗶𝗻𝗸𝘀" 𝘀𝗲𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝘁𝗵𝗲 𝗣𝗹𝘂𝗴𝗶𝗻 𝗣𝗮𝗴𝗲.

Contact us at [email protected] for any additional feature you would require or support question.

$99

One time • Or $5/mo

4.0 stars   •   4 ratings

133 installs  

This plugin does not collect or track your personal data.

Platform

Web & Native mobile

Contributor details

wise:able

Joined 2020   •   122 Plugins

View contributor profile

Instructions

1️⃣: GOOGLE VISION - OCR DOCUMENT (FRONT-END DESKTOP & NATIVE MOBILE)
==================================================================

📋 ELEMENT DESCRIPTION
--------------------------------
GOOGLE VISION - OCR DOCUMENT (FRONT-END DESKTOP & NATIVE MOBILE) provides PERFORM OCR ON FILE action to extract text within a PDF, TIFF or GIF file provided as input, using Google Vision's powerful AI. The front-end element is suitable for applications when reactivity is desired, such as but not limited to, mobile applications. It supports multiple image formats and automatically optimizes documents to meet Google Cloud requirements.

🔧 STEP-BY-STEP SETUP
--------------------------------
ℹ️ The steps from 0) to 1) can be automatically performed by logging in into your Google Cloud Console, opening the Cloud Shell (top right corner of your page) and copy pasting this command and press enter:

  wget -q https://storage.googleapis.com/bubblegcpdemo/demo-assets/wiseable-gcp-vision.py && python3 wiseable-gcp-vision.py

Otherwise, follow these manual steps:

0) Set-up a project from Google Cloud Console: https://cloud.google.com/vision/docs/setup
  - Create or select a project
  - Enable the CLOUD VISION API for that project
  - Create a service account
  - Download a private key as JSON.

1) Open the private key JSON file with a text editor, copy/paste the following parameters from your file to the Plugin settings:
  - CLIENT_EMAIL
  - PROJECT_ID
  - PRIVATE_KEY, including the -----BEGIN PRIVATE KEY-----\\n prefix and \\n-----END PRIVATE KEY-----\\n suffix.

2) Register on plugins.wiseable.io. Create a new Credential which associates your BUBBLE APP URL, GOOGLE CLOUD credentials.
The registration service will generate your PUBLIC ACCESS KEY. This key serves as a secure proxy for your real API key. It allows your application to communicate with the service without exposing your real API key. Since this PUBLIC ACCESS KEY is explicitly tied to your registered BUBBLE APP URL, it can only be used from that domain, ensuring that even if the key is publicly visible, it remains safe and cannot be misused by unauthorized sources.

3) Enter in the PLUGIN SETTINGS your PUBLIC ACCESS KEY (used for the front-end element only).

4) Add the GOOGLE VISION - OCR DOCUMENT (FRONT-END DESKTOP & NATIVE MOBILE) element to the page on which the OCR feature must be integrated. Select the RESULT DATA TYPE as Returned type, must always be set to "RESULT (VISION - OCR DOCUMENT)".

5) Integrate the logic into your application using the following GOOGLE VISION - OCR DOCUMENT (FRONT-END DESKTOP & NATIVE MOBILE) element's states and actions:

FIELDS: 
- RESULT DATA TYPE: Returned type, must always be set to "RESULT (VISION - OCR DOCUMENT)".  

EVENTS:
- SUCCESS: Event triggered upon success
- ERROR: Event triggered upon error

EXPOSED STATES:
Use any element able to show/process the data of interest (such as a Group with a Text field) stored within the result of the following states of the GOOGLE VISION - OCR DOCUMENT (FRONT-END DESKTOP & NATIVE MOBILE) element:
- RESULTS: Populated upon SUCCESS event. Returns a list of pages along with their full text annotations, dimensions, and confidence score.
- ERROR MESSAGE: Populated upon ERROR event.
- IS PROCESSING: Set to true when processing is in progress, false otherwise.

ELEMENT ACTIONS - TRIGGERED IN WORKFLOW:
- PERFORM OCR ON FILE (FRONT-END DESKTOP & NATIVE MOBILE): Extract text from a document. Populate RESULTS state upon completion.

   Inputs Fields:
      - FILE: PDF, TIFF or GIF file from the Bubble.io picture uploader, a Protocol-relative URLs (//server/file.ext), or a HTTPS file URL (https://server/file.ext) or a Google Storage URL (gs://bucket/file.ext).

2️⃣: PERFORM OCR ON FILE (BACK-END)
==================================

📋 ACTION DESCRIPTION
--------------------------------
PERFORM OCR ON FILE extracts text within a PDF, TIFF or GIF file provided as input, using Google Vision's powerful AI.
The back-end action is suitable for applications when volume or asynchronous processing is desired.

🔧 STEP-BY-STEP SETUP
--------------------------------
ℹ️ If not already done, perform steps 0 and 1 of the first element setup. The back-end action does not require the PUBLIC ACCESS KEY.

1) Set up the action "PERFORM OCR ON FILE (BACK-END)" action in the workflow.

   Input Fields:
      - FILE: PDF, TIFF or GIF file from the Bubble.io picture uploader, a Protocol-relative URLs (//server/file.ext), or a HTTPS file URL (https://server/file.ext) or a Google Storage URL (gs://bucket/file.ext).
                  You can request online feature detection and annotation of 5 frames (GIF; "image/gif") or pages (PDF; "application/pdf", or TIFF; "image/tiff") of your choosing for each file.
      - MIME TYPE: Required for Google Storage URL (gs://bucket/file.ext) only, otherwise ignored. Valid values are: 'application/pdf', 'image/gif' or 'image/tiff'.
      - RESULT DATA TYPE: Returned type, must always be set to "RESULT (VISION - OCR DOCUMENT)".

   Output Fields:
      - RESULT: Returns a list of pages along with their full text annotations, dimensions, and confidence score.

🔍 IMPLEMENTATION EXAMPLE
======================
Feel free to browse the app editor in the Service URL for an implementation example.

ℹ️ ADDITIONAL INFORMATION
======================
> Supported file formats: PDF, TIFF or GIF for back-end actions. The front-end element supports additional formats and will automatically convert them to compatible formats.

> Supported languages: https://cloud.google.com/vision/docs/languages#supported-langs

> GOOGLE VISION service limits: https://cloud.google.com/vision/quotas

⚠️ TROUBLESHOOTING
================
Any plugin related error will be posted to the the Logs tab, "Server logs" section of your App Editor.
Make sure that "Plugin server side output" and "Plugin server side output" is selected in "Show Advanced".

For front-end actions, you can also open your browser's developer console (F12 or Ctrl+Shift+I in most browsers) to view detailed error messages and logs.

Always check the ERROR MESSAGE state of the element and implement error handling using the ERROR event to provide a better user experience.

> Server Logs Details: https://manual.bubble.io/core-resources/bubbles-interface/logs-tab#server-logs

⚡ PERFORMANCE CONSIDERATIONS
===========================

⏱️ BACK-END ACTION START DELAY
-----------------------------------------------
Each time a server-side action is called, Bubble initializes a small virtual machine to execute the action. If the same action is called shortly after, the caching mechanism kicks in, resulting in faster execution on subsequent calls.

A useful workaround is to fire a dummy execution at page load, which pre-warms the Bubble engine for the next few minutes, reducing the impact of cold starts for your users.

⏳ PROCESSING TIME LIMITS
-----------------------------------------------
For back-end actions, the maximum processing duration is capped at 30 seconds as per Bubble.io design. This time limitation does not apply to front-end actions.

FRONT-END VS BACK-END PROCESSING
----------------------------------------------------
The front-end element is designed to support and optimize multiple document formats and will automatically compress files to adhere to Google Cloud requirements. The back-end action doesn't perform this optimization, so be careful with file size and format when using it.

❓ QUESTIONS?
===========
Contact us at [email protected] for any additional feature you would require or support question.

Types

This plugin can be found under the following types:

Api   •   Background Services   •   Element   •   Event   •   Action

Resources

Support contact

Documentation

Tutorial

Rating and reviews

Average rating (4.0)

OCR Doc Plugin Review

July 9th, 2024

This plugin performs great! I haven't had a single hiccup in the 2 months I've been using it and it was super easy to set up.

pdf : working, gif not working

June 19th, 2024

when i upload pdf file, it work properly. but when uploading gif, error message pop up. Erroer said that : Invalid Argument, The only Meme types supported is blah blah.~~

The file you have tried is not in GIF format, even though it might bear this extension. Animated picture may be WEBP with renamed extension or printed as a GIF after compression in a different format.

Run your file through https://www.site24x7.com/fr/tools/mime-type-checker.html and you will notice it is not "image/gif" as required Mime-Type. To gain confidence this plugin works with a GIF, try with this one: https://commons.wikimedia.org/wiki/File:Animhorse.gif, although no text is present on the GIF anyway, it goes through.

On a side note, I would like to mention that the reviews section of this page is not a support channel and aims at sharing public reviews of the plugin quality and its support performance, which unfortunately we did not have a chance to address through our [email protected] channel prior to this review. (plugin author)

June 19th, 2024

•

wise:able

Does the Google Vision - OCR Document require me to buy the api access of google vision?

May 7th, 2023

I spent a large sum of money on the plugin thinking that it is a permanent solution to my ocr needs but now, i think it requires me to buy the api from google. Does the Google Vision - OCR Document require me to buy the api access of google vision?

Thanks for your review! 

The usage of this plugin is subject to the pricing of Google Vision APIs, as the Google Vision APIs credentials are linked to your account.

Google Vision APIs pricing is usage-based and depends on the volume, as indicated here: https://cloud.google.com/vision/pricing

All that being said, we would be grateful to use this review section to give a feedback on the quality of the plugin and its support. 
For any other queries, we are always happy to support you at [email protected] (plugin author)

May 7th, 2023

•

wise:able

Easy to Use (But Use The Auto Setup)

March 29th, 2023

At first when I was reading through the setup process, it seemed a little out of my league. But, I tried the auto setup shell code and it was really easy from there. It pretty much worked out of the box once I did this. I did have a small question and the dev was quick to answer to get my plugin 100% up and running. Thanks!

Thanks a lot! We are always happy to propel your projects forward 🚀! (plugin author)

May 7th, 2023

•

wise:able

Product

Bubble for

Discover

Learn

Resources

Community

Company

Legal

Google Vision - OCR Document

Plugin details

$99

One time • Or $5/mo

Other actions

Platform

Contributor details

Instructions

Types

Categories

Resources

Rating and reviews

Average rating (4.0)