AWS TEXTRACT - OCR TEXT & DATA

AWS Textract - OCR Text & Data

Published July 2020

    •    Updated February 2026

Plugin details

Amazon Textract is an OCR service that automatically extracts text and data from scanned images or PDFs documents.
Amazon Textract goes beyond simple optical character recognition (OCR) to also identify the contents of fields in forms and information stored in tables.

A script is provided to automatically configure your AWS account settings.

This plugin provides AWS Textract - OCR Text & Data services in two request modes: 
- Synchronous request mode both on front-end and back-end, useful for JPEG, PNG, TIFF and single-page PDF file and time-sensitive application.
- Asynchronous request mode, useful for JPEG, PNG, TIFF and large PDF file, time-insensitive application, requiring an AWS S3 Bucket.

⚠ WARNING: This service provides OCR-specialised operations based on a document as input. If you intend to detect text in an image such as a scene, please refer to the "AWS Rekognition - Text Recognition" plugin.

The sets of available actions are:

- EXTRACT TEXT (Front-End and Back-End for Synchronous, and Asynchronous request mode): It returns the lines and words of detected text, the relationships between the lines and words of detected text, the location of the lines and words of text on the document page.

- ANALYZE DOCUMENT (Front-End and Back-End for Synchronous and Asynchronous request mode): Analyze documents and forms for relationships between detected text.
 Amazon Textract analysis operations return 3 categories of text extraction — text, forms, and tables lines, including words, tables, forms, cells values the relationship between detected items, the page that the item was detected on, the location of the item on the document page.

In Asynchronous request mode, to interact with AWS S3 and AWS SQS, it is highly recommended to use this plugin in conjunction of our "AWS S3 & SQS UTILITIES" plugin to provide the PUT, GET, and DELETE a file from AWS S3, and POLL QUEUE from AWS SQS action, that you can find here: https://bubble.io/plugin/aws-s3--sqs-utilities-1615057147611x666191530957733900

Demo Link: https://awstextractdemo.bubbleapps.io/version-test

Editor Link: https://bubble.io/page?type=page&name=index&id=awstextractdemo-editor&tab=tabs-1

💡 𝗦𝘂𝗯𝘀𝗰𝗿𝗶𝗽𝘁𝗶𝗼𝗻𝘀 𝗮𝗿𝗲 𝗽𝗿𝗼𝗿𝗮𝘁𝗲𝗱. 𝗜𝗳 𝘆𝗼𝘂 𝗶𝗻𝘀𝘁𝗮𝗹𝗹 𝗮𝗻𝗱 𝘂𝗻𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗯𝗲 𝘁𝗵𝗶𝘀 𝗽𝗹𝘂𝗴𝗶𝗻 𝗶𝗻 𝗼𝗻𝗲 𝗱𝗮𝘆 𝘁𝗼 𝘁𝗲𝘀𝘁 𝗶𝘁 𝗼𝘂𝘁, 𝘆𝗼𝘂'𝗹𝗹 𝗼𝗻𝗹𝘆 𝗯𝗲 𝗰𝗵𝗮𝗿𝗴𝗲𝗱 𝟭/𝟯𝟬𝘁𝗵 𝗼𝗳 𝘁𝗵𝗲 𝗺𝗼𝗻𝘁𝗵𝗹𝘆 𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗽𝘁𝗶𝗼𝗻 𝗳𝗲𝗲.

📖 𝗦𝘁𝗲𝗽-𝗯𝘆-𝗦𝘁𝗲𝗽 𝗶𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀 𝗮𝗿𝗲 𝘁𝗵𝗲 "𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀" 𝘀𝗲𝗰𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝗗𝗲𝗺𝗼 𝗘𝗱𝗶𝘁𝗼𝗿 𝗶𝘀 𝗶𝗻 𝘁𝗵𝗲 "𝗟𝗶𝗻𝗸𝘀" 𝘀𝗲𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝘁𝗵𝗲 𝗣𝗹𝘂𝗴𝗶𝗻 𝗣𝗮𝗴𝗲.

Contact us at [email protected] for any additional feature you would require or support question.

$7

Per month

No ratings yet

130 installs  

This plugin does not collect or track your personal data.

Platform

Web & Native mobile

Contributor details

wise:able

Joined 2020   •   122 Plugins

View contributor profile

Instructions

Two sets of actions are available in this plugin both in synchronous and asynchronous modes, one to extract text only, and one to analyze a document structure, such as forms, tables, along with extracting text.

0️⃣a : AUTOMATED CONFIGURATION FOR SYNC & ASYNC
=============================================
If you do not have AWS S3 configured yet, the configuration steps can be automatically performed by using this deployment template:
 https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleS3&param_BucketName=BucketNameOfYourChoice&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSS3Plugin.yaml

  You will find the required parameters values used to configure your AWS S3 plugin, for which "AWS S3 DROPZONE & SQS UTILITIES" is suggested, in the "OUTPUT" tab of the created stack. 

 The steps from 0) to 3) b) of START & GET EXTRACT TEXT (ASYNC) & START & GET ANALYZE DOCUMENT (ASYNC) can be automatically performed by using this deployment template:
  https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextract&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractAsync.yaml

  You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack. 

0️⃣b : AUTOMATED CONFIGURATION FOR SYNC ONLY
=============================================
 The steps from 0) to 1) of EXTRACT TEXT (SYNC) & ANALYZE DOCUMENT (SYNC) can be automatically performed by using this deployment template:
  https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextractSyncOnly&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractSyncOnly.yaml

  You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack. 

1️⃣ : AWS TEXTRACT - OCR TEXT & DATA (FRONT-END)
===========================================

📋 ELEMENT DESCRIPTION
--------------------------------
AWS TEXTRACT - OCR TEXT & DATA element provides EXTRACT TEXT and ANALYZE DOCUMENT actions to extract text and analyze documents for relationships between detected text. The front-end element is suitable for applications when reactivity is desired, such as but not limited to, mobile applications.

🔧 STEP-BY-STEP SETUP
--------------------------------
ℹ️ The steps from 0) to 1) can be automatically performed by using this deployment template:
https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextractSyncOnly&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractSyncOnly.yaml

You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack.

0) Sign-up for AWS TEXTRACT by following this link: https://console.aws.amazon.com/textract/home?p=txt&cp=bn&ad=c

1) Create your AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET, then add to the credentials the AWS TEXTRACT READ-ONLY policy: https://docs.aws.amazon.com/general/latest/gr/aws-sec-cred-types.html#access-keys-and-secret-access-keys

If you intend to use AWS S3 URI (s3://) with your own BUCKET, attach the AWS S3 READ ONLY policy to the considered BUCKET.

2) Register on plugins.wiseable.io. Create a new Credential which associates your BUBBLE APP URL, AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET.
The registration service will generate your PUBLIC ACCESS KEY. This key serves as a secure proxy for your real API key. It allows your application to communicate with the service without exposing your real API key. Since this PUBLIC ACCESS KEY is explicitly tied to your registered BUBBLE APP URL, it can only be used from that domain, ensuring that even if the key is publicly visible, it remains safe and cannot be misused by unauthorized sources.

3) In the Plugin Settings, enter the following:
   - PUBLIC ACCESS KEY (generated from plugins.wiseable.io)
   - AWS SERVICE ENDPOINT REGION (if not provided, default endpoint is "us-east-1").

4) Add the AWS TEXTRACT - OCR TEXT & DATA (FRONT-END) element to the page. Select the RESULT DATA TYPE as "RESULT (TEXTRACT - OCR TEXT & DATA)".

5) Integrate the logic into your application using the following element's states and actions:

FIELDS:
- RESULT DATA TYPE: Returned type, must always be set to "RESULT (TEXTRACT - OCR TEXT & DATA)".

EVENTS:
- SUCCESS: Event triggered upon success
- ERROR: Event triggered upon error

EXPOSED STATES:
Use any element able to show/process the data of interest (such as a Group with a Text field) stored within the result of the following states:
- RESULTS: Populated upon SUCCESS event. Returns a list of Blocks. For each the text, words, lines, a bounding box of the element, confidence value, the polygon coordinates in which the text is contained, and relationships between the detected items.
- ERROR MESSAGE: Populated upon ERROR event.
- IS PROCESSING: Set to true when processing is in progress, false otherwise.
- REQUESTED ACTION: The latest requested action.

ELEMENT ACTIONS - TRIGGERED IN WORKFLOW:
- EXTRACT TEXT (SYNC) (FRONT-END): Extract text from a document.
  Inputs Fields:
    - IMAGE: Image from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), a HTTPS file URL (https://server/file.ext) or a AWS S3 URI (s3://bucket/image.jpg). For both Protocol-relative and HTTPS URL, the file must be accessible through HTTPS Protocol.

- ANALYZE DOCUMENT (SYNC) (FRONT-END): Analyze document to extract text, forms, and tables.
  Inputs Fields:
    - IMAGE: Image from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), a HTTPS file URL (https://server/file.ext) or a AWS S3 URI (s3://bucket/image.jpg). For both Protocol-relative and HTTPS URL, the file must be accessible through HTTPS Protocol.
    - TABLES ANALYSIS: Set to yes to extract tables and the cells in a table. For example, when the following table is detected on a form, Amazon Textract detects a table with four cells.
    - FORMS ANALYSIS: Set to yes to detect selection elements such as option buttons (radio buttons) and check boxes on a document page. Selection elements can be detected in form data and in tables.

2️⃣ : EXTRACT TEXT (SYNC) (BACK-END)
====================

📋 ACTION DESCRIPTION
--------------------------------
  EXTRACT TEXT from a JPEG, PNG, TIFF image or PDF (single-page) file to return the text (words, lines), positions and relationships between the elements.
  Operates in synchronous request mode, useful for small files and time-sensitive application.

🔧 STEP-BY-STEP SETUP
--------------------------------
ℹ️  The steps from 0) to 1) can be automatically performed by using this deployment template:
  https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextractSyncOnly&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractSyncOnly.yaml

  You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack.

  0) Sign-up for AWS TEXTRACT: https://console.aws.amazon.com/textract/home?p=txt&cp=bn&ad=c

  1) Create your AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET, then add to the credentials the AWS TEXTRACT READ-ONLY policy: https://docs.aws.amazon.com/general/latest/gr/aws-sec-cred-types.html#access-keys-and-secret-access-keys

  If you intend to use AWS S3 URI (s3://) with your own BUCKET, attach the AWS S3 READ ONLY policy to the considered BUCKET.

  2) In the Plugin Settings, enter the following:
      - AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET
      - AWS SERVICE ENDPOINT REGION (if not provided, default endpoint is "us-east-1").

  3) Set up the action "EXTRACT TEXT (SYNC) (BACK-END)" in the workflow.

    Inputs Fields:
      - IMAGE: JPEG, PNG image or PDF (single-page) from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), a HTTPS file URL (https://server/file.ext) or a AWS S3 URI (s3://bucket/image.jpg). For both Protocol-relative and HTTPS URL, the file must be accessible through HTTPS Protocol.
      - RESULT DATA TYPE: Returned type, must always be set to "RESULT (TEXTRACT - OCR TEXT & DATA)".
    Output Fields:
      - RESULTS: Returns a list of Blocks. For each the text, words, lines, confidence value, and relationships between the detected items.

3️⃣ : ANALYZE DOCUMENT (SYNC) (BACK-END)
==========================

📋 ACTION DESCRIPTION
--------------------------------
  ANALYZE DOCUMENT in a JPEG, PNG, TIFF image or PDF (single-page) to return the structure (forms, tables), text and values (words, lines, selection elements), positions and relationships between the elements.
  Operates in synchronous request mode, useful for small files and time-sensitive application.

🔧 STEP-BY-STEP SETUP
------------------------------
ℹ️   The steps from 0) to 1) can be automatically performed by using this deployment template:
  https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextractSyncOnly&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractSyncOnly.yaml

  You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack.

  0) Sign-up for AWS TEXTRACT: https://console.aws.amazon.com/textract/home?p=txt&cp=bn&ad=c

  1) Create your AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET, then add to the credentials the AWS TEXTRACT READ-ONLY policy: https://docs.aws.amazon.com/general/latest/gr/aws-sec-cred-types.html#access-keys-and-secret-access-keys

  If you intend to use AWS S3 URI (s3://) with your own BUCKET, attach the AWS S3 READ ONLY policy to the considered BUCKET.

  2) In the Plugin Settings, enter the following:
      - AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET
      - AWS SERVICE ENDPOINT REGION (if not provided, default endpoint is "us-east-1").

  3) Set up the action "ANALYZE DOCUMENT (SYNC) (BACK-END)" in the workflow.
    Inputs Fields:
      - IMAGE URL: JPEG, PNG image or PDF (single-page) from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), a HTTPS file URL (https://server/file.ext) or a AWS S3 URI (s3://bucket/image.jpg). For both Protocol-relative and HTTPS URL, the file must be accessible through HTTPS Protocol.
      - TABLES ANALYSIS: Set to yes to extract tables and the cells in a table. For example, when the following table is detected on a form, Amazon Textract detects a table with four cells.
      - FORMS ANALYSIS: Set to yes to detect selection elements such as option buttons (radio buttons) and check boxes on a document page. Selection elements can be detected in form data and in tables. For example, when the following table is detected on a form, Amazon Textract detects the check boxes in the table cells.
      - RESULT DATA TYPE: Returned type, must always be set to "RESULT (TEXTRACT - OCR TEXT & DATA)".
    Output Fields:
      - RESULTS: Returns a list of Blocks. For each the text, words, lines, tables, forms & cells values, confidence value and relationships between the detected items.

4️⃣ : START & GET EXTRACT TEXT (ASYNC)
================================

📋 ACTION DESCRIPTION
--------------------------------
  EXTRACT TEXT from a JPEG, PNG, PDF file to return the text (words, lines), positions and relationships between the elements.
  Asynchronous request mode, useful for large files and time-insensitive application.


🔧 STEP-BY-STEP SETUP
--------------------------------
ℹ️  The steps from 0) to 3) b) can be automatically performed by using this deployment template:
  https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextract&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractAsync.yaml

  You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack.

  0) Sign-up for AWS TEXTRACT: https://console.aws.amazon.com/textract/home?p=txt&cp=bn&ad=c

  1) Configure AWS TEXTRACT FOR ASYNCHRONOUS OPERATION by following ALL the instructions: https://docs.aws.amazon.com/textract/latest/dg/api-async-roles.html

    Write down your:
      - ACCESS KEY & ACCESS KEY SECRET
      - AWS SERVICE ENDPOINT REGION
      - NOTIFICATION ROLE ARN
      - SNS TOPIC ARN
      - QUEUE URL

  2) In the Plugin Settings, enter the following:
      - AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET
      - AWS SERVICE ENDPOINT REGION (if not provided, default endpoint is "us-east-1").

  3) Set-up in your workflow an action returning the BUCKET and KEY of your file to analyze.
    a) If you do not already have such action, install the plugin "AWS S3 & SQS UTILITIES"
    b) Create a AWS S3 BUCKET that will be used to store the file to analyze: https://docs.aws.amazon.com/AmazonS3/latest/userguide/creating-bucket.html
    c) Set up the "PUT FILE TO S3" action in the workflow.
      Inputs Fields:
        - FILE URL TO STORE: The file URL from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), or a HTTPS file URL (https://server/file.ext). The file must be accessible through the HTTPS protocol.
        - AWS S3 BUCKET NAME: AWS S3 Bucket Name to which the file will be saved.
        - AWS S3 FILE NAME: Path & Name of the file to put to AWS S3. The format must be [path/]filename.ext.
            Example 1: path1/path2/filename.ext.
            Example 2: filename.ext if the file is at the root of the bucket.

  4) Set up the "START EXTRACT TEXT JOB (ASYNC)" action in the workflow.
    Inputs Fields:
    - AWS S3 BUCKET NAME: AWS S3 bucket name from which the input file will be read.
    - AWS S3 FILE NAME: Path & Name of the JPEG, PNG, PDF file to get from AWS S3. The format must be [path/]filename.ext.
        Example 1: path1/path2/filename.ext.
        Example 2: filename.ext if the file is at the root of the bucket.
    - NOTIFICATION ROLE ARN: ARN of an IAM role giving AWS TEXTRACT publishing permissions to the AWS SNS topic.
    - SNS TOPIC ARN: AWS SNS topic ARN to which AWS TEXTRACT posts the completion status.
    Output Fields:
      - JOBID: ID of the Job, to be reused in the "GET JOB STATUS FROM SQS" and "GET EXTRACT TEXT RESULTS (ASYNC)".

  5) Install the plugin "AWS S3 & SQS UTILITIES"
    Set up the action "GET JOB STATUS FROM SQS" in a recurring workflow ('Do every x seconds'), to poll the job completion status on a regular basis.
    Configure this recurring workflow to execute the next step once the job status is SUCCEEDED, using 'Only When' Event Condition, to retrieve the results.
      Inputs Fields:
        - QUEUE URL: URL of AWS SQS you set up at step 1, used to poll for AWS TEXTRACT job status messages.
        - JOBID: ID of the job to poll, returned by "START EXTRACT TEXT JOB (ASYNC)" action.
      Output Fields:
        - JOB STATUS: Valid values are SUCCEEDED, POLLING, IN_PROGRESS, PARTIAL_SUCCESS and FAILED or ERROR, with error or failure messages being appended to the status.

  6) Set up the action "GET EXTRACT TEXT RESULTS (ASYNC)" in the workflow.
    Inputs Fields:
      - JOBID: ID of the job to poll, returned by "START EXTRACT TEXT JOB (ASYNC)" action.
      - MAX RESULTS: Maximum results per paginated calls from AWS. The largest value you can specify is 1000, any greater value will return 1000 results. The default value is 1000. This plugin auto-paginates AWS response based on this parameter.
      - RESULT DATA TYPE: Returned type, must always be set to "RESULT (TEXTRACT - OCR TEXT & DATA)".
    Output Fields:
      - RESULTS: Returns a list of Blocks. For each the text, words, lines, confidence value and relationships between the detected items.

5️⃣ : START & GET ANALYZE DOCUMENT (ASYNC)
========================================

📋 ACTION DESCRIPTION
--------------------------------
  ANALYZE DOCUMENT in a JPEG, PNG, PDF file stored in AWS S3 to return the structure (forms, tables), text and values (words, lines, selection elements), positions and relationships between the elements.
  Asynchronous request mode, useful for large files and time-insensitive application.

🔧 STEP-BY-STEP SETUP
--------------------------------

ℹ️  The steps from 0) to 3) b) can be automatically performed by using this deployment template:
  https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTextract&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTextractAsync.yaml

  You will find the required parameters values used across the plugin in the "OUTPUT" tab of the created stack.

  0) Sign-up for AWS TEXTRACT: https://console.aws.amazon.com/textract/home?p=txt&cp=bn&ad=c

  1) Configure AWS TEXTRACT FOR ASYNCHRONOUS OPERATION by following ALL the instructions: https://docs.aws.amazon.com/textract/latest/dg/api-async-roles.html

    Write down your:
      - ACCESS KEY & ACCESS KEY SECRET
      - AWS SERVICE ENDPOINT REGION
      - NOTIFICATION ROLE ARN
      - SNS TOPIC ARN
      - QUEUE URL

  2) In the Plugin Settings, enter the following:
      - AWS TEXTRACT ACCESS KEY & ACCESS KEY SECRET
      - AWS SERVICE ENDPOINT REGION (if not provided, default endpoint is "us-east-1").

  3) Set-up in your workflow an action returning the BUCKET and KEY of your file to analyze.
    a) If you do not already have such action, install the plugin "AWS S3 & SQS UTILITIES"
    b) Create a AWS S3 BUCKET that will be used to store the file to analyze: https://docs.aws.amazon.com/AmazonS3/latest/userguide/creating-bucket.html
    c) Set up the "PUT FILE TO S3" action in the workflow.
      Inputs Fields:
        - FILE URL TO STORE: The file URL from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), or a HTTPS file URL (https://server/file.ext). The file must be accessible through the HTTPS protocol.
        - AWS S3 BUCKET NAME: AWS S3 Bucket Name to which the file will be saved.
        - AWS S3 FILE NAME: Path & Name of the file to put to AWS S3. The format must be [path/]filename.ext.
            Example 1: path1/path2/filename.ext.
            Example 2: filename.ext if the file is at the root of the bucket.

  4) Set up the "START ANALYZE DOCUMENT JOB (ASYNC)" action in the workflow.
    Inputs Fields:
    - AWS S3 BUCKET NAME: AWS S3 bucket name from which the input file will be read.
    - AWS S3 FILE NAME: Path & Name of the JPEG, PNG, PDF file to get from AWS S3. The format must be [path/]filename.ext.
        Example 1: path1/path2/filename.ext.
        Example 2: filename.ext if the file is at the root of the bucket.
    - TABLES ANALYSIS: Set to yes to extract tables and the cells.
    - FORMS ANALYSIS: Set to yes to extract forms data. 
    - QUERIES & RESULT ALIAS: Each Alias - Query pair contains the question you want to ask in the Text. The result of the query will be associated with the Alias you give. Example: InvoiceNo = What is the invoice number? will associate the invoice number to invoiceNo Alias, which is searchable is the response.
    - NOTIFICATION ROLE ARN: The ARN of an IAM role giving AWS TEXTRACT publishing permissions to the Amazon SNS topic.
    - SNS TOPIC ARN: The AWS SNS topic ARN to which AWS TEXTRACT posts the completion status.
    Output Fields:
     - JOBID: ID of the Job, to be reused in the "GET JOB STATUS FROM SQS" and "GET ANALYZE DOCUMENT RESULTS (ASYNC)".

  5) Install the plugin "AWS S3 & SQS UTILITIES"
    Set up the action "GET JOB STATUS FROM SQS" in a recurring workflow ('Do every x seconds') AWS TEXTRACT job status messages with the JOBID.
    Configure this recurring workflow to execute the next step once the job status is SUCCEEDED, using 'Only When' Event Condition, to retrieve the results.
    Inputs Fields:
      - QUEUE URL: URL of AWS SQS you set up at step 1, used to poll for AWS TEXTRACT job status messages.
      - JOBID: ID of the job to poll, returned by "START ANALYZE DOCUMENT JOB (ASYNC)" action.
    Output Fields:
      - JOB STATUS: Valid values are SUCCEEDED, POLLING, IN_PROGRESS, PARTIAL_SUCCESS and FAILED or ERROR, with error or failure messages being appended to the status.

  6) Set up the action "GET ANALYZE DOCUMENT RESULTS (ASYNC)" in the workflow.
    Inputs Fields:
      - JOBID: ID of the job to poll, returned by "START ANALYZE DOCUMENT JOB (ASYNC)" action.
      - MAX RESULTS: Maximum results per paginated calls from AWS. The largest value you can specify is 1000, any greater value will return 1000 results. The default value is 1000. This plugin auto-paginates AWS response based on this parameter.
      - RESULT DATA TYPE: Returned type, must always be set to "RESULT (TEXTRACT - OCR TEXT & DATA)".
    Output Fields:
      - RESULTS: Returns a list of Blocks. For each the text, words, lines, tables, forms & cells values, confidence value, and relationships between the detected items.

🔍IMPLEMENTATION EXAMPLE
======================
Feel free to browse the app editor in the Service URL for an implementation example.

ℹ️ ADDITIONAL INFORMATION
======================
> Lines & Words objects details: https://docs.aws.amazon.com/textract/latest/dg/how-it-works-lines-words.html
> Forms objects details: https://docs.aws.amazon.com/textract/latest/dg/how-it-works-kvp.html
> Tables objects details: https://docs.aws.amazon.com/textract/latest/dg/how-it-works-tables.html
> Selection Elements objects details: https://docs.aws.amazon.com/textract/latest/dg/how-it-works-selectables.html

> AWS TEXTRACT service limits: https://docs.aws.amazon.com/textract/latest/dg/limits.html
> AWS services availability per region: https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/
> AWS Service endpoints list: https://docs.aws.amazon.com/general/latest/gr/rande.html

⚠️TROUBLESHOOTING
================

Any plugin related error will be posted to the the Logs tab, "Server logs" section of your App Editor.
Make sure "Plugin server side logging" and "Plugin client side logging" is selected in "Show Advanced".

For front-end actions, you can also open your browser's developer console (F12 or Ctrl+Shift+I in most browsers) to view detailed error messages and logs.

Always check the ERROR MESSAGE state of the element and implement error handling using the ERROR event to provide a better user experience.

> Server Logs Details: https://manual.bubble.io/core-resources/bubbles-interface/logs-tab#server-logs

⚡PERFORMANCE CONSIDERATIONS
===========================

GENERAL
-------------
For back-end actions, the maximum processing duration is capped at 30 seconds as per Bubble.io design. This time limitation does not apply to front-end actions.

⏱️ BACK-END ACTION START DELAY
-----------------------------------------------
Each time a server-side action is called, Bubble initializes a small virtual machine to execute the action. If the same action is called shortly after, the caching mechanism kicks in, resulting in faster execution on subsequent calls.

A useful workaround is to fire a dummy execution at page load, which pre-warms the Bubble engine for the next few minutes, reducing the impact of cold starts for your users.

FRONT-END VS BACK-END PROCESSING
----------------------------------------------------
The front-end element is designed to support and optimize multiple image formats and will automatically compress images to adhere to AWS requirements. The back-end action doesn't perform this optimization, so be careful with file size and format when using it.

❓QUESTIONS?
===========
Contact us at [email protected] for any additional feature you would require or support question.

Types

This plugin can be found under the following types:

Api   •   Background Services   •   Element   •   Event   •   Action

Resources

Support contact

Documentation

Tutorial

Rating and reviews

Average rating (0.0)

stack outdated

January 14th, 2024

not working - money gone

Thanks for your message. 

Please note that this public section is meant to share reviews on both plugin’s build and support quality.

Regarding the latter, we haven’t been able to find your support request on our [email protected] support email address, which can be found on the plugin’s description, instructions and demo page.

Feel free to reach us so we can solve your issue. (plugin author)

January 14th, 2024

•

wise:able

Product

Bubble for

Discover

How Bubble works

Examples

Pricing

Learn

Resources

Community

Company

Legal

Terms

Privacy

©  2026, Bubble Group, Inc. All rights reserved.

Bubble uses cookies

Bubble uses cookies. By using our service you consent to all cookies in accordance with our Cookie Policy. Read more

Save & Close

I agree

I disagree

Show details

Strictly necessary

Performance

Targeting

Functionality

Unclassified

Cookie declaration

About cookies

Strictly necessary

Performance

Targeting

Functionality

Unclassified

Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.

Cookie report
Name	Provider / Domain	Expiration	Description
usprivacy	.bubble.io	1 year	This cookie stores the user's consent state regarding tracking and privacy in compliance with the United States privacy regulation.
__cf_bm	Cloudflare Inc. .calendly.com	29 minutes 45 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
__cf_bm	Cloudflare Inc. .clutch.co	29 minutes 55 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
li_gc	LinkedIn Corporation .linkedin.com	5 months 4 weeks	Used to store guest consent to the use of cookies for non-essential purposes
__cf_bm	Cloudflare Inc. .transformo.io	29 minutes 57 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
meta_u1main	.bubble.io	Session
__cf_bm	Cloudflare Inc. .lu.ma	29 minutes 44 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
meta_live_u2main.sig	.bubble.io	2 days 23 hours
_GRECAPTCHA	Google LLC www.google.com	5 months 4 weeks	Google reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
__cf_bm	Cloudflare Inc. .producthunt.com	29 minutes 57 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
_GRECAPTCHA	Google LLC www.recaptcha.net	5 months 4 weeks	Google reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
_gd_session	bubble.io	4 hours	This cookie manages user sessions on the website, ensuring that user interactions are recognized across various web requests. This helps in providing a consistent and accurate user experience.
_scid	Snap Inc. .bubble.io	1 year 1 month
opt_out	.postrelease.com	1 year	This cookie is used to track the user's decision to opt out of cookies on the website, indicating they have chosen not to have their data used for tracking and personalisation purposes.
AWSALBCORS	Amazon.com Inc. storm.birdie.so	6 days 23 hours	For continued stickiness support with CORS use cases after the Chromium update, we are creating additional stickiness cookies for each of these duration-based stickiness features named AWSALBCORS (ALB).
VISITOR_PRIVACY_METADATA	YouTube .youtube.com	5 months 4 weeks	This cookie is used to store the user's consent and privacy choices for their interaction with the site. It records data on the visitor's consent regarding various privacy policies and settings, ensuring that their preferences are honored in future sessions.
__cf_bm	Cloudflare Inc. .lumacdn.com	29 minutes 58 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
_tt_enable_cookie	.bubble.io	2 months 4 weeks	This cookie is used to remember the user's preferences regarding the use of cookies on the website.
AWSALBCORS	Amazon.com Inc. app.birdie.so	1 week	For continued stickiness support with CORS use cases after the Chromium update, we are creating additional stickiness cookies for each of these duration-based stickiness features named AWSALBCORS (ALB).
__cf_bm	Cloudflare Inc. .t.co	29 minutes 44 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
__cf_bm	Cloudflare Inc. .twitter.com	29 minutes 44 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
__cf_bm	Cloudflare Inc. .vimeo.com	29 minutes 58 seconds	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
__session	.gitbook.com	1 year 1 month	This cookie is used to maintain an anonymized user session by the server.
meta_live_u2main	.bubble.io	2 days 23 hours
CookieScriptConsent	CookieScript .bubble.io	1 year 1 month	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
meta-firebase_workflow	.bubble.io	59 minutes 21 seconds

Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.

Cookie report
Name	Provider / Domain	Expiration	Description
X-AB	Stack Exchange Inc. sc-static.net	1 day
_ga_G4MHXCYE4T	.bubble.io	1 year 1 month	This cookie is used by Google Analytics to persist session state.
m	Stripe m.stripe.com	1 year 1 month	This cookie is generally used for performance and optimization of payment processing services, facilitating caching of content on the browser to make pages load faster.
_ttp	.bubble.io	2 months 4 weeks	This cookie is used to track user interaction and behavior on the website for site performance and usage analysis. This information is used to improve the user experience and optimize the website's functionality.
analytics_session_id	Cakemail .bubble.io	1 year	This cookie tracks user behavior throughout the session on the website, collecting data such as how long a visitor stays on a page and what links they click on. This information is used to improve user experience and website performance.
data-c	Media.net .media.net	4 weeks 2 days	This cookie is used to collect information on user behavior and interaction to enhance the user experience and measure website performance.
_gd_visitor	bubble.io	1 year 1 month	This cookie is used to track visitors' interactions with the website, collecting data on their behavior for analytics purposes. It helps in understanding how users engage with the site, which parts of the site are most visited, and how the navigation flow is structured, aiming to improve the user experience and site performance.
_ttp	.tiktok.com	2 months 4 weeks	This cookie is used to track user interaction and behavior on the website for site performance and usage analysis. This information is used to improve the user experience and optimize the website's functionality.
analytics_session_id.last_access	.bubble.io	1 year	This cookie is used to store the time of the last access by a user in an analytics session, helping in understanding user engagement and the effectiveness of the website content.
_ga_BFPVR2DEE2	.bubble.io	1 year 1 month	This cookie is used by Google Analytics to persist session state.
ahoy_visit	Teachable store.bubble.io	4 hours	This cookie is used to track a visitor's session, helping to understand how users interact with the site to improve user experience and functionality.
_ga_5Q4JP8E2X4	.bubble.io	1 year 1 month	This cookie is used by Google Analytics to persist session state.
_ga_CEPZJCHM3K	.bubble.io	1 year 1 month	This cookie is used by Google Analytics to persist session state.
data-c-ts	Media.net .media.net	4 weeks 2 days	This cookie is used to time-stamp and perform a time sync for users' sessions, ensuring accurate session time tracking.
_ga	Google LLC .bubble.io	1 year 1 month	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. This cookie is used to distinguish unique users by assigning a randomly generated number as a client identifier. It is included in each page request in a site and used to calculate visitor, session and campaign data for the sites analytics reports. By default it is set to expire after 2 years, although this is customisable by website owners.
_ga_Y168TZXEQ5	.bubble.io	1 year 1 month	This cookie is used by Google Analytics to persist session state.
ajs_anonymous_id	Segment.io Inc. .bubble.io	1 year	These cookies are generally used for Analytics and help count how many people visit a certain site by tracking if you have visited before. This cookie has a lifespan of 1 year.
c	.bidswitch.net	1 year	This cookie is used to identify the frequency of visits and how the visitor accesses the website. It collects data on the user's visits to the website, such as which pages have been read.

Targeting cookies are used to identify visitors between different websites, eg. content partners, banner networks. Those cookies may be used by companies to build a profile of visitor interests or show relevant ads on other websites.

Cookie report
Name	Provider / Domain	Expiration	Description
YSC	Google LLC .youtube.com	Session	This cookie is set by YouTube to track views of embedded videos.
personalization_id	Twitter Inc. .twitter.com	1 year 1 month	This cookie carries out information about how the end user uses the website and any advertising that the end user may have seen before visiting the said website.
CMPRO	Casale Media Inc. .casalemedia.com	2 months 4 weeks	These cookies are linked to advertising and tracking the products users were looking at.
_scid_r	.bubble.io	1 year 1 month	This cookie is used for tracking purposes, helping to identify unique visitors across sessions and track their interactions and engagement on the website.
tv_UICR	.tremorhub.com	4 weeks 2 days	This cookie is used to track user interactions and engagement with the website's content to improve the service and content delivery. It can collect data on user behavior and preferences to facilitate targeted advertising and marketing strategies.
VISITOR_INFO1_LIVE	Google LLC .youtube.com	5 months 4 weeks	This cookie is set by Youtube to keep track of user preferences for Youtube videos embedded in sites;it can also determine whether the website visitor is using the new or old version of the Youtube interface.
tvid	Tremor Video DSP .tremorhub.com	1 year	This cookie is used for tracking user interaction and engagement with the website's content, helping in the improvement and optimization of online services provided. It may also be used for delivering personalized advertising experiences.
uid	.criteo.com	1 year	This cookie provides a uniquely assigned, machine-generated user ID and gathers data about activity on the website. This data may be sent to a 3rd party for analysis and reporting.
_uetsid	Microsoft Corporation .bubble.io	1 day	This cookie is used by Bing to determine what ads should be shown that may be relevant to the end user perusing the site.
lidc	Microsoft Corporation .linkedin.com	1 day	This is a Microsoft MSN 1st party cookie that ensures the proper functioning of this website.
_uetvid	Microsoft Corporation .bubble.io	1 year 3 weeks	This is a cookie utilised by Microsoft Bing Ads and is a tracking cookie. It allows us to engage with a user that has previously visited our website.
addshoppers	shop.pe	1 year 1 month	This cookie is used to track user interaction and sharing behavior on social media platforms, enabling personalized marketing and social media sharing capabilities.
_gcl_au	Google LLC .bubble.io	3 months	Used by Google AdSense for experimenting with advertisement efficiency across websites using their services
addshoppers.com	bubble.io	1 year 1 month	This cookie is associated with the AddShoppers social sharing platform, a technology that integrates with websites to enable tracking and sharing capabilities across social networks. It supports social media integration and can gather data regarding sharing and social interactions on the site to help understand user influence and to enhance marketing strategies.
CMID	Casale Media Inc. .casalemedia.com	1 year	These cookies are linked to advertising and tracking the products users were looking at.
bcookie	Microsoft Corporation .linkedin.com	1 year	This is a Microsoft MSN 1st party cookie for sharing the content of the website via social media.
tuuid_lu	.bidswitch.net	1 year	Contains a unique visitor ID, which allows Bidswitch.com to track the visitor across multiple websites. This allows Bidswitch to optimize advertisement relevance and ensure that the visitor does not see the same ads multiple times.
tuuid	.bidswitch.net	1 year	This cookie is mainly set by bidswitch.net to make advertising messages more relevant to the website visitor.
_fbp	Meta Platform Inc. .bubble.io	2 months 4 weeks	Used by Facebook to deliver a series of advertisement products such as real time bidding from third party advertisers
ab	.agkn.com	1 year	This cookie is generally provided by agkn.com and is used for advertising purposes.
demdex	Adobe Inc. .demdex.net	5 months 4 weeks	This cookie helps Adobe Audience Manger perform basic functions such as visitor identification, ID synchronization, segmentation, modeling, reporting, etc.
sc_at	Snap Inc. .snapchat.com	1 year
MUID	Microsoft Corporation .bing.com	1 year	This cookie is widely used my Microsoft as a unique user identifier. It can be set by embedded microsoft scripts. Widely believed to sync across many different Microsoft domains, allowing user tracking.
dpm	Adobe Inc. .dpm.demdex.net	5 months 4 weeks	Adobe Audience Manager - data management platform uses this cookie to record information around synchronisation of IDs.
CMPS	Casale Media Inc. .casalemedia.com	2 months 4 weeks	These cookies are linked to advertising and tracking the products users were looking at.

Functionality cookies are used to remember visitor information on the website, eg. language, timezone, enhanced content.

Cookie report
Name	Provider / Domain	Expiration	Description
ajs_anonymous_id	Segment.io Inc. .loom.com	1 year 1 month	These cookies are generally used for Analytics and help count how many people visit a certain site by tracking if you have visited before. This cookie has a lifespan of 1 year.
loom_anon_comment	Loom .loom.com	1 year 1 month	This cookie is used to differentiate anonymous users when they leave comments on holaspirit.com, enabling the identity of the commenter to remain unknown for privacy.
_cfuvid	.vimeo.com	Session	This cookie is used for purposes of tracking users across sessions to optimize user experience by maintaining session consistency and providing personalized services.
__stripe_mid	Stripe Inc. .bubble.io	1 year	This cookie is set by Stripe to distinguish users and enable secure payment processing during interactions with the website.
_cfuvid	.calendly.com	Session	This cookie is used for purposes of tracking users across sessions to optimize user experience by maintaining session consistency and providing personalized services.
ajs_anonymous_id	Segment.io Inc. demo.arcade.software	1 year	These cookies are generally used for Analytics and help count how many people visit a certain site by tracking if you have visited before. This cookie has a lifespan of 1 year.
__stripe_sid	Stripe Inc. .bubble.io	29 minutes 58 seconds	This cookie is set by Stripe to manage and process payments securely, allowing temporary storage of session related information during a users visit to the website.
loom_referral_video	Loom, Inc. .www.loom.com	Session	This cookie is used to track referrals and video plays across the website, enabling the website to attribute video views to the correct referral sources.
MSPTC	Microsoft .bat.bing.com	1 year	This cookie is used to track user engagement and interaction with the website to enhance customer experience and website functionality. It may collect information about how users navigate and use the site, helping to identify preferences and improve service delivery.
visitor-id	Media.net .media.net	1 year	This cookie is used to identify unique visitors across the website to provide a consistent and personalized experience.

Unclassified cookies are cookies that do not belong to any other category or are in the process of categorization.

Cookie report
Name	Provider / Domain	Expiration
__Secure-ROLLOUT_TOKEN	.youtube.com	5 months 4 weeks
ttcsid	.bubble.io	2 months 4 weeks
intercom-device-id-cz703g22	.bubble.io	8 months 4 weeks
intercom-session-cz703g22	.bubble.io	1 week
kwsu	ciqtracking.com	1 year 1 month
CrossDomainCookieScriptConsent_239	.crossdomain.cookie-script.com	1 year 1 month
__BROWSER__	store.bubble.io	Session
ahoy_visitor	store.bubble.io	1 year 1 month
bubble-certifications_live_u2main	.certification.bubble.io	3 days
_ScCbts	.bubble.io	1 week
_brilliant_session	.bubble.io	Session
bubble-certifications_live_u2main.sig	.certification.bubble.io	3 days
ttcsid_CRGSJIJC77UAMBH9S77G	.bubble.io	2 months 4 weeks
bubble-certifications_u1main	.certification.bubble.io	Session
_iub_cs-57859130-uspr	.bubble.io	1 year
intercom-id-cz703g22	.bubble.io	8 months 4 weeks
muc_ads	Twitter .t.co	1 year 1 month
upgrade_pv_1487181547537x364191731148390400	bubble.io	1 week

Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.

You can change your consent to cookie usage at any time on our Privacy Policy page.

Your consent will also apply to the following websites:

bubble.io
flusk.eu