MARKETPLACE
PLUGINS
AWS TRANSCRIBE - AUDIO & VIDEO
AWS Transcribe - Audio & Video logo

AWS Transcribe - Audio & Video

Published October 2021
   •    Updated this month

Plugin details

Convert speech-to-text in an audio or video file. Using Automatic Speech Recognition (ASR) technology, you can choose to use this plugin for a variety of business applications, including transcription of voice-based customer service calls, generation of subtitles on audio/video content, and conduct (text based) content analysis on audio/video content with multiple speakers.
This plugin returns the transcript, along with diarization information, of the audio or video file in FLAC, MP3, MP4, Ogg, WebM, AMR, or WAV format stored in AWS S3.

Also, a speech recorder visual element is provided to record the speech from the user's device.

The supported languages are listed here: https://docs.aws.amazon.com/transcribe/latest/dg/supported-languages.html#table-language-matrix

To interact with AWS S3 and AWS SQS, it is highly recommended to use this plugin in conjunction of our "AWS S3 & SQS Utilities" plugin to provide the Put, Get, and Delete a file from AWS S3, and Poll Queue from AWS SQS action , that you can find here: https://bubble.io/plugin/aws-s3--sqs-utilities-1615057147611x666191530957733900

The demo application link: https://awstranscribeaudiovideodemo.bubbleapps.io/version-test

💡 𝗦𝘂𝗯𝘀𝗰𝗿𝗶𝗽𝘁𝗶𝗼𝗻𝘀 𝗮𝗿𝗲 𝗽𝗿𝗼𝗿𝗮𝘁𝗲𝗱. 𝗜𝗳 𝘆𝗼𝘂 𝗶𝗻𝘀𝘁𝗮𝗹𝗹 𝗮𝗻𝗱 𝘂𝗻𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗯𝗲 𝘁𝗵𝗶𝘀 𝗽𝗹𝘂𝗴𝗶𝗻 𝗶𝗻 𝗼𝗻𝗲 𝗱𝗮𝘆 𝘁𝗼 𝘁𝗲𝘀𝘁 𝗶𝘁 𝗼𝘂𝘁, 𝘆𝗼𝘂'𝗹𝗹 𝗼𝗻𝗹𝘆 𝗯𝗲 𝗰𝗵𝗮𝗿𝗴𝗲𝗱 𝟭/𝟯𝟬𝘁𝗵 𝗼𝗳 𝘁𝗵𝗲 𝗺𝗼𝗻𝘁𝗵𝗹𝘆 𝘀𝘂𝗯𝘀𝗰𝗿𝗶𝗽𝘁𝗶𝗼𝗻 𝗳𝗲𝗲.

📖 𝗦𝘁𝗲𝗽-𝗯𝘆-𝗦𝘁𝗲𝗽 𝗶𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀 𝗮𝗿𝗲 𝘁𝗵𝗲 "𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀" 𝘀𝗲𝗰𝘁𝗶𝗼𝗻 𝗮𝗻𝗱 𝗗𝗲𝗺𝗼 𝗘𝗱𝗶𝘁𝗼𝗿 𝗶𝘀 𝗶𝗻 𝘁𝗵𝗲 "𝗟𝗶𝗻𝗸𝘀" 𝘀𝗲𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝘁𝗵𝗲 𝗣𝗹𝘂𝗴𝗶𝗻 𝗣𝗮𝗴𝗲.

Contact us at [email protected] for any additional feature you would require or support question.

$99

One time  •  Or  $5/mo

3.0 stars   •   1 ratings
27 installs  
This plugin does not collect or track your personal data.

Platform

Web & Native mobile

Contributor details

wise:able logo
wise:able
Joined 2020   •   122 Plugins
View contributor profile

Instructions

0: SPEECH RECORDER ELEMENT ==========================

ELEMENT DESCRIPTION
--------------------------------
 SPEECH RECORDER is a visual element allowing you to record voice in WAV, OGG, WAV, WEBM or PCM format on all desktop devices and browsers (with exception of iOS, where due to browser policy restrictions it works only in Safari browser). After recording, the element stored the file in the app's storage and returns the file URL.

STEP-BY-STEP SETUP
--------------------------------
 1) Drap and drop up the visual element SPEECH RECORDER in your app.

 2) Select the SPEECH RECORDER element, in APPEARANCE section, configure the following fields :
 
 FIELDS :
 - ENABLE AUTO-BINDING PARENT ELEMENT'S THING : If selected, SPEECH RECORDER will update the parent elements thing, evaluating to a FILE, once the recording will be ready.
 - MAX FILE SIZE : Limits the file size of the recording (Megabytes).
 - FILE UPLOAD ENABLED : Must be set to yes.
 - CHANNELS : Select the number of channels to record.
 - FORMAT : Output format of the recording. Valid values are WAV | OGG | PCM | WEBM | MP3.
 - BACKGROUND WHEN OFF : Recorder background color when recording is off.
 - BACKGROUND WHEN ON : Recorder background color when recording is on.
 - RECORDER WHEN OFF : Recorder color when recording is off.
 - RECORDER WHEN ON : Recorder color when recording is on.

 3) Integrate the logic into your application using the following SPEECH RECORDER, states and actions:

 EVENTS :
 - RECORD CAPTURED : Triggered when the record has been captured.
 - RECORD ENCOUNTERED ERROR : Triggered when the record has encountered an error. The "ERROR MESSAGE" is then exposed as element STATE.
 
 EXPOSED STATES:
 Use any element able to show/process the data of interest (such as a Group with a Text field) stored within the result of the following states of the SPEECH RECORDER element :
 - DURATION : Duration of the recording.
 - RECORDING : Returns yes while recording.
 - FILE SIZE : Size of the recording in bytes.
 - SAVING : Returns yes while recording is being saved to the app's storage.
 - PAUSED : Returns yes while paused.
 - RECORDING FILE : URL of the recording file, saved to the app's storage.
 - ERROR MESSAGE : Contains the error message upon "RECORDER ENCOUNTERS AN ERROR" event.

 ELEMENT ACTIONS - TRIGGERED IN WORKFLOW:
   - START - STOP
   - PAUSE - RESUME
   - CANCEL RECORDING

 4)  Then, implement either one the of the following actions to trigger the speech transcription.

1 : START & GET TRANSCRIBE (ASYNC)
================================

ACTION DESCRIPTION
--------------------------------
 TRANSCRIBE JOB starts a transcription job from a FLAC, MP3, MP4, Ogg, WebM, AMR, or WAV file stored in AWS S3. The language is automatically detected by AWS Transcribe.

STEP-BY-STEP SETUP
--------------------------------
The steps from 0) to 1) can be automatically performed by using this deployment template:
 https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=BubbleTranscribe&templateURL=https://bubble-resources.s3.amazonaws.com/deployment-assets/CloudFormation-AWSTranscribeAsync.yaml

 0) Sign-up for AWS TRANSCRIBE : https://console.aws.amazon.com/transcribe/home?p=rkn&cp=bn&ad=c

 1) Create your AWS TRANSCRIBE API KEY & SECRET and attach the AWS TRANSCRIBE FULL ACCESS and S3 READ ONLY policies: https://docs.aws.amazon.com/general/latest/gr/aws-sec-cred-types.html#access-keys-and-secret-access-keys

   Write down your:
     - KEY & KEY SECRET
     - AWS SERVICE ENDPOINT REGION

 2) In the Plugin Settings, enter the following :
     - AWS TRANSCRIBE API KEY & SECRET
     - AWS SERVICE ENDPOINT REGION (if not provided, default endpoint is "us-east-1").

 3) Set-up in your workflow an action returning the BUCKET and KEY of your file to analyze.
   a) If you do not already have such action, install the plugin "AWS S3 & SQS UTILITIES"
   b) Create a AWS S3 BUCKET that will be used to store the file to analyze: https://docs.aws.amazon.com/AmazonS3/latest/userguide/creating-bucket.html
   c) Set up the "PUT FILE TO S3" action in the workflow.
     Inputs Fields:
       - FILE URL TO STORE : The file URL from the Bubble.io uploader, or a Protocol-relative URLs (//server/file.ext), or a HTTPS file URL (https://server/file.ext). The file must be accessible through the HTTPS protocol.
      - AWS S3 BUCKET NAME : Bucket Name to which the file will be saved.
      - AWS S3 FILE NAME : Path & File Name to save to. The format must be [path/]filename.ext.
           Example 1: path1/path2/filename.ext.
           Example 2: filename.ext if the file is at the root of the bucket.

 4) Set up the "START TRANSCRIBE JOB" action in the workflow.
   Inputs Fields:
   - AWS S3 BUCKET NAME : AWS S3 bucket name from which the input file will be read.
   - AWS S3 FILE NAME : AWS S3 file name for the input file. Enter here the file from the Bubble.io file, or a Protocol-relative URLs (//server/filename.ext), or a HTTPS URL (https://server/filename.ext). The supported file formats are FLAC, MP3, MP4, Ogg, WebM, AMR, or WAV.
       Example 1 : path1/path2/filename.ext.
       Example 2 : filename.ext if the file is at the root of the bucket.
   - SHOW SPEAKER LABELS (Speaker Diarization) : Determines whether the transcription job uses speaker recognition to identify different speakers in the input audio.
   - MAX SPEAKERS : The maximum number of speakers to identify in the input audio. If there are more speakers in the audio than this number, multiple speakers are identified as a single speaker. Minimum value of 2. Maximum value of 10.
   Output Fields:
     - JOBID : ID of the Job, to be reused in the "GET TRANSCRIBE RESULTS".
     - RESULTS: Returns the transcript in text format upon COMPLETED TRANSCRIPTION JOB STATUS, along with the detected language code, diarization attributes containing transcript timings and speaker identification.

IMPLEMENTATION EXAMPLE
======================
 Feel free to browse the app editor in the Service URL for an implementation example.

ADDITIONAL INFORMATION
======================

> Supported languages: https://docs.aws.amazon.com/transcribe/latest/dg/supported-languages.html#table-language-matrix
> AWS TRANSCRIBE service limits : https://docs.aws.amazon.com/transcribe/latest/dg/limits-guidelines.html#limits
> AWS services availability per region : https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/
> AWS Service endpoints list : https://docs.aws.amazon.com/general/latest/gr/rande.html

TROUBLESHOOTING
================
 Any plugin related error will be posted to the the Logs tab, "Server logs" section of your App Editor.
 Make sure that "Plugin server side output" and "Plugin server side output" is selected in "Show Advanced".

 > Server Logs Details: https://manual.bubble.io/core-resources/bubbles-interface/logs-tab#server-logs

PERFORMANCE CONSIDERATIONS
===========================

GENERAL
-------------
 The largest usable file size depends on the capacity units your app can use on Bubble.io (see Capacity Boost or Professional Plan, the highest limit being capped by Bubble.io's Workflow Action maximum execution time to perform the response parsing operation.


QUESTIONS ?
===========
 Contact us at [email protected] for any additional feature you would require or support question

Types

This plugin can be found under the following types:
Api   •   Background Services   •   Element   •   Event   •   Action

Categories

This plugin can be found under the following categories:
Media   •   Productivity   •   Video   •   AI   •   Input Forms

Resources

Support contact
Documentation
Tutorial

Rating and reviews

Average rating (3.0)

Muito complicado
December 26th, 2022
Prezado, Adquiri seu plugin na data de hoje para utilização em uma aplicação web específica, dentro do que li na proposta do mesmo, no entanto, simplesmente não consegui implementá-lo se adquirir outro plugin que também é pago, por essa razão deseisti a compra e solicitei cancelamento ao meu cartão. Conto com sua compreensão.
Bubble