MARKETPLACE
PLUGINS
PDF.JS TEXT EXTRACTION
PDF.js Text Extraction logo

PDF.js Text Extraction

Published April 2021
   •    Updated this week

Plugin details

Thin wrapper for server side execution of the Node.js pdfjs-dist package maintained by Mozilla, exposing actions to extract text from a PDF.

Free

For everyone

0.7 stars   •   3 ratings
502 installs  
This plugin does not collect or track your personal data.

Other actions

Contributor details

 logo
Joined 2022   •   9 Plugins
View contributor profile

Instructions

The plugin provides the following Server Side Actions:
"Extract All Text" extract text from all the pages of the base 64 encoded PDF. Returns a list of strings containing the text on each page.

"PDF Info" retrieve information and metadata from the base 64 encoded PDF. Returns the page count and strings containing the serialized JSON of the information and metadata.

"Extract Page Text" extract the text from a specific page of the base 64 encoded PDF. Returns a string containing the extracted text.

Types

This plugin can be found under the following types:

Categories

This plugin can be found under the following categories:
Web Scraping   •   Technical   •   Productivity   •   Data (things)

Resources

Support contact
Documentation
Tutorial

Rating and reviews

Average rating (0.7)
Doesn't work
September 19th, 2024
at BaseExceptionClosure (/var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:543:29) at Array. (/var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:546:2) at __w_pdfjs_require__ (/var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:24153:41) at /var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:24393:13 at /var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:24444:3 at /var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:24447:12 at webpackUniversalModuleDefinition (/var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:25:20) at Object. (/var/task/node_modules/pdfjs-dist/legacy/build/pdf.js:32:3) at Module._compile (node:internal/modules/cjs/loader:1364:14) at Module._extensions..js (node:internal/modules/cjs/loader:1422:10)
Não serve.
April 15th, 2024
Poderia colocar um descrição de como funciona.
Plugin is not working
January 9th, 2024
Extract all text is not working nor extract text from page.
Bubble