JPDFText for Linux

From Software Infocard Wiki
Revision as of 12:48, 1 August 2021 by Padrepos (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Infocard

Target Platform: Linux
Version: 2021R1
Release Date: January 29, 2021
License: Shareware
Price: USD 800
Publisher: Qoppa Software, LLC
Product Web Site: [External Link]
Extract text from PDF documents
76.51 Mb

Description by the Publisher

jPDFText is a Java library to extract text from PDF documents. With jPDFText, PDF documents can be processed to extract the textual content for archiving, storage, searching or indexing. jPDFText is built on top of Qoppas proprietary PDF technology so you do not have to install any third party software or drivers. Since it is written in Java, it allows your application to remain platform independent and run on Windows, Linux, Unix (Solaris, HP UX, IBM AIX), Mac OS X and any other platform that supports the Java runtime environment.

Main Features

Load PDF documents from files, network drives, URLs or input streams
Extract text in the logical reading order
Extract words as a vector of Strings
Works on Windows, Linux, Unix and Mac OS X (100% Java)
No need to install or configure additional drivers or software when deploying
Tested on JDK 1.4.2 and above

If you require any additional information, dont hesitate to contact us at [email protected].

jPDFText can extract existing text content from PDF documents. If you are interesting in recognizing text in scanned PDF documents or PDF documents containing images, you may be interested in our Java OCR feature.

Limitations in the Downloadable Version

Watermark

Product Identity

Unique Product ID: PID-1100D09D3C1F

Unique Publisher ID: BID-AB00354BDF5C

[jPDFText for Linux PAD XML File]

Category