|
|
PDFTextStream |
|
|
v1.1 |
|
|
Chas Emerick |
|
|
Free Evaluation / $1900.00 |
|
|
Java |
|
|
Java / File Manipulation |
|
|
Click to Visit |
|
|
See the Demo |
|
|
Click to Download |
|
|
30 |
|
PDF v1.5 Support
PDFTextStream supports all versions of the PDF document specification, including v1.5 (used by Adobe Acrobat 6). We are agressively keeping pace with Adobe's changes to the PDF file format, so you don't have to.
Unicode
Supports Unicode within Latin character sets (support for 2-byte languages [Arabic, Chinese, Japanese, Korean, etc] may be included in future releases), and provides optional ASCII approximation of Unicode characters (ideal for search and indexing systems that cannot or do not handle Unicode character sets).
Familiar API
PDFTextStream subclasses the java.io.Reader class, which ensures a simple, familiar interface, and easy integration opportunities for existing components expecting a java.io.Reader instance.
Accesses PDF Document Metadata
PDF files can contain many different kinds of metadata, including who created them, when, what the source document(s) were, and what application was used to generate them. PDFTextStream automatically loads all metadata found in a PDF file, and provides a simple, java.util.Map-like API for retrieving these properties that can prove very useful when indexing and searching PDF documents.
Nothing Else is Faster
PDFTextStream is the fastest library available for Java for reading text out of PDF documents. Since it was built from the ground up to read text out of PDF files, PDFTextStream delivers measurable performance advantages compared to other Java PDF libraries.
Jakarta Lucene Integration
PDFTextStream provides a convenient component allowing for easy integration with Jakarta Lucene, the leading open source text indexing and searching library for Java.
Full Encryption Support
PDFTextStream seamlessly decrypts documents encrypted with 40- or 128-bit encryption technologies. Decryption support extends to PDF document text and document metadata, providing a full view of the contents of encrypted PDF files.
|