What is PDFxStream?
PDFxStream is a software component developed by Snowtide Informatics Systems, Inc. that aims to extract text, tables, images, and form data from PDF documents. According to the vendor, PDFxStream is suitable for organizations of all sizes, including software development teams, business analysts, data analysts, legal professionals, and government agencies.
Key Features
PDFxStream Base: According to the vendor, this component provides complete PDF format compatibility and basic data extraction capabilities. It supports all versions of the PDF document specification, including decryption of encrypted PDF documents with various ciphers. It can also extract PDF annotations, embedded files, bookmarks, and document metadata, as well as raw character data and image metadata. Additionally, it supports merging of PDF files.
PDFTextStream: The vendor claims that this component allows users to extract Unicode text from PDF documents, including support for Chinese, Japanese, and Korean (CJK) in both horizontal and vertical writing modes. It offers an OutputHandler API for customizing PDF text extract formatting and supports regional text extraction for fixed-format forms. It also provides complete support for embedded and standard fonts and character encodings, automated layout processing, and extraction of text from "searchable image" PDFs and rotated text. Furthermore, it offers comprehensive support for extracting PDF tables, including export to CSV for Excel.
PDFImageStream: According to the vendor, this component offers comprehensive PDF image extraction capabilities. It can decompress and decode various types of PDF images, render images to on-screen graphics contexts, and save them to disk in formats such as JPEG, TIFF, GIF, PNG, and BMP. It also supports automatic stitching of image tiles and strips.
PDFFormStream: The vendor states that this component enables users to extract and fill interactive and XFA forms. It supports extraction of "Acroform" form data from all types of fields, including text, dropdowns, radio buttons, checkboxes, push buttons, and signatures. It also supports extraction of XFA form data and allows filling of "Acroform" fields and writing updated PDF documents.
PDFxStream Complete: According to the vendor, this package includes all the components of PDFxStream, simplifying project management and minimizing development costs with a single dependency and API. Users only pay for the components they use.
Interoperability with Java and .NET languages: According to the vendor, PDFxStream supports interoperability with Java and .NET languages, allowing developers to integrate it seamlessly into their existing software systems.
Complete PDF format compatibility and access to basic PDF data: The vendor states that PDFxStream provides wide-ranging support for various PDF file format features, including decryption, repair, annotations, bookmarks, and more. It ensures complete compatibility with the PDF format and enables access to essential PDF data.
Wide range of supported PDF file format features: According to the vendor, PDFxStream supports a wide range of PDF file format features, including decryption, repair, annotations, bookmarks, and more. It offers comprehensive capabilities for working with PDF documents and extracting relevant data.
