A Java class library for parsing PDF documents including PDF to text extraction, merge PDF documents, PDF document decryption. [Open source, BSD license]
The library allows to create and edit PDF documents and includes native Unicode, TrueType, Type 1 font embedding support. Documentation and examples are provided. [Commercial w/ Evaluation]
A Java library that specializes in extracting text and metadata out of PDF documents. Supports extraction from encrypted PDF files, and integrates with Jakarta Lucene to enable indexing of PDF document content.
A Java library designed to quickly add a PDF generation functionality to end products. A key feature of this library is that it uses popular HTML 3.2 as an input format. For Web application developmers PD4ML is available as a JSP taglib. [Commercial]