Technical documentation

Here you can easily access all of the documentation, tutorials, and reference material available for PDFTextStream.

PDFTextStream API Reference

PDFTextStream's API reference can be found here. While this comes in "javadoc" form, PDFTextStream's API is identical whether you choose to use it on .NET (via C#, VB.Net, or F#) or on the JVM (via Java, Clojure, Scala, Groovy, JRuby, Jython, or any other JVM language).

PDFTextStream Developer's Guide

Learn how to get started with and then get the most out of PDFTextStream with a series of "long-form" introductory and detailed walkthroughs of all aspects of PDFTextStream's capabilities.

  1. Introduction
  2. Setting Up PDFTextStream
  3. Extracting text from PDF documents
  4. Controlling the formatting of extracted text
  5. Restricting PDF text extraction to only specific coordinates
  6. Unicode text and character sets
  7. Accessing PDF bookmarks
  8. Accessing PDF document metadata
  9. Accessing PDF annotations
  10. Extracting and updating PDF form data
  11. Reading encrypted PDF files
  12. Indexing PDF documents with Lucene and PDFTextStream
  13. Error handling
  14. Logging
  15. Using PDFTextStream from the command line
  16. PDFTextStream for .NET
  17. Using PDFTextStream in Multiple-CPU, Multithreaded Environments
  18. PDFTextStream Configuration Options
  19. Appendix: Selective PDF Text Extraction Based on Bookmark Coordinates
  20. Appendix: The Art of Reading PDF Text

Need technical, evaluation, or sales support?

Have you looked at PDFTextStream's technical documentation yet? There's a wealth of information there that may help you address your requirements and point you in the right direction if you're having problems.

Above all, please do not hesitate to contact us; we will do whatever we can to help, whatever the topic.

You are currently looking at the help for PDFTextStream v2.6.0 (the latest). Other options available include: