• Add a set of PDF readers fro per-page (PagePdfDocumentReader) and per-paragraph (ParagraphPdfDocumentReader) readers.
  • Use a PDFLayoutTextStripper fork and PDFLayoutTextStripperByArea extension to preserve the structure of the extracted document.
  • PdfDocumentReaderConfig and PageExtractedTextFormatter in standalone classes.
  • Create a new document-readers top level model and the pdf-reader under.

Part of #12

Comment From: markpollack

merged in 1266c04