Skip to content

Latest commit

 

History

History
9 lines (8 loc) · 789 Bytes

pdf.md

File metadata and controls

9 lines (8 loc) · 789 Bytes

pypdf & pdfminer.six

pypdf

pdfminer.six

  • https://github.com/pdfminer/pdfminer.six
  • Pdfminer.six is a community maintained fork of the original PDFMiner. It is a tool for extracting information from PDF documents. It focuses on getting and analyzing text data. Pdfminer.six extracts the text from a page directly from the sourcecode of the PDF. It can also be used to get the exact location, font or color of the text.