Skip to content

Latest commit

 

History

History
36 lines (23 loc) · 912 Bytes

Procedures and tools.md

File metadata and controls

36 lines (23 loc) · 912 Bytes

Procedures and Tools

DEPRECATED: All this work was done. Working on revision now.

On Linux:

  1. Extract chapter or pages:
$ pdftk infile.pdf cat 12-15 output outfile.pdf
  1. Convert to TXT or directly to MD:
$ pdftotext outfile.pdf outfile.txt

    A good python alternative with better results:

$ pdf2txt.py outfile.pdf > outfile.txt

    Convert PDF directly to MD: PDF to Markdown Converter (source in GitHub). Very good, recommended.

  1. In both cases, fix broken paragraphs: paragrapher.

  2. Copy txt inside the right chapter:

echo "" >> chapter.md
cat outfile.txt >> chapter.md
  1. Edit with Ghostwriter or other editor.