Encryption and Decryption of PDFs examples in documentation remove attachments, links, ... #2544

redfast00 · 2024-03-26T10:36:14Z

redfast00
Mar 26, 2024

I want to decrypt a PDF document, while keeping the document intact. The documentation gives an example for this (https://pypdf.readthedocs.io/en/stable/user/encryption-decryption.html); but this decrypts the document, then reconstructs it by copying page by page to the new document.

This removes attachments, but also the table of contents, the links in the document pointing to other pages, ...

I would like an approach where the encrypted data is decrypted 'in-place', keeping the document structure intact.

Environment

Which environment were you using when you encountered the problem?

$ python -m platform
Linux-6.5.0-26-generic-x86_64-with-glibc2.35

$ python -c "import pypdf;print(pypdf._debug_versions)"
pypdf==4.1.0, crypt_provider=('cryptography', '42.0.5'), PIL=none
(modified by me to add support for the pubsec decryptor, but this doesn't affect anything)

Code + PDF

This is a minimal, complete example that shows the issue; it does not use encryption, but it has the same problem as the sample that does use encryption.

from pypdf import PdfReader, PdfWriter

reader = PdfReader("PN7160_PN7161.pdf")
writer = PdfWriter()

for idx, page in enumerate(reader.pages):
    writer.add_page(page)

print("Saving to file...")
# Save the new PDF to a file
with open("out.pdf", "wb") as f:
    writer.write(f)

PN7160_PN7161.pdf

Answered by j-t-1

Mar 26, 2024

Create a backup of the PDF, then switch these:

writer = PdfWriter()

for idx, page in enumerate(reader.pages):
    writer.add_page(page)

writer = PdfWriter(clone_from=reader)

View full answer

j-t-1 · 2024-03-26T11:12:13Z

j-t-1
Mar 26, 2024

Create a backup of the PDF, then switch these:

writer = PdfWriter()

for idx, page in enumerate(reader.pages):
    writer.add_page(page)

writer = PdfWriter(clone_from=reader)

0 replies

redfast00 · 2024-03-26T11:32:28Z

redfast00
Mar 26, 2024
Author

@j-t-1 that works! Should I make a PR to replace that in the documentation?

0 replies

j-t-1 · 2024-03-26T12:06:22Z

j-t-1
Mar 26, 2024

Could do, currently it is not obvious that adding each page is insufficient to have equivalency; I had a similar problem #2485.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encryption and Decryption of PDFs examples in documentation remove attachments, links, ... #2544

{{title}}

Replies: 3 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Encryption and Decryption of PDFs examples in documentation remove attachments, links, ... #2544

redfast00 Mar 26, 2024

Environment

Code + PDF

Replies: 3 comments

j-t-1 Mar 26, 2024

redfast00 Mar 26, 2024 Author

j-t-1 Mar 26, 2024

redfast00
Mar 26, 2024

j-t-1
Mar 26, 2024

redfast00
Mar 26, 2024
Author

j-t-1
Mar 26, 2024