Is it possible to check a PDF for data corruption?

7,271

Browsing through PDF Reference sixth edition (2006), it appears that PDF files do not have an overall checksum, though embedded files within the PDF (similar to attachments in an email message) may optionally have an MD5 hash.

You should therefore archive your PDFs in a container which supports error detection / correction. For example, a zip file, or optical media (CD-R etc).

Share:
7,271

Related videos on Youtube

Francesco Turco
Author by

Francesco Turco

Updated on September 17, 2022

Comments

  • Francesco Turco
    Francesco Turco over 1 year

    I have some PDF documents and I'd like to check them for possibile data corruption, even if I'm able to display them without problems. I don't really know if PDF documents store an embedded checksum string for this kind of purposes. My operating system of choice is GNU/Linux. Thanks.

    • Hugh Allen
      Hugh Allen about 14 years
      If they display OK why do you suspect they're corrupt?
    • Francesco Turco
      Francesco Turco about 14 years
      I don't think they are corrupted. I just have to archive them and preserve them from future corruption. So I should choose between computing a MD5/SHA1/SHA2 checksum myself or relying on an embedded checksum.
    • Apache
      Apache almost 14 years
      Just use a free tool which gives you a checksum and provide it with the pdf (in a zip package for example).
  • Algific
    Algific about 14 years
    File compression(zip,rar) is also known to use CRC same as used on optical medias.