Is it possible to check a PDF for data corruption?
7,271
Browsing through PDF Reference sixth edition (2006), it appears that PDF files do not have an overall checksum, though embedded files within the PDF (similar to attachments in an email message) may optionally have an MD5 hash.
You should therefore archive your PDFs in a container which supports error detection / correction. For example, a zip file, or optical media (CD-R etc).
Related videos on Youtube
Author by
Francesco Turco
Updated on September 17, 2022Comments
-
Francesco Turco over 1 year
I have some PDF documents and I'd like to check them for possibile data corruption, even if I'm able to display them without problems. I don't really know if PDF documents store an embedded checksum string for this kind of purposes. My operating system of choice is GNU/Linux. Thanks.
-
Hugh Allen about 14 yearsIf they display OK why do you suspect they're corrupt?
-
Francesco Turco about 14 yearsI don't think they are corrupted. I just have to archive them and preserve them from future corruption. So I should choose between computing a MD5/SHA1/SHA2 checksum myself or relying on an embedded checksum.
-
Apache almost 14 yearsJust use a free tool which gives you a checksum and provide it with the pdf (in a zip package for example).
-
-
Algific about 14 yearsFile compression(zip,rar) is also known to use CRC same as used on optical medias.