63.4 PDF Comparisons

20190906 The diffpdf tool provides a graphical summary of differences between two pdf documents.

$ diffpdf doc01.pdf doc02.pdf

This will not find differences for image based PDF files that might come for example from a document scan. Such a pdf document contains an image of the text rather than the text itself, and hence it is not possible to compare text between such documents as is. See Section 63.12 to recognise the text within an image-based pdf and to mark up the original pdf with the text to then allow a comparison of two image-based pdf documents.



Your donation will support ongoing development and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2021 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0.