I have a preprocessor that generates and compiles some input into LaTeX and then into a PDF.
I'd like to have some regression tests so that I can tell if I've accidentally broken something in the code.
What's the best way to, from the command line, check if two latex-generated files are the same? (I could check all the sources, but there are a few hundred of them so I'd like a faster solution.
I've been trying to use pythons filecmp.cmp method - but that always finds a freshly generated file nonidentical to one generated seconds before - I'm assuming that's because there is a timestamp encoded somewhere in the pdf...