The input text files are memory mapped, and the internal data is a pointer ... The code is a bit crude and it's sort of based on my recollection of Hamish Dewar's far more elegant "compare" program ...
There are tools that automatically compare the differences between two PDF files, but you can't detect the difference well unless they are almost the same text. When comparing PDF files with a revised ...