In this blog post I’m going to show you how you can extract text from scanned pdf files, or pdf files where no text recognition was performed. (For pdfs where text recognition was performed, you can read my other blog post).
The pdf I’m going to use can be downloaded from <a href=“http://www.luxemburgensia. …