I am looking for help and doing bulk OCR.
I have the following software, ABBY Find reader for Scan Snap, DevonThink 3, PDF Element, ABBYY FineReader, Evernote, EagleFiler, OCR Wizard, PDF Expert, Keyboard Maestro, Hazel and Mac Sparky’s excellent book on going Paperless.
Read me thinks that DevonThink 3 may be the answer, but I have avoided that, as it seems like it will be quite an investment of time and effort to get that working well.
In the past I’ve tried using the Mac Power Users script along with Hazel to call PDFpen Pro and have it OCR about 500 files.
I’ve been trying off and on to get this to work for the last six years. So I’ve been through multiple changes of the operating system as well as new versions of PDFpen Pro. It will do several files and then just hang.
I don’t think it’s the PDF files from selves as I have an old version of Adobe Acrobat Pro running on a Windows machine and it’s able to OCR the files just fine.
I was really excited when the latest version of PDFpen Pro included “OCR Files” right inside the application. Unfortunately it exhibits the same performance as it will OCR 10 to 20 files and then just hang.
Computers are complicated so I’m sure that there could be something on my system that is causing these problems however I don’t have any other applications the behave like this, i.e. work for a while and then hang.
I’m not sure if it makes any difference but my files are stored in iCloud.
I’ve asked Smile Software to see if they could provide an updated script to replace the one that I got from Mac power users. I wonder if there were a few more or longer delay in so that it does not overwhelm the application it might be more reliable.
I have sent smile software log files but even after all this time we’ve not been able to get it working.
Here is the script that I am using.
tell application “PDFpenPro”
open theFile as alias
– does the document need to be OCR’d?
get the needs ocr of document 1
if result is true then
tell document 1
ocr
repeat while performing ocr
delay 1
end repeat
delay 1
close with saving
end tell
–In PDFpen, when no documents are open, window 1 is “Preferences”
–If other documents are open, do not close the App.
if name of window 1 is “Preferences” then
tell application “PDFpenPro”
quit
end tell
end if
else
– Scan Doc was previously OCR’d or is already a text type PDF.
tell document 1
close without saving
end tell
–In PDFpen, when no documents are open, window 1 is “Preferences”
–If other documents are open, do not close the App.
if name of window 1 is “Preferences” then
tell application “PDFpenPro”
quit
end tell
end if
end if
end tell
I hate having to go back to windows to be able to OCR the files, and when those files are on iCloud it takes a while for them to be reflected on my Mac.
Is anyone else having the same problem with PDFpen Pro hanging?
Do you see any additions to the above script that might help?
Can you recommend any additional software it might do a better job at bulk OCR.
I purchased ABBYY find reader pro which offers quite a few conversion utilities but I haven’t figured out how to have it do Bulk OCR yet.
As you can imagine I’m beyond frustrated as I have spent a lot of money on software to try and solve this as well as a lot of time scanning my documents in only to find that it’s difficult to locate what I need.