6
General Software Discussion / Re: Pdf Management
« on: February 09, 2013, 09:04 AM »
Interesting thread here (more than 4 pages I'm afraid):
http://answers.microsoft.com/en-us/windows/forum/windows_7-files/i-need-to-index-pdf-files-in-my-laptop-running/ef5fa7c3-bb64-4f3e-9607-3e01349dccca?page=5
Even Adobe products can search several pdf's in a row, but it doesn't seem to be fast.
Have a look here:
http://www.foxitsoftware.com/products/ifilter/performance.php
Foxit pdf iFilter is 20 bucks per seat; if it works, that should be very reasonable.
When having crawled many more "search pdf index" sites, I'll do perhaps some testing, just indexing my current pdf's of all sorts with multiple progs and compare the results (and no, there does NOT seem to be any such prog that will tell you it can't properly index a pdf...). Prob here, in order to not affect the indexing with one prog by the previous installation of a competing prog, I'd have to to reset my comp to a previous state between every trialling (takes my 30 minutes each time). But from the above thread you'll have understood that the pdf format is the worst file format you can get. All the more so it'd be helpful to know which indexing search tools (and in which global circumstances of your system) will deliver reliable results. I would have expected many more insights into this format from the "academic sw" side (where reliability of pdf searches would be crucial), but no...
http://answers.microsoft.com/en-us/windows/forum/windows_7-files/i-need-to-index-pdf-files-in-my-laptop-running/ef5fa7c3-bb64-4f3e-9607-3e01349dccca?page=5
Even Adobe products can search several pdf's in a row, but it doesn't seem to be fast.
Have a look here:
http://www.foxitsoftware.com/products/ifilter/performance.php
Foxit pdf iFilter is 20 bucks per seat; if it works, that should be very reasonable.
When having crawled many more "search pdf index" sites, I'll do perhaps some testing, just indexing my current pdf's of all sorts with multiple progs and compare the results (and no, there does NOT seem to be any such prog that will tell you it can't properly index a pdf...). Prob here, in order to not affect the indexing with one prog by the previous installation of a competing prog, I'd have to to reset my comp to a previous state between every trialling (takes my 30 minutes each time). But from the above thread you'll have understood that the pdf format is the worst file format you can get. All the more so it'd be helpful to know which indexing search tools (and in which global circumstances of your system) will deliver reliable results. I would have expected many more insights into this format from the "academic sw" side (where reliability of pdf searches would be crucial), but no...