ATTENTION: You are viewing a page formatted for mobile devices; to view the full web page, click HERE.

Main Area and Open Discussion > General Software Discussion

The best Automatic Organizer for documents and other files

<< < (2/2)

Contro:
Where is it to catalogue all types of files.

Do you know anyone ?-Contro (May 27, 2012, 08:46 AM)
--- End quote ---

Try searching DC for "whereisit"  :)
-rjbull (May 27, 2012, 11:22 AM)
--- End quote ---


Thank a lot rjbull

https://www.donationcoder.com/forum/index.php?topic=7764.0
https://www.donationcoder.com/forum/index.php?topic=7756.0
https://www.donationcoder.com/forum/index.php?topic=7183.0
https://www.donationcoder.com/forum/index.php?topic=2684.0
https://www.donationcoder.com/forum/index.php?topic=15929.0
https://www.donationcoder.com/forum/index.php?topic=23805.0

One more question.

I would like a finder or searcher of the pdf files that are scanned or imaged, not searchables.

How can I do that ?

 :-*

rjbull:
I would like a finder or searcher of the pdf files that are scanned or imaged, not searchables. -Contro (May 27, 2012, 12:27 PM)
--- End quote ---

I just saw what looks like your other thread on that.  I don't have an answer.  I wondered about using pdftotext from XPDF, and assuming that any file with zero text output was an image, but I don't think that will work if the file is encrypted.  If the information in pdfinfo (from the same XPDF package) is both correct and complete, there just doesn't seem to be a flag that says that the file is a scanned image.  Once you've found them, maybe the best thing is to rename them, or otherwise tag them, but that doesn't help find them in the first place.

IainB:
There's an earlier thread on a similar topic to this that could be useful:
DONE: Check folder and tell me which PDFs are images (non-searchable)
@vevola: You might be interested in this.
I see you use Foxit and the "reference management" programme Mendeley Desktop.
I have recently started using another reference management programme called Qiqqa, having tried Zotero and Mendelay - I found the latter two did not meet my requirements.

I was blown away by Qiqqa though - it scans and indexes text-searchable PDF files and PDF files containing images - i.e., it OCRs any text in the images in the PDF files. I let Qiqqa loose on my library of about 650 PDF documents, and left it scanning and indexing the lot overnight.
It successfully OCRed and indexed all the imaged PDF files too. It seems quite intuitive to use and has lots of good features.

There's a good list and comparison of "reference management software" here: Comparison of reference management software

On the Qiqqa website, there is a good comparison between Qiqqa, EndNote, Zotero, and Mendelay, :  Qiqqa Features
The OCR capability stands out as a strength for Qiqqa.
-IainB (July 22, 2011, 11:30 AM)
--- End quote ---

Contro:
I would like a finder or searcher of the pdf files that are scanned or imaged, not searchables. -Contro (May 27, 2012, 12:27 PM)
--- End quote ---

I just saw what looks like your other thread on that.  I don't have an answer.  I wondered about using pdftotext from XPDF, and assuming that any file with zero text output was an image, but I don't think that will work if the file is encrypted.  If the information in pdfinfo (from the same XPDF package) is both correct and complete, there just doesn't seem to be a flag that says that the file is a scanned image.  Once you've found them, maybe the best thing is to rename them, or otherwise tag them, but that doesn't help find them in the first place.
-rjbull (May 27, 2012, 03:52 PM)
--- End quote ---

I''ll take a look at XPDF
Best Regards

Contro:
Check folder and tell me which PDFs are images (non-searchable) [/b]
-IainB (May 27, 2012, 06:03 PM)
--- End quote ---

Goingggg
 ;D

Navigation

[0] Message Index

[*] Previous page

Go to full version