I've been looking to scan and convert to pdf lots of receipts, statements etc and then index their CONTENTS. Innocently, I thought that would be simple, as X1, Copernicus et al say they index pdfs, jpgs etc. and I've a couple of print to pdf softwares (primo, dopdf)
None of the indexers seem to make it clear up front tho that they only index the file names, not the content of jpg/image files. (or have I got that wrong?)
For pdfs, it seems rather trickier eg, if I create a pdf using dopdf using a word doc as source, contents will be indexed. If I create the pdf using dopdf from an image, the contents won't be indexed.
(Probably if I use acrobat pro to create the pdfs it will work?)
What does get indexed depends on the search software too, as I discovered using both X1 and Copernicus
So, my question is, please: what's a simple, reliable, cheap way to create pdfs from a scanned doc whose CONTENTS are searchable by X1 or Copernicus or GDS