ATTENTION: You are viewing a page formatted for mobile devices; to view the full web page, click HERE.

Main Area and Open Discussion > General Software Discussion

Batch convert folders from .htm to .pdf ?

(1/1)

jity2:
Dear all,

I would like to batch convert folders (and many subfolders) full of .htm and .html files to pdf files with "pdf creator" ?

I guess that "pdf creator" can be used with a command line (http://www.pdfforge.org/content/setup-command-line-parameters) and maybe somebody can help me with a AutoHotkey script ? But I am not against another program as long as the pdf is OCRed.

I would like that it deletes the htm file once the job is done and keep the same filename (example001.htm => example001.pdf) inside the same folder(s).
I also would be much pleased if "pdf creator" can be run in silent mode for a specified main folder (does the job in the background).

Many thanks  in advance, ;)

ps: here is why I need this : Over the years I have saved many htm files in my computer with only one hand gesture in Firefox (see https://www.donationcoder.com/forum/index.php?topic=23782.msg215935)
I would like to be able to make keywords searches of my archives in my Google Drive. Alas for now Google Drive displays (and index) only the code of html files. ;(
I have already copied only my htm + html files with Syncback (free http://www.2brightsparks.com/freeware/freeware-hub.html) and Puresync (http://www.jumpingbytes.com/en/puresync.html).
Hopefully Google Drive index the content of pdf files (first 10 pages if no OCR in the pdf file and first 100 pages if the pdf file is already OCRed. See http://support.google.com/drive/bin/answer.py?hl=en&answer=2423485). Hence my idea to convert my htm(l) files into pdf ones. Furthermore, converting my files decrease their size (example: .htm size : 153 ko => .pdf size : 105ko only ! ;))

Curt:
deleted - I am too sleepy!  :-[

rjbull:
I don't really understand what you want.  Am I right in thinking you want to0 convert the HTMs into PDFs because you want to put them into a Google cloud service, which indexes and OCRs PDFs?  But only shows HTMs as raw code?

This doesn't directly answer your query, but I'm wondering; have you ever considered using a Web clipper, of which there are many?  See, e.g., the DC thread web clipping.  If you really need your data in the cloud, you might have to go to EverNote, or RightNote.

jity2:
(...) Am I right in thinking you want to convert the HTMs into PDFs because you want to put them into a Google cloud service, which indexes and OCRs PDFs?  But only shows HTMs as raw code?-rjbull (December 05, 2012, 02:13 PM)
--- End quote ---

Yes ! ;)

This doesn't directly answer your query, but I'm wondering; have you ever considered using a Web clipper, of which there are many?  See, e.g., the DC thread web clipping.  If you really need your data in the cloud, you might have to go to EverNote, or RightNote.

--- End quote ---
Thanks. I am aware of those products. I have started saving webpages (and their related images + audio + video..) about 13 years ago (I use Copernic Desktop pro for indexing my content right now). Those products did even exist at that time. But I prefer to be independent of them. Who knows if they still exist in the next x years ? If they go bankrupt I lose everything. ;(

Thanks anyway for the ideas. ;)
See ya ;)
Thanks also to "Curt" for his help. ;)

jity2:
Dear all,

I have tried to find a solution (http://www.autohotkey.com/board/topic/87965-batch-convert-files-from-htm-to-pdf-many-folders/).
Alas even if I am very pleased with all the help received. ;) I must confess that my starting idea was stupid (convert .htm to .pdf). ;(
It is better imho to forget about it !
Sorry for disturbing.
See ya ;)


Navigation

[0] Message Index

Go to full version