topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Thursday March 28, 2024, 11:40 am
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Batch convert folders from .htm to .pdf ?  (Read 5456 times)

jity2

  • Charter Member
  • Joined in 2006
  • ***
  • default avatar
  • Posts: 126
    • View Profile
    • Donate to Member
Batch convert folders from .htm to .pdf ?
« on: December 04, 2012, 11:28 AM »
Dear all,

I would like to batch convert folders (and many subfolders) full of .htm and .html files to pdf files with "pdf creator" ?

I guess that "pdf creator" can be used with a command line (http://www.pdfforge....mand-line-parameters) and maybe somebody can help me with a AutoHotkey script ? But I am not against another program as long as the pdf is OCRed.

I would like that it deletes the htm file once the job is done and keep the same filename (example001.htm => example001.pdf) inside the same folder(s).
I also would be much pleased if "pdf creator" can be run in silent mode for a specified main folder (does the job in the background).

Many thanks  in advance, ;)

ps: here is why I need this : Over the years I have saved many htm files in my computer with only one hand gesture in Firefox (see https://www.donation...opic=23782.msg215935)
I would like to be able to make keywords searches of my archives in my Google Drive. Alas for now Google Drive displays (and index) only the code of html files. ;(
I have already copied only my htm + html files with Syncback (free http://www.2brightsp...re/freeware-hub.html) and Puresync (http://www.jumpingby...com/en/puresync.html).
Hopefully Google Drive index the content of pdf files (first 10 pages if no OCR in the pdf file and first 100 pages if the pdf file is already OCRed. See http://support.googl...n&answer=2423485). Hence my idea to convert my htm(l) files into pdf ones. Furthermore, converting my files decrease their size (example: .htm size : 153 ko => .pdf size : 105ko only ! ;))

« Last Edit: December 04, 2012, 12:08 PM by jity2 »

Curt

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 7,566
    • View Profile
    • Donate to Member
Re: Batch convert folders from .htm to .pdf ?
« Reply #1 on: December 04, 2012, 03:40 PM »
deleted - I am too sleepy!  :-[
« Last Edit: December 04, 2012, 03:46 PM by Curt »

rjbull

  • Charter Member
  • Joined in 2005
  • ***
  • default avatar
  • Posts: 3,199
    • View Profile
    • Donate to Member
Re: Batch convert folders from .htm to .pdf ?
« Reply #2 on: December 05, 2012, 02:13 PM »
I don't really understand what you want.  Am I right in thinking you want to0 convert the HTMs into PDFs because you want to put them into a Google cloud service, which indexes and OCRs PDFs?  But only shows HTMs as raw code?

This doesn't directly answer your query, but I'm wondering; have you ever considered using a Web clipper, of which there are many?  See, e.g., the DC thread web clipping.  If you really need your data in the cloud, you might have to go to EverNote, or RightNote.

jity2

  • Charter Member
  • Joined in 2006
  • ***
  • default avatar
  • Posts: 126
    • View Profile
    • Donate to Member
Re: Batch convert folders from .htm to .pdf ?
« Reply #3 on: December 06, 2012, 12:09 AM »
(...) Am I right in thinking you want to convert the HTMs into PDFs because you want to put them into a Google cloud service, which indexes and OCRs PDFs?  But only shows HTMs as raw code?

Yes ! ;)

This doesn't directly answer your query, but I'm wondering; have you ever considered using a Web clipper, of which there are many?  See, e.g., the DC thread web clipping.  If you really need your data in the cloud, you might have to go to EverNote, or RightNote.
Thanks. I am aware of those products. I have started saving webpages (and their related images + audio + video..) about 13 years ago (I use Copernic Desktop pro for indexing my content right now). Those products did even exist at that time. But I prefer to be independent of them. Who knows if they still exist in the next x years ? If they go bankrupt I lose everything. ;(

Thanks anyway for the ideas. ;)
See ya ;)
Thanks also to "Curt" for his help. ;)

jity2

  • Charter Member
  • Joined in 2006
  • ***
  • default avatar
  • Posts: 126
    • View Profile
    • Donate to Member
Re: Batch convert folders from .htm to .pdf ?
« Reply #4 on: December 12, 2012, 07:35 AM »
Dear all,

I have tried to find a solution (http://www.autohotke...to-pdf-many-folders/).
Alas even if I am very pleased with all the help received. ;) I must confess that my starting idea was stupid (convert .htm to .pdf). ;(
It is better imho to forget about it !
Sorry for disturbing.
See ya ;)