topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Tuesday March 19, 2024, 1:12 am
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Screenshots all page content of openned pdf file in sumatraPDF in burst mode ?  (Read 4112 times)

jity2

  • Charter Member
  • Joined in 2006
  • ***
  • default avatar
  • Posts: 126
    • View Profile
    • Donate to Member
Dear all,

First some background for the idea :
I have some PDF files which are damaged. My goal is to OCR what can be repaired (I recently tested "Nuance Power PDF Advanced2". IMHO it can OCR many pdf that have problems that other OCR softwares can't even open. But alas it has still problem with some pdf files.)

I have tried several tools and techniques. The best ones so far being :
- 3-Heights™ PDF Analysis & Repair (they also sell a shell version).
https://www.pdf-tool...pdf-analysis-repair/
(The free version can be used here : https://www.pdf-onli....com/osa/repair.aspx )
The problem is that it doesn't repair all defects properly. ;(

- and a batch script using SumatraPDF and the printer Bullzip ( see https://www.donation...opic=42713.msg399623 ).
The problem here is that it takes a lot of time,CPU and memory. For instance a pdf of 100MB uses 16GB of temporary SSD space in order to produce ("print") finally, after 10 minutes, a 300MB pdf !
Also for several pdf files, the process is done and at the end no pdf file is created ! ;(

So I got this idea :
I realize that the nice thing is that I can open most of the pdf (that have errors) with SumatraPDF.  ;)
So it would be great if some software could once the pdf openned in SumatraPDF, take a screenshot of each pages in burst mode (one screenshot then turn to the next page, then repeat). Then I could probably make a pdf from the image files and OCR them very fast ?
I did test SCREENSHOT CAPTOR VERSION 4 https://www.donation...hotcaptor/index.html but I wasn't able to do it (the automatic page "down/up" did not work - win8.1 64) !


Thanks in advance ;)

Ath

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 3,610
    • View Profile
    • Donate to Member
I did test SCREENSHOT CAPTOR VERSION 4 https://www.donation...hotcaptor/index.html but I wasn't able to do it (the automatic page "down/up" did not work - win8.1 64) !

Have you let SCC try all methods of scrolling it supports? (Assume you have, but just asking to be sure)

A similar request led to the development of my WinSendKeys tool, though that doesn't do the OCR, but only the pagedown action

tomos

  • Charter Member
  • Joined in 2006
  • ***
  • Posts: 11,958
    • View Profile
    • Donate to Member
I did test SCREENSHOT CAPTOR VERSION 4 http://www.donationc...hotcaptor/index.html but I wasn't able to do it (the automatic page "down/up" did not work - win8.1 64) !

FWIW scrolling capture works in Adobe Reader, but that in Windows 7 here.

I'm not sure if it can be set to save separate shots per page (which I presume you'ld prefer), as opposed to making one long screenshot.
Tom

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,896
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
There are a few methods available when attempting to get the automatic paging to work when doing a scrolling capture with screenshot captor..
You could also just set Screenshot Captor to capture and stay minimized and do page down manually, just hitting page down, prtscr, page down, prtscr, etc.. which shouldn't be too painful once you set Screenshot Captor to not pop up any dialog after each capture.

The other option would be to script the process using some tool and using a commandline screen capture like my MiniCap (https://www.donation...r/MiniCap/index.html).

"I'm not sure if it can be set to save separate shots per page (which I presume you'ld prefer), as opposed to making one long screenshot."

Screenshot Captor does now have an option to save each screen as a separate image (it's under the ... button after you finish the scrolling capture).