topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Thursday April 18, 2024, 9:11 pm
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Redacting PDF Scans  (Read 8400 times)

JennyB

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 212
  • Test all things - hold fast to what is good
    • View Profile
    • Donate to Member
Redacting PDF Scans
« on: March 09, 2012, 09:25 AM »
I have some PDF Scans of old computer magazines from the 1990's that I'd like to put online, but they contain some names and addresses and other private and probably out-of-date info that I'd rather not display.

Is there any easy way to black that out without rescanning?


If you don't see how it can fail -
you haven't understood it properly.

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,900
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
Re: Redacting PDF Scans
« Reply #1 on: March 09, 2012, 09:46 AM »
That's a more interesting question than it may first appear -- there are surely easy ways to put a black bar over the information from within a pdf editor -- but you need to be very careful that the original text isn't still recoverable and contained within the pdf -- something that might be quite hard to figure out just on a casual examination of the file.  Lots of stories about how people thought they removed/hid some information in a document but found that the information was still present in the file.

JennyB

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 212
  • Test all things - hold fast to what is good
    • View Profile
    • Donate to Member
Re: Redacting PDF Scans
« Reply #2 on: March 09, 2012, 10:02 AM »
I don't have them to hand, but I think that most of them are image scans from pre-PDF days, not even OCRed. Does that make a difference?
If you don't see how it can fail -
you haven't understood it properly.

kunkel321

  • Supporting Member
  • Joined in 2009
  • **
  • Posts: 597
    • View Profile
    • Donate to Member
Re: Redacting PDF Scans
« Reply #3 on: March 09, 2012, 11:07 AM »
If they started as paper docs and got converted to a PDF, I'll bet you can use a PDF tool to black out the parts, then REprint as a new PDF file and the blacked out info would be permanently gone.  I saw a PDF tool at dottech recently
http://dottech.org/f...-limited-time-offer/
that probably can do the blocking.
I'm not sure if it does virtual printing, but you probably already can do that.  If not, check out the free http://www.cutepdf.c...s/CutePDF/writer.asp

EDIT:  I'm not positive, but I think that even a Word file saved as pdf, then redacted as above and sent through a virtual printer would be secure.

cmpm

  • Charter Member
  • Joined in 2006
  • ***
  • default avatar
  • Posts: 2,026
    • View Profile
    • Donate to Member
Re: Redacting PDF Scans
« Reply #4 on: March 09, 2012, 11:20 AM »
not even OCRed. Does that make a difference?

Yes, that would make it easy, if it's converted to a .png or .jpg.
Use ScreenshotCapture's editor or another editor that can blackout sections.
Then to be sure, take a screenshot of the finished product.

bob99

  • Supporting Member
  • Joined in 2008
  • **
  • default avatar
  • Posts: 345
    • View Profile
    • Donate to Member
Re: Redacting PDF Scans
« Reply #5 on: March 09, 2012, 12:03 PM »

It's fun to go back and look at old computer & electronics magazine articles just to see how things have changed over time. I kept some from the '70s and it's amazing.

No one has said it yet though, I'd be careful what you put on-line.  Especially if any of it is copyrighted.  While your intentions are good, too many people today try to make money the "new-fashioned" way... litigation and lawsuits. 
Just a thought.

kunkel321

  • Supporting Member
  • Joined in 2009
  • **
  • Posts: 597
    • View Profile
    • Donate to Member
Re: Redacting PDF Scans
« Reply #6 on: April 02, 2012, 06:27 PM »
FYI there's a PDF OCR tool on http://www.giveawayoftheday.com/ today...

Renegade

  • Charter Member
  • Joined in 2005
  • ***
  • Posts: 13,288
  • Tell me something you don't know...
    • View Profile
    • Renegade Minds
    • Donate to Member
Re: Redacting PDF Scans
« Reply #7 on: April 02, 2012, 09:29 PM »
not even OCRed. Does that make a difference?

Yes, that would make it easy, if it's converted to a .png or .jpg.
Use ScreenshotCapture's editor or another editor that can blackout sections.
Then to be sure, take a screenshot of the finished product.

+1

That's a good recommendation.

If it is OCR'd, you can remove text in an editor like Adobe Illustrator, then save the result.

However, you still have the potential for metadata to be present, so cmpm's recommendation that you take a screenshot of the final product is excellent.

Slow Down Music - Where I commit thought crimes...

Freedom is the right to be wrong, not the right to do wrong. - John Diefenbaker