topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Thursday April 18, 2024, 2:16 pm
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: IDEA: get all images, tables from a pdf and save them as pdfName-fig-number.png  (Read 7350 times)

urlwolf

  • Charter Member
  • Joined in 2006
  • ***
  • Posts: 1,837
    • View Profile
    • Donate to Member
When I read APA formatted articles, I have to go back and forth to see the figures and tables (they are at the end) between the place where the fig. is mentioned and the figure itself.

Acrobat has a function to do what I want:
advanced > export images.

However, it fails in many occasions. It says that only images that contain raster or bitmap data can be exported. Most papers don't.

What I do "by hand" is to use screencaptor to take them (crop them) and save them. I then use the Dopus picture viewer to see them when they are mentioned.

A program that could do this automatically would be a godsend for academics around the word.

jgpaiva

  • Global Moderator
  • Joined in 2006
  • *****
  • Posts: 4,727
    • View Profile
    • Donate to Member
I'm not sure if it can do it, but just take a look at Mark0's BitmapRip (Bitmap Ripper).
I think it might be exactly what you're looking for.

skrommel

  • Fastest code in the west
  • Developer
  • Joined in 2005
  • ***
  • Posts: 933
    • View Profile
    • 1 Hour Software by skrommel
    • Donate to Member
 :) How about opening the pdf file twice?

I couldn't get BitmapRipper to work, but Carol Haynes mentioned PDF Explorer from http://homepage.onid.../pdfe/downloads.html, and this on extracts images like a charm. Also, GhostView from http://www.seas.ucla...u/~ee5cta/ghostView/ can convert pdf to jpg or tif, to ease the cutting and pasting.

Skrommel

urlwolf

  • Charter Member
  • Joined in 2006
  • ***
  • Posts: 1,837
    • View Profile
    • Donate to Member
I think the problem is that some of the graph-tables are not bitmaps.
For example, is the author drew a chart in word, it would be a vector graph in the final pdf and will be ignored by these tools. Same with tables.

I guess opening two copies is not a bad idea, although it loses a lot of screen real space (and I'm on a laptop to start with).

Thanks a lot

rjbull

  • Charter Member
  • Joined in 2005
  • ***
  • default avatar
  • Posts: 3,199
    • View Profile
    • Donate to Member
Sounds like you might like to try XPDF (freeware), specifically this component of it


DESCRIPTION
       Pdfimages saves images from  a  Portable  Document  Format
       (PDF)  file  as  Portable  Pixmap  (PPM),  Portable Bitmap
       (PBM), or JPEG files.

       Pdfimages reads the PDF file, scans  one  or  more  pages,
       PDF-file,  and  writes one PPM, PBM, or JPEG file for each
       image, image-root-nnn.xxx, where nnn is the  image  number
       and xxx is the image type (.ppm, .pbm, .jpg).



mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,900
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
oh nice find rjbull!