NANY 2013 Release: pdfautomv 0.3

ATTENTION: You are viewing a page formatted for mobile devices; to view the full web page, click HERE.

DonationCoder.com Software > N.A.N.Y. 2013

(1/3) > >>

phitsc:
NANY 2013 Entry Information
Application Name pdfautomv (or pdfautomv Robot, haven't decided yet ;)) Version 0.3 Short Description Moves PDF files into directories depending on embedded text Supported OSes Windows (and possibly Linux) Web Page https://bitbucket.org/phitsc/pdfautomv Download Link source available following above link. but I'll make a zip too. System Requirements
* Ruby runtimeVersion History
* 0.3 - Fixed a crash.
* 0.2 - Rule files now have to be UTF-8. Fixed crash with -v 3 option.

Description
pdfautomv will be a simple command line utility (although perfectly usable with a simple double-click on a desktop shortcut) for the paperless office aficionado. Its purpose is to move PDF files from one directory to another based on the text embedded in the PDF file. My own primary use case is as follows:

1. Put invoice, receipt, letter, bank statement, whatever on scanner
2. Start scanning process => this will produce a PDF file in directory A
3. Repeat 1 - 2 until everything is scanned and directory A is full of files like Document.pdf, Document001.pdf, Document002.pdf, etc.
4. Double-click shortcut to pdfautomv.rb => marvel how all the Document bla bla.pdf files get nicely and neatly renamed and sorted into the directories where they belong

Usage
Installation
The application will be written in Ruby. So the Ruby runtimes have to be installed if they are not already. The application itself is just one Ruby file.

Using the Application
The application will rely on some "rule" files which have to be supplied by the user. A rule file specifies what pdfautomv should look for in a PDF file and where to move it and how to rename it if it finds a matching PDF file.

phitsc:
I've now and again tried to find an application that does just what pdfautomv will do but failed to find any. If someone knows such an application, even commercial (if it's not absurdly expensive), please let me know.

rjbull:
I used to have to rename health & safety data sheet PDFs. I had a batch file that used pdftotext from xpdf to convert the PDF to text, with an AWK script to parse the file and find what should be the name, and make a temporary .BAT that actually renamed the file. That only worked because the files had predictable structure.

phitsc:
That is exactly what I'm doing. It's also supposed to work with documents with predictable structure (actually, predictable content), hence the rule files.

mouser:
will renaming be part of this?

if so, are we saying it might be possible to rename a pdf file based on the text content in the file -- like on the article title and author? i've long wanted such a thing for downloaded academic articles, which are often named things like fdj4893dfjk48.pdf

Navigation

[0] Message Index

[#] Next page

Go to full version