Hi,
I'd like to bring up a sub-topic concerning
tools/strategies for note taking related to reading PDF files (or locally saved, static text files more generally).
Like many others I read a lot of articles, reports, books that come as PDF files. Fairly often, I find myself falling back to a very primitive two part note taking system:
system 1. print PDF to paper, then jot down brief margin notes, underline, mark and so on during the first, quick read.
system 2. During a second, more thorough reading, type summary, copy&paste key quote sections, add thoughts/comments and so on in notepad or similar. Save notes for "article123.pdf" in "article123.txt" next to the PDF.
This common (perhaps apart from the last detail on naming/saving) system has advantages (+) and drawbacks (-).
system 1: notes on PDF-printout
+ quick for short notes
+ no computer needed
+ notes located right next to relevant source (no separate quoting needed, context is clear)
- no programmatic searching, indexing, comparison, editing, tagging
- timeconsuming for long notes
- physical archive needed
- costs related to paper (economical, environmental. Lessened, not removed, by FinePrint)
system 2: notes in separate plain-text files
inverse many +/- above. Also:
- rich formats lost when qouting
- for long notes, less overview due to plain text. Using manual sectioning (1 , 1.2 , 1.3 ... ) takes time and is error prone.
+ independent/portable between computers, operating systems, software and so on
My impression is that many of the applications discussed in this enormous thread replace system 2 with complex software that allow rich text formats, search, indexing, editing, tagging and so on. Such replacement is not my focus here so I'll just say that I've tried many such programs, find many of them impressive but have not yet settled on any single one. Mainly since first making a complete commitment to an application and then later wanting to switch after all could come at a high cost (time, format conversion issues, re-learning and so on). So I keep trying out different software. In the meantime I also often fall back on system 2, complemented with separate rich text files when needed (usually via the Open Office apps)
Now, what I really want to discuss is software that replaces system 1. Specifically, software that emulates and improves on manual notes on/around a printed source text and also ideally connects/combines systems 1 & 2.
I know of three types of software that do some of that: complex pdf-viewers/editors (Acrobat), bibliography tools (EndNote), general information aggregators (OneNote). I don't have a lot of experience with any of these tools but my vague impression is that they are all still pretty crude in this regard. They may lack in customizability, easy portability and/or cross-document searching, indexing, tagging of notes. Maybe the OneNote type of applications are most promising for now, especially combined with input tech like tablet PCs with pens and OCR for hand written text. Such tech is of course needed if we at some time want to import all previous, hand written notes.
Anyway, here are three features I think an future, ideal such system should have:
A. co-location: notes should be inputable/viewable right next to the relevant source section. So underlining, adding margin notes and so should be possible in or in the context of the pdf-viewer, not (only) in a separate document in a separate window. And even additional notes in a separate document should somehow be (hyper)linked to the source text parts that they concern.
B. independence/portability: notes should be easily portable between different users, computers, operating systems, applications.
C. systematicity: cross-document searching, indexing, tagging and so on of notes should be supported.
Combining A-C seems like a very hard task. B seems to require some standardized format for such notes, a format that different applications can follow or at least import/export from/to.
One way to tackle this design problem would be to find an area with similarities to on-the-page note taking and that already has working solutions with features A-C.
An interesting such comparison that I thought of is plain-text .srt-subtitle files for video: http://en.wikipedia.org/wiki/SubRip . These contain a list of paired timestamps and dialog text snippets. Put "movie123.srt" in the same folder as the movie file "movie123.avi" the subtitle was made for, start the video and chances are your media player will recognize and display the subtitle automatically.
Can a similar system be had for notes in PDF files? The .srt system relies on static video files and many pdf files are static enough too. For example pdf versions of professional journal articles. Where .srt files tie content (dialog text) to points in time (timestamps), the pdf notes can instead be tied to page number, X/Y coordinates and other properties of the original pdf text.
With such a system sharing of notes would be very easy if the same source pdf files are already available. Just copy/paste upload/download the note files. Specific sites could host large note archives. Notes could be tied to articles/books through filename and DOI (
http://www.doi.org/ ) or ISBN (
http://en.wikipedia....Standard_Book_Number ) numbers. For a very rough analogy on how such a site could work, see this german site
http://www.cutlist.de/ that seems to share timestamps for when to start/stop cropping when removing commercials from PVR recorded programs (my german is bad so I'm not completely sure though).
Also, multiple notes for one single pdf file could potentially be combined and switched on/off individually, like layers in Photoshop (
http://en.wikipedia....igital_image_editing) ). Imagine a pdf viewer window with two panes. The left displays the pdf and also has a row of tabs/checkboxes on top. Someone studying ancient literature has a free pdf version of the Iliad. She then adds and compares on-the-page notes on the Iliad from scholar 1, scholar 2 and so on by clicking on the different note layer tabs. It would be as easy as browsing through the various audio tracks on a DVD disc! The right pane contains more extensive and general notes. Some hyperlinking systems connects these panes, so double clicking on a section in the right pane jumps the left pane the the related pdf section and vice versa.
Also, users could in the pdf viewer customize the display style for one and the same basic note file (like what a .css can do for a .html). For example, switch underline display color from red to green.
Another advantage with such a standardized system is that it would allow different software developers to continually compete and allows users to switch back and forth between various software. The notes aren't locked into one specific software.
Ok, I've written long enough for now. I'd love some feedback on the ideas sketched above. I know they're a bit utopian.
Also, I have a more specific question to all readers of this thread:
Do you think that there already is some good software that does something close to A-C for pdf notes?