Been a looong time since I last checked in to look for a solution... and even longer since I posted a question, but if there's one place on the web where I will find a bunch of interesting and relevant replies, ...
I've got a set of self-edited web pages which link to each other in various ways, where anchors may be hyperlinks, classes or names, and 'bookmarks' using divs with name/id.
Ergo:
<a href="myserver.net/index.html">
<a href="#N0A12bf67">
<div id="tocN0A12bf67">
<a name="N0A12bf67">
<a class="file" href ="foo.txt">
I want to save that set of pages in a document, while keeping the links working, and it seems that I have to rework all the links so they stop pointing to the original web pages.
I have tried various ways of saving to PDF, have C&P into Word and OpenOffice, have even tried some pretty nifty e-book editing apps. So far, some of the links still point to the original URL.
I'm thinking that I may need to ...
a) combine all the pages into one document?
and/or
a) find every anchor.
b) decide if it's internal or external to the current document; then ignore internals as they'll continue to work.
c) modify the external links with new internal links, OR, if I have not combined all the documents into one, modify the links to erm, uh,
dunno!
This last bit may depend on whether I'm making a PDF, .doc, or odf. as I suppose I need to make the revised links point to the correct filetype.
I'm pretty sure I need to combine all pages into one document so that when I make the PDF/doc/odf, each link looks within the one document rather than heading off in a failed search of a separate one.
Am thinking I can use cmd.exe to create the concatenated version and save it as HTML.
Thereafter, I would change the URLs to point at itself in the new concatenated format, then print it as a PDF.
Or C&P to word, OO etc.
Am I right so far?
If so, all I ask is whether someone knows a tool that will do all this for me!
Happy New Year!