ATTENTION: You are viewing a page formatted for mobile devices; to view the full web page, click HERE.

Main Area and Open Discussion > Living Room

Any ideas for a save website crawler for offline reading?

(1/4) > >>

Carol Haynes:
I am doing a university course at the moment and most of the course materials are available via a website.

I have to login to the website and programs like HTtrack don't seem to be able to download the pages for offline reading.

To make matters worse the website is optimised for Internet Explorer and doesn't seem to work well in other browsers.

Ideally I need an Internet Explorer 9 add-on that can download the website for offline reading (offline web pages were removed by MS from IE7 onwards).

There are too many pages to save them individually.

rjbull:
As long as you're willing to pay to register, it looks like Internet Download Manager (IDM) can.  From the Grabber Wizard. Creating a project section of Help:
Step 1. Set a start page
 
On the first step of the wizard you should specify the start page. By default, http protocol is assumed; other protocols like https are required to be specified explicitly. The start page also sets the current site. For example if you specified http://www.tonec.com/support/index.html, the current site would be www.tonec.com with all supported protocols like ftp, https, http applied to this site name.
 
If a site requires authorization, you should also set login and password on this step. Some sites allow browsing/downloading only after authentication on a certain page. In this case you should press on "Advanced>>" button, check "Enter login and password manually" box, and specify the page to login to the site. Also if the site has a logout button, you should specify here the logout pages that the Grabber should not open. If you set the login page, the Grabber will open a browser window after the fourth step and let you login to the site manually before proceeding with exploring and downloading.
[...]
If you need to download all pictures, video or audio files from a website, or download a complete web site, you may select the appropriate template in Project template listbox. Project templates make it easy to start your projects quickly, because all required settings are made automatically.
--- End quote ---

Sanity check - I run IDM all the time, but haven't tried that particular feature.

Carol Haynes:
Nice idea but unfortunately it crashes every time I run it after about 14 pages :-(

Renegade:
Try Teleport Pro.

Carol Haynes:
Try Teleport Pro.
-Renegade (April 02, 2012, 07:52 PM)
--- End quote ---

Nope doesn't have the ability to go past a login page - it can use username and password but only for the very limited number of websites that all user name and password in the URL. It's also very basic and expensive unless you want to spend a ridiculous amount on the non 'Pro' versions. HTtrack is free (O/S) and is more fully featured than the PRO version.

IDM does what I need - just a shame it crashes every time I try to actually use it!

Navigation

[0] Message Index

[#] Next page

Go to full version