Printing web pages
Net Llama!
netllama
Mon May 17 11:46:18 PDT 2004
On 04/04/03 19:39, Collins Richey wrote:
> Now that I've looked at the problem a little more closely, I probably
> need more that this. The root of what I want to retrieve is
> www.slackware.com/book which is a php beast. What I'm looking to do is
>
> 1. Retrieve the base page and follow all Next links, strip out all the
> extra crap on each page, retain and format the text, and store the
> result for printing.
>
> 2. I could do this with simple python tools for a normal html site, but
> the #$@! slackware site doesn't respond to simple http requests; even
> the links are php commands. A browser, of course, can wade through this
> with ease, but I don't want to have to save each individual page as html
> just to format it.
>
> 3. All this work because the Slack folks don't provide a printable
> version.
>
> Any thoughts?
wget with the mirror option?
--
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
L. Friedman netllama at linux-sxs.org
Linux Step-by-step & TyGeMo: http://netllama.ipfox.com
8:35pm up 26 days, 21:04, 3 users, load average: 0.27, 0.07, 0.02
More information about the Linux-users
mailing list