Printing web pages
Alan Jackson
ajackson
Mon May 17 11:46:18 PDT 2004
Run a perl script to follow all the links, and dump each page
with lynx. You might even be able to coax lynx into following
the links, I don't know.
On Fri, 4 Apr 2003 20:39:46 -0700
Collins Richey <erichey2 at attbi.com> wrote:
> On Fri, 04 Apr 2003 17:39:19 -0800
> "Net Llama!" <netllama at linux-sxs.org> wrote:
>
> > On 04/04/03 17:35, Collins Richey wrote:
> > > Are there any generalized utility programs that will grap a web
> > > page, extract the text, convert to a text (or fill-in-the-blanks)
> > > file for printing?
> > >
> > > I'm getting ready to work on some python code to do that for
> > > printing the Slackware users' manual, but it would be nice to have a
> > > real tool.
> >
> > html2jpeg creates jpegs (basically screenshots) of webpages:
> > http://freshmeat.net/projects/html2jpg/
> >
> > html2ps converts html to postscript
> > http://freshmeat.net/projects/html2ps/
> >
> > html2pdf
> > http://freshmeat.net/projects/html2pdf/
> >
>
> Thanks,
>
> Now that I've looked at the problem a little more closely, I probably
> need more that this. The root of what I want to retrieve is
> www.slackware.com/book which is a php beast. What I'm looking to do is
>
> 1. Retrieve the base page and follow all Next links, strip out all the
> extra crap on each page, retain and format the text, and store the
> result for printing.
>
> 2. I could do this with simple python tools for a normal html site, but
> the #$@! slackware site doesn't respond to simple http requests; even
> the links are php commands. A browser, of course, can wade through this
> with ease, but I don't want to have to save each individual page as html
> just to format it.
>
> 3. All this work because the Slack folks don't provide a printable
> version.
>
> Any thoughts?
>
> --
> Collins - Slack 9.0 EXT3
> _______________________________________________
> Linux-users mailing list
> Linux-users at linux-sxs.org
> Unsubscribe/Suspend/Etc -> http://www.linux-sxs.org/mailman/listinfo/linux-users
>
>
--
-----------------------------------------------------------------------
| Alan K. Jackson | To see a World in a Grain of Sand |
| alan at ajackson.org | And a Heaven in a Wild Flower, |
| www.ajackson.org | Hold Infinity in the palm of your hand |
| Houston, Texas | And Eternity in an hour. - Blake |
-----------------------------------------------------------------------
More information about the Linux-users
mailing list