Printing web pages

Alan Jackson ajackson
Mon May 17 11:46:18 PDT 2004


Run a perl script to follow all the links, and dump each page
with lynx. You might even be able to coax lynx into following
the links, I don't know.


On Fri, 4 Apr 2003 20:39:46 -0700
Collins Richey <erichey2 at attbi.com> wrote:

> On Fri, 04 Apr 2003 17:39:19 -0800
> "Net Llama!" <netllama at linux-sxs.org> wrote:
> 
> > On 04/04/03 17:35, Collins Richey wrote:
> > > Are there any generalized utility programs that will grap a web
> > > page, extract the text, convert to a text (or fill-in-the-blanks)
> > > file for printing?
> > > 
> > > I'm getting ready to work on some python code to do that for
> > > printing the Slackware users' manual, but it would be nice to have a
> > > real tool.
> > 
> > html2jpeg creates jpegs (basically screenshots) of webpages:
> > http://freshmeat.net/projects/html2jpg/
> > 
> > html2ps converts html to postscript
> > http://freshmeat.net/projects/html2ps/
> > 
> > html2pdf
> > http://freshmeat.net/projects/html2pdf/
> > 
> 
> Thanks,
> 
> Now that I've looked at the problem a little more closely, I probably
> need more that this.  The root of what I want to retrieve is
> www.slackware.com/book which is a php beast.  What I'm looking to do is
> 
> 1. Retrieve the base page and follow all Next links, strip out all the
> extra crap on each page, retain and format the text, and store the
> result for printing.
> 
> 2. I could do this with simple python tools for a normal html site, but
> the #$@! slackware site doesn't respond to simple http requests; even
> the links are php commands.  A browser, of course, can wade through this
> with ease, but I don't want to have to save each individual page as html
> just to format it.
> 
> 3. All this work because the Slack folks don't provide a printable
> version.
> 
> Any thoughts?
> 
> --
> Collins - Slack 9.0 EXT3
> _______________________________________________
> Linux-users mailing list
> Linux-users at linux-sxs.org
> Unsubscribe/Suspend/Etc -> http://www.linux-sxs.org/mailman/listinfo/linux-users
> 
> 


-- 
-----------------------------------------------------------------------
| Alan K. Jackson            | To see a World in a Grain of Sand      |
| alan at ajackson.org          | And a Heaven in a Wild Flower,         |
| www.ajackson.org           | Hold Infinity in the palm of your hand |
| Houston, Texas             | And Eternity in an hour. - Blake       |
-----------------------------------------------------------------------


More information about the Linux-users mailing list