Using wget to snapshot a web site
Michael Hipp
Michael
Wed Dec 8 19:21:31 PST 2004
Alan Jackson wrote:
> On Wed, 07 Dec 2005 15:00:49 -0600
> Michael Hipp <Michael at Hipp.com> wrote:
>
> The pages don't exist - they are created on the fly by a cgi program.
Yes. Though since wget has no problem getting the first page, it would seem
like it could get all the rest as well (by just pretending to be a browser)
but it just doesn't seem to make any attempt to follow those cgi "links".
> One could presumably write a perl script that would act like a user
> going to each page and trap that, but it would be painful.
Yes. Guess that's what I'll have to do. But I'll reach for Python since it's
what I know best.
Thanks,
Michael
More information about the Linux-users
mailing list