Warrick

Announcements, maintenance etc.. Boring.
jbikker
Posts: 175
Joined: Mon Nov 28, 2011 8:18 am
Contact:

Warrick

Postby jbikker » Wed Nov 30, 2011 12:53 pm

Apparently, there exists some software that is able to recursively fetch webpages from Google cache, as well as other sources, and combinations of sources if copies are partial. The software is called Warrick:
http://frankmccown.blogspot.com/2011/08 ... tatus.html
Unfortunately, Warrick is currently undergoing a drastic update which was required because of changes in Google APIs and Archive.org. An updated version is expected 'in a couple of weeks' (see above url). Apparently, Google may delete a site from the cache when it fails to crawl it, so let's hope a couple of weeks is fast enough. It may provide us with a full copy of ompf.

nhm
Posts: 8
Joined: Thu Dec 01, 2011 9:57 pm

Re: Warrick

Postby nhm » Thu Dec 08, 2011 1:12 pm

I tried downloading and running warrick but as stated it doesn't work with the Internet Archive any longer. I stopped playing with it pretty quickly as I didn't want google to ban my server's IP if I happened to get it working without IA. Might be worth looking into the technique they use here (ignore the spammy sounding URL):

http://www.startuploans.org/archive-recovery/

Not sure I'll have time to look into this before the next version of Warrick is released, but I thought I'd mention it if anyone else has free time.

-nhm

davepermen
Posts: 48
Joined: Fri Dec 02, 2011 12:21 pm

Re: Warrick

Postby davepermen » Fri Dec 09, 2011 5:15 am

just in case. does bing have a web archive? if we mess up the google one..


Return to “ompf2”

Who is online

Users browsing this forum: No registered users and 2 guests