[Az-Geocaching] Suggestion for statistics

Brian LaFrance listserv@azgeocaching.com
Fri, 5 Sep 2003 16:12:15 -0700


For the AZGeocaching gods:

If/When things get back to semi-normal, how open are you Snaptek guys to
allowing some of us that know what we are doing create tools/toys based
on the data you have retreived?  It would be nice to get some data in
XML format for personal use, as Jerry had mentioned earlier.  Also, are
there any tasks that volunteers could take on with the site to help take
some of the load off Snaptek?

Brian
Team AZEvil




-----Original Message-----
From: az-geocaching-admin@listserv.azgeocaching.com
[mailto:az-geocaching-admin@listserv.azgeocaching.com] On Behalf Of
Jason Poulter
Sent: Friday, September 05, 2003 2:43 PM
To: listserv@azgeocaching.com
Subject: Re: [Az-Geocaching] Suggestion for statistics


just got an email reply back from geocaching.com about the blocked 
ip.... he has been busy and neglecting emails... but he did get the ip 
UNBLOCKED so we can probably resume crawling  normally....

he did metion that there are scripts inplace to detect ips /crawlers 
that are beating the system to hard at anyone time.... so if  i throttle

back the crawling we should be ok.... so i have a little fine tuning to 
do until we get back to normal...

please be patient!!!!

thanks

Jason
Snaptek
AzGeocaching.com



Team Cache-Quest wrote:

>Most "crawlers" will obey a site's ROBOTS.TXT file.
>
>GEOCACHING.COM's ROBOTS.TXT disallows almost all crawling.  Here is a 
>copy...
>
>User-agent: *
># Disallow all unnecessary content from the search
>Disallow: /iis/*
>Disallow: /login/*
>Disallow: /admin/*
>Disallow: /map/*
>Disallow: /email/*
>Disallow: /my/*
>Disallow: /seek/nearest.asp*
>Disallow: /seek/nearest_cache.asp*
>Disallow: /seek/waypoint.asp*
>Disallow: /bait.asp
>
>Because of what Snaptek needs for azgeocaching.com, they have to ignore

>the robots.txt file.  Unfortunately most web sites are smart enough now

>days to recognize unauthorized crawling and will automatically prevent 
>it.  Some consider it a form of hacking because of what it does to the 
>server.
>
>You can't really blame geocaching.com.  They probably have many folks 
>trying to crawl the site and because of international interest there 
>really isn't a good time to do it.  What they really need is a decent 
>interface to allow folks to get bulk information.  The query generator 
>was a step in the right direction, but it falls short.  I'd like to 
>simply get a list of all the caches I've found in XML format to do with

>what I want, but the Query Generator can't even do that.
>
>Jerry (Cache-Quest)
>
>----- Original Message -----
>From: "Regan L Smith" <buggers@mindspring.com>
>To: <listserv@azgeocaching.com>
>Sent: Friday, September 05, 2003 11:38 AM
>Subject: Re: [Az-Geocaching] Suggestion for statistics
>
>
>  
>
>>There has to be other "crawlers" out there how are they dealing with 
>>it
>>    
>>
>and
>  
>
>>is there anything that we the beneficiaries of your fantastic work 
>>do???
>>----- Original Message ----- 
>>From: "Regan L Smith" <buggers@mindspring.com>
>>To: <listserv@azgeocaching.com>
>>Sent: Friday, September 05, 2003 10:42 AM
>>Subject: Re: [Az-Geocaching] Suggestion for statistics
>>
>>
>>    
>>
>>>do they consider AZGeocaching a threat? I don't understand you guys 
>>>make their stuff much easier to use, and understand.
>>>      
>>>
>
>____________________________________________________________
>Az-Geocaching mailing list listserv@azgeocaching.com
>To edit your setting, subscribe or unsubscribe visit: 
>http://listserv.azgeocaching.com/mailman/listinfo/az-geocaching
>
>Arizona's Geocaching Resource
>http://www.azgeocaching.com
>  
>


____________________________________________________________
Az-Geocaching mailing list listserv@azgeocaching.com
To edit your setting, subscribe or unsubscribe visit:
http://listserv.azgeocaching.com/mailman/listinfo/az-geocaching

Arizona's Geocaching Resource
http://www.azgeocaching.com