Jump to content

  • Log in with Facebook Log in with Twitter Log In with Google      Sign In   
  • Create Account

Subscribe to HRA Now!

 



SEO Class in Chicago, IL

Learn How To Optimize Your Website on July 26, 2013


Looking for personalized in-depth SEO training among your peers?



High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!



 


Are you a Google Analytics enthusiast?

Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE! 

 



 

 www.CustomReportSharing.com 

From the folks who brought you High Rankings!



Photo

Web Stats Analysis


  • Please log in to reply
7 replies to this topic

#1 joker

joker

    HR 3

  • Active Members
  • PipPipPip
  • 64 posts

Posted 08 October 2003 - 02:09 PM

Hey all --

Just a quick question for you all, I'm not certain if this is the best location or not, so my apologies if this question shoudl appear in another forum.

Specifically, what I wanted to know is if anyone knew of a good list of the IP addresses of spiders: I have been hoping that someone may have put together a list that I could place in my web tracking software filter to help eliminate the lion share of automated visits to my site: does anyone have any suggestions?

Thanks!

#2 Matt B

Matt B

    The modem is the message.

  • Active Members
  • PipPipPipPipPipPip
  • 558 posts
  • Location:Canton, OH

Posted 08 October 2003 - 02:26 PM

a list that I could place in my web tracking software filter to help eliminate the lion share of automated visits to my site

Adding the IP address of spiders to a stats tracking program won't eliminate the actual visits. If that's what you are trying to do. Are you simply trying to filter the bot visits from the rest of the stats? If so, what tracking program are you currently using? I guess I'm not sure I understand the question, but most tracking software should automatically filter bot visits from human visits.

IMO, tracking bot IP's can be a full-time excersize in futility. Time that is better spent working on the site.

#3 Ron Carnell

Ron Carnell

    HR 6

  • Moderator
  • 959 posts
  • Location:Michigan USA

Posted 08 October 2003 - 03:18 PM

I agree with Matt. Whether you want to track spiders in your Stats Analysis program or Allow/Deny bad 'bots in your .htaccess file, you'll likely find the user-agent string much more useful than an IP address. Those are readily available and change very infrequently.

#4 joker

joker

    HR 3

  • Active Members
  • PipPipPip
  • 64 posts

Posted 08 October 2003 - 04:10 PM

Sorry for the confusion:

I monitor the traffic to my site using a combination of deepmetrix, Hi-Stats, and a home-grown Access database to monitor my website traffic.

I don't want to deny the bots from my site, rather, I just want to filter out the records of their visits when I review the website data. The only way I can do that is by IP# and/or IP range.

I have found a few lists on various websites that list the various robots as well as their IP ranges, but was wondering if there were any lists that people used and/or recommended.

thanks again

#5 Ron Carnell

Ron Carnell

    HR 6

  • Moderator
  • 959 posts
  • Location:Michigan USA

Posted 08 October 2003 - 05:58 PM

Assuming the user-agent string is making it to your raw logs, you should be able to filter on that?

If you really want IP addresses, instead, here's how to find it (and an indication of what you'll be facing). The only application where the IP address is absolutely required is for cloaking, so a search on "spider cloaking" (no quotes) should yield you some good results.

What you likely will discover is that most of the sites that sell such scripts also provide frequent updates to their IP list, because without such updates cloaking is useless. That the IP addresses change fairly frequently is a given, else they couldn't make money with updates.

If you're using a custom database (Access), you could also program your own search utility. Any time the robots.txt file is accessed, it's a pretty safe bet you got yourself a spider. Create a table to store that information, and you'll soon have a personalized list of IP addresses.

#6 joker

joker

    HR 3

  • Active Members
  • PipPipPip
  • 64 posts

Posted 09 October 2003 - 09:07 AM

thanks for the information guys -- just out of curiousity, why is it that the user-agent is so much more stable than the IP address?

#7 Ron Carnell

Ron Carnell

    HR 6

  • Moderator
  • 959 posts
  • Location:Michigan USA

Posted 09 October 2003 - 09:40 AM

An IP address is like a vehicle. You may use a number of different cars during the course of a month, but most people, being honest and having nothing to hide, don't change their name. ('Course, the ones not so honest are the ones we worry about, so the IP still has its uses.) :rolleyes:

#8 meta

meta

    HR 5

  • Active Members
  • PipPipPipPipPip
  • 301 posts
  • Location:Chicago

Posted 09 October 2003 - 04:15 PM

You might check your options for log analysis software. Some products have features to help filter this kind of information out of your logs.




0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users