Jump to content

  • Log in with Facebook Log in with Twitter Log In with Google      Sign In   
  • Create Account

Subscribe to HRA Now!

 



SEO Class in Chicago, IL

Learn How To Optimize Your Website on July 26, 2013


Looking for personalized in-depth SEO training among your peers?



High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!



 


Are you a Google Analytics enthusiast?

Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE! 

 



 

 www.CustomReportSharing.com 

From the folks who brought you High Rankings!



Photo

Bad Spiders


  • Please log in to reply
3 replies to this topic

#1 falcon

falcon

    HR 2

  • Active Members
  • PipPip
  • 17 posts
  • Location:Pennsylvania

Posted 17 March 2004 - 01:56 PM

Hi.

I am still learning about SEO/SEM and have a question. Until recently, I was naive and did not know that there were such things as bad spiders that ignored robots.txt.

When I learned of these little beasties, I alerted my company's Networking department. They plan to put NT security or logon pages in place. Will this prevent the spiders from accessing the directories/pages? Or, should any information we don't want spidered be moved off a machine that's accessible to the outside world?

Thanks.

Falcon

PS I did search for information on this topic, but didn't find anything. If this has been covered elsewhere, please just point me in the right direction.

#2 bobsledbob

bobsledbob

    HR 3

  • Active Members
  • PipPipPip
  • 102 posts
  • Location:Ogden, Utah, USA

Posted 17 March 2004 - 06:27 PM

This is probably more a "technobabble" kind of discussion, rather than search engine friendly design.

However, yes adding a login page or "NT security" (basic authentication) to your web pages will prevent a bad robot from gaining access.

It would also obviously prevent anyone else from getting there so long as they don't have an appropriate username and password.

If you have super sensitive information, get it off the webserver. An IIS webserver is not necessarily the safest thing to have exposed to the internet. :cheers: But really, it doesn't matter which kind of server, if it's sensitive, get it off the network period.

For ways to deter bad robots, you might want to read this topic which somewhat addresses it: http://www.highranki...?showtopic=4254

#3 Scottie

Scottie

    Psycho Mom

  • Admin
  • 6,294 posts
  • Location:Columbia, SC

Posted 17 March 2004 - 11:53 PM

Is there a particular kind of beastie you are trying to block? The more you know about it, the more effective you can be at blocking it.

There are many techniques to try and foil e-mail harvesters, ranging from obscuring the e-mail addresses to feeding the bots a page full of spam-reporting e-mails.

For the most part, it's not worth wasting your time trying to thwart them as a group. If you see in your logs that some are eating up your bandwidth on a regular basis or doing other misbehaving, you can block them at the IP level. If they are really bad beasties, they won't pay attention to your robots.txt anyway.

#4 falcon

falcon

    HR 2

  • Active Members
  • PipPip
  • 17 posts
  • Location:Pennsylvania

Posted 18 March 2004 - 09:45 AM

Thanks very much for your replies.

No, there's not any one in particular that I've seen, but I will look into that.

I will speak to our Networking folks about your feedback.

Thanks again.

Falcon




0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users