Jump to content

  • Log in with Facebook Log in with Twitter Log In with Google      Sign In   
  • Create Account

Subscribe to HRA Now!

 



SEO Class in Chicago, IL

Learn How To Optimize Your Website on July 26, 2013


Looking for personalized in-depth SEO training among your peers?



High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!



 


Are you a Google Analytics enthusiast?

Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE! 

 



 

 www.CustomReportSharing.com 

From the folks who brought you High Rankings!



Photo

Is Robot.txt Always Obeyed By Spiders?


  • Please log in to reply
8 replies to this topic

#1 mitash

mitash

    HR 2

  • Active Members
  • PipPip
  • 43 posts
  • Location:Australia

Posted 17 December 2006 - 11:19 PM

Could the experts clear my doubt pls...

Is it true that spiders do not always obey robots.txt..?

If that's true has someone done any testing?

Or how can we proove that robots.txt is not always obeyed by spiders / bots ..?

Thanks

#2 Jill

Jill

    High Rankings Advisor

  • Admin
  • 32,372 posts

Posted 17 December 2006 - 11:23 PM

It depends on the spider. The major search engines obey it.

#3 mitash

mitash

    HR 2

  • Active Members
  • PipPip
  • 43 posts
  • Location:Australia

Posted 17 December 2006 - 11:32 PM

Ok lets say google, msn and yahoo for example...

Would spiders from above engines ignore the robots.txt file...

Do you know what other engines will igonre the robots.txt...?

#4 projectphp

projectphp

    Lost in Translation

  • Moderator
  • 2,203 posts
  • Location:Sydney Australia

Posted 18 December 2006 - 12:08 AM

All those obey it, it just comes down to whether you get the syntax right.

Try signing up for sitemaps, I am pretty sure they have a sitemap checker.

#5 torka

torka

    Vintage Babe

  • Moderator
  • 4,408 posts
  • Location:Triangle area, NC, USA, Earth (usually)

Posted 18 December 2006 - 12:28 AM

Yes, at Google Webmaster Tools (formerly Google Sitemaps), they do offer a robots.txt checker.

--Torka mf_prop.gif

#6 Alan Perkins

Alan Perkins

    Token male admin

  • Admin
  • 1,566 posts
  • Location:UK

Posted 18 December 2006 - 04:56 AM

QUOTE(mitash)
Ok lets say google, msn and yahoo for example...

Would spiders from above engines ignore the robots.txt file...
No, they wouldn't ignore it ... but they may not treat it as you would expect.
  • Google may index URLs that are protected by robots.txt, without reading or indexing the content at those URLs. This leads to the so-called "PIPs" (partially indexed pages).
  • Google's AdwordsBot only obeys instructions directed specifically at it. It does not obey "User-Agent: *". This is only an issue if you you use Adwords on the domain.


#7 mitash

mitash

    HR 2

  • Active Members
  • PipPip
  • 43 posts
  • Location:Australia

Posted 18 December 2006 - 08:33 AM

QUOTE(Alan Perkins @ Dec 18 2006, 08:56 PM) View Post
No, they wouldn't ignore it ... but they may not treat it as you would expect.
  • Google may index URLs that are protected by robots.txt, without reading or indexing the content at those URLs. This leads to the so-called "PIPs" (partially indexed pages).
  • Google's AdwordsBot only obeys instructions directed specifically at it. It does not obey "User-Agent: *". This is only an issue if you you use Adwords on the domain.



Perhaps I should have said that do spiders ignore some of the statements made in the robots.txt..?

You said Google may index URL's that are protected by robots.txt without reading or indexing the content at those URLs.

How can we show that the above happens..?

Thanks

#8 Jill

Jill

    High Rankings Advisor

  • Admin
  • 32,372 posts

Posted 18 December 2006 - 08:35 AM

Do a search for the particular URL in question that you have blocked via robots.txt. If Google has done that, you'll still see it in the results, but without any title or description, just the URL.

#9 mitash

mitash

    HR 2

  • Active Members
  • PipPip
  • 43 posts
  • Location:Australia

Posted 18 December 2006 - 08:37 AM

Excellent,
Thanks for the quick reply Jill.

I will do that.

Thanks for your help.

Cheers




0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users