Hi,
I have robots.txt file for a domain. The robots.txt file contains 230 Disallow statements, which are all valid syntactically. Googlebot routinely reads this file, and WMT indicates that the processing of it is "successful".
My problem is that an entry that was in an old version of robots.txt several months back, is getting blocked, when in fact, I do not want it to get blocked.
For whatever reason, it seems like this old version of robots.txt is actually being used by Google, despite the fact that I've made many changes to it over the last month, and it has been spidered by Google.
Is there a standard period of time that typically needs to elapse, before a new version of robots.txt becomes the defacto standard for the site ? Is there something that I can do to force Google to use this new version ?
Thanks in advance !
SEO Class in Chicago, IL
Learn How To Optimize Your Website on July 26, 2013
Looking for personalized in-depth SEO training among your peers?
High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!
Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
International SEM | Social Media | Search Friendly Design | SEO | Paid Search / PPC | Seminars | Forum Threads | Q&A | Copywriting | Keyword Research | Web Analytics / Conversions | Blogging | Dynamic Sites | Linking | SEO Services | Site Architecture | Search Engine Spam | Wrap-ups | Business Issues | HRA Questions | Online Courses
New Robots.txt File Not Being Used
Started by
doughayman
, May 07 2010 05:30 PM
3 replies to this topic
#1
Posted 07 May 2010 - 05:30 PM
#2
Posted 07 May 2010 - 09:05 PM
So, you checked GWT and they are still showing as "Restricted by robots.txt"?
#3
Posted 09 May 2010 - 09:01 PM
So, you checked GWT and they are still showing as "Restricted by robots.txt"?
Yes Scottie, precisely. Either the reporting in GWT is erroneous, or for some reason they have my old robots.txt file in cache, and despite downloading the new robots.txt, they are still not using it. Perplexing, to say the least.
#4
Posted 10 May 2010 - 12:40 AM
When you say that WMT is reporting the file as restricted, I take it you mean that it's listed as restricted under Diagnostics > Crawl Errors.
Have you tried using the Test Robots.txt tab under Site Configuration > Crawler Access? You enter the URL in question, and if it's blocked, it should report which line in the file is being read as blocking it. If Google is accessing the current robots.txt file (and you say it is) and unless you've got an error in there that's causing the file to be blocked (and you say you haven't), then I wonder if it's possible that one section of WMT would report a given URL as blocked and another section would report the opposite.
Have you tried using the Test Robots.txt tab under Site Configuration > Crawler Access? You enter the URL in question, and if it's blocked, it should report which line in the file is being read as blocking it. If Google is accessing the current robots.txt file (and you say it is) and unless you've got an error in there that's causing the file to be blocked (and you say you haven't), then I wonder if it's possible that one section of WMT would report a given URL as blocked and another section would report the opposite.
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users










