Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
International SEM | Social Media | Search Friendly Design | SEO | Paid Search / PPC | Seminars | Forum Threads | Q&A | Copywriting | Keyword Research | Web Analytics / Conversions | Blogging | Dynamic Sites | Linking | SEO Services | Site Architecture | Search Engine Spam | Wrap-ups | Business Issues | HRA Questions | Online Courses
Googlebot Found An Extremely High Number Of Urls On Your Site
Started by
Andrew Gates
, Oct 30 2011 11:33 PM
7 replies to this topic
#1
Posted 30 October 2011 - 11:33 PM
I'm receiving this message in Google webmaster tool every 2-3 months
"Googlebot found an extremely high number of URLs on your site"
We have many product pages, some pages are blocked by robots.txt file.
The problem is that Google webmaster tool shows URLs in example ("Here's a list of sample URLs with potential problems") which are blocked by robots.txt
Do you know why they do it? Thanks.
"Googlebot found an extremely high number of URLs on your site"
We have many product pages, some pages are blocked by robots.txt file.
The problem is that Google webmaster tool shows URLs in example ("Here's a list of sample URLs with potential problems") which are blocked by robots.txt
Do you know why they do it? Thanks.
#2
Posted 31 October 2011 - 08:45 AM
Are you positive they're blocked via robots.txt?
If so, then I wouldn't worry about it. But I would double and triple check that you've done that correctly.
If so, then I wouldn't worry about it. But I would double and triple check that you've done that correctly.
#3
Posted 31 October 2011 - 02:17 PM
robots.txt only tells Googlebot not to fetch those pages. It FINDS them through links on other pages, or through an XML sitemap. Make sure your XML sitemap doesn't include pages you're telling Googlebot and/or Bingbot not to crawl. It's okay to have navigational links that point to blocked pages.
#4
Posted 31 October 2011 - 05:22 PM
I was about to post the same as MM. make sure you are not generating a list of pages someplace on your site. Many shopping carts will do this. Seems like google is gearing up for a deep crawl every couple of months, and then generating this error.
Are you by any chance submitting these urls to google products? that will do it for you.
Are you by any chance submitting these urls to google products? that will do it for you.
#5
Posted 31 October 2011 - 09:30 PM
Are you positive they're blocked via robots.txt?
If so, then I wouldn't worry about it. But I would double and triple check that you've done that correctly.
If so, then I wouldn't worry about it. But I would double and triple check that you've done that correctly.
I checked every page in webmaster tool (Crawler access) which shows is it blocked or not. It says that pages are blocked.
This is why I'm surprised.
#6
Posted 31 October 2011 - 09:33 PM
robots.txt only tells Googlebot not to fetch those pages. It FINDS them through links on other pages, or through an XML sitemap. Make sure your XML sitemap doesn't include pages you're telling Googlebot and/or Bingbot not to crawl. It's okay to have navigational links that point to blocked pages.
We don't have such links in xml map, but there are many links from other pages on our web site. Though all links to those pages are "nofollow" links.
Why they still keep such links in database?
#7
Posted 01 November 2011 - 12:30 PM
I have seen Google keep links for years, but maybe that's only because it still finds them in rare, hard-to-find, obscure documents.
You should be careful about using "Nofollow" so extensively across your Website, however. It is essentially spreading your PageRank across the Web.
You should be careful about using "Nofollow" so extensively across your Website, however. It is essentially spreading your PageRank across the Web.
#8
Posted 02 November 2011 - 07:59 AM
QUOTE
but there are many links from other pages on our web site. Though all links to those pages are "nofollow" links.
Why???
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users









