Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
Prevent Se Spider From Indexing A Website Page
#1
Posted 01 September 2003 - 01:37 AM
I am looking to prevent SE spiders from crawling and indexing one of my pages on my web site. Does anyone know of anything i can put in my HTML to do this? I ran across this, <meta name="MSSmartTagsPreventParsing" content="TRUE">
Will this prevent an SE spider from crawling/indexing a page? Any input is appreciated.
-Blue
#2
Posted 01 September 2003 - 02:51 AM
You can put a robots tag in your head section of the page you do not want to be spidered like this:
<META NAME="ROBOTS"' CONTENT="NOINDEX">
or there is a special tag that only works for Googlebot:
<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">.
#3
Posted 01 September 2003 - 08:50 AM
The Meta robots tag has been known to be not always obeyed by the spiders.
Jill
#4
Posted 01 September 2003 - 08:56 AM
No.I ran across this, <meta name="MSSmartTagsPreventParsing" content="TRUE">
Will this prevent an SE spider from crawling/indexing a page?
You should either use the robots meta tag (as described by Mel above) or the robots.txt protocol. Details of both can be found at http://www.robotstxt.org/
#5
Posted 01 September 2003 - 03:49 PM
<META NAME="ROBOTS" CONTENT="NOINDEX">
<META NAME="ROBOTS" CONTENT="NOFOLLOW">
<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">
Is it ok to use all three, or will it cause a problem?
#6
Posted 01 September 2003 - 04:17 PM
<meta name="robots" content="noindex, nofollow">
#7
Posted 01 September 2003 - 04:41 PM
Jill
#8
Posted 01 September 2003 - 04:48 PM
You just need this:robotstxt.org was very helpful. This is what i am going to place in my header tags,
<META NAME="ROBOTS" CONTENT="NOINDEX">
<META NAME="ROBOTS" CONTENT="NOFOLLOW">
<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">
Is it ok to use all three, or will it cause a problem?
<META NAME="ROBOTS" CONTENT="NOINDEX,NOFOLLOW">
Forget the Googlebot tag. That does something slightly different (it allows indexing but disallows the cached page from being seen. You want to disallow indexing.)
The Meta robots tag does not prevent robots accessing your page, it prevents compliant search engines from indexing your pages - a subtle but important difference. Use the robots.txt file to prevent compliant robots accessing your pages.
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users








