SEO Class in Chicago, IL
Learn How To Optimize Your Website on July 26, 2013
Looking for personalized in-depth SEO training among your peers?
High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!
Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
International SEM | Social Media | Search Friendly Design | SEO | Paid Search / PPC | Seminars | Forum Threads | Q&A | Copywriting | Keyword Research | Web Analytics / Conversions | Blogging | Dynamic Sites | Linking | SEO Services | Site Architecture | Search Engine Spam | Wrap-ups | Business Issues | HRA Questions | Online Courses
Robots.txt Syntax For Wildcards?
Started by
Marfola
, May 12 2008 06:08 AM
4 replies to this topic
#1
Posted 12 May 2008 - 06:08 AM
I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?
Disallow: /*print.html$
Disallow: /*?
Disallow: /*print.html$
Disallow: /*?
#2
Posted 12 May 2008 - 07:23 AM
Though *'d wildcards aren't part of the robots.txt specification, yes those examples will work for the search engines you mentioned. They do support such wildcards.
#3
Posted 13 May 2008 - 08:29 AM
Though *'d wildcards aren't part of the robots.txt specification, yes those examples will work for the search engines you mentioned. They do support such wildcards.
Thanks.
Do I need to have two seperate sets of directions in my robots.txt, one for google, yahoo and msn and a second for all others?
#4
Posted 13 May 2008 - 09:06 AM
Depends upon what you want.
If you want those instructions to apply to only certain spiders, then you need to set that in the User-agent field. If not, you can use a User-agent: * to cover them all.
Those spiders who don't understand/use wildcards will simply ignore those lines.
If you want those instructions to apply to only certain spiders, then you need to set that in the User-agent field. If not, you can use a User-agent: * to cover them all.
Those spiders who don't understand/use wildcards will simply ignore those lines.
#5
Posted 14 May 2008 - 02:45 AM
Thanks Randy. That is exactly what I wanted to know. ‘Those spiders who don't understand/use wildcards will simply ignore those lines.’
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users









