Jump to content

  • Log in with Facebook Log in with Twitter Log In with Google      Sign In   
  • Create Account

Subscribe to HRA Now!

 



SEO Class in Chicago, IL

Learn How To Optimize Your Website on July 26, 2013


Looking for personalized in-depth SEO training among your peers?



High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!



 


Are you a Google Analytics enthusiast?

Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE! 

 



 

 www.CustomReportSharing.com 

From the folks who brought you High Rankings!



Photo

Robots.txt Syntax For Wildcards?


  • Please log in to reply
4 replies to this topic

#1 Marfola

Marfola

    HR 1

  • Members
  • Pip
  • 4 posts

Posted 12 May 2008 - 06:08 AM

I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?

Disallow: /*print.html$
Disallow: /*?

#2 Randy

Randy

    Convert Me!

  • Moderator
  • 17,540 posts

Posted 12 May 2008 - 07:23 AM

Though *'d wildcards aren't part of the robots.txt specification, yes those examples will work for the search engines you mentioned. They do support such wildcards.

#3 Marfola

Marfola

    HR 1

  • Members
  • Pip
  • 4 posts

Posted 13 May 2008 - 08:29 AM

QUOTE(Randy @ May 12 2008, 01:23 PM) View Post
Though *'d wildcards aren't part of the robots.txt specification, yes those examples will work for the search engines you mentioned. They do support such wildcards.

Thanks.

Do I need to have two seperate sets of directions in my robots.txt, one for google, yahoo and msn and a second for all others?




#4 Randy

Randy

    Convert Me!

  • Moderator
  • 17,540 posts

Posted 13 May 2008 - 09:06 AM

Depends upon what you want.

If you want those instructions to apply to only certain spiders, then you need to set that in the User-agent field. If not, you can use a User-agent: * to cover them all.

Those spiders who don't understand/use wildcards will simply ignore those lines.

#5 Marfola

Marfola

    HR 1

  • Members
  • Pip
  • 4 posts

Posted 14 May 2008 - 02:45 AM

Thanks Randy. That is exactly what I wanted to know. ‘Those spiders who don't understand/use wildcards will simply ignore those lines.’





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users