Just to confirm - can someone please check that my robots.txt file is correct:
User-agent: *
Disallow: /images/
Disallow: /img/
> what I am trying to do is allow all bots all access but I dont want the images being archived and indexed.
Thanks - appreciate it! (I guess it is vital to make sure the robots.txt is correctly written otherwise no bot visits!!)
Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
International SEM | Social Media | Search Friendly Design | SEO | Paid Search / PPC | Seminars | Forum Threads | Q&A | Copywriting | Keyword Research | Web Analytics / Conversions | Blogging | Dynamic Sites | Linking | SEO Services | Site Architecture | Search Engine Spam | Wrap-ups | Business Issues | HRA Questions | Online Courses
Quick Question On Robots.txt
Started by
lister
, May 14 2009 02:06 AM
5 replies to this topic
#1
Posted 14 May 2009 - 02:06 AM
#2
Posted 14 May 2009 - 04:03 AM
it all depends if the image files are in those directories.
You are blocking a directory with that command, and *IF* the spider honours the robots.txt protocol then it won't index anything in those folders, it has nothing to do with file type.
you can check out the syntax here http://www.robotstxt.org/ , G! also have a checker if you have a GWMT account.
You are blocking a directory with that command, and *IF* the spider honours the robots.txt protocol then it won't index anything in those folders, it has nothing to do with file type.
you can check out the syntax here http://www.robotstxt.org/ , G! also have a checker if you have a GWMT account.
#3
Posted 14 May 2009 - 07:28 AM
it all depends if the image files are in those directories.
You are blocking a directory with that command, and *IF* the spider honours the robots.txt protocol then it won't index anything in those folders, it has nothing to do with file type.
you can check out the syntax here http://www.robotstxt.org/ , G! also have a checker if you have a GWMT account.
You are blocking a directory with that command, and *IF* the spider honours the robots.txt protocol then it won't index anything in those folders, it has nothing to do with file type.
you can check out the syntax here http://www.robotstxt.org/ , G! also have a checker if you have a GWMT account.
perhaps it is better to allow the bots to do exactly what they want - indeed, is there any need for the robots.txt at all?
#4
Posted 14 May 2009 - 08:57 AM
well if you don't want your website images showing up on google image search, then it's a good idea to block the images folder.
there is only a need IMO for a robots.txt file *IF* you are trying to block specific content/folders from being indexed, otherwise there is no need for one.
there is only a need IMO for a robots.txt file *IF* you are trying to block specific content/folders from being indexed, otherwise there is no need for one.
#5
Posted 20 May 2009 - 10:08 AM
useful for wildcard blocks though, with all those pesky duplicated sortby? pages..
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users









