Anyhow, while in Notepad doing a "find next" to locate google searches and bots, I noticed that several times, the bot appeared within fractions of a minute to spider a page a searcher had just clicked through to from the SERPs.
It could've been coincidence-- it was a high-traffic time, and Google was randomly (to me, it seemed) spidering great swathes of my site.
Google searcher came to /foo/bar.html and loaded the entire page, images, .CSS, .JS, and all. (08 Sept 16:06:25)
16:07:13, 18.104.22.168 Googlebot/2.1 shows up to get foo/bar.html, then at
16:11:03 bot 22.214.171.124 got robots.txt, then at
16:11:05 126.96.36.199 got fred/barney.html, then at
16:14:12 bot #188.8.131.52 got bar/foo/ HTTP/1.0.
So, it got the specific page the searcher had clicked on, then robots.txt, an unrelated page, and then the first page's entire directory.
I don't know what that means, but I thought I'd point it out. Isn't that interesting? I didn't know Google did that, and still don't have proof.
Googlebot came back repeatedly over the next ten minutes and spidered the first directory, the second directory, and a number of unrelated pages.
It also repeatedly got robots.txt, my .css file, and two different .js files once each.
But, again, this is just one inexperienced webmaster-newbie with Notepad and the Find function and a PR 5 site, so...
BTW, I'm not entirely sure what HTTP/1.0 and HTTP/1.1 actually mean, but it seems that it indicates that the visitor is examining the contents of a directory to see what to get? I'm not sure, so any enlightenment is welcome.
<edit-- spelling. I used the spellchecker and it didn't know Googlebot, spidered, HTTP, html, webmaster, I'm, or newbie. Hmm...>
[Moved to Statistics and Tracking ROI forum. - Jill]
Edited by Jill, 09 September 2003 - 06:17 PM.