| Important Announcement: ***Need an Affordable SEO Website Review?*** |
![]() ![]() |
Jun 3 2005, 03:04 AM
Post
#1
|
|
![]() Hong Kong SEO ![]() ![]() ![]() ![]() ![]() ![]() Group: Active Members Posts: 910 Joined: 31-May 05 User's local time: Feb 10 2010, 04:17 AM From: Hong Kong, China Member No.: 7,607 |
Does anyone had a look at Google Sitemaps?
www.google.com/webmasters/sitemaps Probably a good idea so we can better monitor our sites on whether they are crawled by the search engines successfully or not. Any thoughts? |
|
|
|
Jun 3 2005, 07:29 AM
Post
#2
|
|
![]() Convert Me! Group: Admin Posts: 17,379 Joined: 17-August 03 User's local time: Feb 9 2010, 02:17 PM Member No.: 551 |
It's a pretty cool idea, especially for dynamic sites that are having difficulty getting some of their inner pages pages crawled isn't it?
Never tried it myself, but I may when I release my next new site in a few weeks just to see if it makes a difference in how quickly some of those deeper pages get spidered. |
|
|
|
Jun 3 2005, 07:52 AM
Post
#3
|
|
![]() HR 6 ![]() ![]() ![]() ![]() ![]() ![]() Group: Active Members Posts: 850 Joined: 4-May 04 User's local time: Feb 9 2010, 04:17 PM From: Atlanta, GA Member No.: 3,454 |
Wow, that's interesting. Any idea how long it's been around?
|
|
|
|
Jun 3 2005, 07:53 AM
Post
#4
|
|
![]() High Rankings Advisor Group: Admin Posts: 29,201 Joined: 21-July 03 User's local time: Feb 9 2010, 03:17 PM From: Ashland, MA Member No.: 2 |
This is VERY interesting:
QUOTE 7. I have no easy way to extract URLs from my database of dynamic URLs. How can I generate a Sitemap? You can use any reasonably large access log (i.e., an Apache log) to submit your URLs. The Sitemap Generator allows you to generate a Sitemap from a list of URLs, from your access logs, or by pointing to a directory path hosting static files corresponding to URLs. From Google Sitemap Help |
|
|
|
Jun 3 2005, 08:38 AM
Post
#5
|
|
![]() HR 10 Group: Moderator Posts: 7,489 Joined: 24-July 03 User's local time: Feb 9 2010, 03:17 PM From: Somerville, MA Member No.: 22 |
Would anyone not say that this is at least a step in the direction of Google starting to accept trusted feeds?
|
|
|
|
Jun 3 2005, 09:36 AM
Post
#6
|
|
![]() High Rankings Advisor Group: Admin Posts: 29,201 Joined: 21-July 03 User's local time: Feb 9 2010, 03:17 PM From: Ashland, MA Member No.: 2 |
It absolutely IS, Bob.
They claim they will always be free though, so it isn't paid-inclusion. But this will go right along with the NPR stuff, etc. I think it's a great thing, and a great way to get stuff indexed that would otherwise be part of the invisible web. Google is smart to want to index whatever they can get their hands on. (IMG:http://www.highrankings.com/forum/style_emoticons/default/cheers.gif) |
|
|
|
Jun 3 2005, 10:00 AM
Post
#7
|
|
![]() HR 5 ![]() ![]() ![]() ![]() ![]() Group: Active Members Posts: 300 Joined: 24-July 03 User's local time: Feb 9 2010, 03:17 PM From: Friendswood, TX Member No.: 21 |
QUOTE(Jill @ Jun 3 2005, 08:36 AM) I think it's a great thing, and a great way to get stuff indexed that would otherwise be part of the invisible web. I always understood the term 'invisible web' to mean content that was password protected or disallowed from spiders via robots.txt. I didn't see where this XML sitemap info indicated that Google would crawl or index content that was password protected or disallowed by robots.txt. |
|
|
|
Jun 3 2005, 10:01 AM
Post
#8
|
|
![]() The modem is the message. ![]() ![]() ![]() ![]() ![]() ![]() Group: Active Members Posts: 558 Joined: 21-July 03 User's local time: Feb 9 2010, 05:17 PM From: Canton, OH Member No.: 4 |
This is a huge step forward - yes I believe it is essentially a trusted feed if they are allowing an apache log to generate URL paths. This breaks open thousands, if not millions more pages into the Google index.
Of course, it does rely on web managers to know this exists. Are we going to see another round of "submitting my site" type of questions as a result of this? (IMG:http://www.highrankings.com/forum/style_emoticons/default/hmm.gif) |
|
|
|
Jun 3 2005, 10:28 AM
Post
#9
|
|
![]() HR 4 ![]() ![]() ![]() ![]() Group: Active Members Posts: 173 Joined: 2-October 03 User's local time: Feb 9 2010, 03:17 PM From: Marlborough, MA Member No.: 963 |
Here it is in the news: http://news.com.com/Google+tests+tool+to+a...30744&subj=news Just got it off an RSS feed.
|
|
|
|
Jun 3 2005, 10:33 AM
Post
#10
|
|
![]() HR 10 Group: Moderator Posts: 7,489 Joined: 24-July 03 User's local time: Feb 9 2010, 03:17 PM From: Somerville, MA Member No.: 22 |
I wonder how common it is for a server to have Python installed, since it doesn't look like this will work without it.
|
|
|
|
Jun 3 2005, 11:02 AM
Post
#11
|
|
![]() Work is Fun Group: Moderator Posts: 4,642 Joined: 31-July 03 User's local time: Feb 9 2010, 04:17 PM From: Neath, South Wales, UK Member No.: 110 |
The snake is back then (IMG:http://www.highrankings.com/forum/style_emoticons/default/smile.gif)
This is very good news indeed, and is yet another step forward for google. This means that all those great content sites on forums and other dynamic sites can now carry google adsense ads. It also means that G has upped the anti with regard session id's etc. All geat stuff. Interesting to not (as an aside) that Googleguy has now out and out said that the use of '&ID' in any url effectively renders it useless. |
|
|
|
Jun 3 2005, 11:09 AM
Post
#12
|
|
![]() High Rankings Advisor Group: Admin Posts: 29,201 Joined: 21-July 03 User's local time: Feb 9 2010, 03:17 PM From: Ashland, MA Member No.: 2 |
QUOTE I always understood the term 'invisible web' to mean content that was password protected or disallowed from spiders via robots.txt. No, although that may be part of it (but I don't even think it is) the invisible web is often content that the engines simply can't index for one reason or another, very often because it's all contained in a database. I don't think that's the only stuff, but that's a big part of the invisible web. I'm pretty sure that Chris Sherman has written a lot on this subject, and in fact I believe one of his books is in regards to the invisible web. |
|
|
|
Jun 3 2005, 11:37 AM
Post
#13
|
|
![]() HR 3 ![]() ![]() ![]() Group: Active Members Posts: 64 Joined: 8-September 04 User's local time: Feb 9 2010, 12:17 PM From: San Diego, California Member No.: 4,979 |
I've been trying to get in all morning to have a look. Not a chance! LOL
|
|
|
|
Jun 3 2005, 12:01 PM
Post
#14
|
|
![]() HR 9 Group: Moderator Posts: 4,356 Joined: 13-August 03 User's local time: Feb 9 2010, 08:17 PM From: Blackpool UK Member No.: 492 |
|
|
|
|
Jun 3 2005, 02:58 PM
Post
#15
|
|
![]() HR 5 ![]() ![]() ![]() ![]() ![]() Group: Active Members Posts: 300 Joined: 24-July 03 User's local time: Feb 9 2010, 03:17 PM From: Friendswood, TX Member No.: 21 |
QUOTE(Jill @ Jun 3 2005, 10:09 AM) No, ... the invisible web is often content that the engines simply can't index for one reason or another, ... That's what I meant - content that they can't index (because it is restricted in some way) as opposed to just won't index or haven't found. Your original post made it seem like restricted content might be indexed under this new scheme, but I read elsewhere earlier today that the Google Engineer confirmed that Google will respect robots.txt over the XML. I just wanted to draw a clarification. |
|
|
|
![]() ![]() ![]() |
|
Lo-Fi Version | Time is now: 9th February 2010 - 03:17 PM |