High Rankings Search Engine Optimization ForumHigh Rankings Advisor Search Marketing Newsletter

Welcome Guest ( Log In | Register )

Important Announcement: ***Need an Affordable SEO Website Review?***
4 Pages V   1 2 3 > »   
Reply to this topicStart new topic
> Any Thoughts On Google Sitemap?, Alleges making crawling more effective
ewc21
post Jun 3 2005, 03:04 AM
Post #1


Hong Kong SEO
******

Group: Active Members
Posts: 910
Joined: 31-May 05
User's local time:
Feb 10 2010, 04:17 AM
From: Hong Kong, China
Member No.: 7,607



Does anyone had a look at Google Sitemaps?

www.google.com/webmasters/sitemaps

Probably a good idea so we can better monitor our sites on whether they are crawled by the search engines successfully or not.

Any thoughts?
Go to the top of the page
 
+Quote Post
Randy
post Jun 3 2005, 07:29 AM
Post #2


Convert Me!
Group Icon

Group: Admin
Posts: 17,379
Joined: 17-August 03
User's local time:
Feb 9 2010, 02:17 PM
Member No.: 551



It's a pretty cool idea, especially for dynamic sites that are having difficulty getting some of their inner pages pages crawled isn't it?

Never tried it myself, but I may when I release my next new site in a few weeks just to see if it makes a difference in how quickly some of those deeper pages get spidered.
Go to the top of the page
 
+Quote Post
Shane
post Jun 3 2005, 07:52 AM
Post #3


HR 6
******

Group: Active Members
Posts: 850
Joined: 4-May 04
User's local time:
Feb 9 2010, 04:17 PM
From: Atlanta, GA
Member No.: 3,454



Wow, that's interesting. Any idea how long it's been around?
Go to the top of the page
 
+Quote Post
Jill
post Jun 3 2005, 07:53 AM
Post #4


High Rankings Advisor
Group Icon

Group: Admin
Posts: 29,201
Joined: 21-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Ashland, MA
Member No.: 2



This is VERY interesting:

QUOTE
7. I have no easy way to extract URLs from my database of dynamic URLs. How can I generate a Sitemap?

You can use any reasonably large access log (i.e., an Apache log) to submit your URLs. The Sitemap Generator allows you to generate a Sitemap from a list of URLs, from your access logs, or by pointing to a directory path hosting static files corresponding to URLs.


From Google Sitemap Help
Go to the top of the page
 
+Quote Post
qwerty
post Jun 3 2005, 08:38 AM
Post #5


HR 10
Group Icon

Group: Moderator
Posts: 7,489
Joined: 24-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Somerville, MA
Member No.: 22



Would anyone not say that this is at least a step in the direction of Google starting to accept trusted feeds?
Go to the top of the page
 
+Quote Post
Jill
post Jun 3 2005, 09:36 AM
Post #6


High Rankings Advisor
Group Icon

Group: Admin
Posts: 29,201
Joined: 21-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Ashland, MA
Member No.: 2



It absolutely IS, Bob.

They claim they will always be free though, so it isn't paid-inclusion. But this will go right along with the NPR stuff, etc.

I think it's a great thing, and a great way to get stuff indexed that would otherwise be part of the invisible web. Google is smart to want to index whatever they can get their hands on.

(IMG:http://www.highrankings.com/forum/style_emoticons/default/cheers.gif)
Go to the top of the page
 
+Quote Post
Bernard
post Jun 3 2005, 10:00 AM
Post #7


HR 5
*****

Group: Active Members
Posts: 300
Joined: 24-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Friendswood, TX
Member No.: 21



QUOTE(Jill @ Jun 3 2005, 08:36 AM)
I think it's a great thing, and a great way to get stuff indexed that would otherwise be part of the invisible web.


I always understood the term 'invisible web' to mean content that was password protected or disallowed from spiders via robots.txt. I didn't see where this XML sitemap info indicated that Google would crawl or index content that was password protected or disallowed by robots.txt.
Go to the top of the page
 
+Quote Post
Matt B
post Jun 3 2005, 10:01 AM
Post #8


The modem is the message.
******

Group: Active Members
Posts: 558
Joined: 21-July 03
User's local time:
Feb 9 2010, 05:17 PM
From: Canton, OH
Member No.: 4



This is a huge step forward - yes I believe it is essentially a trusted feed if they are allowing an apache log to generate URL paths. This breaks open thousands, if not millions more pages into the Google index.
Of course, it does rely on web managers to know this exists.

Are we going to see another round of "submitting my site" type of questions as a result of this? (IMG:http://www.highrankings.com/forum/style_emoticons/default/hmm.gif)
Go to the top of the page
 
+Quote Post
don1
post Jun 3 2005, 10:28 AM
Post #9


HR 4
****

Group: Active Members
Posts: 173
Joined: 2-October 03
User's local time:
Feb 9 2010, 03:17 PM
From: Marlborough, MA
Member No.: 963



Here it is in the news: http://news.com.com/Google+tests+tool+to+a...30744&subj=news Just got it off an RSS feed.
Go to the top of the page
 
+Quote Post
qwerty
post Jun 3 2005, 10:33 AM
Post #10


HR 10
Group Icon

Group: Moderator
Posts: 7,489
Joined: 24-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Somerville, MA
Member No.: 22



I wonder how common it is for a server to have Python installed, since it doesn't look like this will work without it.
Go to the top of the page
 
+Quote Post
OldWelshGuy
post Jun 3 2005, 11:02 AM
Post #11


Work is Fun
Group Icon

Group: Moderator
Posts: 4,642
Joined: 31-July 03
User's local time:
Feb 9 2010, 04:17 PM
From: Neath, South Wales, UK
Member No.: 110



The snake is back then (IMG:http://www.highrankings.com/forum/style_emoticons/default/smile.gif)

This is very good news indeed, and is yet another step forward for google. This means that all those great content sites on forums and other dynamic sites can now carry google adsense ads. It also means that G has upped the anti with regard session id's etc. All geat stuff.
Interesting to not (as an aside) that Googleguy has now out and out said that the use of '&ID' in any url effectively renders it useless.
Go to the top of the page
 
+Quote Post
Jill
post Jun 3 2005, 11:09 AM
Post #12


High Rankings Advisor
Group Icon

Group: Admin
Posts: 29,201
Joined: 21-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Ashland, MA
Member No.: 2



QUOTE
I always understood the term 'invisible web' to mean content that was password protected or disallowed from spiders via robots.txt.


No, although that may be part of it (but I don't even think it is) the invisible web is often content that the engines simply can't index for one reason or another, very often because it's all contained in a database. I don't think that's the only stuff, but that's a big part of the invisible web.

I'm pretty sure that Chris Sherman has written a lot on this subject, and in fact I believe one of his books is in regards to the invisible web.
Go to the top of the page
 
+Quote Post
SpeedyPin
post Jun 3 2005, 11:37 AM
Post #13


HR 3
***

Group: Active Members
Posts: 64
Joined: 8-September 04
User's local time:
Feb 9 2010, 12:17 PM
From: San Diego, California
Member No.: 4,979



I've been trying to get in all morning to have a look. Not a chance! LOL
Go to the top of the page
 
+Quote Post
chrishirst
post Jun 3 2005, 12:01 PM
Post #14


HR 9
Group Icon

Group: Moderator
Posts: 4,356
Joined: 13-August 03
User's local time:
Feb 9 2010, 08:17 PM
From: Blackpool UK
Member No.: 492



the "don't use &id" has been added to the guidelines as well

See this thread
Go to the top of the page
 
+Quote Post
Bernard
post Jun 3 2005, 02:58 PM
Post #15


HR 5
*****

Group: Active Members
Posts: 300
Joined: 24-July 03
User's local time:
Feb 9 2010, 03:17 PM
From: Friendswood, TX
Member No.: 21



QUOTE(Jill @ Jun 3 2005, 10:09 AM)
No, ... the invisible web is often content that the engines simply can't index for one reason or another, ...


That's what I meant - content that they can't index (because it is restricted in some way) as opposed to just won't index or haven't found. Your original post made it seem like restricted content might be indexed under this new scheme, but I read elsewhere earlier today that the Google Engineer confirmed that Google will respect robots.txt over the XML. I just wanted to draw a clarification.
Go to the top of the page
 
+Quote Post

4 Pages V   1 2 3 > »    
Fast ReplyReply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



This forum is sponsored by High Rankings, a Boston SEO Agency
- Lo-Fi Version Time is now: 9th February 2010 - 03:17 PM