High Rankings Search Engine Optimization ForumHigh Rankings Advisor Search Marketing Newsletter

Welcome Guest ( Log In | Register )

Important Announcement: ***Need an Affordable SEO Website Review?***
6 Pages V   1 2 3 > »   
Reply to this topicStart new topic
> Matt Cutts Giving Out Clues, Google Webmaster Central
jehochman
post Jul 23 2007, 12:18 PM
Post #1


Jonathan Hochman
Group Icon

Group: Moderator
Posts: 1,554
Joined: 27-November 05
User's local time:
Feb 9 2010, 04:23 PM
From: Connecticut - Land of Steady Habits
Member No.: 9,569



Matt's posted a thread on his blog where he's asking us to vote for features we most want for Google Webmaster Tools. One of the proposed features would tell the webmaster which pages on their site don't validate.

Why would Google have that information? Are they validating pages just for the fun of it? I found this rather shocking because most of us think validation has nothing to do with rankings. Maybe it has something to do with indexing, or the method of indexing used.

In Providence last month Dan Crow told us that Google ability to index the Internet is limited by power and bandwidth, not money. He suggested that Google only allocates so much time and bandwidth to spider each site, so for large sites it's in the site owner's interest to do things that make the indexing process more efficient, because that may lead to more pages being indexed.

One way to make a site easier to index is to reduce code bloat. Another might be clearing coding errors so that the indexing program can quickly figure out the page, rather than having to make guesses when the code is all screwy. Guessing and compensating for errors takes time, processor power, and ultimately electricity. When indexing billions of pages, trivial things that eat up CPU cycles can create a significant burden.
Go to the top of the page
 
+Quote Post
Michael Martinez
post Jul 23 2007, 01:00 PM
Post #2


HR 8
********

Group: Active Members
Posts: 3,719
Joined: 5-April 05
User's local time:
Feb 9 2010, 12:23 PM
From: Seattle, WA
Member No.: 7,091



I think validation is necessary or at least helpful with Accessible Search (a point I have neglected to address in past discussions about validation). It makes sense that Google would want to ask Webmasters if that is important to them.
Go to the top of the page
 
+Quote Post
mcanerin
post Jul 23 2007, 01:06 PM
Post #3


HR 7
Group Icon

Group: Moderator
Posts: 2,241
Joined: 31-July 03
User's local time:
Feb 9 2010, 01:23 PM
From: Calgary, Alberta, Canada
Member No.: 170



I'm not sure that Google is validating already and just not reporting it. Many of the other options are related to things Google isn't doing now, as well, such as a tool to move from one domain to another. I was looking at it as an extension/variation of an earlier choice, "Score the crawlability or accessibility of pages".

Come to think of it, this is all assuming that he meant W3C validating, and not some sort of "Google" validating, which may or may not be more useful, depending on your viewpoint, and would not surprise me given Googles infamous "we can do it better than anyone else" approach to things.

Ian
Go to the top of the page
 
+Quote Post
1dmf
post Jul 23 2007, 05:26 PM
Post #4


Keep Asking, Keep Questioning, Keep Learning
*******

Group: Active Members
Posts: 1,950
Joined: 24-May 07
User's local time:
Feb 9 2010, 08:23 PM
From: Worthing - England
Member No.: 17,339



lol - non standards / invalid code might be coming to bite some people in the behind sooner than you think (IMG:style_emoticons/default/hysterical.gif)

QUOTE
Come to think of it, this is all assuming that he meant W3C validating, and not some sort of "Google" validating,
let's hope they don't go down the path MS did with standards making it up as they went along in IE6 , it drives us coders mad when you have to incorporate different code for different browsers!

Crickey if it became a choice between W3C standards and not getting indexed or Google standards and get indexed, all hell is gonna break loose in the 'standards compliant' community, and I for one won't be happy having just W3C'd all my sites - oh well , might keep me out of mischeif having to serve different pages for Google than the Joe public - oh hangon - isn't that 'black hat' SEO. (IMG:style_emoticons/default/whistling.gif)

And to what standard does google's website validate to because it certainly isn't W3C - Stop it my sides are hurting (IMG:style_emoticons/default/girl_cray2.gif)

This post has been edited by 1dmf: Jul 23 2007, 05:31 PM
Go to the top of the page
 
+Quote Post
Jill
post Jul 23 2007, 06:41 PM
Post #5


High Rankings Advisor
Group Icon

Group: Admin
Posts: 29,201
Joined: 21-July 03
User's local time:
Feb 9 2010, 03:23 PM
From: Ashland, MA
Member No.: 2



QUOTE
Why would Google have that information?


I wouldn't read anything into it, Jonathan, other than people have probably been asking Matt for that sort of thing because they read in so many places that valid code will help rankings. (Which we all know, it doesn't.)

Go to the top of the page
 
+Quote Post
jehochman
post Jul 23 2007, 07:27 PM
Post #6


Jonathan Hochman
Group Icon

Group: Moderator
Posts: 1,554
Joined: 27-November 05
User's local time:
Feb 9 2010, 04:23 PM
From: Connecticut - Land of Steady Habits
Member No.: 9,569



But Google has limited computing power and bandwidth. Why would they waste it to provide this feature? It doesn't make sense unless they're already doing the calculation for some other purpose. Perhaps this is just a red herring and they have no intention of providing the feature.
Go to the top of the page
 
+Quote Post
mcanerin
post Jul 23 2007, 07:31 PM
Post #7


HR 7
Group Icon

Group: Moderator
Posts: 2,241
Joined: 31-July 03
User's local time:
Feb 9 2010, 01:23 PM
From: Calgary, Alberta, Canada
Member No.: 170



Or they intend to scrape the results from the W3C, using THEIR bandwidth, etc...

Ian
Go to the top of the page
 
+Quote Post
maleman
post Jul 23 2007, 08:36 PM
Post #8


HR 6
******

Group: Active Members
Posts: 669
Joined: 9-October 04
User's local time:
Feb 9 2010, 05:23 PM
Member No.: 5,329



QUOTE
(Which we all know, it doesn't)


But... if G has limited computing power and bandwidth, it may use up whatever is allocated for a given site crawl on a few bloated pages and split.

I always try to keep my main pages short and sweet (300 to 400 words) and get them to validate to whatever doctype. Also I put intersite links to those pages where they can be found fast.

This post has been edited by maleman: Jul 23 2007, 08:41 PM
Go to the top of the page
 
+Quote Post
projectphp
post Jul 23 2007, 09:31 PM
Post #9


Lost in Translation
Group Icon

Group: Moderator
Posts: 2,202
Joined: 5-August 03
User's local time:
Feb 10 2010, 07:23 AM
From: Sydney Australia
Member No.: 283



QUOTE
But Google has limited computing power and bandwidth. Why would they waste it to provide this feature?

The answer is simple: they don't need to recrawl your site to validate it. They just need to run a parser over the downloaded HTML.

QUOTE
It doesn't make sense unless they're already doing the calculation for some other purpose.

The parser is likely open source open source, is wrtten in Perl, meaning it would be trivial to change it to read from a database / fielsystem, and spit out output into the Webmaster Central template, and already is pretty battle tested.

Google need never know the outcome, and might not even bother to record the output / result.

The time thing is pretty obvious, and pretty useful to know. One wonders why so many sites are so code heavy. Stripping out newlines, tabs and commetns always seemed a good idea to me.
Go to the top of the page
 
+Quote Post
Jill
post Jul 23 2007, 09:51 PM
Post #10


High Rankings Advisor
Group Icon

Group: Admin
Posts: 29,201
Joined: 21-July 03
User's local time:
Feb 9 2010, 03:23 PM
From: Ashland, MA
Member No.: 2



QUOTE(mcanerin @ Jul 23 2007, 08:31 PM) *
Or they intend to scrape the results from the W3C, using THEIR bandwidth, etc...

Ian


Exactly.

Anyway, who says they're going to offer it? They're just 'askin what people think are important, no?
Go to the top of the page
 
+Quote Post
1dmf
post Jul 24 2007, 05:42 AM
Post #11


Keep Asking, Keep Questioning, Keep Learning
*******

Group: Active Members
Posts: 1,950
Joined: 24-May 07
User's local time:
Feb 9 2010, 08:23 PM
From: Worthing - England
Member No.: 17,339



It certainly would seem odd, re-inventing the wheel, when the acknowledged standards is W3C , who offer validating services already and seeing as they set/draft the standards, I think google simply need to stick to what they are good at.

It wouldn't hurt for there to be a button next to each page in the Webmaster Tools section where you could click validate and it just passes this to W3C in a new window, leaving the W3C to validate with their tool and bandwidth and google having a quick and easy access option in the WMT's.

Maybe they could even record the result next to each page, then Google wouldn't have to validate while crawling simply cross reference against their DB to see if a page is valid. Then do what ever accordingly, seem to me a more sensible approach.

Go to the top of the page
 
+Quote Post
piskie
post Jul 24 2007, 06:14 AM
Post #12


HR 6
******

Group: Active Members
Posts: 798
Joined: 16-September 03
User's local time:
Feb 9 2010, 08:23 PM
From: Cornwall
Member No.: 824



Or just maybe they are already either doing it or some part way towards it (serious time consuming errors only)and as such want to know if it would be a popular addition to offer.

Code Bloat and Errors could be a factor if not now then sometime in the future. Head in Sand Merchants can ignore the "Possibility" at their own risk.

"They will never do that" is a brave assumption because after all Never is a long time.
Go to the top of the page
 
+Quote Post
projectphp
post Jul 24 2007, 06:45 AM
Post #13


Lost in Translation
Group Icon

Group: Moderator
Posts: 2,202
Joined: 5-August 03
User's local time:
Feb 10 2010, 07:23 AM
From: Sydney Australia
Member No.: 283



Seriously, why theorise without investigasting? The W3C validator is open source (I don't add links pointlessly), which means Google are free to use it (here is the licence), to validate pages they have already downloaded. Minimal bandwidth costs. No development costs (beyond changing a template and a simple DB call). Seems a useful tool to offer and a PR win for minimal output.

Occam's razor: simplest answer is often the best.

No conspiracy. No new knowledge. Nothing to learn here. Move along.
Go to the top of the page
 
+Quote Post
1dmf
post Jul 24 2007, 07:52 AM
Post #14


Keep Asking, Keep Questioning, Keep Learning
*******

Group: Active Members
Posts: 1,950
Joined: 24-May 07
User's local time:
Feb 9 2010, 08:23 PM
From: Worthing - England
Member No.: 17,339



I for one couldn't be doing with keeping open source code up-to-date everytime it changes, a form that submits to their already compiled and up-to-date website code seems far more logical.

But hey, like you say nothing to learn here.


This post has been edited by 1dmf: Jul 24 2007, 08:41 AM
Go to the top of the page
 
+Quote Post
projectphp
post Jul 24 2007, 08:22 AM
Post #15


Lost in Translation
Group Icon

Group: Moderator
Posts: 2,202
Joined: 5-August 03
User's local time:
Feb 10 2010, 07:23 AM
From: Sydney Australia
Member No.: 283



Huh? I have no idea what that means! I don't even know who "they" are: GOogle or W3C? I don't know either what " form that submits to their already compiled and up-to-date website code" means.

I think you are a bit confused as to what the code does, but then, not looking at the code creates that issue (IMG:style_emoticons/default/wink1.gif)
Go to the top of the page
 
+Quote Post

6 Pages V   1 2 3 > »    
Fast ReplyReply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



This forum is sponsored by High Rankings, a Boston SEO Agency
- Lo-Fi Version Time is now: 9th February 2010 - 03:23 PM