SEO Class in Chicago, IL
Learn How To Optimize Your Website on July 26, 2013
High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!
Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
Are All Ses Now Reading Dynamic Urls?
#1
Posted 28 August 2003 - 04:46 PM
I understand that Google is now listing dynamic urls - I checked out a site I'm working on that is over half dynamic pages (content management system) and Google reported 74 pages, lots of which have dynamic urls.
Then I go over to Alltheweb and I get just 8 pages, only one of which is a dynamic url. And MSN reports just 4 pages, none of which are dynamic urls.
The difference is so vast that I find it difficult to believe that MSN and Alltheweb are searching dynamic urls thoroughly, especially as Google and Alltheweb both search roughly 3 billion pages.
What am I missing??
Thanks.
BrianR
#2
Posted 28 August 2003 - 05:12 PM
I have a forum program that uses 5 criteria in the URL- I've eliminated every other possibility for why it is not getting spidered and the URL is definitely the issue.
So- not all dynamic URLs are spiderable.
#3
Posted 28 August 2003 - 05:39 PM
www.mydomain.com/about/press_page.asp?ID=123
So, while Google may read and list the more simple dynamic urls, my little test seems to indicate that some other major SEs do not to any significant degree, if at all.
Scottie - do you (or indeed, anyone else) have any experience of SEs other than Google doing a thorough listing job of simple dynamic urls?
What I'm trying to get a fix on is which SEs we will lose out on even if we present properly optimised dynamic pages with simple urls.
Thanks.
BrianR
#4
Posted 28 August 2003 - 07:35 PM
1. For reasons explored in other threads, every search engine is going to avoid anything resembling a session ID and the potentially infinite number of pages a session ID can present. Of the three considerations, this is the only one that makes perfect sense to me and the only one I am CERTAIN is true. What I'm not certain about is just how paranoid the spiders will be. Googleguy as been quoted as saying we shouldn't use ID as a parameter in a dynamic URL. I've talked to people at other forums who have found it impossible to get pages with long strings of numbers (which definitely looks like PHP or ASP generated session IDs) indexed. Beyond that, everything seems to be a guess.
2. Common wisdom is that Googlebot will spider a page with up to three parameters. That really makes no sense to me, since the number of parameters has very little relationship to anything Google would care about. Three parameters could produce a gazillion pages just as easily as can seven or eight parameters. Just as importantly, strictly from a programming standpoint, I see absolutely no reason the spider would ever waste time parsing the parameters from the query string.
3. Common wisdom also suggests that a dynamic URL linked to from a static page is much more likely to be indexed than the same dynamic URL linked to from another dynamic page. This makes some sense, especially if both dynamic URLs are calling the same exact script. The potential for an infinite loop becomes very real.
A few years ago, of course, it was impossible to get any dynamic pages indexed. So I guess we're lucky we what we have. But I think the current situation is still too new, and probably still in too much flux, to make many hard and fast rules. As usual, we're limited to guesses, and this list of three concerns is what I've seen for us to guess about.
Anyone think of a fourth?
#5
Posted 29 August 2003 - 03:25 AM
BrianR
#6
Posted 02 September 2003 - 06:43 AM
Not from the query string of the page it is accessing, but from the query strings it finds in links (i.e. the pages it will, or at least might, access).strictly from a programming standpoint, I see absolutely no reason the spider would ever waste time parsing the parameters from the query string.
I've written a spider that crawls dynamic sites, and it does parse the query parameters. It then sorts them into alpha order and reformulates the URL. This helps classify something like these two as the same URL:
www.site.com?a=1&b=2
www.site.com?b=2&a=1
#7
Posted 02 September 2003 - 08:07 AM
I'm curious if your program counted the parameters and refused to follow a link with too many?
If one were to believe that the number of parameters was the only problem with an URL, it would sure be easy to circumvent. The ampersand delimiter, after all, is merely a convention. If you're not using a utility to parse the string (like cgi.pm), there's no reason the delimiter can't be changed to something else entirely. Let Google try to count the parameter THEN.
#8
Posted 02 September 2003 - 08:21 AM
Certainly Inktomi and FAST can index dynamic pages as they used to do it all the time.
Food for thought...
Jill
#9
Posted 02 September 2003 - 08:43 AM
No.I'm curious if your program counted the parameters and refused to follow a link with too many?
#10
Posted 02 September 2003 - 04:31 PM
The rough stats I indicated at the beginning of this thread seem to indicate there's some truth in the PFI rumour.
But, with my non-paranoid hat on, it runs counter to the search engines' promise to deliver relevant and reasonanly comprehensive results.
But perhaps I'm just being naive...
BrianR
#11
Posted 02 September 2003 - 09:42 PM
But, with my non-paranoid hat on, it runs counter to the search engines' promise to deliver relevant and reasonanly comprehensive results.
Yeah, no kidding.
Which is why they'll never catch Google in a trilliion, zillion, quadrillion years.
Jill
#12
Posted 03 September 2003 - 05:14 AM
JillNot sure if it's true or not, but it seems that some of the engines that charge for inclusion may purposely NOT be including dynamic pages in hopes that you'll pay for trusted feed on those.
Certainly Inktomi and FAST can index dynamic pages as they used to do it all the time.
Food for thought...
Jill
I have wondered that too. Of course it seems to me that in some cases if you submit via paid inclusion (to Inktomi in particular) it seems to have a negative effect on them actually spidering your site naturally, as was discussed in another thread.
I wonder if it actually a dynamic pages issue rather than just about PFI. I am in the process of getting a site converted from working on the principle of URL parameters to using a URL rewriter to server "static looking" pages. I will post again if this makes a difference to Inktomi crawling the site.
Simon
#13
Posted 03 September 2003 - 10:10 AM
#14
Posted 03 September 2003 - 04:48 PM
Your interesting post raises a couple of questions for me:
1. Which search engines are you having success with?
2. Can you give me an example of a typical dynamic url from your site?
Thanks for your help
BrianR
#15
Posted 03 September 2003 - 06:11 PM
A typical query string for this site looks like this: www.website.edu/graduate/ categorymain.php?cid=1&sid=1406
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users









