Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
How To Find Out Number Of Pages In A Website?
#1
Posted 16 January 2007 - 07:34 PM
I've been lurking more than posting these last few months but I'm resurfacing to ask this question:
I know that to check the number of pages in a website, you enter site:www.nameofsite.comornetorwhatever I did this in Google and it returned the exact # of pages in my own website...then checked for another site of which I am aware of how many pages they have (9000+) and again the results came back 9000+. Also, my site is static, the other is dynamic.
I've got a possible new client who told me they had XX,XXX pages, which is very believable just by looking at their website. When I searched Google using the mentioned query...only 55 pages were reported. This site is 2+ years old so it's not like they are in the sandbox. This is a retail site that has thousands of products for sale, so the product pages do have the "add to cart" feature. Does this have anything to do with the query results?
I will mention that their URL's are very cleanly structured, for what it's worth.
I want to learn what this "strange low count return" means as far as possible SE spidering problems, etc. At this point with only 55 pages showing up, something's got to be wrong! I've read about orphaned pages. Is this an instance of it? Even if only 500 pages were reported...I'd still say "what's going on?"
I do look forward to being educated on this. I also wonder then, how do you check out the number of pages in a particular website?
Claudia
#2
Posted 16 January 2007 - 08:00 PM
#3
Posted 16 January 2007 - 08:01 PM
allinurl: domain.com
Other SEs have a similar search such as "site: domain.co.uk" etc
#5
Posted 17 January 2007 - 03:49 AM
#6
Posted 17 January 2007 - 04:44 AM
#7
Posted 17 January 2007 - 09:55 AM
#8
Posted 17 January 2007 - 11:16 AM
#9
Posted 18 January 2007 - 02:30 AM
Yeah, Xenu works well even though it's slow. Another good program I use is GSite Crawler by GSoft (I think). It's slow if you have a lot of links, too, but it gives you a nice Google Site Map at the end.
#10
Posted 18 January 2007 - 08:31 AM
DUH me...I had tried the allinurl query but had mistakenly inlcuded "www.". Using this correctly, Google shows 9,270 pages.
Jill: not to challenge or argue with you at all, but since you said to "be sure not to put the space after the colon" just for the heck of it out of curiousity I searched both ways and the same (9,270) results were returned. So maybe a space doesn't make a difference?
Another question: alright...so now I have a result of 9,270 pages. But the client says "we've got 50,000+ pages". How do I really investigate this? How do I determine if they're just totally over-exaggerating, if they are correct, or if the Google result of 9,270 is much closer to the actual page count?
If they're coming to me for SEO purposes, though I'm not going to be personally working on each and every page...I'd still like to know the page-size of their website. I do need this bit of information.
Claudia
#11
Posted 18 January 2007 - 08:53 AM
Feel free to challenge or argue with me anytime. That's not a problem!
It's good that they're showing the same results now for that. There are times, and perhaps it's with other commands such as the link: one, where they don't do the same thing. It's usually a good idea to do it without the space as I believe that's the true command.
#12
Posted 18 January 2007 - 10:38 AM
Same thing with link. I got an email yesterday morning from a client with links to a couple of searches that he thought indicated that a competitor had a lot more links than he did. Nope.
#13
Posted 18 January 2007 - 01:56 PM
Google doesn't necessarily index every page on every domain.
You can find out how many pages each major search engine (Ask, Google, Windows Live, and Yahoo!) has indexed independently of the others and guess that the largest number may be correct. However, with dynamic content sites, often the same content shows up on multiple transient URLs. Since you don't say what kind of site this is, it's hard for people to give you much useful advice.
The chief reason Google would not index many pages on a large content site is a lack of trusted backlinks to the deep content. That is becoming more of an issue now (and has, in my opinion, been more of an issue since about a year ago).
#14
Posted 21 January 2007 - 02:40 AM
Cheers
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users









