High Rankings Search Engine Optimization ForumHigh Rankings Advisor Search Marketing Newsletter

Welcome Guest ( Log In | Register )

Important Announcement: ***Need an Affordable SEO Website Review?***
2 Pages V   1 2 >  
Reply to this topicStart new topic
> Can Google Index Unlinked Pages?, Can Google index unlinked pages?
klewando
post Nov 22 2005, 09:51 AM
Post #1


HR 1
*

Group: Members
Posts: 3
Joined: 22-November 05
User's local time:
Feb 9 2010, 03:57 PM
Member No.: 9,501



Hi. There is a discussion on my Web design mailing list about whether Google can index pages to which there are no links, such as those under development which are not intended to be public yet. Is this possible?

Thanks,
Kim
Go to the top of the page
 
+Quote Post
torka
post Nov 22 2005, 10:02 AM
Post #2


Vintage Babe
Group Icon

Group: Moderator
Posts: 4,142
Joined: 31-July 03
User's local time:
Feb 9 2010, 02:57 PM
From: Triangle area, NC, USA, Earth (usually)
Member No.: 89



It is possible for them to spider those pages, assuming there's no password to prevent it or robots.txt to disallow the spider. It apparently has happened in the past; there are a number of theories around about means they might use to find the page in the first place, although I don't know if anyone's ever determined for sure how the spider finds these orphan pages.

It's also possible the page might show up briefly in the index, but unless the page gets links pointing to it, the page won't stay in the index, and even during the potential time it might be indexed it's doubtful that it would rank very well for any even mildly competitive term with no inbound links.

That said, that's no guarantee that no one will ever find the page -- people enter obscure/oddball search terms all the time, and you can't always predict what phrases a page might show up for. We all have the experience of finding referers in our logs from totally off-the-wall searches. So "security through obscurity" is unreliable at best -- all it takes is one freaky search during that time when the page is in the index and the proverbial cat could be out of the proverbial bag.

If the concern is keeping the SEs away from a page or preventing the general public from stumbling across it, IMHO the only way to insure that is to password-protect it. There are enough bots out there that don't obey robots.txt that you can't rely on robots.txt to "protect" a page from indexing.

If the idea is to create an orphan page and somehow get it to rank for something, that won't happen unless the page gets inbound links from somewhere. Without links, it won't "stick" in Google's index even if it does briefly show up initially.

I think we've got a thread or two around here where this was tested...

--Torka (IMG:http://www.highrankings.com/forum/style_emoticons/default/mf_prop.gif)
Go to the top of the page
 
+Quote Post
Alan Perkins
post Nov 22 2005, 10:05 AM
Post #3


Token male admin
Group Icon

Group: Admin
Posts: 1,436
Joined: 28-July 03
User's local time:
Feb 9 2010, 07:57 PM
From: UK
Member No.: 45



Welcome to the forums klewando (IMG:http://www.highrankings.com/forum/style_emoticons/default/bye1.gif)


QUOTE
There is a discussion on my Web design mailing list about whether Google can index pages to which there are no links, such as those under development which are not intended to be public yet. Is this possible?
Yes.

Here are some possible ways (not saying that any of these are probable):

1) The URL appears up as a referrer in a pubic referrer log, or in trackbacks, etc.
2) Google crawls ISP logs
3) The Google Toolbar (with Pagerank display enabled) reports the existence of the URL to Google
4) Somebody submits the URL to Google using their Add URL form
Go to the top of the page
 
+Quote Post
davidbrett
post Nov 22 2005, 10:29 AM
Post #4


HR 4
****

Group: Active Members
Posts: 200
Joined: 15-January 04
User's local time:
Feb 9 2010, 03:57 PM
From: Limerick, Ireland
Member No.: 1,936



Hi Kim,

This has happened to me once. We were developing a site last year and we had nearly finished the job. However we weren't happy with it and in the end we didn't go live with the site.

A couple of months later I saw that all of the search engines had picked up the site - even though no links were pointing at it. It even gained good rankings in MSN and Yahoo.

This post has been edited by davidbrett: Nov 22 2005, 11:01 AM
Go to the top of the page
 
+Quote Post
sonnyyu
post Nov 22 2005, 09:29 PM
Post #5


HR 4
****

Group: Active Members
Posts: 157
Joined: 7-April 04
User's local time:
Feb 9 2010, 04:57 PM
Member No.: 3,158



I would like add few more;-

1. Google Web Accelerator
2. Google Sitemap Xml Submite
3. Froogle Submite



my $0.0199
Go to the top of the page
 
+Quote Post
Michael Martinez
post Nov 22 2005, 10:17 PM
Post #6


HR 8
********

Group: Active Members
Posts: 3,719
Joined: 5-April 05
User's local time:
Feb 9 2010, 11:57 AM
From: Seattle, WA
Member No.: 7,091



Matt Cutts has reportedly stated that they don't get URLs from the Toolbar.

A few months ago, several of us here submitted orphaned pages to Google through their ADD URL feature and none of the pages were spidered.

The other suggestions seem plausible. You can, on rare occasions, find some referer data in open logs, for example.

I'm not sure Web accelerator feeds the index. I haven't seen any discussions of this possibility. A number of people have speculated that AdSense might feed the index, but Google engineers have denied that being so.
Go to the top of the page
 
+Quote Post
chrishirst
post Nov 23 2005, 03:25 AM
Post #7


HR 9
Group Icon

Group: Moderator
Posts: 4,356
Joined: 13-August 03
User's local time:
Feb 9 2010, 07:57 PM
From: Blackpool UK
Member No.: 492



Submitting doesn't get you in Google, eveyone from here who tried it came up with the same result, only the Yahoo crawler turned up.

The Google, MSN and Yahoo toolbars. I've used all three and checked various unlinked pages on a couple of crawled sites. Not a crawler in sight.

Adsense, Had it on several unlinked pages only the Media-Partners bot visited. Same with using Opera with the ads running.
Go to the top of the page
 
+Quote Post
qwerty
post Nov 23 2005, 09:08 AM
Post #8


HR 10
Group Icon

Group: Moderator
Posts: 7,489
Joined: 24-July 03
User's local time:
Feb 9 2010, 02:57 PM
From: Somerville, MA
Member No.: 22



Ditto that (except I didn't try the Yahoo or MSN toolbars).

And my submitted page, after about half a year, is still in the Yahoo index, so I guess one could say that's a legitimate way to get into Yahoo. It was dropped for a little while, but it came back without any action on my part.
Go to the top of the page
 
+Quote Post
michaelhood
post Nov 24 2005, 05:58 AM
Post #9


HR 1
*

Group: Members
Posts: 7
Joined: 23-October 05
User's local time:
Feb 9 2010, 12:57 PM
Member No.: 9,116



QUOTE(Michael Martinez @ Nov 22 2005, 08:17 PM)
Matt Cutts has reportedly stated that they don't get URLs from the Toolbar.


I've had pages with:

- no inbound links
- no web-facing server logs
- etc etc

3 of us were working on a beta of a site, similar to the story above, and this site was indxed in G, and remained in the index. This happened in Oct 04.
Go to the top of the page
 
+Quote Post
Jill
post Nov 25 2005, 07:58 PM
Post #10


High Rankings Advisor
Group Icon

Group: Admin
Posts: 29,201
Joined: 21-July 03
User's local time:
Feb 9 2010, 02:57 PM
From: Ashland, MA
Member No.: 2



QUOTE
3 of us were working on a beta of a site, similar to the story above, and this site was indxed in G, and remained in the index. This happened in Oct 04.


I'd have to see it with my own eyes to believe it.
Go to the top of the page
 
+Quote Post
sonnyyu
post Nov 28 2005, 03:55 PM
Post #11


HR 4
****

Group: Active Members
Posts: 157
Joined: 7-April 04
User's local time:
Feb 9 2010, 04:57 PM
Member No.: 3,158



I would like add few more;-

1. Google VPN
2. Port 443 (SSL Certificate Scan)
3. Register .gov, .mil, .edu domain.



my $0.01999
Go to the top of the page
 
+Quote Post
making-it-big
post Nov 29 2005, 11:21 AM
Post #12


HR 2
**

Group: Active Members
Posts: 40
Joined: 29-November 05
User's local time:
Feb 9 2010, 01:57 PM
Member No.: 9,585



To comment on the discussion above I have seen similar results from google on new sites. No link and just starting it gets me to wonder if they are looking at domain registration or something. Google was the first to pick up the site and this even BEFORE we started to design the site. All we did was buy the domain name and the hosting package. No I do not know for sure - even though it didn't any backlinks - but our server COULD have a list accessable to bots of new sites on their server. To this day I do not even see anything from our server as backlinks (google or yahoo). It is strange indeed.

I also see that Google is picking up MSN results and have seen Google trying to crawl funky broken up URL's like (/directory/file...) with the 3 dots. The only sites that has URLs like that (when doing a search in Google and Yahoo) is msn results and scrapers of those results. Thes links are cut off and are at the end of search results but are not linked. But who is to say there isn't something somewhere that is actually linked like that but is not showing up in searches - I highly doubt it but it could be possible.

Just my few cents worth.

This post has been edited by Jill: Nov 29 2005, 06:32 PM
Go to the top of the page
 
+Quote Post
Randy
post Nov 29 2005, 02:04 PM
Post #13


Convert Me!
Group Icon

Group: Admin
Posts: 17,378
Joined: 17-August 03
User's local time:
Feb 9 2010, 01:57 PM
Member No.: 551



QUOTE
No link and just starting it gets me to wonder if they are looking at domain registration or something.


Well, there's probably no way to prove it one way or another. But since Google became an official ICANN Accredited Registrar earlier this year...

Let's just say that it's a pretty safe bet that they have a lot more information regarding domains and their ownership now than they had in the past. (IMG:http://www.highrankings.com/forum/style_emoticons/default/wink.gif)
Go to the top of the page
 
+Quote Post
making-it-big
post Nov 29 2005, 04:22 PM
Post #14


HR 2
**

Group: Active Members
Posts: 40
Joined: 29-November 05
User's local time:
Feb 9 2010, 01:57 PM
Member No.: 9,585



I wouldn't doubt that they would use that information. It was rather quick that they picked up the site that is for sure. Yahoo was quick to do it also. MSN didn't until we gathered a few links.
Go to the top of the page
 
+Quote Post
dmcconkey
post Nov 29 2005, 04:46 PM
Post #15


HR 4
****

Group: Active Members
Posts: 150
Joined: 31-October 05
User's local time:
Feb 9 2010, 03:57 PM
From: Charlotte, NC, USA
Member No.: 9,217



QUOTE
Let's just say that it's a pretty safe bet that they have a lot more information regarding domains and their ownership now than they had in the past.


Yeah, big brother is starting to scare me. Next thing we know, our tax filings will go through them.

But I have a hard time believing in G crawling new domain registrations. Sure they could take a guess at the index address and crawl from there. But what about some of the claims in this thread of G crawling non linked folders in established sites?

Best idea I've heard is that G gets these pages from our PR toolbar queries.

-Dan
Go to the top of the page
 
+Quote Post

2 Pages V   1 2 >   
Fast ReplyReply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



This forum is sponsored by High Rankings, a Boston SEO Agency
- Lo-Fi Version Time is now: 9th February 2010 - 02:57 PM