| Important Announcement: ***Need an Affordable SEO Website Review?*** |
![]() ![]() |
Nov 22 2005, 09:51 AM
Post
#1
|
|
|
HR 1 ![]() Group: Members Posts: 3 Joined: 22-November 05 User's local time: Feb 9 2010, 03:57 PM Member No.: 9,501 |
Hi. There is a discussion on my Web design mailing list about whether Google can index pages to which there are no links, such as those under development which are not intended to be public yet. Is this possible?
Thanks, Kim |
|
|
|
Nov 22 2005, 10:02 AM
Post
#2
|
|
![]() Vintage Babe Group: Moderator Posts: 4,142 Joined: 31-July 03 User's local time: Feb 9 2010, 02:57 PM From: Triangle area, NC, USA, Earth (usually) Member No.: 89 |
It is possible for them to spider those pages, assuming there's no password to prevent it or robots.txt to disallow the spider. It apparently has happened in the past; there are a number of theories around about means they might use to find the page in the first place, although I don't know if anyone's ever determined for sure how the spider finds these orphan pages.
It's also possible the page might show up briefly in the index, but unless the page gets links pointing to it, the page won't stay in the index, and even during the potential time it might be indexed it's doubtful that it would rank very well for any even mildly competitive term with no inbound links. That said, that's no guarantee that no one will ever find the page -- people enter obscure/oddball search terms all the time, and you can't always predict what phrases a page might show up for. We all have the experience of finding referers in our logs from totally off-the-wall searches. So "security through obscurity" is unreliable at best -- all it takes is one freaky search during that time when the page is in the index and the proverbial cat could be out of the proverbial bag. If the concern is keeping the SEs away from a page or preventing the general public from stumbling across it, IMHO the only way to insure that is to password-protect it. There are enough bots out there that don't obey robots.txt that you can't rely on robots.txt to "protect" a page from indexing. If the idea is to create an orphan page and somehow get it to rank for something, that won't happen unless the page gets inbound links from somewhere. Without links, it won't "stick" in Google's index even if it does briefly show up initially. I think we've got a thread or two around here where this was tested... --Torka (IMG:http://www.highrankings.com/forum/style_emoticons/default/mf_prop.gif) |
|
|
|
Nov 22 2005, 10:05 AM
Post
#3
|
|
![]() Token male admin Group: Admin Posts: 1,436 Joined: 28-July 03 User's local time: Feb 9 2010, 07:57 PM From: UK Member No.: 45 |
Welcome to the forums klewando (IMG:http://www.highrankings.com/forum/style_emoticons/default/bye1.gif)
QUOTE There is a discussion on my Web design mailing list about whether Google can index pages to which there are no links, such as those under development which are not intended to be public yet. Is this possible? Yes.Here are some possible ways (not saying that any of these are probable): 1) The URL appears up as a referrer in a pubic referrer log, or in trackbacks, etc. 2) Google crawls ISP logs 3) The Google Toolbar (with Pagerank display enabled) reports the existence of the URL to Google 4) Somebody submits the URL to Google using their Add URL form |
|
|
|
Nov 22 2005, 10:29 AM
Post
#4
|
|
|
HR 4 ![]() ![]() ![]() ![]() Group: Active Members Posts: 200 Joined: 15-January 04 User's local time: Feb 9 2010, 03:57 PM From: Limerick, Ireland Member No.: 1,936 |
Hi Kim,
This has happened to me once. We were developing a site last year and we had nearly finished the job. However we weren't happy with it and in the end we didn't go live with the site. A couple of months later I saw that all of the search engines had picked up the site - even though no links were pointing at it. It even gained good rankings in MSN and Yahoo. This post has been edited by davidbrett: Nov 22 2005, 11:01 AM |
|
|
|
Nov 22 2005, 09:29 PM
Post
#5
|
|
|
HR 4 ![]() ![]() ![]() ![]() Group: Active Members Posts: 157 Joined: 7-April 04 User's local time: Feb 9 2010, 04:57 PM Member No.: 3,158 |
I would like add few more;-
1. Google Web Accelerator 2. Google Sitemap Xml Submite 3. Froogle Submite my $0.0199 |
|
|
|
Nov 22 2005, 10:17 PM
Post
#6
|
|
|
HR 8 ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Group: Active Members Posts: 3,719 Joined: 5-April 05 User's local time: Feb 9 2010, 11:57 AM From: Seattle, WA Member No.: 7,091 |
Matt Cutts has reportedly stated that they don't get URLs from the Toolbar.
A few months ago, several of us here submitted orphaned pages to Google through their ADD URL feature and none of the pages were spidered. The other suggestions seem plausible. You can, on rare occasions, find some referer data in open logs, for example. I'm not sure Web accelerator feeds the index. I haven't seen any discussions of this possibility. A number of people have speculated that AdSense might feed the index, but Google engineers have denied that being so. |
|
|
|
Nov 23 2005, 03:25 AM
Post
#7
|
|
![]() HR 9 Group: Moderator Posts: 4,356 Joined: 13-August 03 User's local time: Feb 9 2010, 07:57 PM From: Blackpool UK Member No.: 492 |
Submitting doesn't get you in Google, eveyone from here who tried it came up with the same result, only the Yahoo crawler turned up.
The Google, MSN and Yahoo toolbars. I've used all three and checked various unlinked pages on a couple of crawled sites. Not a crawler in sight. Adsense, Had it on several unlinked pages only the Media-Partners bot visited. Same with using Opera with the ads running. |
|
|
|
Nov 23 2005, 09:08 AM
Post
#8
|
|
![]() HR 10 Group: Moderator Posts: 7,489 Joined: 24-July 03 User's local time: Feb 9 2010, 02:57 PM From: Somerville, MA Member No.: 22 |
Ditto that (except I didn't try the Yahoo or MSN toolbars).
And my submitted page, after about half a year, is still in the Yahoo index, so I guess one could say that's a legitimate way to get into Yahoo. It was dropped for a little while, but it came back without any action on my part. |
|
|
|
Nov 24 2005, 05:58 AM
Post
#9
|
|
|
HR 1 ![]() Group: Members Posts: 7 Joined: 23-October 05 User's local time: Feb 9 2010, 12:57 PM Member No.: 9,116 |
QUOTE(Michael Martinez @ Nov 22 2005, 08:17 PM) Matt Cutts has reportedly stated that they don't get URLs from the Toolbar. I've had pages with: - no inbound links - no web-facing server logs - etc etc 3 of us were working on a beta of a site, similar to the story above, and this site was indxed in G, and remained in the index. This happened in Oct 04. |
|
|
|
Nov 25 2005, 07:58 PM
Post
#10
|
|
![]() High Rankings Advisor Group: Admin Posts: 29,201 Joined: 21-July 03 User's local time: Feb 9 2010, 02:57 PM From: Ashland, MA Member No.: 2 |
QUOTE 3 of us were working on a beta of a site, similar to the story above, and this site was indxed in G, and remained in the index. This happened in Oct 04. I'd have to see it with my own eyes to believe it. |
|
|
|
Nov 28 2005, 03:55 PM
Post
#11
|
|
|
HR 4 ![]() ![]() ![]() ![]() Group: Active Members Posts: 157 Joined: 7-April 04 User's local time: Feb 9 2010, 04:57 PM Member No.: 3,158 |
I would like add few more;-
1. Google VPN 2. Port 443 (SSL Certificate Scan) 3. Register .gov, .mil, .edu domain. my $0.01999 |
|
|
|
Nov 29 2005, 11:21 AM
Post
#12
|
|
|
HR 2 ![]() ![]() Group: Active Members Posts: 40 Joined: 29-November 05 User's local time: Feb 9 2010, 01:57 PM Member No.: 9,585 |
To comment on the discussion above I have seen similar results from google on new sites. No link and just starting it gets me to wonder if they are looking at domain registration or something. Google was the first to pick up the site and this even BEFORE we started to design the site. All we did was buy the domain name and the hosting package. No I do not know for sure - even though it didn't any backlinks - but our server COULD have a list accessable to bots of new sites on their server. To this day I do not even see anything from our server as backlinks (google or yahoo). It is strange indeed.
I also see that Google is picking up MSN results and have seen Google trying to crawl funky broken up URL's like (/directory/file...) with the 3 dots. The only sites that has URLs like that (when doing a search in Google and Yahoo) is msn results and scrapers of those results. Thes links are cut off and are at the end of search results but are not linked. But who is to say there isn't something somewhere that is actually linked like that but is not showing up in searches - I highly doubt it but it could be possible. Just my few cents worth. This post has been edited by Jill: Nov 29 2005, 06:32 PM |
|
|
|
Nov 29 2005, 02:04 PM
Post
#13
|
|
![]() Convert Me! Group: Admin Posts: 17,378 Joined: 17-August 03 User's local time: Feb 9 2010, 01:57 PM Member No.: 551 |
QUOTE No link and just starting it gets me to wonder if they are looking at domain registration or something. Well, there's probably no way to prove it one way or another. But since Google became an official ICANN Accredited Registrar earlier this year... Let's just say that it's a pretty safe bet that they have a lot more information regarding domains and their ownership now than they had in the past. (IMG:http://www.highrankings.com/forum/style_emoticons/default/wink.gif) |
|
|
|
Nov 29 2005, 04:22 PM
Post
#14
|
|
|
HR 2 ![]() ![]() Group: Active Members Posts: 40 Joined: 29-November 05 User's local time: Feb 9 2010, 01:57 PM Member No.: 9,585 |
I wouldn't doubt that they would use that information. It was rather quick that they picked up the site that is for sure. Yahoo was quick to do it also. MSN didn't until we gathered a few links.
|
|
|
|
Nov 29 2005, 04:46 PM
Post
#15
|
|
|
HR 4 ![]() ![]() ![]() ![]() Group: Active Members Posts: 150 Joined: 31-October 05 User's local time: Feb 9 2010, 03:57 PM From: Charlotte, NC, USA Member No.: 9,217 |
QUOTE Let's just say that it's a pretty safe bet that they have a lot more information regarding domains and their ownership now than they had in the past. Yeah, big brother is starting to scare me. Next thing we know, our tax filings will go through them. But I have a hard time believing in G crawling new domain registrations. Sure they could take a guess at the index address and crawl from there. But what about some of the claims in this thread of G crawling non linked folders in established sites? Best idea I've heard is that G gets these pages from our PR toolbar queries. -Dan |
|
|
|
![]() ![]() ![]() |
|
Lo-Fi Version | Time is now: 9th February 2010 - 02:57 PM |