SEO Class in Chicago, IL
Learn How To Optimize Your Website on July 26, 2013
High Rankings is offering a 1-day customized SEO training class in Chicago. Class size is limited so please sign-up now if you want in!
Are you a Google Analytics enthusiast?
Share and download Custom Google Analytics Reports, dashboards and advanced segments--for FREE!

www.CustomReportSharing.com
From the folks who brought you High Rankings!
More SEO Content
Most Links To Duplicate Page
#1
Posted 26 May 2012 - 02:26 AM
I have just discovered an old unloved website whereby urls are internally linked without the .html extension, but (to my horror) all of the external backlinks to these inner pages are WITH the .html.
Both versions serve up the same page, so is duplicate content.
Ideally, I would like to stick with the site's normal url protocol of using no .html.
If I 301 all .html versions to non .html, then most of the website's 15k links will also be 301'ing to the non .html versions.
Would I be better to ditch the entire site structure and stick get it totally reindexed with the .html versions (so the links are correct), or can I trust that 301 will pass the link juice through the 301's? Baring mind these pages have been live for years and rank well!
Thanks
#2
Posted 26 May 2012 - 10:42 AM
Baring mind these pages have been live for years and rank well!
Then why not leave well enough alone? Sounds like Google has already worked it out for you.
#3
Posted 26 May 2012 - 12:11 PM
#4
Posted 26 May 2012 - 04:23 PM
If I do a site: command the few incorrectly .html extensioned pages show really low down (2 or 3rd page). Despite this they do still rank well, presumably as they have a ton of inbounds.
This to me indicates they aren't being given their due credit, perhaps as the juice is divided across 2 pages? They "could" rank even higher.
The problem for me though is how much is lost through 301'ing the majority of the website's backlinks.
#5
Posted 26 May 2012 - 04:33 PM
#6
Posted 27 May 2012 - 01:32 AM
#7
Posted 27 May 2012 - 05:20 AM
In this case, using the word protocol despite it being absolutely grammatically correct and appropriate for the statement. It may cause some confusion for readers without a good working knowledge of the English language thus leading to some mis-understanding how "protocol" is being used. Here being employed in the diplomatic sense of:OK thanks Jill, I think I will 301 to the correct non html protocol. Fingers crossed!
"customs or regulations when dealing with behaviour or formality"
For this topic,and for the purpose of avoiding any ambiguity or confusion with "communication protocol" the ".html" version of the URI would/should be referred to as the "canonical version"
And yes this is me in my pedantic and "nit-picking" "primary school teacher" persona.
Edited by chrishirst, 27 May 2012 - 05:21 AM.
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users








