Are you a Google Analytics enthusiast?
More SEO Content
File Extensions And Crawlers?
Posted 22 December 2003 - 02:28 PM
I wouldn't use the html extsion on my actual SCRIPT scripts, I would just use it for my PHP included headers and footers.
I guess another question would be, if I do change my .htaccess file to parse PHP with a .html extension...am I still able to use the .php extension? Do they both work together?
Posted 22 December 2003 - 04:12 PM
I think I am still going to go ahead and config my htacess file to parse php via .html, for usability sake. I think some users get confused when they see .php extensions vs .html extensions...and it just looks cleaner IMO.
And if anyone needs to know...if you config your htacess file to:
AddType application/x-httpd-php .htmlPHP will work with .html and .php extensions.
Posted 22 December 2003 - 04:26 PM
The reasoning for this is that spiders (the polite ones anyway) will crawl a dynamic site considerably more slowly than a dynamic one. There's very little processing overhead to send an HTML file, so the spiders will fly through it at a nice clip. A dynamic (live generated) page requires a lot more server side resources for it to be displayed. It has to execute code and process everything - plus, if it's tied into a database it's got to make a call or two to that as well. At the end of the day, there are only so many calls per second that your server can handle before it says, "I give up!" And, when it does say that, it'll take some human intervention, in most cases to get things back up.
So, if the spider thinks it's looking at a static HTML file, it's going to hit the site a lot faster (more hits per second/minute) than it would one that is obviously dynamic.
A server, as it reaches the limit of what it can do, even if it doesn't crash, it'll start working rather like a deli line. Your browser sends its request and the server says, "Take a number." It'll then process the request when it gets to it. The faster the requests keep coming, the longer the wait and it affects pageload times across the board. All of your visitors will be "taking a number" that can delay the return of their request for measurable seconds - even before the server totally gives up.
If your site is small, this won't really matter as the spider(s) will retrieve the site and be gone fairly quickly. If it's large, though, it could take the spiders days full of continuous requests to get it all. So, during those days, your response times will be considerably slowed - for everyone involved.
Posted 22 December 2003 - 07:56 PM
Posted 22 December 2003 - 09:14 PM
Posted 03 January 2004 - 06:54 AM
However, with this input, I'm thinking that renaming to .php or .html would be better in the long run, and one could simply use 301 Redirects to send people & spiders to the newly-named pages. Of course, some of those redirects would probably have to be there forever.
Posted 03 January 2004 - 09:34 AM
Then you can put a content management system on top of the thing for new stuff. From there, there are many options depending upon what you want to do and how consistent you want everything.
I owe ya a favor or two from over the past year - drop me a line if you want some help getting started.
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users