Every site needs a robot.txt file located in its root folder. This simple text file tells the search engine spider what content it is allowed to index, and what it is not.
If your website doesn't have one you could have real problems, as it is the first thing a spider looks for on its arrival.
Type in www.your-site-name.com/robots.txt (where 'your-site-name' is, of course, your domain name) and you should see some text that starts with "User-agent". If not you don't have one.
You would be surprised how many professional web design companies forget this simple step!
Let's suppose our site has an admin folder, as well as a members folder with sub folders that we don't want to show up in Google searches.
Simply update your robots.txt file to read:
User-agent: *
Disallow: /admin
Disallow: /members
And if we wanted to disallow access to a specific page as well (eg. login.html) then
we put:
User-agent: *
Disallow: /admin
Disallow: /members
Disallow: login.html
Quite why you would want to do this we're not sure, as we all want more traffic... but a good example of how to block certain engines and allow others through can be found at: www.robotstxt.org/robots.txt
![]()
Do...
|
|
Don't...
|
![]()
|
|
[ sitemap ] Seo Book © 2008 - 2009
|