Is Google Reading Your Robots.txt File?
Posted by Mitch Mitchell on Mar 16, 2009
Last week, I happened upon a post by our buddy Caleb of The Market Secrets Blog titled How Search Engine Robots Read Your Site. Once I got past the image of the stupid spider (had to be a spider, didn’t it Caleb?), in the article, he has this box attached to something called Search Engine Spider Simulator, where you put in your URL (that’s the link to your blog or website, for the uninitiated), click “submit”, then it goes through and shows you how the search engines go through your site.
I put in the URL for this blog, and I got nothing. I knew that couldn’t be right, because I have the WordPress Google XML Sitemap plugin on the blog. But there is was, not a single thing. I then put in the URL of my two business sites, and all sorts of pages came up.
At that point, I decided to come back to my blog and go into the settings for the plugin. Truthfully, I’d never been in there before, just accepting however it’s default settings gave me. Near the top, under the Basic Options area, there’s a box that was unchecked for me that states “Add sitemap URL to the virtual robots.txt file.” I checked that, saved it, and waited a day. I then went back to Caleb’s article (forgetting about that stupid spider) and put in my blog’s URL once again. This time, all worked perfectly; wow!
It makes me wonder now if, later on, I might actually start attaining some kind of page rank on some of my internal pages, now that I know for sure that the search engines will be going through them more often than just when I first wrote them. I don’t know for sure, but I guess we’ll find out.
If you want to learn more about robots.txt files, you can check out the Web Robots Pages site, but I’ll tell you this truth. I don’t have a robots.txt file for most of my sites. Instead, I use a page called XML Sitemaps.org, where you can create many different types of files that you can create, then upload to your site, to help the search engines “spider” your site. It works just fine for my regular sites, but it doesn’t seem to work well for my blogs, probably because I’m uploading that file to the wrong place. No matter, now that I know better how to use the WordPress Google XML Sitemap plugin.
Copyright secured by Digiprove © 2010 Mitch Mitchell



I'm Just Sharing is where I share my thoughts on internet marketing, writing, blogging and many other things. You never know what I'll be posting on. So keep coming back, read, enjoy, and buy something! ;)


And I’m more than sure your PR is about to rise!
BTW: Learn the ways of Peter Parker, Mitch… Spiders are our friends
(First time I commented it didn’t seem to go through,but if it pops up just delete it)
Caleb´s last blog post..Start Blogging Now!
Mitch Reply:
March 16th, 2009 at 9:33 PM
Mieszkania Gryfino´s last blog post..Działka budowlana w Gryfinie
Mitch Reply:
March 17th, 2009 at 9:20 AM
Caleb´s last blog post..Start Blogging Now!
Mitch Reply:
March 17th, 2009 at 1:50 PM
Mitch Reply:
March 17th, 2009 at 3:47 PM
Caleb Reply:
March 17th, 2009 at 6:39 PM
I see you’re not the only one who didn’t know about the “tick box”.
I suspected others might have been experiencing this and not even realize it.
Caleb´s last blog post..GatherSuccess.com Gets Simulated
Mitch Reply:
March 17th, 2009 at 7:52 PM
Caleb (Market Secrets Blogger) Reply:
March 20th, 2009 at 2:19 PM
So who’s down for telling them ‘wassup’…Sire perhaps??
Caleb (Market Secrets Blogger)´s last blog post..How to Finally Make Money Using Affiliate Programs
Mitch Reply:
March 20th, 2009 at 4:51 PM
Mitch Reply:
March 17th, 2009 at 6:40 PM
By the way, what country do you live in? I’m unfamiliar with Rhondda Cynon Taff.
khaled Reply:
March 18th, 2009 at 7:52 AM
Mitch Reply:
March 18th, 2009 at 9:47 AM
Sire´s last blog post..Googles Interest-Based Advertising Sucks
Mitch Reply:
March 20th, 2009 at 9:18 AM