<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Google ? Lying ? Noooo&#8230;&#8230;</title>
	<atom:link href="http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/</link>
	<description>Eclectic.</description>
	<pubDate>Fri, 05 Dec 2008 00:47:10 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7-almost-beta</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: MacManX</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3951</link>
		<dc:creator>MacManX</dc:creator>
		<pubDate>Tue, 28 Jun 2005 18:34:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3951</guid>
		<description>&lt;blockquote&gt;Why do you ban Googlebot in the first place?&lt;/blockquote&gt;

Oh, Mark has plenty of words to say on that subject.  ^_^

http://www.tamba2.org.uk/T2/archives/2005/03/23/google-steals/
http://www.tamba2.org.uk/T2/archives/2005/03/27/more-on-google/
http://www.tamba2.org.uk/T2/archives/2005/04/24/google-screws/</description>
		<content:encoded><![CDATA[<blockquote><p>Why do you ban Googlebot in the first place?</p></blockquote>
<p>Oh, Mark has plenty of words to say on that subject.  ^_^</p>
<p><a href="http://www.tamba2.org.uk/T2/archives/2005/03/23/google-steals/" rel="nofollow">http://www.tamba2.org.uk/T2/archives/2005/03/23/google-steals/</a><br />
<a href="http://www.tamba2.org.uk/T2/archives/2005/03/27/more-on-google/" rel="nofollow">http://www.tamba2.org.uk/T2/archives/2005/03/27/more-on-google/</a><br />
<a href="http://www.tamba2.org.uk/T2/archives/2005/04/24/google-screws/" rel="nofollow">http://www.tamba2.org.uk/T2/archives/2005/04/24/google-screws/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3942</link>
		<dc:creator>Mark</dc:creator>
		<pubDate>Tue, 28 Jun 2005 11:13:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3942</guid>
		<description>I had a PR of 6 or 7 - I forget.

I have pointed this post out to them on the day I made the post. I have heard nothing from them.</description>
		<content:encoded><![CDATA[<p>I had a PR of 6 or 7 - I forget.</p>
<p>I have pointed this post out to them on the day I made the post. I have heard nothing from them.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Angsuman Chakraborty</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3941</link>
		<dc:creator>Angsuman Chakraborty</dc:creator>
		<pubDate>Tue, 28 Jun 2005 11:06:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3941</guid>
		<description>Why do you ban Googlebot in the first place? Is it because you weren't getting high ranking as you seem to indicate in comments?

I think you should let them know. They do respond fairly quickly.</description>
		<content:encoded><![CDATA[<p>Why do you ban Googlebot in the first place? Is it because you weren&#8217;t getting high ranking as you seem to indicate in comments?</p>
<p>I think you should let them know. They do respond fairly quickly.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: MacManX.com &#187; Blogroll Dive: 6/27/05</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3929</link>
		<dc:creator>MacManX.com &#187; Blogroll Dive: 6/27/05</dc:creator>
		<pubDate>Mon, 27 Jun 2005 07:08:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3929</guid>
		<description>[...] nt of view on the upcoming 9/11 memorial. Tom ruminates on RSS and its possible uses. And, Mark discovers that the Googlebot is disobeying his robots.txt file.       [...]</description>
		<content:encoded><![CDATA[<p>[...] nt of view on the upcoming 9/11 memorial. Tom ruminates on RSS and its possible uses. And, Mark discovers that the Googlebot is disobeying his robots.txt file.    </p>
<p> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: MacManX</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3928</link>
		<dc:creator>MacManX</dc:creator>
		<pubDate>Mon, 27 Jun 2005 00:54:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3928</guid>
		<description>&lt;blockquote&gt;The standard says we should obey the first applicable rule, whereas Googlebot obeys the longest (that is, the most specific) applicable rule.&lt;/blockquote&gt;

Now that's just plain idiotic.  Googlebot should obey any rules set to "User-agent: Googlebot" and any rules set to "User-agent: *", not just the longest.  Basically, Google's help pages say two things:

1. "Use robots.txt to block the Googlebot."

2. "The Googlebot will not obey the standard syntax of robots.txt file."

This remind me of the average American:

1. The average American will not vote for the President that's best for him/her.

2. He/she will vote for the President with the biggest smile.</description>
		<content:encoded><![CDATA[<blockquote><p>The standard says we should obey the first applicable rule, whereas Googlebot obeys the longest (that is, the most specific) applicable rule.</p></blockquote>
<p>Now that&#8217;s just plain idiotic.  Googlebot should obey any rules set to &#8220;User-agent: Googlebot&#8221; and any rules set to &#8220;User-agent: *&#8221;, not just the longest.  Basically, Google&#8217;s help pages say two things:</p>
<p>1. &#8220;Use robots.txt to block the Googlebot.&#8221;</p>
<p>2. &#8220;The Googlebot will not obey the standard syntax of robots.txt file.&#8221;</p>
<p>This remind me of the average American:</p>
<p>1. The average American will not vote for the President that&#8217;s best for him/her.</p>
<p>2. He/she will vote for the President with the biggest smile.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3925</link>
		<dc:creator>Mark</dc:creator>
		<pubDate>Sun, 26 Jun 2005 16:29:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3925</guid>
		<description>TigerDE2 - that could well be true ....
And this leads to what made me ban Googlebot in the first place - go to google and look for "Mark tamba2 wordpress". Now if Googlebot has been taking my data, why will it not return that data in a search ?

I'm going to look again.</description>
		<content:encoded><![CDATA[<p>TigerDE2 - that could well be true &#8230;.<br />
And this leads to what made me ban Googlebot in the first place - go to google and look for &#8220;Mark tamba2 wordpress&#8221;. Now if Googlebot has been taking my data, why will it not return that data in a search ?</p>
<p>I&#8217;m going to look again.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cameron aka desk003</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3924</link>
		<dc:creator>Cameron aka desk003</dc:creator>
		<pubDate>Sun, 26 Jun 2005 16:21:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3924</guid>
		<description>Mark, you were just owned by TigerDE2. :razz:

I think (s)he's correct.</description>
		<content:encoded><![CDATA[<p>Mark, you were just owned by TigerDE2. :razz:</p>
<p>I think (s)he&#8217;s correct.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: TigerDE2</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3923</link>
		<dc:creator>TigerDE2</dc:creator>
		<pubDate>Sun, 26 Jun 2005 16:16:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3923</guid>
		<description>Well, I'm not a native English speaker, but to me, your last quote says
&lt;blockquote&gt;Googlebot is taking the longest rule applicable it can find.&lt;/blockquote&gt;
So, basically, Googlebot sees:
&lt;blockquote&gt;User-agent: Googlebot
Disallow: /&lt;/blockquote&gt;

which is very specific but two lines long, and then it finds this:
&lt;blockquote&gt;User-agent: *
Disallow: /gallery
Disallow: /games
Disallow: /images
Disallow: /nota
Disallow: /stats
Disallow: /upb
Disallow: /getout.php&lt;/blockquote&gt;

And that rule includes Googlebot and is &lt;i&gt;way&lt;/i&gt; longer than the first one, so it's picked...
At least that's what I think... :wink:</description>
		<content:encoded><![CDATA[<p>Well, I&#8217;m not a native English speaker, but to me, your last quote says</p>
<blockquote><p>Googlebot is taking the longest rule applicable it can find.</p></blockquote>
<p>So, basically, Googlebot sees:</p>
<blockquote><p>User-agent: Googlebot<br />
Disallow: /</p></blockquote>
<p>which is very specific but two lines long, and then it finds this:</p>
<blockquote><p>User-agent: *<br />
Disallow: /gallery<br />
Disallow: /games<br />
Disallow: /images<br />
Disallow: /nota<br />
Disallow: /stats<br />
Disallow: /upb<br />
Disallow: /getout.php</p></blockquote>
<p>And that rule includes Googlebot and is <i>way</i> longer than the first one, so it&#8217;s picked&#8230;<br />
At least that&#8217;s what I think&#8230; :wink:</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3922</link>
		<dc:creator>Mark</dc:creator>
		<pubDate>Sun, 26 Jun 2005 08:01:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3922</guid>
		<description>From the help pages:
&lt;blockquote&gt;The standard says we should obey the first applicable rule, whereas Googlebot obeys the longest (that is, the most specific) applicable rule.&lt;/blockquote&gt;
In my robots.txt, the longest does not apply, the second is as specific as you can get.
Robots.txt is created to allow flexibility and that is an option I wish to exercise. MSNBot is welcome here, Googlebot is not.

I want the Google person to tell me how to exclude their bot because even though I am following what i think are the rules, Googlebot is disobeying them.
If I have not heard a decent reply in a few days I will post this over at Webmasterworld and other such forums to both publicise it and get more information.

Fact is that even when Google did crawl my site they refuse to return me in results. They've taken 100meg of my data - go search for 'tamba2' - you will not find a single direct link. Not one. Hardly fair is it ?</description>
		<content:encoded><![CDATA[<p>From the help pages:</p>
<blockquote><p>The standard says we should obey the first applicable rule, whereas Googlebot obeys the longest (that is, the most specific) applicable rule.</p></blockquote>
<p>In my robots.txt, the longest does not apply, the second is as specific as you can get.<br />
Robots.txt is created to allow flexibility and that is an option I wish to exercise. MSNBot is welcome here, Googlebot is not.</p>
<p>I want the Google person to tell me how to exclude their bot because even though I am following what i think are the rules, Googlebot is disobeying them.<br />
If I have not heard a decent reply in a few days I will post this over at Webmasterworld and other such forums to both publicise it and get more information.</p>
<p>Fact is that even when Google did crawl my site they refuse to return me in results. They&#8217;ve taken 100meg of my data - go search for &#8216;tamba2&#8242; - you will not find a single direct link. Not one. Hardly fair is it ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cameron aka desk003</title>
		<link>http://www.tamba2.org.uk/T2/2005/06/25/google-lying-noooo/#comment-3921</link>
		<dc:creator>Cameron aka desk003</dc:creator>
		<pubDate>Sun, 26 Jun 2005 01:17:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.tamba2.org.uk/T2/?p=1485#comment-3921</guid>
		<description>this is mine:
User-agent: *
Disallow: /cgi-bin/
Disallow: /private/
Disallow: /scgi-bin/
Disallow: /old/
Disallow: /new/
Disallow: /backup/
Disallow: /_images/
Disallow: /webalizer/
Disallow: /willoway/
Disallow: /stuff/
Disallow: /images/

and I think that keeps googlebot at bay.</description>
		<content:encoded><![CDATA[<p>this is mine:<br />
User-agent: *<br />
Disallow: /cgi-bin/<br />
Disallow: /private/<br />
Disallow: /scgi-bin/<br />
Disallow: /old/<br />
Disallow: /new/<br />
Disallow: /backup/<br />
Disallow: /_images/<br />
Disallow: /webalizer/<br />
Disallow: /willoway/<br />
Disallow: /stuff/<br />
Disallow: /images/</p>
<p>and I think that keeps googlebot at bay.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
