Posted by Melanie Phung on Monday, April 10, 2006 at 8:25 pm
Can anyone tell me what’s wrong with this picture?
It’s what appears on the first page of Google results today if you do a search on the term “technorati.” It’s a link to the del.icio.us page of items tagged “technorati.” But del.icio.us pages all use < name="robots" content="noarchive,nofollow,noindex">. In other words, the robots instructions on the page tell the search engines not to index the page! Noindex means it shouldn’t show up in search results.
What gives? Has Google started ignoring noindex?
Two thoughts: 1) Get ready for some aggressive del.icio.us tag spamming, and 2) how do we avoid getting in trouble for duplicate content if we can’t keep Google from indexing dupe pages using the standard robots exclusion?
Update: This question was answered in my subsequent post, Google’s Interpretation of ‘noindex’.
Updated June 19: Looks like Google is indeed indexing pages that are tagged “no follow.” See this recent Webmaster World discussion.
No comments yet.
Sorry, the comment form is closed at this time.