The second session on my liveblogging hitllist for SES Chicago is the “Duplicate Content and Multiple Sites” moderated by Adam Audette. Unfortunatley, Michael Gray was not able to be there, so things started up with Susan Moskwa, Webmaster Trends Analyst from Google, and Shari Thurow was added on the fly.
What is duplicate content? Identical or substantially similar content. Also multiple URLs with the same content. Google realizes that duplication can be deliberate or accidental.
Basically, duplicate content in the context of a search engine is publishing different URLs that present the same content. (My Example:)
www.domainname.com
domainname.com
www.domainname.com/index.php
www.domainname.com/index.php?sid=123
Why does Google care about duplicate content? Users don’t like to see 10 nearly identical results. Also, there’s no benefit in crawling multiple urls with the same content. It’s a waste of resources for Googlebot to do that.



More and more website owners are concerned that they might get penalized accidentally or overtly because of duplicate content. For example, if you run mirror sites, will search engines ban you? If you have listings that are similar in nature, is that an issue?
Did you realize that search engines have gone full circle on URLs in variables? It used to be considered something to avoid, now search engines are saying variables in URLs are good, as long as you use the canonical meta tag. Google is pushing them with FeedBurner and if webmasters aren’t careful, they could fall victim to a new onslaught of duplicate content issues.







6 Comments