<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AuthorityLabs &#187; duplicate content</title>
	<atom:link href="http://authoritylabs.com/blog/tag/duplicate-content/feed/" rel="self" type="application/rss+xml" />
	<link>http://authoritylabs.com</link>
	<description></description>
	<lastBuildDate>Mon, 14 May 2012 20:44:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Alternatives to rel=&quot;canonical&quot;</title>
		<link>http://authoritylabs.com/blog/alternatives-to-rel-canonical/</link>
		<comments>http://authoritylabs.com/blog/alternatives-to-rel-canonical/#comments</comments>
		<pubDate>Mon, 17 Oct 2011 17:57:31 +0000</pubDate>
		<dc:creator>Dawn Wentzell</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Optimization]]></category>
		<category><![CDATA[canonical tag]]></category>
		<category><![CDATA[canonical URL]]></category>
		<category><![CDATA[duplicate content]]></category>

		<guid isPermaLink="false">http://authoritylabs.com/?p=2902</guid>
		<description><![CDATA[There has been some recent discussion in the SEO community about whether Google and Bing have different rules for the use of the rel=”canonical” tag. Google has said it is fine to have self-referential canonical tags (ie. the rel=&#8221;canonical&#8221; tag specifies the same URL as the page you are on), whereas Bing indicates they’d prefer [...]]]></description>
			<content:encoded><![CDATA[<p></p><div id="attachment_2904" class="wp-caption alignleft" style="width: 320px">
	<img class="size-full wp-image-2904  " src="http://authoritylabs.com/wp-content/uploads/bandaid.jpg" alt="" width="320" height="240" />
	<p class="wp-caption-text">Don&#039;t use rel=&quot;canonical&quot; as a bandaid solution</p>
</div>
<p>There has been <a href="http://nickroshon.com/seo/google-bing-disagree-on-relcanonical-implementation">some</a> <a href="http://www.johnfdoherty.com/do-bing-and-google-treat-relcanonical-differently/">recent</a> <a href="http://sphinn.com/story/225015">discussion</a> in the SEO community about whether Google and Bing have different rules for the use of the rel=”canonical” tag. Google has said it is fine to have self-referential canonical tags (ie. the rel=&#8221;canonical&#8221; tag specifies the same URL as the page you are on), whereas Bing indicates they’d prefer the canonical tag be left blank in that case.</p>
<p>The proper use of rel=&#8221;canonical&#8221; can be confusing at best, and can produce <a href="http://www.seomoz.org/blog/catastrophic-canonicalization">devastating results</a> at worst. So what is an SEO to do?</p>
<p>First of all, realize that using rel=&#8221;canonical&#8221; isn’t necessary in many cases of duplicate content. The canonical tag is a great tool for extreme situations and enterprise-level sites, but on small to medium sized websites there are often other solutions.<br />
<span id="more-2902"></span></p>
<h2>Choose Non-WWW to WWW, or Vice Versa</h2>
<p>A lot of canonical issues arise because a website is available at both the WWW and non-WWW versions of the domain, and other sites may end up linking to either version. Using your favorite method, <a href="../blog/solving-canonical-problems/">redirect the non-WWW to the WWW</a>, or vice versa. If you do this when the site is initially built, you can eliminate most instances of the wrong version being linked to (people tend to just grab whatever URL is in the address bar, anyway). Make the choice early on, and stick with it.</p>
<h2>Don’t Use URL Parameters</h2>
<p>If possible, try to avoid using parameters in URLs.</p>
<ul>
<li>If you run an e-commerce or community-based site, store all session information in a cookie rather than as parameters. This is a programming best practice to ensure users don’t get access to each others’ information.</li>
<li>Avoid specifying sort order or viewing options of a search results or product page in the URL. It is better to display the page with a <a href="http://searchnewscentral.com/20110601167/General-SEO/solving-duplicate-content-issues-arising-from-faceted-navigation.html">static URL</a>, and make use of AJAX for sorting and filtering.</li>
<li>If you use tracking parameters for referrals, replace the question mark (?) in the URL with a hashtag (#). Don’t forget to adjust your Google Analytics tracking code <a href="http://www.lunametrics.com/blog/2009/02/02/hashing-it-out-referral-tracking/">to allow hashtags</a>!</li>
</ul>
<h2>Don’t Generate URLs on the Fly</h2>
<p>Some content management systems generate page URLs on the fly, based on how the user navigated to it. I suppose the idea behind it is that the URL then becomes sort of a breadcrumb trail so the user can easily figure out how get to higher levels, but this is just a bad idea on so many levels.</p>
<p>Ideally, you want each product, article, post or resource to be available at a single, static URL, regardless of how the user got there.</p>
<h2>301 Redirects</h2>
<p>No matter how hard you try to avoid it, you sometimes end up with pages on your site being linked to in strange ways, potentially causing duplicate content issues. In cases like this, you may want to set up redirect rules that specify certain parameters (or all parameters, if you want to be extreme) get redirected back to the root page. If it only occurs on a few pages across your site, you can individually redirect them to the preferred URL (an advantage small sites have over enterprise ones).</p>
<p>In general, you should try to prevent duplicate content issues before they happen, or fix them when they do. Use of the rel=&#8221;canonical&#8221; tag is advanced and shouldn’t be applied as a blanket solution to all canonicalization problems.</p>
<p><em>Photo: <a href="http://www.flickr.com/photos/streamishmc/133584518/">Jason Tester</a>/Flickr</em></p>
]]></content:encoded>
			<wfw:commentRss>http://authoritylabs.com/blog/alternatives-to-rel-canonical/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Solving Canonical Problems with WWW</title>
		<link>http://authoritylabs.com/blog/solving-canonical-problems/</link>
		<comments>http://authoritylabs.com/blog/solving-canonical-problems/#comments</comments>
		<pubDate>Thu, 29 Sep 2011 17:31:25 +0000</pubDate>
		<dc:creator>Dawn Wentzell</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Optimization]]></category>
		<category><![CDATA[canonical URL]]></category>
		<category><![CDATA[duplicate content]]></category>
		<category><![CDATA[htaccess]]></category>
		<category><![CDATA[IIS]]></category>
		<category><![CDATA[redirects]]></category>

		<guid isPermaLink="false">http://authoritylabs.com/?p=2845</guid>
		<description><![CDATA[One of the most common problems I see in websites is the same content being available at both the WWW and non-WWW versions of a domain. I’ve encountered this in nearly every website I’ve done an SEO audit for, and I see it every day when browsing the web. Despite it being so prevalent, it [...]]]></description>
			<content:encoded><![CDATA[<p></p><p><img class="alignright size-full wp-image-2855" src="http://authoritylabs.com/wp-content/uploads/directions1.jpg" alt="" width="350" height="265" />One of the most common problems I see in websites is the same content being available at both the WWW and non-WWW versions of a domain. I’ve encountered this in nearly every website I’ve done an SEO audit for, and I see it every day when browsing the web. Despite it being so prevalent, it is indeed a problem.</p>
<p>Having the same content available on both the WWW and non-WWW versions of a domain (such as authoritylabs.com and www.authoritylabs.com) is called canonicalization. While you and I might realize they are in fact the same page, search engines mistake them to be unique pages.</p>
<p>Most of the time, search engines can figure out that they are the same page and only include the canonical URL in their index. SEObook <a href="http://www.seobook.com/glossary/#canonical-url">explains the canonical URL</a> as:</p>
<blockquote><p>The canonical version of any URL is the single most authoritative version indexed by major search engines. Search engines typically use PageRank or a similar measure to determine which version of a URL is the canonical URL.</p></blockquote>
<p>Regardless, canonicalization can result in indexing problems and duplicate content issues. Most importantly, canonicalization will split the link juice between each version as people link to and share both.</p>
<p>What you want to see is a redirection from the WWW to the non-WWW, or vice versa, so that if the wrong version is entered or linked to, the user is automatically taken to the canonical URL. Fortunately, this is relatively easy to set up.</p>
<p><span id="more-2845"></span></p>
<h2>Google Webmaster Tools</h2>
<p>If you’ve verified your site with Google Webmaster Tools, you can set your preferred domain by going to Site Configuration &gt; Settings, and selecting either ‘Display URLs as www.yourdomain.com’ or ‘Display URLs as yourdomain.com’.</p>
<p><img class="aligncenter size-medium wp-image-2846" src="http://authoritylabs.com/wp-content/uploads/preferred-domain-575x260.png" alt="" width="575" height="260" /></p>
<p>This will make sure that Google only indexes your preferred canonical URL. However, it doesn’t fix the problem of splitting your link juice so you should still set up a redirect using one of the following methods.</p>
<p>&nbsp;</p>
<h2>Redirect Using .htaccess</h2>
<p>If your site is hosted on Apache, you can redirect from the WWW to the non-WWW, or vice versa, with a few lines in your .htaccess file.</p>
<p><strong>Redirect WWW to non-WWW:</strong></p>
<pre>RewriteEngine On
RewriteCond %{HTTP_HOST} !^(yourdomain\.com)?$
RewriteRule ^(.*)$ http://yourdomain.com/$1 [R=301,L]</pre>
<p><strong>Redirect non-WWW to WWW:</strong></p>
<pre>RewriteEngine On
RewriteCond %{HTTP_HOST} !^(www\.yourdomain\.com)?$
RewriteRule ^(.*)$ http://www.yourdomain.com/$1 [R=301,L]</pre>
<p>&nbsp;</p>
<h2>Redirect Using cPanel</h2>
<p><strong>aka The Lazy Way to Redirect Using .htaccess</strong></p>
<p>If your website is hosted with a provider that uses cPanel, you can even set up your redirects without touching a line of code. This actually adds the redirect rule directly to the .htaccess file, but sometimes I’d rather not get my hands dirty. To do this, log in to your cPanel, and go to Redirects.</p>
<p><strong>Redirect WWW to non-WWW:</strong></p>
<p><img class="aligncenter size-medium wp-image-2847" src="http://authoritylabs.com/wp-content/uploads/cpanel-redirect1-575x192.png" alt="" width="575" height="192" /></p>
<p><strong>Redirect non-WWW to WWW:</strong></p>
<p><strong><img class="aligncenter size-medium wp-image-2848" src="http://authoritylabs.com/wp-content/uploads/cpanel-redirect2-575x188.png" alt="" width="575" height="188" /></strong></p>
<p>&nbsp;</p>
<h2>Redirect Using IIS7</h2>
<p>With IIS7, there are actually two ways to do this. The <a href="http://www.iis.net/download/URLRewrite">URL Rewrite extension</a> is required for this.</p>
<p>The first method involves adding the following as the first rule in the system.webServer section of the web.config file of the site in question.</p>
<p><strong>Redirect WWW to non-WWW:</strong></p>
<pre>&lt;rewrite&gt;
   &lt;rules&gt;
      &lt;rule name="www to non www"" enabled="true"&gt;
         &lt;match url="(.*)" /&gt;
         &lt;conditions&gt;
            &lt;add input="{HTTP_HOST}" negate="true" pattern="^www\.yourdomain\.com$"  /&gt;
         &lt;/conditions&gt;
         &lt;action type="Redirect" url=http://www\.yourdomain\.com/{R:1}” redirectType="Permanent" /&gt;
      &lt;/rule&gt;
   &lt;/rules&gt;
&lt;/rewrite&gt;</pre>
<p><strong>Redirect non-WWW to WWW:</strong></p>
<pre>&lt;rewrite&gt;
   &lt;rules&gt;
      &lt;rule name="non www to www" enabled="true"&gt;
         &lt;match url="(.*)" /&gt;
         &lt;conditions&gt;
            &lt;add input="{HTTP_HOST}" negate="true" pattern="^www\.youdomain\.com$" /&gt;
         &lt;/conditions&gt;
         &lt;action type="Redirect" url="http://www\.yourdomain.\com/{R:0}" redirectType="Permanent" /&gt;
      &lt;/rule&gt;
   &lt;/rules&gt;
&lt;/rewrite&gt;</pre>
<p>The second way is using the user interface of the URL Rewrite module. You can follow the <a href="http://weblogs.asp.net/owscott/archive/2009/11/27/iis-url-rewrite-rewriting-non-www-to-www.aspx">steps outlined</a> on Scott Forsyth’s blog. I suppose you could call that the lazy way to redirect in IIS7.</p>
<p><img class="aligncenter size-full wp-image-2849" src="http://authoritylabs.com/wp-content/uploads/rewrite-extension.png" alt="" width="531" height="522" /></p>
<p>&nbsp;</p>
<h2>Redirect Using nginx</h2>
<p><a title="nginx" href="http://nginx.org/">Nginx</a> is starting to gain popularity due to lower overhead and higher performance than other servers. For the redirect, you will add one of the following to the top of your site&#8217;s config file.</p>
<p><strong>Redirect WWW to non-WWW</strong></p>
<pre>server {
    listen 80;
    server_name www.yourdomain.com;
    rewrite ^/(.*) http://yourdomain.com/$1 permanent;
}</pre>
<p><strong>Redirect non-WWW to WWW</strong></p>
<pre>server {
    listen 80;
    server_name yourdomain.com;
    rewrite ^/(.*) http://www.yourdomain.com/$1 permanent;
}</pre>
<p>&nbsp;</p>
<p>Whether you’re on Apache, IIS, or nginx these methods really only take a few minutes to set up, so you really don’t have much of an excuse not to.</p>
<p><em>Photo: <a href="http://www.flickr.com/photos/_fabrizio_/1578087736/">Fabrizio Sciami</a>/Flickr</em></p>
]]></content:encoded>
			<wfw:commentRss>http://authoritylabs.com/blog/solving-canonical-problems/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Canonical Tag is Useless</title>
		<link>http://authoritylabs.com/blog/canonical-tag-is-useless/</link>
		<comments>http://authoritylabs.com/blog/canonical-tag-is-useless/#comments</comments>
		<pubDate>Fri, 13 Feb 2009 18:05:52 +0000</pubDate>
		<dc:creator>Chase Granberry</dc:creator>
				<category><![CDATA[Optimization]]></category>
		<category><![CDATA[canonical tag]]></category>
		<category><![CDATA[duplicate content]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Live]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://blog.authoritylabs.com/?p=175</guid>
		<description><![CDATA[Canonical tag is a must in terms of 'best practices' but mostly it's a way for search engines to put an end to all the duplicate content questions.]]></description>
			<content:encoded><![CDATA[<p></p><p>The canonical tag is simply a way to silence webmasters.<img class="alignright size-full wp-image-176" title="Webmasters and Duplicate Content" src="http://blog.authoritylabs.com/wp-content/uploads/2009/02/webmasters.jpg" alt="Webmasters and Duplicate Content" width="120" height="143" /></p>
<p>Yesterday at SMX West it was announced that Google, Yahoo! and Live have agreed upon a method to help webmasters define original content within a site. The <strong>canonical tag</strong> is a meta tag used to tell search engines which page to list in SERPs when multiple listings of the same, or very similar content exist in their databases.</p>
<p><span id="more-175"></span></p>
<p>On-site duplicate content has been a big issue since the beginning of the web. There are lots of things that could cause search engine spiders to index the same content at multiple locations. URL canonical variations (http://www.exmaple.com vs. http://example.com) is the most common issues, but there are others. This is a <a title="Canonical Tag Explaination" href="http://www.seomoz.org/blog/canonical-url-tag-the-most-important-advancement-in-seo-practices-since-sitemaps">good explanation of the canonical tag</a> and how it&#8217;s implemented.</p>
<p>In september the Google Webmaster Central Blog published a fairly extensive post on <a title="How Search Engines Handle Duplicate Content" href="http://googlewebmastercentral.blogspot.com/2008/09/demystifying-duplicate-content-penalty.html">how search engines handle on-site duplicate content</a> in hopes of dispelling a lot of the rumors and speculation circling the issue. Despite how clear the article was, and all the other resources on the web, there were still LOTS of questions about the issue.</p>
<p>As new webmasters come into the space trying to understand SEO and mitigating duplicate content the easy thing to do is ask Google. If you&#8217;re site is set up correctly duplicate content shouldn&#8217;t be an issue. The canonical tag, is not the end-all be-all of duplicate content. You still need a sitemap, good URLs, redirection from the www or the non-www to the correct path, etc. This tag is simply an easy answer to the duplicate content question.</p>
<p>What do you think? Have you implimented the canonical tag yet? Is it really THAT important considering all the other things that still need to be done?</p>
]]></content:encoded>
			<wfw:commentRss>http://authoritylabs.com/blog/canonical-tag-is-useless/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Content Tags</title>
		<link>http://authoritylabs.com/blog/content-tags/</link>
		<comments>http://authoritylabs.com/blog/content-tags/#comments</comments>
		<pubDate>Wed, 21 Jan 2009 17:41:01 +0000</pubDate>
		<dc:creator>Chase Granberry</dc:creator>
				<category><![CDATA[Housekeeping]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[duplicate content]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[LuckyStartups]]></category>
		<category><![CDATA[Obama]]></category>
		<category><![CDATA[relevance]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[tags]]></category>
		<category><![CDATA[usability]]></category>
		<category><![CDATA[video]]></category>
		<category><![CDATA[video SEO]]></category>

		<guid isPermaLink="false">http://blog.authoritylabs.com/?p=78</guid>
		<description><![CDATA[Tag your content. Use tags that are relevant to that content. Don't stuff your tags with keywords you want to rank for. Again, be relevant, but always think about what people might search for.  Don't use all your tags in every post.  Use tags repeatedly across posts when it's appropriate.]]></description>
			<content:encoded><![CDATA[<p></p><p>Tags are all the rage these days.  We&#8217;re seeing them everywhere, but why?</p>
<p>Tags offer a few different benefits. Essentially, tags are a way to quickly visualize what a certain type of content is about. Tag clouds can convey a lot of information very legibly. Take a look, for example, at this tag cloud of Obama&#8217;s inauguration speech. Which words stand out to you?</p>
<p><img class="alignnone size-full wp-image-83" title="obamatagcloud" src="http://blog.authoritylabs.com/wp-content/uploads/2009/01/obamatagcloud.jpg" alt="obamatagcloud" width="450" height="252" /><span id="more-78"></span></p>
<p>I see: new, nation, America, every, people and common. If I hadn&#8217;t heard the speech I would have a good idea of what he spoke about and why.  Read Write Web has a great post with a few <a title="Word Cloud Innaguration Speeches" href="http://www.readwriteweb.com/archives/tag_clouds_of_obamas_inaugural_speech_compared_to_bushs.php">word clouds from different inauguration speeches</a>. These tag clouds summarize what speech writers at the time found important to the American public.</p>
<p>From a human usability perspective hopefully you can see why this is important. With a tag cloud people can quickly get an idea of what a post or blog is about. With these tags as links, they can drill down to interesting posts in only a couple clicks. From a search engine&#8217;s standpoint it&#8217;s easy to tell what your content is about also. If you look at the source code of a WordPress blog, for example, you can see that each tag in a tag cloud is explicity said what size it should be. WordPress has implemented tags in a very search friendly way. A lot of SEO is common sense usability. If you build it right for humans, and pay attention to the details, it&#8217;s going to be easy for search engines to understand.</p>
<p>The number of tags you use to describe a post is important. Again though, think about it from a usability standpoint. If you litter your content with tags, then the majority of those tags aren&#8217;t going to be relevant to the post. It&#8217;s the same reason why keywords stuffing doesn&#8217;t work. If you litter your content with keywords, just to include keywords &#8230; you&#8217;re going to confuse your readers, and you&#8217;re going to confuse the search engines. How can you tell what a post is really about when the important parts of it are diluted. <a title="Good Content Tags" href="http://www.convinceandconvert.com/"></a></p>
<p><a title="Good Content Tags" href="http://www.convinceandconvert.com/">Jason Baer over at Convince and Convert</a> has implemented tags well. On average he uses less than 10 associate with each post. He capitalizes tags that should be capitalized (according to a what looks like an AP writing style) and he includes them at the bottom of each post. I wish he would put a cloud in the right hand column, but having them at the bottom of each post gives people a way to continue reading about topic they&#8217;re interested in, and it reinforces the content within the post. I also wouldn&#8217;t use both categories and tags. Pick one or the other and stick with it. If you use both you run the risk of duplicate content, or having the same content accessible at different URLs. If you&#8217;re the New York Times, it would be something to worry about, but the average blogger should spend time on quality content, not duplicate content. Check out <a title="Official Google Duplicate Content Statement" href="http://googlewebmastercentral.blogspot.com/2008/06/duplicate-content-due-to-scrapers.html">Google&#8217;s official statement on duplicate content</a> for more info.</p>
<p>When tagging video content, there are a few other things to worry about. When most people use video within a blog post it&#8217;s uploaded at a third party provider like YouTube, or UStream for instance. There&#8217;s an entire niche around video SEO, but when it comes to content tags the important thing to worry about is making sure that your post is ranking higher than the actual video for the things you&#8217;re optimizing for. So make sure the content tags you used on your video, are on your actual post somewhere, in a way search engines can understand them. In the case of LuckyStartups, the <a title="Startup Interviews" href="http://www.luckystartups.com/">startup that interviews startups</a>, their tags are embedded in a flash file, which is useless for search purposes. Their lucky Ustream isn&#8217;t optimized that well, and that they have more original text content on the site, or they run the risk of their Ustream videos out ranking their posts.</p>
<p>In summary &#8230; Tag your content. Use tags that are relevant to that content. Don&#8217;t stuff your tags with keywords you want to rank for. Again, be relevant, but always think about what people might search for.  Don&#8217;t use all your tags in every post.  Use tags repeatedly across posts when it&#8217;s appropriate.</p>
]]></content:encoded>
			<wfw:commentRss>http://authoritylabs.com/blog/content-tags/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using apc (User agent is rejected)
Object Caching 372/375 objects using apc

Served from: authoritylabs.com @ 2012-05-17 04:12:20 -->
