Duplicate Content
-
Hello All, my first web crawl has come back with a duplicate content warning for
and
slightly mystified!
thanks
paul
-
If you're still in contact with a web developer, that would be great. If you're not, a note to everyone else on this thread that the website in question is using IIS 6.0, so apache info isn't going to help in this case.
-
Just a 301 from 7index to /
-
Hi Cesar,
there is no drawback. technically www.simodal.com and www.simodal.com/ are different pages just like www.simodal.com/randompage and www.simodal.com/randompage/ would be considered different. Most people would consider /randompage a page and /randompage/ a directory. But from a SEO perspective .com and .com/ are equally good.
What you should do is to decide whether you want to use a trailing slash or not and stick to it. if you dedide not to use / on your sites root page use it consistently everywhere.
Generally speaking there are 3 often seen ways: use .html for pages and / for directorys vs. no suffix for pages (domain.tld/page ) and / for directorys vs. / for all pages and directories (wordpress uses / AFAIK). It doesnt realy matter much, take one and stick to it.
-
which is the drawback of the 301 redirect without the "/"?
-
Hi Paul
I can fully identify with your frustrations - been there!
A simple question may help you. Did you have a web developer, and are you still in relationship with him/her. If so, get them to do a 301 redirect from the www.simodal.com/index.htm to your chosen version. Most seem to do www.simodal.com/ - but with a trailing forward slash at the end. Someone else might like to comment on that.
Also as Aaron says also do it for the version without the www's ie: http://simodal.com/ and do a 301 to exactly the same URL as the above.
If you haven't got a developer there is some info around telling you exactly how to do it.
Hope this helps
-
Hey Paul,
here is the explanation:
www.simodal.com and www.simodal.com/index.htm are considered separate pages by google, although both are your sites "starting point". Some Content Management Systems (CMS) make thiis mistake, i.e. delivering the same page and not distinguishing between simodal.com/ and simodal.com/index.htm.
As said before, you should decide whether all your pages should be www.simodal oder just simodal.com. There is a great Whiteboard-Friday Video by Rand on this toppic. Then you should rewrite your URLs to either version.
Additionally you might want to add a rel canonical to your page, maybe just to your starting page. a
<link rel="canonical" href="http://www.simodal.com/" />
on your starting page would tell google to ignore the /index.htm and use /
But watch out, rel canonical is somewhat tricky...but there are good tutorials here.
To be honest: I know quiet a lot of pages, that make this mistake. Google should be able to correct this, so dont qorry about rankings. You should however do the redirect www. (or the opposite) as this will trigger googles DC filter. Also: if you plan to use SSL (https:// ) make sure that these pages are also not indexed, best by using rel canonical.
-
Hello Paul!
Because the URL is different, the crawlers look them as different pages, but as you know, they're not! It's just two ways to get there!
To solve this, you have to redirect the /index page to the non-/index, using the 301 redirection code.
Tutorial here: http://www.tamingthebeast.net/articles3/spiders-301-redirect.htm
Got it?
Hope it helps! =]
-
ThanksAaron, this is very new to me and you will have to forgive my DOH! moments.
Still don't get it. Can you point me in any direction so I can understand.
best
paul
-
It is indeed duplicate content! You might want to consider doing a redirect. I also noticed that you haven't done a redirect from the non www. domain either!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do you think my client is being hit for duplicate content?
Wordpress website. The client's website is http://www.denenapoints.com/ The URL that we purchase so that we could setup the hosting account is http://houston-injury-lawyers.com, which shows 1 page indexed in Google when I search for site:http://houston-injury-lawyers.com On http://www.denenapoints.com/ there is <link rel="<a class="attribute-value">canonical</a>" href="http://houston-injury-lawyers.com/"> But on http://houston-injury-lawyers.com it says the same thing, <link rel="<a class="attribute-value">canonical</a>" href="http://houston-injury-lawyers.com/" /> Is this how it should be setup, assuming that we want everything to point to http://denenapoints.com/? Maybe we should do a 301 redirect to be 100% Sure? Hopefully I explained this well enough. Please let me know if anyone has any thoughts, thanks!
Technical SEO | | georgetsn0 -
Duplicate content warning for a hierarchy structure?
I have a series of pages on my website organized in a hierarchy, let's simplify it to say parent pages and child pages. Each of the child pages has product listings, and an introduction at the top (along with an image) explaining their importance, why they're grouped together, providing related information, etc.
Technical SEO | | westsaddle
The parent page has a list of all of its child pages and a copy of their introductions next to the child page's title and image thumbnail. Moz is throwing up duplicate content warnings for all of these pages. Is this an actual SEO issue, or is the warning being overzealous?
Each child page has tons of its own content, and each parent page has the introductions from a bunch of child pages, so any single introduction is never the only content on the page. Thanks in advance!0 -
Development Website Duplicate Content Issue
Hi, We launched a client's website around 7th January 2013 (http://rollerbannerscheap.co.uk), we originally constructed the website on a development domain (http://dev.rollerbannerscheap.co.uk) which was active for around 6-8 months (the dev site was unblocked from search engines for the first 3-4 months, but then blocked again) before we migrated dev --> live. In late Jan 2013 changed the robots.txt file to allow search engines to index the website. A week later I accidentally logged into the DEV website and also changed the robots.txt file to allow the search engines to index it. This obviously caused a duplicate content issue as both sites were identical. I realised what I had done a couple of days later and blocked the dev site from the search engines with the robots.txt file. Most of the pages from the dev site had been de-indexed from Google apart from 3, the home page (dev.rollerbannerscheap.co.uk, and two blog pages). The live site has 184 pages indexed in Google. So I thought the last 3 dev pages would disappear after a few weeks. I checked back late February and the 3 dev site pages were still indexed in Google. I decided to 301 redirect the dev site to the live site to tell Google to rank the live site and to ignore the dev site content. I also checked the robots.txt file on the dev site and this was blocking search engines too. But still the dev site is being found in Google wherever the live site should be found. When I do find the dev site in Google it displays this; Roller Banners Cheap » admin <cite>dev.rollerbannerscheap.co.uk/</cite><a id="srsl_0" class="pplsrsla" tabindex="0" data-ved="0CEQQ5hkwAA" data-url="http://dev.rollerbannerscheap.co.uk/" data-title="Roller Banners Cheap » admin" data-sli="srsl_0" data-ci="srslc_0" data-vli="srslcl_0" data-slg="webres"></a>A description for this result is not available because of this site's robots.txt – learn more.This is really affecting our clients SEO plan and we can't seem to remove the dev site or rank the live site in Google.Please can anyone help?
Technical SEO | | SO_UK0 -
What could be the cause of this duplicate content error?
I only have one index.htm and I'm seeing a duplicate content error. What could be causing this? IUJvfZE.png
Technical SEO | | ScottMcPherson1 -
Duplicate content in Magento
Hi all We got some serious issues with duplicate content on a Magento site that we are marketing. For example: http://www.citcop.se/varmepumpar-luft-luft/panasonic/panasonic-nordic-ce9nke-5-0kw http://www.citcop.se/panasonic/panasonic-nordic-ce9nke-5-0kw http://www.citcop.se/panasonic-nordic-ce9nke-5-0kw All of the above seem to work just fine as it is now but since they are excatly the same product they should ofcourse do a 301 redirect to the main page. Any ideas on how to sort this out in Magnto without having to resort to manual work in .htaccess? Have a great day Fredrik
Technical SEO | | Resultify0 -
Large Scale Ecommerce. How To Deal With Duplicate Content
Hi, One of our clients has a store with over 30,000 indexed pages but less then 10,000 individual products and make a few hundred static pages. Ive crawled the site in Xenu (it took 12 hours!) and found it to by a complex mess caused by years of hack add ons which has caused duplicate pages, and weird dynamic parameters being indexed The inbound link structure is diversified over duplicate pages, PDFS, images so I need to be careful in treating everything correctly. I can likely identify & segment blocks of 'thousands' of URLs and parameters which need to be blocked, Im just not entirely sure the best method. Dynamic Parameters I can see the option in GWT to block these - is it that simple? (do I need to ensure they are deinxeded and 301d? Duplicate Pages Would the best approach be to mass 301 these pages and then apply a no-index tag and wait for it to be crawled? Thanks for your help.
Technical SEO | | LukeyJamo0 -
Are RSS Feeds deemed duplicate content?
If a website content management system includes built-in feeds of different categories that the client can choose from, does that endanger them of having duplicate content if their categories are the same as another client's feed? These feeds appear on templated home page designs by default. Just trying to figure out how big of an issue these feeds are in terms of duplicate content across clients' sites. Should I be concerned? Obviously, there's other content on the home page besides the feed and have not really seen negative effects, but could it be impacting results?
Technical SEO | | KyleNeuberger0 -
Duplicate content and tags
Hi, I have a blog on posterous that I'm trying to rank. SEOMoz tells me that I have duplicate content pretty much everywhere (4 articles written, 6 errors at the last crawl). The problem is that I tag my posts, and apparently SEOMoz thinks that it's duplicate content only because I don't have so many posts, so pages end up being very very similar. What can I do in these situations ?
Technical SEO | | ngw0