Duplicate Content
-
Hello All, my first web crawl has come back with a duplicate content warning for
and
slightly mystified!
thanks
paul
-
If you're still in contact with a web developer, that would be great. If you're not, a note to everyone else on this thread that the website in question is using IIS 6.0, so apache info isn't going to help in this case.
-
Just a 301 from 7index to /
-
Hi Cesar,
there is no drawback. technically www.simodal.com and www.simodal.com/ are different pages just like www.simodal.com/randompage and www.simodal.com/randompage/ would be considered different. Most people would consider /randompage a page and /randompage/ a directory. But from a SEO perspective .com and .com/ are equally good.
What you should do is to decide whether you want to use a trailing slash or not and stick to it. if you dedide not to use / on your sites root page use it consistently everywhere.
Generally speaking there are 3 often seen ways: use .html for pages and / for directorys vs. no suffix for pages (domain.tld/page ) and / for directorys vs. / for all pages and directories (wordpress uses / AFAIK). It doesnt realy matter much, take one and stick to it.
-
which is the drawback of the 301 redirect without the "/"?
-
Hi Paul
I can fully identify with your frustrations - been there!
A simple question may help you. Did you have a web developer, and are you still in relationship with him/her. If so, get them to do a 301 redirect from the www.simodal.com/index.htm to your chosen version. Most seem to do www.simodal.com/ - but with a trailing forward slash at the end. Someone else might like to comment on that.
Also as Aaron says also do it for the version without the www's ie: http://simodal.com/ and do a 301 to exactly the same URL as the above.
If you haven't got a developer there is some info around telling you exactly how to do it.
Hope this helps
-
Hey Paul,
here is the explanation:
www.simodal.com and www.simodal.com/index.htm are considered separate pages by google, although both are your sites "starting point". Some Content Management Systems (CMS) make thiis mistake, i.e. delivering the same page and not distinguishing between simodal.com/ and simodal.com/index.htm.
As said before, you should decide whether all your pages should be www.simodal oder just simodal.com. There is a great Whiteboard-Friday Video by Rand on this toppic. Then you should rewrite your URLs to either version.
Additionally you might want to add a rel canonical to your page, maybe just to your starting page. a
<link rel="canonical" href="http://www.simodal.com/" />
on your starting page would tell google to ignore the /index.htm and use /
But watch out, rel canonical is somewhat tricky...but there are good tutorials here.
To be honest: I know quiet a lot of pages, that make this mistake. Google should be able to correct this, so dont qorry about rankings. You should however do the redirect www. (or the opposite) as this will trigger googles DC filter. Also: if you plan to use SSL (https:// ) make sure that these pages are also not indexed, best by using rel canonical.
-
Hello Paul!
Because the URL is different, the crawlers look them as different pages, but as you know, they're not! It's just two ways to get there!
To solve this, you have to redirect the /index page to the non-/index, using the 301 redirection code.
Tutorial here: http://www.tamingthebeast.net/articles3/spiders-301-redirect.htm
Got it?
Hope it helps! =]
-
ThanksAaron, this is very new to me and you will have to forgive my DOH! moments.
Still don't get it. Can you point me in any direction so I can understand.
best
paul
-
It is indeed duplicate content! You might want to consider doing a redirect. I also noticed that you haven't done a redirect from the non www. domain either!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content when working with makes and models?
Okay, so I am running a store on Shopify at the address https://www.rhinox-group.com. This store is reasonably new, so being updated constantly! The thing that is really annoying me at the moment though, is I am getting errors in the form of duplicate content. This seems to be because we work using the machine make and model, which is obviously imperative, but then we have various products for each machine make and model. Have we got any suggestions on how I can cut down on these errors, as the last thing I want is being penalised by Google for this! Thanks in advance, Josh
Technical SEO | | josh.sprakes1 -
Is there an percentage of duplicate content required before you should use a canonical tag?
Is there a percentage (approximate or exact) of duplicate content you should have before you use a canonical tag? Similarly how does Google handle canonical tags if the pages aren’t 100% duplicate? I've added some background and an example below; Nike Trainer model 1 – has an overview page that also links to a sub-page about cushioning, one about Gore-Tex and one about breathability. Nike Trainer model 2,3,4,5 – have an overview page that also links to sub-pages page about cushioning , Gore-Tex and breathability. In each of the sub-pages the URL is a child of the parent so a distinct page from each other e.g. /nike-trainer/model-1/gore-tex /nike-trainer/model-2/gore-tex. There is some differences in material composition, some different images and of course the product name is referred multiple times. This makes the page in the region of 80% unique.
Technical SEO | | punchseo0 -
Headers & Footers Count As Duplicate Content
I've read a lot of information about duplicate content across web pages and was interested in finding out about how that affected the header and footer of a website. A lot of my pages have a good amount of content, but there are some shorter articles on my website. Since my website has a header, footer, and sidebar that are static, could that hurt my ranking? My only concern is that sometimes there's more content in the header/footer/sidebar than the article itself since I have an extensive amount of navigation. Is there a way to define to Google what the header and footer is so that they don't consider it to be duplicate content?
Technical SEO | | CyberAlien0 -
Why are some pages now duplicate content?
It is probably a silly question, but all of a sudden, the following pages of one of my clients are reported as Duplicate content. I cannot understand why. They weren't before... http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal
Technical SEO | | MarketingEnergy
http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal
http://www.ciaoitalia.nl/product/pizza-originale/döner-halal
http://www.ciaoitalia.nl/product/pizza-originale/vegetariana
http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate
http://www.ciaoitalia.nl/product/pizza-originale/contadina
http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni
http://www.ciaoitalia.nl/product/pizza-originale/shoarma Thanks for any help in the right direction 🙂 | |
| |
| |
| |
| |
| |
| |
| | <colgroup><col style="mso-width-source: userset; mso-width-alt: 17225; width: 353pt;" width="471"></colgroup>
| http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/döner-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/vegetariana |
| http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate |
| http://www.ciaoitalia.nl/product/pizza-originale/contadina |
| http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni |
| http://www.ciaoitalia.nl/product/pizza-originale/shoarma |0 -
Duplicate content problem?
Hello! I am not sure if this is a problem or if I am just making something too complicated. Here's the deal. I took on a client who has an existing site in something called homestead. Files cannot be downloaded, making it tricky to get out of homestead. The way it is set up is new sites are developed on subdomains of homestead.com, and then your chosen domain points to this subdomain. The designer who built it has kindly given me access to her account so that I can edit the site, but this is awkward. I want to move the site to its own account. However, to do so Homestead requires that I create a new subdomain and copy the files from one to the other. They don't have any way to redirect the prior subdomain to the new one. They recommend I do something in the html, since that is all I can access. Am I unnecessarily worried about the duplicate content consequences? My understanding is that now I will have two subdomains with the same exact content. True, over time I will be editing the new one. But you get what I'm sayin'. Thanks!
Technical SEO | | devbook90 -
Thin/Duplicate Content
Hi Guys, So here's the deal, my team and I just acquired a new site using some questionable tactics. Only about 5% of the entire site is actually written by humans the rest of the 40k + (and is increasing by 1-2k auto gen pages a day)pages are all autogen + thin content. I'm trying to convince the powers that be that we cannot continue to do this. Now i'm aware of the issue but my question is what is the best way to deal with this. Should I noindex these pages at the directory level? Should I 301 them to the most relevant section where actual valuable content exists. So far it doesn't seem like Google has caught on to this yet and I want to fix the issue while not raising any more red flags in the process. Thanks!
Technical SEO | | DPASeo0 -
Duplicate Content Issue
Hi Everyone, I ran into a problem I didn't know I had (Thanks to the seomoz tool) regarding duplicate content. my site is oxford ms homes.net and when I built the site, the web developer used php to build it. After he was done I saw that the URL's looking like this "/blake_listings.php?page=0" and I wanted them like this "/blakes-listings" He changed them with no problem and he did the same with all 300 pages or so that I have on the site. I just found using the crawl diagnostics tool that I have like 3,000 duplicate content issues. Is there an easy fix to this at all or does he have to go in and 301 Redirect EVERY SINGLE URL? Thanks for any help you can give.
Technical SEO | | blake-766240 -
Large Scale Ecommerce. How To Deal With Duplicate Content
Hi, One of our clients has a store with over 30,000 indexed pages but less then 10,000 individual products and make a few hundred static pages. Ive crawled the site in Xenu (it took 12 hours!) and found it to by a complex mess caused by years of hack add ons which has caused duplicate pages, and weird dynamic parameters being indexed The inbound link structure is diversified over duplicate pages, PDFS, images so I need to be careful in treating everything correctly. I can likely identify & segment blocks of 'thousands' of URLs and parameters which need to be blocked, Im just not entirely sure the best method. Dynamic Parameters I can see the option in GWT to block these - is it that simple? (do I need to ensure they are deinxeded and 301d? Duplicate Pages Would the best approach be to mass 301 these pages and then apply a no-index tag and wait for it to be crawled? Thanks for your help.
Technical SEO | | LukeyJamo0