Duplicate pages or note? Variations just due to language changes?
-
I have some pages marked as duplicates, so I want to do what I can to solve the issues concerned.
One issue concerns duplicates where the page content is indeed the same except for the language that the content is offered in.
The URL for example of the documentation page of the site, in English is as follows:
http://www.domain.com/support/documentationWe then have the same content in German, French, Russian using the following URLs.
http://www.domain.com/de/support/documentation
http://www.domain.com/fr/support/documentation
http://www.domain.com/ru/support/documentationEach page has links to PDFs which are all in fact in English so the links to the docs are the same. Moz is flagging up all these pages as being duplicate content (which it is when translated back into English, but is not if you just consider that they are using completely different languages!)
Has anyone any thoughts on how to solve this? Or is this something not to worry about / disregard?
Many thanks
Simon
-
Ryan - thank you too for taking the time to respond.
Had a quick peek at the blog you noted - going to go back and read it v-e-r-y slowly!
Ditto, many thanks re the webconfs link - plenty of fun tools to try out there. Am sure it will all deepen my learning / confusion!Thanks again!
-
Don - thank you so much for responding
I think you may well have identified the issue too! The documentation page is the same in each case - has a table with e.g. 4 columns and 20 or so rows - and I guess much of the structural content of the pages, irrespective of the linguistic variations of the text shown on screen, is the same.
Will look closer at this.
Assuming this is the case - the next logical questions would be: Does it matter in terms of SEO? Or is it a kind of 'false positive' which can be noted but ignored? What could I do about it anyway? I guess the answer is implied in your answer above: change the template for each language?
Allied to this, is the fact that since the site is 'growing' with multiple language versions, the problem seen with this sample page will potentially be replicated all over the site. Again, the big question is about the effect on SEO. Web pages are scoring well for brand terms and other important words, and while there are new phrases and words to focus on, I am unsure whether correcting these is to prevent strict penalties or simply to make already-decent rankings as good as they can be.
Thanks in advance for any further points you care to make.
-
Hi Simon. Don has given you some good guidance. Here's a recent Moz Dev Blog post on the subject: https://moz.com/devblog/near-duplicate-detection/. Note their images explaining much of what Don described. Two pages having enough shared phrases (because of the header, footer, nav, etc) can trigger the duplicate warning. While the latter part of the dev blog post certainly gets technical, it should explain why you might be getting duplicate content warnings even further if that's your bent.
Since each tool is a bit different you can also check your pages with other tools, such as: http://www.webconfs.com. Cheers!
-
Hi Simon,
Okay so crawlers can crawl PDF's unless they are encrypted / encoded. However since they link to the PDF that shouldn't be the issue.Ref: googleblog
How much content are on these pages? I ask because when there is thin content you may find that the template itself is causing the duplication problem, unless of course you are using different templates for each language as well.
Take for example a page that reads.
en: The woman eats frozen fruit daily.
de: Die Frau isst gefrorenes Gemüse jegen tag.
es: La mujer come las verduras congeladas diariaNow surround each of those pages with a header content, footer content, right / left column content same images same alt tags and the deviation of content is so small it is not noticed.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is it easier to rank high with a front page than a landing page?
My product is laptop and of cause, I like to rank high for the keyword "laptop". Do any of you know if the search engines tends to rank a front page higher than a landing page? Eg. www.brand.com vs. www.brand.com/laptop
Technical SEO | | Debitoor0 -
Duplicate Titles on Wordpress blog pages
Hi, I have an issue where I am getting for duplicate page titles for pages that shouldn't exist. The issue is on the blog index page's (from 0 - 16) and involves the same set of attachment_id for each page, i.e. /blog/page/10/?attachment_id=minack /blog/page/10/?attachment_id=ponyrides /blog/page/11/?attachment_id=minack /blog/page/11/?attachment_id=ponyrides There are 6 attachment_id values (and they are not ID values either) which repeat for every page on the index now what I can't work out is where those 6 links are coming from as on the actual blog index page http://www.bosinver.co.uk/blog/page/10/ there are no links to it and the links just go to blog index page and it ignores the attachment_id value. There is no sitemap.xml file either which I thought might have contained the links. Thanks
Technical SEO | | leapSEO0 -
Pageing page and seo meta tag questions
Hi if i am using paging in my website there is lots of product in my website now in paging total paging is 1000 pages now what title tag i need to add for every paging page or is there any good way we can tell search engine all page or same ?
Technical SEO | | constructionhelpline0 -
Is there an easy solution for duplicate page content on a drupal CMS?
I have a drupal 7 site www.australiacounselling.com.au that has over 5000 crawl errors (!). The main problem - close to 3000 errors- is I have duplicate page content. When I create a page I can create a URL alias for the page that is SEO friendly, however every time I do this, it is registering there are 2 pages with the same content. Is there a module that you're aware of that I can have installed that would allow me to show what is the canonical page? My developers seemed stumped and have given up trying to find a solution, but I'm not convinced that it should be that hard. Any ideas from those familiar with drupal 7 would be greatly appreciated!
Technical SEO | | ClintonP0 -
What's the best way to eliminate duplicate page content caused by blog archives?
I (obviously) can't delete the archived pages regardless of how much traffic they do/don't receive. Would you recommend a meta robot or robot.txt file? I'm not sure I'll have access to the root directory so I could be stuck with utilizing a meta robot, correct? Any other suggestions to alleviate this pesky duplicate page content issue?
Technical SEO | | ICM0 -
If you only want your home page to rank, can you use rel="canonical" on all your other pages?
If you have a lot of pages with 1 or 2 inbound links, what would be the effect of using rel="canonical" to point all those pages to the home page? Would it boost the rankings of the home page? As I understand it, your long-tail keyword traffic would start landing on the home page instead of finding what they were looking for. That would be bad, but might be worth it.
Technical SEO | | watchcases0 -
How do I fix Duplicate Content/Title going to memberlist.php page?
I have over 6,000 duplicate title and duplicate content errors going to this link: http://community.mautofied.com/memberlist.php?mode=viewprofile&u=100299 How do I fix this?
Technical SEO | | mautofied0