Need help fixing the duplicate content that keeps growing
-
Need help fixing the duplicate content that keeps growing
-
can you do me a favour and check if http://www.............magazine.com is duplicate with the above domain that we are talking about i used your similar page checker and it said 100% we have a redirect on the domain, but i am concerned that it may not be effective
-
Thank you for your help, So these pages have no content on them yet, So I guess I need to put some content on them. Do you think that these issues affect my google ranking?
T
-
Hey T,
I want to let you know that I removed the link to your campaign from your last post and I would recommend that you don't post those types of links publicly. Our site is secure, and only admins and account owners can access data through those links, but you never know what someone may try to do maliciously with that information.
For the link you provided, we are reporting those pages as duplicate content because they are pretty much 100% similar in the code and content for the page: http://www.screencast.com/t/qMgveoj4i. (Our campaign tolerance is 90% similarity.) The only difference is the state name the pages refer to, which is not enough to make the pages different. You can verify that using this tool here: http://smallseotools.com/similar-page-checker/
Here is a great resource for learning about canonical tags: https://moz.com/learn/seo/canonicalization
Here is a post about how we detect duplicate content:https://moz.com/devblog/near-duplicate-detection/ And for more information on using canonical tags, check out this great post by our very own Dr. Pete: http://moz.com/blog/rel-confused-answers-to-your-rel-canonical-questions -
Hi Chiaryn, and Bryan
thanks for your help..I did not want to be too specific initially as I did not know how open this help request would be.The site is as mentioned above.
One question, will all these duplicate content affect the site in Google?
I can apply the User-agent: *
Disallow: ?attachmentWhat is the code to perform the canonical tag?Also please can you look at this duplicate content.. I do t understand why it is duplicate content
If I fix these issues will my site ranking improve?
Thanks again
T
-
Hey There! It looks like you are trying get some assistance without specifically naming the site you are concerned about and I definitely understand that, but it is really difficult to give advice on this issue without more detailed information. However, I took a look at your campaigns and I am going to address the issue I am seeing with the site that had the largest increase of duplicate content over the last couple of crawls. I apologize if this isn't the site you are referring to.
The campaign I'm looking at is the 5 Star campaign. It looks like a large number of the pages with duplicate content are related to ?attachment parameters in the URL, such as www.site.com/?attachment_id=77899. There is very little content on these pages and it looks like they are added to the site pretty regularly, since all of the ones I looked at are dated closely together.
I'm not an SEO expert, so Bryan may have better advice for you, but I can give a few suggestion of how to resolve this issue. I don't entirely understand the purpose of these pages, so that would affect which of these options might be best for your personal strategy for the site.
You can add a canonical tag to these pages to point to one specific page as the most important page with this content. For this option, they would have to point back to the same page or our crawler will still show them as duplicates because we assume that the two canonical pages are then also likely to be duplicates. Google, however will stop indexing these pages.
You can also block these pages from being accessed using the robots.txt file for this site. For example, it would look something like this:
User-agent: *
Disallow: ?attachmentetc., until you have covered all of the parameters you would like to block. The User-agent: * blocks all crawlers from accessing those pages, but you can also use User-agent: rogerbot to specifically block only our crawler.
I hope this helps! Please let me know if I can help you with anything else.
-
That's still not quite enough to go on. Could you provide the message they're giving you, and/or URLs regarding the duplicate content? Examples in either should prove helpful.
-
Hi thanks for the heads up.. just been very frustrating.
Moz and Google show dup internal content.
Using WordPress and Yoest SEO plugin to try to fix some of the issues.
Blog content
Regards T
-
You'll have to be a lot more specific. Try answering at least some of these questions so we can help you:
- What content is being duplicated?
- Where are the duplicates?
- Is this all internal (on your site only)?
- Are you receiving any duplicate content warnings from Moz, Google, etc.?
- In what way does it "keep[s] growing?"
- What kind of content is this?
Once you provide answers to some of these questions, I'm sure we'll be able to help you fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How can I identify most relevant websites in Mexico that create content about a specific term?
Hi. I need to define the most relevant sites which are talking about a specific keyword ir order to create an PR strategy based on that term. How can I identify those sites?
Getting Started | | HarolRuiz0 -
5xx Crawl Issue might not be issues at all. Help
Hi, I ran a crawl test on our website and it came back with 900 5xx potential errors. When I started opening these links 1 by 1 I could see they were actually working. So i exported the full list of 900 and went to the website: https://httpstatus.io/ pasted the links by 100 and used that. They came back with status codes of 301 / 301 / 200 which i believe means they are okay. After reading it says that my programmer may need to see if we are blocking the MOZ BOT or to slow the MOZ BOT down. I guess I'm wondering if this is not done is the site actually having these 5xx errors when Google is Crawling or is it just showing 900 errors because of MOZ BOT but actually things are okay? I know the simple answer is to get the programmer to fix the MOZ BOT issue to know for sure but getting programmers to do things take a lot of time so I'm trying to get a better idea here. Thanks for your input.
Getting Started | | Cfarcher1 -
Do I need meta descriptions for category pages in Wordpress?
Do I need meta descriptions for category pages in Wordpress? They show up as errors in the Moz site audit. What should I do about it? Thanks,
Getting Started | | Jarod45450 -
Why is my wp-uploads folder flagged as thin content?
SIte Crawl is flagging multiple entries from my wp-uploads directory as being 'thin content': these are dated folders that contain the images for the site blog, and as such don't have any text/html content at all. Should these directories be crawled at all? How would I go about correcting these warnings?
Getting Started | | lostmotionassembly0 -
Our website has 8 subdomains for each country, do i need to set up a campaign for each? And have to upgrade to have more than 5 campaigns?
Hello there, Our website is set up to have one main domain with 8 subdomains for each country. To be able to track each subdomain do i need 8 campaigns when setting up in moz?
Getting Started | | cwpang0 -
How do I interpret Duplicate Content in a Crawl Report, when it only gives me a URL? How do I know what is duplicated on that page somewhere else?
I need help interpreting the Crawl Report for Duplicate Content. It gives me the URLs of pages that have duplicate content, but how do I know what content exactly is duplicated elsewhere? And how do I figure out where it is duplicated? Also, are there Moz Analytics articles or videos teaching you how to use each component of the analytics programs? Thanks!
Getting Started | | NancyBryan0 -
Moz's official stance on Subdomain vs Subfolder - does it need updating?
Hi, I am drawing your attention to Moz's Domain basics here: http://moz.com/learn/seo/domain It reads: "Since search engines keep different metrics for domains than they do subdomains, it is recommended that webmasters place link-worthy content like blogs in subfolders rather than subdomains. (i.e. www.example.com/blog/ rather than blog.example.com) The notable exceptions to this are language-specific websites. (i.e., en.example.com for the English version of the website)." I am wondering if this is still Moz's current recommendation on the subfolders vs subdomains debate, given that the above (sort of) implies that SE's may not combine ranking factors to the domain as a whole if subdomains are used - which (sort of) contradicts Matt Cutts last video on the matter ( http://www.youtube.com/watch?v=_MswMYk05tk ) which implies that this is not the case and there is so little difference that their recommendation is to use whatever is easiest. It would also seem to me that if you were looking through the eyes of Google, it would be silly to treat them differently if there were no difference at all other than subdomain vs subfolder as one of the main reasons a user would use a sud-domain is a technical on for which it would not make sense for Google to treat differently in terms of its algorithm. I notice that in terms of Moz, while most of the site uses subfolders, you do have http://devblog.moz.com/ - and I was wondering if this is due to a technical reason or conscious decision, as it would seem to me that the content within this section is indeed linkworthy (as it has external links pointing to it from external sources), therefore it would seem to not be following the initial advice that is posted in Moz's basics on domains. Therefore I am assuming it is due to a technical reason - or that Moz's adive is out of date with current Moz thinking, and is indeed in line with Matt C in that it doesn't matter. Cheers
Getting Started | | James773 -
Is there a way MOZ can help me get HQ links?
I'm new to MOZ, I'm on the niche sites building. Is there an easy way to find HQ pages to post to with MOZ? Like it's with Market samurai.
Getting Started | | bishop230