SEO Myth-Busters -- Isn't there a "duplicate content" penalty by another name here?
-
Where is that guy with the mustache in the funny hat and the geek when you truly need them?
So SEL (SearchEngineLand) said recently that there's no such thing as "duplicate content" penalties.
http://searchengineland.com/myth-duplicate-content-penalty-259657
by the way, I'd love to get Rand or Eric or others Mozzers aka TAGFEE'ers to weigh in here on this if possible.
The reason for this question is to double check a possible 'duplicate content" type penalty (possibly by another name?) that might accrue in the following situation.
1 - Assume a domain has a 30 Domain Authority (per OSE)
2 - The site on the current domain has about 100 pages - all hand coded. Things do very well in SEO because we designed it to do so.... The site is about 6 years in the current incarnation, with a very simple e-commerce cart (again basically hand coded). I will not name the site for obvious reasons.
3 - Business is good. We're upgrading to a new CMS. (hooray!) In doing so we are implementing categories and faceted search (with plans to try to keep the site to under 100 new "pages" using a combination of rel canonical and noindex. I will also not name the CMS for obvious reasons.
In simple terms, as the site is built out and launched in the next 60 - 90 days, and assume we have 500 products and 100 categories, that yields at least 50,000 pages - and with other aspects of the faceted search, it could create easily 10X that many pages.
4 - in ScreamingFrog tests of the DEV site, it is quite evident that there are many tens of thousands of unique urls that are basically the textbook illustration of a duplicate content nightmare. ScreamingFrog has also been known to crash while spidering, and we've discovered thousands of URLS of live sites using the same CMS.
There is no question that spiders are somehow triggering some sort of infinite page generation - and we can see that both on our DEV site as well as out in the wild (in Google's Supplemental Index).
5 - Since there is no "duplicate content penalty" and there never was - are there other risks here that are caused by infinite page generation?? Like burning up a theoretical "crawl budget" or having the bots miss pages or other negative consequences?
6 - Is it also possible that bumping a site that ranks well for 100 pages up to 10,000 pages or more might very well have a linkuice penalty as a result of all this (honest but inadvertent) duplicate content? In otherwords, is inbound linkjuice and ranking power essentially divided by the number of pages on a site? Sure, it may be some what mediated by internal page linkjuice, but what's are the actual big-dog issues here?
So has SEL's "duplicate content myth" truly been myth-busted in this particular situation?
???
Thanks a million!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Optimising meta tags: How to write them perfectly without duplicating? Impact of using different keywords?
Hi friends, Generally most of the articles about tags are either title rag or header tags, but not about both. I would like to know how to write perfect title and header tags. How much they must be relevant and different? Can we use the same tags for title and H1? If we are planning to rank for different keywords, can that different keywords can be used? I'm really curious to see some interesting answers for this. Thanks
Algorithm Updates | | vtmoz0 -
Does Google considers the cached content of a page if it's redirected to new page?
Hi all, If we redirect an old page to some new page, we know that content relevancy between source page and this new page matters at Google. I just wonder if Google is looking at the content relevancy of old page (from cache) and new page too. Thanks
Algorithm Updates | | vtmoz0 -
Content strategy for landing pages: Topics vs Features
Hi all, We are going to create new landing pages and optimise existing pages. We have a confusion on how to employ content on these pages....whether these will be filled with content to rank for "topics" and "keywords" or direclty jump into the features are are providing. If we go with first, users may feel boring about teaching them about that topic, if we go with latter...it's hard to rank being no related content to rank for that topic. I have seen some of the websites are employing multiple landing pages where they fill with topic related content and then link to features pages. I need suggestions here. Thank you
Algorithm Updates | | vtmoz1 -
Can we ignore "broken links" without redirecting to "new pages"?
Let's say we have reaplced www.website.com/page1 with www.website.com/page2. Do we need to redirect page1 to page2 even page1 doesn't have any back-links? If it's not a replacement, can we ignore a "lost page"? Many websites loose hundreds of pages periodically. What's Google's stand on this. If a website has replaced or lost hundreds of links without reclaiming old links by redirection, will that hurts?
Algorithm Updates | | vtmoz0 -
How do I code SEO for a secondary site without impacting the main site?
We have a secondary site for our online magazine, how do I code the SEO so I don't steal links from the main site?
Algorithm Updates | | gacwebteam0 -
Google not crawling click to expand content - suggestions?
It seems like Google confirmed this week in a G+ hangout that content in click to expand content e.g. 'read more' dropdown and tabbed content scenarios will be discounted. The suggestion was if you have content it needs to be visible on page load. Here's more on it https://www.seroundtable.com/google-index-click-to-expand-19449.html and the actual hangout, circa 11 mins in https://plus.google.com/events/cjcubhctfdmckph433d00cro9as. From a UX and usability point of view having a lot of content that was otherwise tabbed or in click to expand divs can be terrible, especially on mobile. Does anyone have workable solutions or can think of examples of really great landing pages (i'm mostly thinking ecommerce) that also has a lot of visible content? Thanks Andy
Algorithm Updates | | AndyMacLean0 -
Does adding lots of new content on a site at one time actually hurt you?
When speaking with a client today, he made the comment that he didn't want all of the new content we'd been working to be added to the site all at once for fear that he would get penalized for flooding the site with new content. I don't have any strong data to confirm or refute the claim, is there any truth to it?
Algorithm Updates | | JordanRussell0 -
Regarding google panda: would it be wise to use automatic generated content when there is no content.
Hi guys, i am currently creating a local business directory and was deciding when we first start there will be a lot of business listings without a business decription until the owner of that business come to submit a description. so when if a business listing have no business description would it be better to have an automatic generated business description like this:
Algorithm Updates | | usaccess608
www.startlocal.com.au/retail/books/tas_hobartandsouth/Scene_Magazine_2797040.html the automated genrated description for this listing on that page is:
Scene Magazine is a business that is based in Kingston, 7050, TAS: Hobart And South. Scene Magazine is listed in 2 categories including: Magazines and Periodicals Shops and Book Stores and Shops. Within the Magazines and Periodicals Shops category there are 5 businesses within 25 km of Scene Magazine. Some of those businesses included within the radius of 25 km are Island Magazine, Artemis Publishing Consultants and Bride Tasmania Magazine. would google panda affect this or not and would it be wise to use this auto content when there is no description for a business?0