Duplicate Page Errors
-
Hey guys,
I'm wondering if anyone can help... Here is my issue...
Our website:
http://www.cryopak.com
It's built on Concrete 5 CMSI'm noticing a ton of duplicate page errors (9530 to be exact). I'm looking at the issues and it looks like it is being caused by the CMS. For instance the home page seems to be duplicating..
http://www.cryopak.com/en/
http://www.cryopak.com/en/?DepartmentId=67
http://www.cryopak.com/en/?DepartmentId=25
http://www.cryopak.com/en/?DepartmentId=4
http://www.cryopak.com/en/?DepartmentId=66Do you think this is an issue? Is their anyway to fix this issue? It seems to be happening on every page.
Thanks
Jim
-
Thanks everyone for the help. This should def. help clean up some of the problems that I've been having with the website.
-
I ran a crawl with Xenu (similar to what Donna did with Screaming Frog), and came across some deep page that may be causing this problem. For example, on this page...
...the last link to "phase change material" goes to:
http://www.cryopak.com/product_line/default.aspx?DepartmentId=67
...which then redirects to...
http://www.cryopak.com/en/?DepartmentId=67
It seems like multiple pages share that template, so one canonical tag might clean up a lot. I'd have to understand the site structure a lot better to advise, though. Google doesn't seem to be indexing these URLs, so they probably aren't a huge problem, but they could be diluting your ranking power. It's worth cleaning them up.
-
James,
I did a scan of your site. Your problem appears to have several sources. Do you know how to use the screamingfrog scan utility? It's free for sites with less than 500 pages. When I ran a scan on your site, looking only at the html pages, i came up with 283.
- You have search result pages indexed that shouldn't be. They'll look like duplicates to Google.
- You have product pages that contain a lot of the same content, for example http://www.cryopak.com/en/cold-chain-packaging/pre-qualified-shipping-containers/timesaver-2-8-c-series/timesaver24-24-hour-pre-qualified-shipper/ and http://www.cryopak.com/en/cold-chain-packaging/pre-qualified-shipping-containers/timesaver-2-8-c-series/timesaver48-48-hour-pre-qualified-shipper/ (24-24-hour vs 48-48-hour).
- You have different pages with the exact same title tag.
- You have some pages that are identical with one extra character in the URL e.g. http://www.cryopak.com/en/about and http://www.cryopak.com/en/about/. See that extra slash at the end?
I suggest you run and scan and inventory to get a good idea of where your problems are.
I'm not seeing your http://www.cryopak.com/en/?DepartmentId=xx (where xx represents 67, 25, 4 and 66) in the scan results. They're not redirecting and I don't see a canonical tag in the source code so I don't know what to tell you about those.
If it helps, I can direct message you a CSV file with the results of my scan.
-
Okay.. well I don't see any duplicate page issues in webmaster tools but I only see them in the SEO Moz Craw Errors report. So if they aren't showing up in webmaster tools should I really worry about this???
I can't edit those pages individually because those pages don't exist they are just a product of the CMS system generating those URL strings with the numbers. So I don't think I can canonical tag those pages.
I guess I can group them together and do 301 redirects??
Yes.. http://cryopak.spydertrapdev.com/ is just a dev environment.
-
Hi James,
I suggest you canonical the duplicate pages rather than 301 redirect them. Using canonical tags instead of 301 redirects will allow you to preserve any incoming link equity from external links to those pages. With a 301 redirect, you'll lose that equity.
David may have run your site through Open Site Explorer (OSE) and seen that there's very few incoming links to the duplicate pages and therefore felt it unnecessary to canonicalize them. I see only 8 from the example you gave us above, but don’t want to assume that’s all there is, especially when you're saying you see duplicates on the site, If you have webmaster tools set up, you can get a more exhaustive list of incoming links there.
The other thing I noticed is that the incoming links to the sample pages are coming from a cryopak subdomain on another site. Here are the ones I can see using OSE.
|
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=25
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=4
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=66
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=67
|
I get an error when I try to look at spydertrapdev.com so can't tell if that's a development environment that's been set up for your site or what. These may not be links you want to maintain. You’ll have to decide.
Good luck.
Donna
-
There are two ways to fix this.
First is to redirect all the pages to the proper home page, using a 301. Duplicate pages are bad for seo. Google likes to see one set of content, for each URL. See the webmaster tools article on duplicate content here.
Second is to go into webmaster tools, and set the true URL for this page, using the "URL parameters" function. This way, you can set the proper version of the page, so Google knows what to index. Be very careful when doing this, as you can mess up the way Google sees your site. There is a video on the link, I would watch it, and do a bit of reading first.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Search Console Showing 404 errors for product pages not in sitemap?
We have some products with url changes over the past several months. Google is showing these as having 404 errors even though they are not in sitemap (sitemap shows the correct NEW url). Is this expected? Will these errors eventually go away/stop being monitored by Google?
Technical SEO | | woshea0 -
404 Errors for Form Generated Pages - No index, no follow or 301 redirect
Hi there I wonder if someone can help me out and provide the best solution for a problem with form generated pages. I have blocked the search results pages from being indexed by using the 'no index' tag, and I wondered if I should take this approach for the following pages. I have seen a huge increase in 404 errors since the new site structure and forms being filled in. This is because every time a form is filled in, this generates a new page, which only Google Search Console is reporting as a 404. Whilst some 404's can be explained and resolved, I wondered what is best to prevent Google from crawling these pages, like this: mydomain.com/webapp/wcs/stores/servlet/TopCategoriesDisplay?langId=-1&storeId=90&catalogId=1008&homePage=Y Implement 301 redirect using rules, which will mean that all these pages will redirect to the homepage. Whilst in theory this will protect any linked to pages, it does not resolve this issue of why GSC is recording as 404's in the first place. Also could come across to Google as 100,000+ redirected links, which might look spammy. Place No index tag on these pages too, so they will not get picked up, in the same way the search result pages are not being indexed. Block in robots - this will prevent any 'result' pages being crawled, which will improve the crawl time currently being taken up. However, I'm not entirely sure if the block will be possible? I would need to block anything after the domain/webapp/wcs/stores/servlet/TopCategoriesDisplay?. Hopefully this is possible? The no index tag will take time to set up, as needs to be scheduled in with development team, but the robots.txt will be an quicker fix as this can be done in GSC. I really appreciate any feedback on this one. Many thanks
Technical SEO | | Ric_McHale0 -
Home Page Blog Snippets - Duplicate Content Help?
Afternoon Folks- I have been asked to contribute to a new site that has a blogfeed prominently displayed on the home page. It's laid out like this: Logo | Menu HOME PAGE SLIDER Blog 1 Title about 100 words of blog 1 Text Blog 2 Title about 100 words of blog 2 Text Blog 3 Title about 100 words of blog 3 Text Footer: -- This seems like an obvious duplicate content situation but also a way I have seen a lot of blogs laid out. (I.E. With blog content snippets being a significant portion of the home page content) I want the blogs to rank and I want the home page to rank, so I don't feel like a rel canonical on the blog post's is the correct option unless I have misunderstood their purpose. Anyone have any ideas or know how this is usually handled?
Technical SEO | | CRO_first0 -
How can I fix this home page crawl error ?
My website shows this crawl error => 612 : Home page banned by error response for robots.txt. I also did not get any page data in my account for this website ... I did get keyword rankings and traffic data, I am guessing from the analytics account. url = www.mississaugakids.com Not sure really what to do with this ! Any help is greatly appreciated.
Technical SEO | | jlane90 -
Why is it the crawler saying I have 9 Duplicate Page Titles?
Hi, I received my weekly web crawl and it is saying this: | 4 | Duplicate Page Content |
Technical SEO | | afrohairsolutions
| 22 | Missing Meta Description Tag |
| 9 | Duplicate Page Title |
| 1 | Title Element Too Long (> 70 Characters) |
| 1 | Title Element Too Short |
| 1 | 301 (Permanent Redirect) | I'm new to SEO and don't know how to fix this, I don't really see how I have Duplicate Page Content or Duplicate Page Title. This is my website: afrohairsolutions.co.uk Thank you in advance.0 -
Advice on Duplicate Page Content
We have many pages on our website and they all have the same template (we use a CMS) and at the code level, they are 90% the same. But the page content, title, meta description, and image used are different for all of them. For example - http://www.jumpstart.com/common/find-easter-eggs
Technical SEO | | jsmoz
http://www.jumpstart.com/common/recognize-the-rs We have many such pages. Does Google look at them all as duplicate page content? If yes, how do we deal with this?0 -
How to remove the 4XX Client error,Too many links in a single page Warning and Cannonical Notices.
Firstly,I am getting around 12 Errors in the category 4xx Client error. The description says that this is either bad or a broken link.How can I repair this ? Secondly, I am getting lots of warnings related to too many page links of a single page.I want to know how to tackle this ? Finally, I don't understand the basics of Cannonical notices.I have around 12 notices of this kind which I want to remove too. Please help me out in this regard. Thank you beforehand. Amit Ganguly http://aamthoughts.blogspot.com - Sustainable Sphere
Technical SEO | | amit.ganguly0 -
Duplicate content error from url generated
We are getting a duplicate content error, with "online form/" being returned numerous times. Upon inspecting the code, we are calling an input form via jQuery which is initially called by something like this: Opens Form Why would this be causing it the amend the URL and to be crawled?
Technical SEO | | pauledwards0