How can i see the pages that cause duplicate content?
-
SEOmoz PRO is giving me back duplicate content errors. However, i don't see how i can get a list of pages that are duplicate to the one shown. If i don't know which pages/urls cause the issue i can't really fix it. The only way would be placing canonical tags but that's not always the best solution.
Is there a way to see the actual duplicate pages?
-
The only other thing I can think of is there's duplicate page content and duplicate title content. If it says true in either of those columns then there's no URLs in the columns to the right of it (headed duplicate_page_content or duplicate_title) then I'd contact Moz and work with them. Mine populate fine.
-
That surely makes sense! But when i look at the column that says duplicate_page_content then there is nothing shown.. even if they are marked as true. I must be missing something...
-
OK, within that Excel file, there's a column header with "duplicate page content" - so, the URL in question will be in the far left (URL) then there's a column that says "duplicate page" (with true/false as the options) and if it's true, then there's another column with "duplicate page content" as a header and URLs in it. Those should be the ones that Moz caught duplicating the URL in the "URL" column - if that makes any sense at all!
-
True! it's really helpful! I might have one more question regarding this. When i export to csv i get a ton of data. I open the file in Excel and seperate the data to columns. The pages that have duplicate content issues are marked as "true". But how can i see within this document which pages are duplicate for another specific page?
-
No shame! There's a ton of data here and it can be a bit of a needle in a haystack at first to figure out That's why these forums are so helpful!
-
Exactly. The download gives much deeper data, however with a few clicks that Netlogiq suggested you can find it w/o downloading.
-
Ummm.. i just found it. Not having bright moments today. shame. You must click on the number which is in the column "Other urls's". I was clicking on the page title shown in the column: "Page title url"
Didn't really jump to mind to click on the number.
Everything in order! Thx for responding everyone!
-
Hmmm not quite clear yet..
When i click on the issue in the overview a list of pages which have a duplicate content issue, opens. Then when i click on one of those links the only thing i see is a bold URL and some information about the duplicate content. But i don't see the url that is duplicate to the one displayed bold.
-
Now, I'll preface this by saying I don't know what documents you may be looking at vs what I have access to. I see duplicate links from SEOMOz, so you can get to it.
For example, when I log into my SEOMoz campaign information and click on the red errors box, then the duplicate content box, there's a selection of duplicate URLs right below the chart. My current one is indicating it caught 29 duplicate pages of content for my Spanish signs product section, then I can see all the URLs listed out that it sees as duplicates.
Granted, SEOMoz only crawls 10,000URLs at a time, so for a major site like mine that's only part of what we have, but it's an indicator of stuff we need to fix. I download my campaign report into a CSV file and there's columns in that identifying what's duplicate, too.
-
You can also export the document:
Crawl Diagnosis - Duplicate page content - export to CVS. Or - click on the +x number of duplicate pages, and you will see all the duplicate pages for that URL.
-
Yes, you can click on the error/duplicate content link and the pages will list. It will list the other pages below the bolded listing. Hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Dropdown content on page being crawled
Hi, will the content within a dropdown on a page be crawled? I.e. if the page visitor has to click to reveal the content as a dropdown will it be crawled by bots. Thanks
Technical SEO | | BillSCC1 -
Duplicates - How to know if trailing slashes are creating duplicate pages?
Hi, How do you determine whether trailing slashes are creating duplicate pages? Search Console is showing both /about and about/ for example but how do I know whether this is a problem? Thanks James
Technical SEO | | CamperConnect140 -
Can Google Crawl This Page?
I'm going to have to post the page in question which i'd rather not do but I have permission from the client to do so. Question: A recruitment client of mine had their website build on a proprietary platform by a so-called recruitment specialist agency. Unfortunately the site is not performing well in the organic listings. I believe the culprit is this page and others like it: http://www.prospect-health.com/Jobs/?st=0&o3=973&s=1&o4=1215&sortdir=desc&displayinstance=Advanced Search_Site1&pagesize=50000&page=1&o1=255&sortby=CreationDate&o2=260&ij=0 Basically as soon as you deviate from the top level pages you land on pages that have database-query URLs like this one. My take on it is that Google cannot crawl these pages and is therefore having trouble picking up all of the job listings. I have taken some measures to combat this and obviously we have an xml sitemap in place but it seems the pages that Google finds via the XML feed are not performing because there is no obvious flow of 'link juice' to them. There are a number of latest jobs listed on top level pages like this one: http://www.prospect-health.com/optometry-jobs and when they are picked up they perform Ok in the SERPs, which is the biggest clue to the problem outlined above. The agency in question have an SEO department who dispute the problem and their proposed solution is to create more content and build more links (genius!). Just looking for some clarification from you guys if you don't mind?
Technical SEO | | shr1090 -
Can Googlebot crawl the content on this page?
Hi all, I've read the posts in Google about Ajax and javascript (https://support.google.com/webmasters/answer/174992?hl=en) and also this post: http://moz.com/ugc/can-google-really-access-content-in-javascript-really. I am trying to evaluate if the content on this page, http://www.vwarcher.com/CustomerReviews, is crawlable by Googlebot? It appears not to be. I perused the sitemap and don't see any ugly Ajax URLs included as Google suggests doing. Also, the page is definitely indexed, but appears the content is only indexed via its original source (Yahoo!, Citysearch, Google+, etc.). I understand why they are using this dynamic content, because it looks nice to an end-user and requires little to no maintenance. But, is it providing them any SEO benefit? It appears to me that it would be far better to take these reviews and simply build them into HTML. Thoughts?
Technical SEO | | danatanseo0 -
Duplicate page issue
Hi, i have a serious duplicate page issue and not sure how it happened and i am not sure if anyone will be able to help as my site was built in joomla, it has been done through k2, i have never come across this issue before i am seem to have lots of duplicate pages under author names, example http://www.in2town.co.uk/blog/diane-walker this page is showing the full articles which is not great for seo and it is also showing that there are hundreds more articles at the bottom on the semoz tool i am using, it is showing these as duplicates although there are hundreds of them and it is causing google to see lots of duplicate pages. Diane Walker
Technical SEO | | ClaireH-184886
http://www.in2town.co.uk/blog/diane-walker/Page-2 5 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-210 1 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-297 1 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-3 5 1 0
Diane Walker can anyone please help me to sort this important issue out.0 -
Container Page/Content Page Duplicate Content
My client has a container page on their website, they are using SiteFinity, so it is called a "group page", in which individual pages appear and can be scrolled through. When link are followed, they first lead to the group page URL, in which the first content page is shown. However, when navigating through the content pages, the URL changes. When navigating BACK to the first content page, the URL is that for the content page, but it appears to indexers as a duplicate of the group page, that is, the URL that appeared when first linking to the group page. The client updates this on the regular, so I need to find a solution that will allow them to add more pages, the new one always becoming the top page, without requiring extra coding. For instance, I had considered integrating REL=NEXT and REL=PREV, but they aren't going to keep that up to date.
Technical SEO | | SpokeHQ1 -
Can you 301 redirect a page to an already existing/old page ?
If you delete a page (say a sub department/category page on an ecommerce store) should you 301 redirect its url to the nearest equivalent page still on the site or just delete and forget about it ? Generally should you try and 301 redirect any old pages your deleting if you can find suitable page with similar content to redirect to. Wont G consider it weird if you say a page has moved permenantly to such and such an address if that page/address existed before ? I presume its fine since say in the scenario of consolidating departments on your store you want to redirect the department page your going to delete to the existing pages/department you are consolidating old departments products into ?
Technical SEO | | Dan-Lawrence0 -
How can something be duplicate content of itself?
Just got the new crawl report, and I have a recurring issue that comes back around every month or so, which is that a bunch of pages are reported as duplicate content for themselves. Literally the same URL: http://awesomewidgetworld.com/promotions.shtml is reporting that http://awesomewidgetworld.com/promotions.shtml is both a duplicate title, and duplicate content. Well, I would hope so! It's the same URL! Is this a crawl error? Is it a site error? Has anyone seen this before? Do I need to give more information? P.S. awesomewidgetworld is not the actual site name.
Technical SEO | | BetAmerica0