Duplicate content pages
-
Crawl Diagnostics Summary shows around 15,000 duplicate content errors for one of my projects, It shows the list of pages with how many duplicate pages are there for each page. But i dont have a way of seeing what are the duplicate page URLs for a specific page without clicking on each page link and checking them manually which is gonna take forever to sort.
When i export the list as CSV, duplicate_page_content column doest show any data.
Can anyone please advice on this please.
Thanks
<colgroup><col width="1096"></colgroup>
| duplicate_page_content | -
Hey there!
Thanks for writing in.
I downloaded the CSV from your Travel Pack campaign. It looks like all of the duplicate content pages are in the CSV that I exported. I found them by sorting the the rows in Excel. Here is a good guide on how to get started sorting in Excel: http://office.microsoft.com/en-us/excel-help/sort-data-in-a-range-or-table-HP010073947.aspx
Thanks!
Nick
-
Sorry if my English was not clear, it's not my first language. My issue is I can't get the list of duplicate URLs of my site...
-
If they are attached to specific strings ( String: After the URL it looks like this: /?alwer.ei.we ) you can block the string(s) in your robot.txt file.
Lets say there are 100 duplicates that start with"/?osifos.sdjvnksdj" block out the "?osifos" in your robot txt.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why would someone go to same 404 page over and over?
Good morning, I've been using the redirection plugin on my wordpress site and noticed i have multiple IP addresses going to the same folder on my site - like "mydomain.com/folder-name/". The "folder-name" is obviously not anything remotely like any folder or file name I have on my domain - so it's obviously spammy in nature. And, there are multiple IP addresses going to this same URL address every 3 hours on the dot, so it's appears automated. Is this something to be concerned about? Should I "do" anything? Thanks in advance for reading and replying!
Moz Pro | | mlm120 -
Videos on duplicate content editing
Hi, I am looking for good videos with visual examples on how to edit duplicate content issues. I am editing a law firms website, and for the most part the duplicate issues seem to show up in tag URL's on the blog. I feel like I have maybe half of the picture figured out, but I am not sure how or where to make changes. I have gone through the crawl diagnostic issues and a few articles, but I know I am a visual learner. Therefore a video might be helpful. Does anyone have any suggestions on where to get started? Thanks.
Moz Pro | | DigitalEnvy0 -
Is www.domain.com/page the same url as www.domain.com/page/ for Google? (extra slash at end of url)
Dear all, in open site explorer there is a difference the url's 'www.domain.com/page' and 'www.domain.com/page/' (extra slash at end). There can be different values in pageauthority etc. in the open site explorer tool, but is this also the case for Google? Thanks for replying, Regards, Ben
Moz Pro | | HMK-NL0 -
Transfering Page Authority
Hi, I have recently change my url architecture with site redesign and was just doing some analysis of the old and new pages. I seem to be losing a little bit of Organic Search because of it. As an example this old diving page in open site explorer shows a Page Authority of 46 whilst the new diving page shows a Page Authority of 22. I have a 301 redirect going from the old page to the new, but that seems to be quite a drop in Page Authority. Is there anything else I can be doing to improve upon it? Thanks, Adam
Moz Pro | | NaescentAdam0 -
Excel tips or tricks for duplicate content madness?
Dearest SEO Friends, I'm working on a site that has over 2,400 instances of duplicate content (yikes!). I'm hoping somebody could offer some excel tips or tricks to managing my SEOMoz crawl diagnostics summary data file in a meaningful way, because right now this spreadsheet is not really helpful. Here's a hypothetical situation to describe why: Say we had three columns of duplicate content. The data is displayed thusly: | Column A | Column B | Column C URL A | URL B | URL C | In a perfect world, this is easy to understand. I want URL A to be the canonical. But unfortunately, the way my spreadsheet is populated, this ends up happening: | Column A | Column B | Column C URL A | URL B | URL C URL B | URL A | URL C URL C | URL A | URL B | Essentially all of these URLs would end up being called a canonical, thus rendering the effect of the tag ineffective. On a site with small errors, this has never been a problem, because I can just spot check my steps. But the site I'm working on has thousands of instances, making it really hard to identify or even scale these patterns accurately. This is particularly problematic as some of these URLs are identified as duplicates 50+ times! So my spreadsheet has well over 100K cells!!! Madness!!! Obviously, I can't go through manually. It would take me years to ensure the accuracy, and I'm assuming that's not really a scalable goal. Here's what I would love, but I'm not getting my hopes up. Does anyone know of a formulaic way that Excel could identify row matches and think - "oh! these are all the same rows of data, just mismatched. I'll kill off duplicate rows, so only one truly unique row of data exists for this particular set" ? Or some other work around that could help me with my duplicate content madness? Much appreciated, you Excel Gurus you!
Moz Pro | | FMLLC0 -
Finding page authority for a list of sites
I went to Distilled's linklove conference earlier this week. Many of the presenters showed spreadsheets where they had pulled in Page Authority for a long list of sites. What tool can you use to quickly do this? For instance, I want to be able to copy a list of 100 sites into a tool and have it spit out the PA for every site...anybody know how to do this?
Moz Pro | | znotes0 -
Why aren't canonical tags reducing duplicate page title/content?
We have canonical tags set up for a feature page on one of our sites. This site has an image gallery controlled by javascript. To aid the user experience the image can also be specified by a URL parameter (the javascript also uses this URL to fetch the images). The SEOMoz report complains that the links to these images have duplicate page titles and content. To try and combat this we set canonical tags to point only to the original page, without the slideshow parameter. e.g. http://www.example.com/feature-page/ http://www.example.com/feature-page/?slideshow=1 -> canonical tag set to http://www.example.com/feature-page/ http://www.example.com/feature-page/?slideshow=2 -> canonical tag set to http://www.example.com/feature-page/ The latest SEOMoz report has come back and the errors still exist. What can we do to remove these error messages? Thanks
Moz Pro | | TJSSEO1 -
How To Solve Too Many On-Page Links In Blogger?
Hi, I Have An Issue Too Many On-Page Links In My Site And I Saw That There Are More Than 300 On Page Links On My Home Page URL. My Site Is Hosted On Blogger. So Please Tell Me How To Fix This Problem In Blogger.
Moz Pro | | MaherHackers0