7,608 High Priority Crawl Diagnostic problems
-
Hey There,
I have an e-commerce site that is showing 7,608 High Priorities to fix - 7,536 are duplicate content. What's the most effective process to start with?
I'm open to outsourcing some of the work to an expert - email me on dave@emanbee.com
Thanks for your time,
Dave
-
Cheers Kate.
From doing more reading, MOZ/ Google views thin content (300 words or less) or webpages with 95% of the same HTML code as duplicate. That will be the majority of what is showing in my crawl diagnostics.
That means I'm back to your original advice of fixing up duplicate page titles from GWT.
Currently, the canonical tags are generated sitewide through a template function. Without full control over the canonical tag I can't fix or structure things as easily as I'd like so I will see if a web dev can help out with this. We should be able to add the whole link too.
Thanks again,
Dave
-
Looks like moz isn't taking the canonical into effect, as long as it's there, you're fine. But I'd warn you not to use relative canonical links ( /directory/page/ vs http://www.domain.com/directory/page/), link to the whole thing. I've seen this go wrong in the past. It's not causing issues now but could in the future.
-
Hi Katemorris,
Thanks again for getting back to me.
I have started going through and fixing up pages. I'm hoping you can clarify something from MOZ for me?
In MOZ > crawl diagnostics> duplicate page content (the largest and only high priority issue listed for me) > the first link in the list > show the duplicate pages
Below is an example of 4 links that are all showing as duplicates of http://www.mooloolabamusic.com.au/page/brands in the moz software:
http://www.mooloolabamusic.com.au/live-sound-lighting/lighting/atmospheric-effects/?pr=72-82&rf=pr
http://www.mooloolabamusic.com.au/live-sound-lighting/lighting/atmospheric-effects/?pr=0-72&rf=pr
http://www.mooloolabamusic.com.au/studio-production/?pr=1732-1828&rf=pr
http://www.mooloolabamusic.com.au/studio-production/?pr=1770-1827&rf=pr
Can you please clarify how these pages have duplicate content and how to fix this? There are thousands like this.
When I have a look at them using the moz search bar there is already a cononical tag in the header which is either not working or the moz software does not pick it up or is the site template creating 'duplicate content'?
Thanks so much for your time,
Dave
-
Start in Google Webmaster Tools or in the Moz Crawl. Identify those pages with the same title tag and work through that list. The title tag is usually a good indication of duplicate content.
If the content is duplicate for sure, determine if it's a useful duplicate. If so, use a canonical from the duplicate to the original. If it's just duplicated with no real reason, find out how to get rid of the duplicate. This can be anything from unnecessary parameters, to tag pages, and so many more.
You'll start to see trends in the data, try to fix the bigger problems as you see them.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
1 page crawled - again
Just had to let you know that it happend again. So right now we are at 2 out of the last 4 crawls. Uptime here is 99,8% for the last 30 days, with a small downtime due to an update process at the 18/5 from around 2:30 to 4:30 GMT In relation to: http://moz.com/community/q/1-page-crawled-and-other-errors
Moz Pro | | alsvik0 -
Still Cant Crawl My Site
I've removed all blocks but two from our htaccess. They are for amazonaws.com to block amazon from crawling us. I did a fetch as google in our WM tools on our robots txt with success. SEOMoz crawler here hit's our site and gets a 403. I've looks in our blocked request logs and amazon is the only one in there. What is going on here?
Moz Pro | | martJ0 -
Crawl Stats Have Dissapeared
Hi SEOmoz I received an email today that another scan has been performed but when I log into my account all the tracking details have disappeared? States Pages crawled N/A. Can someone please help? Temporary problem? Website www.vintageheirloom.com Thanks
Moz Pro | | well-its-1-louder0 -
Crawl Test - Taking too long
The last crawl test I invoked seems to be in progress for over 24 hours. The one before that completed in a few hours. Wish there was a progress indicator or an option to cancel. The crawl (from Tool > Crawl Test) should not take this long. Any ideas or suggestions? Also, the keyword research tool (plus a few others) have been down ever since I signed up. Is this a normal?
Moz Pro | | MomoMasta0 -
I've got quite a few "Duplicate Page Title" Errors in my Crawl Diagnostics for my Wordpress Blog
Title says it all, is this an issue? The pages seem to be set up properly with Rel=Canonical so should i just ignore the duplicate page title erros in my Crawl Diagnostics dashboard? Thanks
Moz Pro | | SheffieldMarketing0 -
Tool for tracking actions taken on problem urls
I am looking for tool suggestions that assist in keeping track of problem urls, the actions taken on urls, and help deal with tracking and testing a large number of errors gathered from many sources. So, what I want is to be able to export lists of url's and their problems from my current sets of tools (SEOmoz campaigns, Google WM, Bing WM,.Screaming Frog) and input them into a type of centralized DB that will allow me to see all of the actions that need to be taken on each url while at the same time removing duplicates as each tool finds a significant amount of the same issues. Example Case: SEOmoz and Google identify urls with duplicate title tags (example.com/url1 & example.com/url2) , while Screaming frog sees that example.com/url1 contains a link that is no longer valid (so terminates in a 404). When I import the three reports into the tool I would like to see that example.com/url1 has two issues pending, a duplicated title and a broken link, without duplicating the entry that both SEOmoz and Google found. I would also like to see historical information on the url, so if I have written redirects to it (to fix a previous problem), or if it used to be a broken page (i.e. 4XX or 5XX error) and is now fixed. Finally, I would like to not be bothered with the same issue twice. As Google is incredibly slow with updating their issues summary, I would like to not important duplicate issues (so the tool should recognize that the url is already in the DB and that it has been resolved). Bonus for any tool that uses Google and SEOmoz API to gather this info for me Bonus Bonus for any tool that is smart enough to check and mark as resolved issues as they come in (for instance, if a url has a 403 error it would check on import if it still resolved as a 403. If it did it would add it to the issue queue, if not it would be marked as fixed). Does anything like this exist? how do you deal with tracking and fixing thousands of urls and their problems and the duplicates created from using multiple tools. Thanks!
Moz Pro | | prima-2535090 -
Only 1 page has been crawled. Why?
I set a new profile up a fortnight ago. Last week seomoz crawled the entire site (10k pages), and this week has only crawled 1 page. Nothing's changed on the site that I'm aware of, so what's happened?
Moz Pro | | tompollard0 -
Only 1 page is being crawled by SEOmoz for the last 2 crawls
I would like to ask for the possible problem plus solution on one of our campaigns. Only 1 page is being crawled by SEOmoz for the last 2 crawls. Before the last two crawls, SEOmoz crawls numerous pages and we can’t think of a possible reason for this error. For this particular campaign , there are no data --- no errors, warnings and notices. Thanks!
Moz Pro | | TheNorthernOffice790