How can I clean up my crawl report from duplicate records?
-
I am viewing my Crawl Diagnostics Report.
My report is filled with data which really shouldn't be there. For example I have a page:
http://www.terapvp.com/forums/Ghost/
This is a main forum page. It contains a list of many threads. The list can be sorted on many values. The page is canonicalized, and has been since it was created.
My crawl report shows this page listed 15 times.
http://www.terapvp.com/forums/Ghost/?direction=asc
http://www.terapvp.com/forums/Ghost/?direction=desc
http://www.terapvp.com/forums/Ghost/?order=post_date
and so forth. Each of those pages uses the same canonicalization reference shared above.
I have three questions:
-
Why is this data appearing in my crawl report? These pages are properly canonicalized.
-
If these pages are supposed to appear in the report for some reason, how can I remove them? My desire is to focus on any pages which may have an issue which needs to be addressed.
This site has about 50 forum pages and when you add an extra 15 pages per forum, it becomes a lot harder to locate actionable data. To make matters worse, these forum indexes often have many pages. So if I have a "Corvette" forum there that is 10 pages long, then there will be 150 extra pages just for that particular forum in my crawl report.
- Is there anything I am missing? To the best of my knowledge everything is set up according to the best SEO practices. If there is any other opinions, I would like to hear them.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Can you highlight a list of keywords in reports?
Hello all, My client recently asked me if there was a way to highlight a specific list of keywords to put on the first page of every report. These would be the highest priority keywords that we would always want to know the status of for his site. In the reports section, I am only seeing options for organizing the keywords by rankings improved/declined and comparing to competitors. Would anyone know how to label/categorize this list of keywords and then produce them as part of the monthly reports? Thank you,
Moz Pro | | Level2Designs
Daniel0 -
SEO moz Report Card
I just ran some on page report cards. As I was playing around with the tool I noticed that I would get different results if I used my primary domain vs a 2nd domain. The main difference was in how the tool was counting keywords on the page. The keyword used was 'vehicle inventory' Primary domain: www.brand-state.com/inventory.htm Title = 1, URL = 0, Meta = 1, H1 = 1, H2-4 = 1 Body =1, Strong = 1, IMG Alt = 1 Total = 7 2nd domain: www.company-name-brand.com/inventory.htm Title = 1, URL = 0, Meta = 1, H1 = 1, H2-4 = 2 Body =5, Strong = 4, IMG Alt = 2 Total = 13 I can understand if the keyword was in the domain, but it's not. So I'm wondering what is going on here - any help or suggestions on what to research would be a great help. Thank you!
Moz Pro | | gormaniavt0 -
How can i export a historical ranking report which contains keywords with special characters.
How can i export a historical ranking report which contains keywords with special characters? Previously it only turned the keywords into jumbled format (forgot the tech term) in excel i.e. онлайн чат which are in this case russian characters. The way i got around this was to import it into google docs but that also is now converting it into this format. Due to this i 1 do not know what the keywords are and 2 all my formulas do not work.
Moz Pro | | ColumK0 -
Can't find duplicate page content
Hi all. I'm trying to create a report to list all of my site's duplicate content that SEOmoz says we have. However when I click on the link it just shows me the title and description of the page. I don't know what the other page is that has duplicate content or what the duplicate content is. Where do I find this information? Thanks in advance!
Moz Pro | | Info12340 -
Crawl Report Technical Issue
I'm having a problem with our campaign especially the crawl report. Because the last update is Nov 4 and isn't supposed to be updated weekly? I already submit a helpdesk support ticket and even send a seperate e-mail regarding this issue but until now the report is still not updated. Anybody here can help me raise this issue/ Thanks.
Moz Pro | | shebinhassan0 -
Cleaning up link profile
I've recently been trying to tidy up my link profile. We have been link building for a number of years and I decided to check it out and see how good our profile looks. I used OSE to give me a report of all external links pointing to pages on my www sub-domain. The results are scary! I have hundreds of links point to my site that originate from URLs such as the following url which attempts to start a download, use caution!!! <colgroup><col width="768"></colgroup>
Moz Pro | | Entrusteddev
| http://ftp.codeweavers.com/pub/crossover/cxlinux/demo/install-crossover-pro-demo-9.0.1.sh?m=pc&a=bookmarkList.view&target_user_id=1&search_type=tag&keyword=減量 | Another example (this one attampts to start a .jar download!! Be cautious) <colgroup><col width="794"></colgroup>
| http://www.breastreconstructionguide.com/compare.jar?page=1892&username=DXDZSW | All the other metrics reported for the offending URLs seem ok, such as PA and DA. Also, many have meaningful page titles (as opposed to random characters) and nicely formed anchor text. What I'd like to know is; Are these links having a detrimental effect on my SERPs? How does OSE find them since its a URL to a download? Has anyone else had a similar experience? Thanks for your time. Regards Aran <colgroup><col width="794"></colgroup>0 -
Only Crawling 1 page?
Hi Guys, Any advice much appreciated on this! Recently set up a new campaign on my dashboard with just 5 keywords. The domain is brammer.co.uk and a quick Google site:brammer.co.uk shows a good amount of indexed pages. However - first seomoz tool crawl has only crawled 1 url!! "Last Crawl Completed: Apr. 12th, 2011 Next Crawl Starts: Apr. 17th, 2011" Any ideas what's stopping the tool crawl anymore of the site?? Cheers in advance.. J
Moz Pro | | lovealbatross0