Tool for scanning the content of the canonical tag
-
Hey All,
question for you. What is your favorite tool/method for scanning a website for specific tags? Specifically (as my situation dictates now) for canonical tags?
I am looking for a tool that is flexible, hopefully free, and highly customizable (for instance, you can specify the tag to look for). I like the concept of using google docs with the import xml feature but as you can only use 50 of those commands at a time it is very limiting (http://www.distilled.co.uk/blog/seo/how-to-build-agile-seo-tools-using-google-docs/).
I do have a campaign set up using the tools which is great! but I need something that returns a response faster and can get data from more than 10,000 links. Our cms unfortunately puts out some odd canonical tags depending on how a page is rendered and I am trying to catch them quickly before it gets indexed and causes problems. Eventually I would also like to be able to scan for other specific tags, hence the customizable concern. If we have to write a vb script to get it into excel I suppose we can do that.
Cheers,
Josh
-
No idea on that one - it's still pretty new. The developers actually chimed in on the post, so you could ask them in the comments.
-
Thanks Dr. Pete and Marcus.
I just finished reading the post. I have looked at Screaming Frog before but was hoping to be able to find a way to do it myself. Just didn't want to plop money down on something that seemed like it should be able to be done using tools I already had. But the software does look good. Any thought on if they will come out with a one time purchase instead of a yearly subscription?
Cheers!
Josh
-
Hey Dr. Pete, Joshua
I was just coming here to say that I had read the Dr. Pete post and this may do the job. It's a paid bit of a software but I will be picking it up later. I have my guys knocking up a canonical checker that will be free for all but that may take a day or so to get perfect.
Let me know if you have a play with Screaming Frog!
Marcus
-
I'm pretty sure that Screaming Frog SEO Spider will do it, but you need the paid version to custom-filter on the canonical tag. I've got a post going up about it tomorrow.
-
Great, really appreciate it! Many thumbs up
-
Hey Josh,
Right, cool. I have got a few jobs to sort out but I am going to have a bash at knocking this up this afternoon. Should be easy enough (he said, damning himself to hours of problems).
Leave it with me for 24 hours.
Marcus
-
Hey Marcus,
thanks for the quick response. That is exactly what I would be looking for. I do have a list of url's and that is also simple enough to get from something like xenu. Would love to work with you on this.
Thanks.
Josh
-
Hey, I am not aware of any such tool, but it should not be too hard to put one together, maybe a useful little tool as well.
If you have all of your pages in spreadsheet or database, it should be easy enough to write a little script that cycles through them.
Start Loop
-
request page
-
parse code to get canonical URL
-
compare page to canonical
-
output problem URLs
End Loop
Slightly over simplified and requires a list of all your URLs but would be willing to help put something like this together, could be useful for all of us, especially for those (like me) that work with a lot of CMS sites.
Cheers
Marcus
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content on SearchResults.asp
hi guys. I'm currently working through the reported crawl errors in Moz Analytics, but an unsure what to do about some of them. for example... Searchresults.asp?search=frankie+says+relax is showing as having duplicate page content and page title as SearchResults.asp?searching=Y&sort=13&search=Frankie+Says+Relax&show=24 There's all sorts of searchresults.asp page being flagged. Is this something i can safely ignore or is it something i should endeavour to rectify? I'm also getting errors reported on shoppingcart.asp pages as well as pindex.asp (product index). I'm thinking i should maybe add disallow/ shoppingcart.asp to my robots text file, but am unsure as to whether i should be blocking robots from the search results pages and product index (which is essentially a secondary sitemap). Any advice would be greatly appreaciated. Thanks, Dave 🙂
Moz Pro | | giddygrafix0 -
Videos on duplicate content editing
Hi, I am looking for good videos with visual examples on how to edit duplicate content issues. I am editing a law firms website, and for the most part the duplicate issues seem to show up in tag URL's on the blog. I feel like I have maybe half of the picture figured out, but I am not sure how or where to make changes. I have gone through the crawl diagnostic issues and a few articles, but I know I am a visual learner. Therefore a video might be helpful. Does anyone have any suggestions on where to get started? Thanks.
Moz Pro | | DigitalEnvy0 -
How can I tell what day I added my keywords to the keyword ranking tool?
I can't tell if a keyword is newly ranking or if it just started ranking because I added it to the system. I have added keywords piecemeal over several months and I can't remember which ones I added on which date. Can anyone help me with this?
Moz Pro | | unitedairlines0 -
Keyword rankings tool is not working properly
My website http://www.logobite.com/ is in 29th position for the keyword "logo inspiration" but your keyword rankings tool is not showing up 😞 why?
Moz Pro | | logobite0 -
Duplicate page content on / and index.php
Hi I am new to SEOmoz and in the crawl diagnostics for one of my clients it came back duplicate content on the homepage www.myclient.co.uk and on the www.myclient.co.uk/index.php which is obviously the same page. I understand that the key is to do a 301 redirect from the index to /, however how will I know that this will not just create an ever ending loop on the server? From your experience how is the best way to tackle this crawl error? Also is there a specific question that I need to ask the server?
Moz Pro | | search_shop0 -
Could SEOMOZ make a tool thats connected directly to Webmaster Tools?
Could SEOMOZ make a tool that Crawls Internal Links that's connected (or crawls) directly from Webmaster Tools? (So link metrics on our domains would get updates more often) This would eliminate the cost of crawling the entire internet. I hate having to wait a month to see updates on my domain metrics and links. I would like to see updates regularly on my own domain. I understand you can just look at Webmaster tools to see your latest links but they don't have any link metrics. So it’s difficult to see your best links. If this already exists please refer me? Otherwise please show your support for this being developed?
Moz Pro | | charles10 -
Keyword tool: SEOMOZ spacific month ? vs adword tool 12 month average but same data ???
Running a keyword analysis in SEOMOZ it shows my the folowing information "Local Search Volume (Dec)". I compared the data for the specific country , language and keyword with the adwords keyword tool and it exactly showed me the same numbers. The adwords keyword tool shows: "Local Monthly Searches: This column shows the approximate 12-month average number of search terms matching each keyword" http://support.google.com/adwords/bin/answer.py?hl=en&answer=25148 So if the numbers are the same in google keword tool and SEOMOZ why is SEOMOZ saying that for a specif month? If the data is the same one of both can not be right or probaly I didn't get the point. See screenshot: http://screencast.com/t/GyaaW7EkwV Thanks for help
Moz Pro | | n-media0 -
Canonical issue in open site explorer
When I look at my back links in OSE, I see two landing pages on my site that are really the same page. www.mysite.com/ and www.mysite.com/(affiliate code here) These show different inbound link characteristics and page authority. The page in question has a rel=canonical tag. Am I doing something wrong?
Moz Pro | | EugeneF0