Tool for scanning the content of the canonical tag
-
Hey All,
question for you. What is your favorite tool/method for scanning a website for specific tags? Specifically (as my situation dictates now) for canonical tags?
I am looking for a tool that is flexible, hopefully free, and highly customizable (for instance, you can specify the tag to look for). I like the concept of using google docs with the import xml feature but as you can only use 50 of those commands at a time it is very limiting (http://www.distilled.co.uk/blog/seo/how-to-build-agile-seo-tools-using-google-docs/).
I do have a campaign set up using the tools which is great! but I need something that returns a response faster and can get data from more than 10,000 links. Our cms unfortunately puts out some odd canonical tags depending on how a page is rendered and I am trying to catch them quickly before it gets indexed and causes problems. Eventually I would also like to be able to scan for other specific tags, hence the customizable concern. If we have to write a vb script to get it into excel I suppose we can do that.
Cheers,
Josh
-
No idea on that one - it's still pretty new. The developers actually chimed in on the post, so you could ask them in the comments.
-
Thanks Dr. Pete and Marcus.
I just finished reading the post. I have looked at Screaming Frog before but was hoping to be able to find a way to do it myself. Just didn't want to plop money down on something that seemed like it should be able to be done using tools I already had. But the software does look good. Any thought on if they will come out with a one time purchase instead of a yearly subscription?
Cheers!
Josh
-
Hey Dr. Pete, Joshua
I was just coming here to say that I had read the Dr. Pete post and this may do the job. It's a paid bit of a software but I will be picking it up later. I have my guys knocking up a canonical checker that will be free for all but that may take a day or so to get perfect.
Let me know if you have a play with Screaming Frog!
Marcus
-
I'm pretty sure that Screaming Frog SEO Spider will do it, but you need the paid version to custom-filter on the canonical tag. I've got a post going up about it tomorrow.
-
Great, really appreciate it! Many thumbs up
-
Hey Josh,
Right, cool. I have got a few jobs to sort out but I am going to have a bash at knocking this up this afternoon. Should be easy enough (he said, damning himself to hours of problems).
Leave it with me for 24 hours.
Marcus
-
Hey Marcus,
thanks for the quick response. That is exactly what I would be looking for. I do have a list of url's and that is also simple enough to get from something like xenu. Would love to work with you on this.
Thanks.
Josh
-
Hey, I am not aware of any such tool, but it should not be too hard to put one together, maybe a useful little tool as well.
If you have all of your pages in spreadsheet or database, it should be easy enough to write a little script that cycles through them.
Start Loop
-
request page
-
parse code to get canonical URL
-
compare page to canonical
-
output problem URLs
End Loop
Slightly over simplified and requires a list of all your URLs but would be willing to help put something like this together, could be useful for all of us, especially for those (like me) that work with a lot of CMS sites.
Cheers
Marcus
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are there any tools to give a value STRICTLY for Quantity of Content on your website?
I am trying to put a value to all the work I do and want to put a very specific value to the number of pages of unique content I have. I know everyone says its about quality, and sure it is but quantity is still a factor and looked at. (Can't argue with if you prefer 100 semi-optimized pages versus 1 optimized page- and is unfair for a tool to rate the website the 1 optimized page higher) I use a ton of tools but yet to find something that puts a value on quantity of CONTENT ONLY (Please don't respond with PA or DA because that encompasses all the inherit value)
Moz Pro | | SEOEnthusiast0 -
Mac Alternatives for Netpeak & SEO Tools For Excel?
Does anybody know of any mac alternatives for Netpeak & SEO tools for excel? I haven't been able to find any. I just need something to pull PA & DA quickly for a list of domains and URLs. Will I just have to create something custom with the Moz API?
Moz Pro | | kking41200 -
What is the Best Local Ranking Tool?
I'm trying to track down a tool that will provide localized rankings within Google Maps/Places, Yahoo Local, Bing Local as well as major local directories such as Yelp, Yellow Pages, etc. Additionally, I'm looking for the results to provide the address being displayed in the ranking. Any suggestions?
Moz Pro | | JonClark150 -
Blogger ain't working with research tools...
I'm trying to do link research and analysis on my website for dogtraining.blogspot.com however the tool recognizes only blogspot.com giving me fake results....
Moz Pro | | 6786486312640 -
What is the best method to solve duplicate page content?
The issue I am having is an overwhelmingly large number of pages on cafecartel.com show that they have duplicate page content. But when I check the errors on SEOmoz it shows that the duplicate content is from www.cafecartel.com not cafecartel.com. So first of all, does this mean that there are two sites? and is this a problem I can fix easily? (i.e. redirecting the URL and deleting the extra pages) Is this going to make all other SEO useless due to the fact that it shows that nearly every page has duplicate page content? Or am I just completely reading the data wrong?
Moz Pro | | MarkP_0 -
Keyword Difficulty Tool Problems?
I was just using the keyword difficulty tool and for some reason, some of the keywords show "not enough data"...I'm not sure why this is the case because a few days ago, there was data... Anyway, is this because it take time for the tool to gather data each month, or is it because there's an issue with the GoogleAPI? Thanks a lot for your help!
Moz Pro | | simonmhchiu870 -
Keyword Difficulty Tool - How to use it the best way?
Hi, I am freshman, both, here on SEOmoz and in SEO generally and have a question concerning the assessment of KW difficulty. I did browse through the Q&A-Section (great content!!) but could not find a relevant answer to my problem. So I am currently building my initial keyword list, our website is only 2 months old, so we are still in the very early stage. Fashion is a very competitive area, so the KW Diff. Tool indicates high difficulty for a lot of words and phrases. however, I identified some with percentages <50 or even lower than that. Then I compared the results of the SEOmoz tool to Google Adwards and the Google results for competitiveness differed significantly. For example: for the KW Personal Shopping I got KWD from Seomoz 33% (in Germany) and from Google 0,5 for broad and 0,64 for exact search. I am quite confused how to make the right choices for my KWs now. Which metrics should I consider most? What else apart from the competition factor is behind the metric KW diff.? Does it matter in any way that I search from Germany for Germany in German? Do you have any further recommendations for the process of identifying the best Kws? Thanks a lot in advance, best from Berlin Tani
Moz Pro | | TaniBogi0 -
About NOFOLLOW tag for SEOmoz analysis
Hi all, Another issue while trying to resolve all the duplicate content SEOmoz reports to me. May be some of you guys can help: I have a dynamic error page on our website, generated in case of error, that can happen on many urls. Of course that one should not be indexed. I added the following tag on the HEADER: name="robots" content="NOODP,NOINDEX,NOFOLLOW" /> To me this should prevent from having this page indexed, but also from having this page reported by SEOmoz analyzer as duplicate content. Any hints?
Moz Pro | | nuxeo0