TD*IDF analysis Tools
-
Hi guys,
I was wondering if anyone knew of free TD*IDF analysis tools on the market?
I know about onpage.org and Text-tools.net both paid.
I was wondering if anyone knows of other tools?
Cheers,
Chris
-
Hi Chris,
I don't know of any free tools that do this unless you want to write some code yourself. If you go that route we have some open source libraries that you might find useful, especially qdr that implements the TF-IDF scoring and dragnet for parsing/cleaning the HTML. Good luck in your search!
-
Hi Chris,
It's not the TD-IDF solution you're after but may help? SEO Quake (available as a free Chrome plug-in: https://chrome.google.com/webstore/detail/seoquake/akdgnmcogleenhbclghghlkkdndkjdjc) approximates some of this data for you.
It will show the most commonly recurring 1, 2, 3 and 4 word phrases appearing on a web page. It won't compare this to a corpus (e.g. your whole site). It then gives a Density % (broadly, how often this word/phrase appears) and a Prominence % (based around density but also where it appears: title, description, keywords etc.).
Hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
350 (Out the 750) Internal Links Listed by Webmaster Tools Dynamically Generated-Best to Remove?
Greetings MOZ Community: When visitors enter real estate search parameters in our commercial real estate web site, the parameters are somehow getting indexed as internal links in Google Webmaster Tools. About half are 700 internal links are derived from these dynamic URLs. It seems to me that these dynamic alphanumeric URL links would dilute the value of the remaining static links. Are the dynamic URLs a major issue? Are they high priority to remove? The dynamic URLs look like this: /listings/search?fsrepw-search-neighborhood%5B%5D=m_0&fsrepw-search-sq-ft%5B%5D=1&fsrepw-search-price-range%5B%5D=4&fsrepw-search-type-of-space%5B%5D=0&fsrepw-search-lease-type=1 These URLs do not show up when a SITE: URL search is done on Google!
Intermediate & Advanced SEO | | Kingalan10 -
SEO Site Analysis
I am looking for a company doing a SEO analysis on our website www.interelectronix.com and write a optimization proposal incl. a budgetary quote for performing those optimizations.
Intermediate & Advanced SEO | | interelectronix0 -
Limit on Google Removal Tool?
I'm dealing with thousands of duplicate URL's caused by the CMS... So I am using some automation to get through them - What is the daily limit? weekly? monthly? Any ideas?? thanks, Ben
Intermediate & Advanced SEO | | bjs20100 -
Webmaster Tools Internal Links
Hi all, I have around 400 links in the navigation menu (site-wide) and when I use webmaster tools to check for internal links to each page; some have as many as 250K and some as little as 200. Shouldn't the number of internal links for pages found in the navigation menu be relatively the same? Or is Google registering more internal links for pages linked closer to the top of the code Thanks!
Intermediate & Advanced SEO | | Carlos-R0 -
Local Competition Analysis
Hi Mozzers, I've been mainly B2B focused, and am used to estimating the amount of work necessary to best competition for organic results, but now I have a local client. I need a method to estimate the amount of work necessary to get listed in the one-box for my chosen queries. Can someone point me in the right direction? Any help appreciated.
Intermediate & Advanced SEO | | waynekolenchuk0 -
What is the best tool to crawl a site with millions of pages?
I want to crawl a site that has so many pages that Xenu and Screaming Frog keep crashing at some point after 200,000 pages. What tools will allow me to crawl a site with millions of pages without crashing?
Intermediate & Advanced SEO | | iCrossing_UK0 -
Magic keywords in Google Webmaster Tools
Hi All, Recently moved a friend to a new WP back-end website as they were on Flash which is pretty, but not necessarily the best for SEO. http://francesphotography.com My question is that once Google finally indexed the site, I noticed in Google Webmaster tools that it found the most significant keyword to be: automatically On the following top pages: | tag/snow-boarding-photography/ |
Intermediate & Advanced SEO | | BoulderJoe
| tag/style-photography/ |
| tag/underwater-photography/ |
| tag/vacation-photography/ |
| tag/wedding-photography-beaver-creek/ |
| tag/wedding-photography-copper-mountain/ |
| tag/wedding-photography-denver/ |
| tag/wedding-photography/ |
| underwater-photography-scuba-diving-cozumel-mexico/ |
| wedding-photography/ | The goofy thing is I can find anywhere that "automatically" is used - perhaps it is coming from a plug-in or magically keyword beans that Google found? Any guidance is appreciated.0 -
Tool to calculate the number of pages in Google's index?
When working with a very large site, are there any tools that will help you calculate the number of links in the Google index? I know you can use site:www.domain.com to see all the links indexed for a particular url. But what if you want to see the number of pages indexed for 100 different subdirectories (i.e. www.domain.com/a, www.domain.com/b)? is there a tool to help automate the process of finding the number of pages from each subdirectory in Google's index?
Intermediate & Advanced SEO | | nicole.healthline0