Anyways to pull anchor text?
-
Hi guys,
So basically i have a list of URLs/Domains and there backlinks (example: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg) but i'm missing anchor text. Can anyone recommend any tools which can scan a backlink, locate the URL/Domain on the page and then pull the anchor text?
Cheers, Chris
<colgroup><col width="548"><col width="884"></colgroup>
| | | -
Hi Matt!
No i have not yet found a tool which can do this.
The _ScrapeBox Anchor Text plugin _CleverPhD mentioned can only do this for one domain at a time. I need it for multiple domains.
Any other suggestions?
-
Hi Jay! Did you get this worked out?
-
Thanks Jay. If I look on the backlinks side, they all seem to have the same subdomain in some form or another. You would just need to setup the regex in Screaming Frog to look for just that keyword in the subdomain so it should match all the variants of it.
That said, ignore everything I just posted. I was thinking earlier, "Surely there is scraper software out there that does this already." I did not take the time to look. Your mention of Scrapebox reminded me of that.
Scrapebox has a separate addon that does this
http://www.scrapebox.com/anchor-text-checker
The ScrapeBox Anchor Text Checker allows you to enter your domain and then load a list of URL’s that contain your backlink. It will scan all the URL’s containing your link and extract the anchor text used by the websites that link to you.
-
Basically want the anchor text, so I can easily identify the location of the link on the page without needing to view source and search for the URL.
This export is directly from: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg
Scrapebox backlink checker which doesn't give you anchor text.
-
Ok. Can you be more specific on what you are trying to accomplish with this data? I think that may help my understanding of what you are trying to do.
-
Thanks CleverPhD, sorry should had mentioned i'm looking to do this for multiple domain names not just one. So the method you describe works great for a single domain.
-
Screaming Frog can do this with custom extraction and list mode. If I am reading your question correctly, you have a list of URLs and what pages on your site that they link to.
You would upload the list of URLs into Screaming Frog so it knows what pages to scan and run it in list mode
http://www.screamingfrog.co.uk/seo-spider/user-guide/configuration/#15
You would then use the custom extraction tool to grep for the ahref code that has a link to your domain
http://www.screamingfrog.co.uk/web-scraper/
You would need to plug in a regular expression to look for your domain (or versions of it) and then include the rest of the HTML tag that include the anchor text all the way through the ending .
You should then be able to import that data into a spreadsheet and use text to columns to split the anchor text into it's own column.
It is a little tricky as the regular expression may have to be tweaked depending on how other sites link to your site. Run the Frog on a test group of 10 or so to make sure it works. If you have a bunch of errors, take the error examples and tweak the regular expression based on those.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to Avoid Improper Special Characters from Being Pulled into SERPs
If you Google "Progressive careers," for the Progressive.com result you'll see a result where Google pulls in content from the page outside of the meta description (not uncommon) and also pulls in an in-page carrot that indicates a link. This character displays as a square on desktop / Android devices. The site includes this character as an accessibility best practice to indicate a link from a heading for screen readers. On iPhones, that square shows up as a soccer ball emoji even though the entity code is different than our character's entity code (our entity code= , soccer ball entity code is ⚽). Clearly, not the best experience. I know we cannot control what piece of the page is pulled in as the meta description, but does anyone have any tips to hide or help avoid pulling in that special character?
Intermediate & Advanced SEO | | P-C-A0 -
SEO friendly H1 tag with 2 text lines
Hi everyone, I am trying to add span tags in H1, break tag on 2 lines and style each line of H1 differently: Example: Line 1Line 2 I might add a smaller font for line 2 as well... Is this SEO friendly? Will crawlers read entire text or can interfere and block it. Thank you!
Intermediate & Advanced SEO | | bgvsiteadmin0 -
Internal Anchor Text Links
Hi How important are internal anchor text links & rankings? I'm researching competitors and am seeing a lot of internal anchor text links with keywords helping them rank - but they have these links in their menu which at the moment isn't possible for us. We can include our top level 1 categories, but nothing below this in the top navigation Thanks!
Intermediate & Advanced SEO | | BeckyKey1 -
Adding some unique text to search
I do not know how to easily explain that but I will try 🙂 We are running the yellow pages in our country, where most (all) of the companies are listed. Each company has their keywords that are used in our search. All of those search keywords are indexed by Google and it goes something like that domain.com/search/keyword So far this page only lists the companies that match the search query. So that's not much content. I'm thinking about adding a sentence that would go something like this:
Intermediate & Advanced SEO | | FCRMediaLietuva
The search query lists the companies of our country that do KEYWORD. So far COUNT was found. That seems pretty short and I am not sure if that does a lot of help for Google. But I cannot think of more text. Any ideas? Or should I give up on the idea of adding anything at all?0 -
Low text-HTML ratios
Are low text-HTML ratios still a negative SEO ranking factor? Today I ran SEMRUSH site audit that showed 344 out of 345 pages on our website (www.nyc-officespace-leader.com) show an text-HTML ratio that ranges from 8% to 22%. This is characterized as a warning on SEMRUSH. This error did not exist in April when the last SEMRUSH audit was conducted. Is it worthwhile to try to externalize code in order to improve this ratio? Or to add text (major project on a site of this size)? These pages generally have 200-400 words of text. Certain URLs, for example www.nyc-officespace-leader.com/blog/nycofficespaceforlease more text, yet it still shows an text-HTML ratio of only 16%. We recently upgraded to the WordPress 4.2.1. Could this have bloated the code (CSS etcetera) to the detriment of the text-HTML ratio? If Google has become accustomed to more complex code, is this a ratio that I can ignore. Thanks, Alan
Intermediate & Advanced SEO | | Kingalan10 -
Role of Home Page Text
My company operates a commercial real estate web site in New York City. We are competing against brands that can spend hundreds of thousands of dollars per year on marketing and SEO. I am an independent without employees. I am trying to replace generic home page text (my web site is ww.nyc-officespace-leader.com) with more engaging language that will differentiate the site. In terms of SEO best practices, what constraints exist on home page text? Must it discuss the brand in a general way? Must it specifically describe the interior pages of a site? For new home page text, I wanted to take a practical approach and provide visitors with a checklist that would be useful for their search. The check list would be in the form 500-800 words or very practical advice, specific items they should be aware of before commencing a real estate search. No other site is taking this approach and it could save tenants time and grief. I am attaching a very rough mock up of the new text (forgive the grammar, really needs to be edited!). Is there any thing wrong with this approach? I may have stumbled on a good idea, however maybe nobody has taken this route because it is inappropriate for a home page. I would very much appreciate any comments or guidance anyone may have. Best, Alan Rosinsky 0d76Iv2
Intermediate & Advanced SEO | | Kingalan10 -
Anchor Text Usage
Hi, What is the best way to use anchor text during link building after recent updates from Google. I thinking of doing the following: 60% Brand Keyword (my site name)
Intermediate & Advanced SEO | | Vegitss
20% Click here, visit this site etc
20% myurl.com
10% a Mix of both broad & phrase match of my targetted keyword. What do you suggest Does anyone have a working strategy? Will be waiting for your replies...0 -
Multiple sites linking back with pornographic anchor text
I discovered a while ago that we had quite a number of links pointing back to one of our customer's websites. The anchor text of these links contain porn that is extremely bad. These links are originating from forums that seems to link between themselves and then throw my customers web address in there at the same time. Any thoughts on this? I'm seriously worried that this may negatively affect the site.
Intermediate & Advanced SEO | | GeorgeMaven0