Anyways to pull anchor text?
-
Hi guys,
So basically i have a list of URLs/Domains and there backlinks (example: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg) but i'm missing anchor text. Can anyone recommend any tools which can scan a backlink, locate the URL/Domain on the page and then pull the anchor text?
Cheers, Chris
<colgroup><col width="548"><col width="884"></colgroup>
| | | -
Hi Matt!
No i have not yet found a tool which can do this.
The _ScrapeBox Anchor Text plugin _CleverPhD mentioned can only do this for one domain at a time. I need it for multiple domains.
Any other suggestions?
-
Hi Jay! Did you get this worked out?
-
Thanks Jay. If I look on the backlinks side, they all seem to have the same subdomain in some form or another. You would just need to setup the regex in Screaming Frog to look for just that keyword in the subdomain so it should match all the variants of it.
That said, ignore everything I just posted. I was thinking earlier, "Surely there is scraper software out there that does this already." I did not take the time to look. Your mention of Scrapebox reminded me of that.
Scrapebox has a separate addon that does this
http://www.scrapebox.com/anchor-text-checker
The ScrapeBox Anchor Text Checker allows you to enter your domain and then load a list of URL’s that contain your backlink. It will scan all the URL’s containing your link and extract the anchor text used by the websites that link to you.
-
Basically want the anchor text, so I can easily identify the location of the link on the page without needing to view source and search for the URL.
This export is directly from: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg
Scrapebox backlink checker which doesn't give you anchor text.
-
Ok. Can you be more specific on what you are trying to accomplish with this data? I think that may help my understanding of what you are trying to do.
-
Thanks CleverPhD, sorry should had mentioned i'm looking to do this for multiple domain names not just one. So the method you describe works great for a single domain.
-
Screaming Frog can do this with custom extraction and list mode. If I am reading your question correctly, you have a list of URLs and what pages on your site that they link to.
You would upload the list of URLs into Screaming Frog so it knows what pages to scan and run it in list mode
http://www.screamingfrog.co.uk/seo-spider/user-guide/configuration/#15
You would then use the custom extraction tool to grep for the ahref code that has a link to your domain
http://www.screamingfrog.co.uk/web-scraper/
You would need to plug in a regular expression to look for your domain (or versions of it) and then include the rest of the HTML tag that include the anchor text all the way through the ending .
You should then be able to import that data into a spreadsheet and use text to columns to split the anchor text into it's own column.
It is a little tricky as the regular expression may have to be tweaked depending on how other sites link to your site. Run the Frog on a test group of 10 or so to make sure it works. If you have a bunch of errors, take the error examples and tweak the regular expression based on those.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Combining images with text as anchor text
Hello everyone, I am working to create sub-category pages on our website virtualsheetmusic.com, and I'd like to have your thoughts on using a combination of images and text as anchor text in order to maximize keyword relevancy. Here is an example (I'll keep it simple): Let's take our violin sheet music main category page located at /violin/, which includes the following sub-categories: Christmas Classical Traditional So, the idea is to list the above sub-categories as links on the main violin sheet music page, and if we had to use simple text links, that would be something like: Christmas
Intermediate & Advanced SEO | | fablau
Classical
Traditional Now, since what we really would like to target are keywords like: "christmas violin sheet music" "classical violin sheet music" "traditional violin sheet music" I would be tempted to make the above links as follows: Christmas violin sheet music
Classical violin sheet music
Traditional violin sheet music But I am sure that would be too much overwhelming for the users, even if the best CSS design were applied to it. So, my idea would be to combine images with text, in a way to put those long-tail keywords inside the image ALT tag, so to have links like these: Christmas
Classical
Traditional That would allow a much easier way to work the UI , and at the same time keep relevancy for each link. I have seen some of our competitors doing that and they have top-notch results on the SEs. My questions are: 1. Do you see any negative effect of doing this kind of links from the SEO standpoint? 2. Would you suggest any better way to accomplish what I am trying to do? I am eager to know your thoughts about this. Thank you in advance to anyone!1 -
Site-wide links with optimized anchor words?
Hi Moz community, I work at a web design company. I found my competitors have a lot of site-wide backlinks from their clients with optimized anchor text "affordable web design by XXX". Some of the clients' website are not even relevant to web design or design industry. I am sure those are dofollow links. Although I heard a lot of sayings that site-wide backlinks look unnatural and spammy, why the top ranking guys are still using this way to acquire backlinks? Does Google really actually say no to this? Thanks for any help and explanation. Best, Raymond
Intermediate & Advanced SEO | | Raymondlee0 -
External Keyword Anchor Links - Always Bad?
1) I've been told that other sites linking to my site with keyword-rich text are bad. 2) But Google Console / Analytics shows that we rank extremely high for random, pointless phrases loosely tied to the topic of our site. Like "dht blocker". (its a hair loss site) 3) This week I began analyzing our backlinks. Guess what I found? Literally hundreds of bot-created spammy trackback and pingback text links around the phrase "dht blocker" It seems to me that keyword rich anchor text on external sites is NOT a bad thing. In fact its an outstanding way to rank better for your desired keywords. Obviously the "bad" is the spam element. Probably the high quantity. On unrelated websites. But guess what? It worked. _We are ranking extremely well for these pointless phrases, thanks to these spam bots. _ Obviously we will be disavowing all these sites. But I want to start building quality links via legitimate, honest means. So here is my question: If I begin a legitimate honest link building campaign with other websites, and request that they put the HREF around our most coveted keyword phrase - is this inherently BAD? Or is it actually possibly GOOD? ----------------------------------------------------------------------------------- Thoughts?
Intermediate & Advanced SEO | | HLTalk1 -
Does Google read bullet point lists are text? WordPress SEO by Yoast says different...
I am using the WordPress SEO plugin by Yoast. They have a site analysis, once you enter a keyword for optimize it for. Now I found that this plugin doesn't count in the text from bullet point (or numbered lists) as text. Now that made me curios...Does Google see bullet points text as text or not?
Intermediate & Advanced SEO | | soralsokal0 -
Do onpage named anchors play a part in SEO?
Hi Is having on page links from top to bottom etc using keywords as anchors a good methodology? Does it help the page for those words? Thanks
Intermediate & Advanced SEO | | BeytzNet1 -
Does Google read texts when display=none?
Hi, In our e-commerce site on category pages we have pagination (i.e toshiba laptops page 1, page 2 etc.). We implement it with rel='next' and 'prev' etc. On the first page of each category we display a header with lots of text information. This header is removed on the following pages using display='none'. I wondered if since it is only a css display game google might still read it and consider duplicated content. Thanks
Intermediate & Advanced SEO | | BeytzNet0 -
Cannot Identify Self Cannabilizing Keyword Anchor
I am struggling to locate on our website www.towelsrus.co.uk a self cannibalizing keyword. Can anyone help? http://www.towelsrus.co.uk/Bath-Linen-Bath-Pillows/Aztex/Luxury-Bath-Pillows-With-Suction-Cups-4-Embroidered-Designs_ct463bd182pd2818.htm void Keyword Self-Cannibalization Easy fix <dl> <dt>Cannibalizing link</dt> <dd>"Luxury Bath Pillows With Suction Cups, 4 Embroidered Designs"</dd> <dt>Explanation</dt> <dd>It's a best practice in SEO to target each keyword with a single page on your site (sometimes two if you've already achieved high rankings and are seeking a second, indented listing). To prevent engines from potentially seeing a signal that this page is not the intended ranking target and creating additional competition for your page, we suggest staying away from linking internally to another page with the target keyword(s) as the exact anchor text. Note that using modified versions is sometimes fine (for example, if this page targeted the word 'elephants', using 'baby elephants' in anchor text would be just fine).</dd> <dt>Recommendation</dt> <dd>Unless there is intent to rank multiple pages for the target keyword, it may be wise to modify the anchor text of this link so it is not an exact match.</dd> </dl>
Intermediate & Advanced SEO | | Towelsrus0 -
What should my optimal anchor text look like, given cannibalization risk?
We have a content page with the explicit goal of ranking highly for "raised garden beds". We drive traffic from this page to our various types of raised garden beds in our store. The "FarmsteadRaised Garden Bed" is one such product. http://eartheasy.com/grow_raised_beds.htm Should we avoid using "raised garden beds" in the anchor text of the internal links pointing to the products in our store because of cannibalization? We recently changed the anchor text of the internal links to have keywords instead of just "click here" or "more info" - was this a good idea? What should our optimal anchor text look like?
Intermediate & Advanced SEO | | aran0880