Anyways to pull anchor text?
-
Hi guys,
So basically i have a list of URLs/Domains and there backlinks (example: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg) but i'm missing anchor text. Can anyone recommend any tools which can scan a backlink, locate the URL/Domain on the page and then pull the anchor text?
Cheers, Chris
<colgroup><col width="548"><col width="884"></colgroup>
| | | -
Hi Matt!
No i have not yet found a tool which can do this.
The _ScrapeBox Anchor Text plugin _CleverPhD mentioned can only do this for one domain at a time. I need it for multiple domains.
Any other suggestions?
-
Hi Jay! Did you get this worked out?
-
Thanks Jay. If I look on the backlinks side, they all seem to have the same subdomain in some form or another. You would just need to setup the regex in Screaming Frog to look for just that keyword in the subdomain so it should match all the variants of it.
That said, ignore everything I just posted. I was thinking earlier, "Surely there is scraper software out there that does this already." I did not take the time to look. Your mention of Scrapebox reminded me of that.
Scrapebox has a separate addon that does this
http://www.scrapebox.com/anchor-text-checker
The ScrapeBox Anchor Text Checker allows you to enter your domain and then load a list of URL’s that contain your backlink. It will scan all the URL’s containing your link and extract the anchor text used by the websites that link to you.
-
Basically want the anchor text, so I can easily identify the location of the link on the page without needing to view source and search for the URL.
This export is directly from: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg
Scrapebox backlink checker which doesn't give you anchor text.
-
Ok. Can you be more specific on what you are trying to accomplish with this data? I think that may help my understanding of what you are trying to do.
-
Thanks CleverPhD, sorry should had mentioned i'm looking to do this for multiple domain names not just one. So the method you describe works great for a single domain.
-
Screaming Frog can do this with custom extraction and list mode. If I am reading your question correctly, you have a list of URLs and what pages on your site that they link to.
You would upload the list of URLs into Screaming Frog so it knows what pages to scan and run it in list mode
http://www.screamingfrog.co.uk/seo-spider/user-guide/configuration/#15
You would then use the custom extraction tool to grep for the ahref code that has a link to your domain
http://www.screamingfrog.co.uk/web-scraper/
You would need to plug in a regular expression to look for your domain (or versions of it) and then include the rest of the HTML tag that include the anchor text all the way through the ending .
You should then be able to import that data into a spreadsheet and use text to columns to split the anchor text into it's own column.
It is a little tricky as the regular expression may have to be tweaked depending on how other sites link to your site. Run the Frog on a test group of 10 or so to make sure it works. If you have a bunch of errors, take the error examples and tweak the regular expression based on those.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Shall I hide short product review texts from customers (to avoid google panda/quality issues)?
About 30% of product reviews that the clients of our ecommerce store submitted in the last 10 years are 3 words or less (we did not require any minimum length). Would you recommend to hide those very short review texts? Where to draw the limit?
Intermediate & Advanced SEO | | lcourse
Numeric star rating would still go into our accumulated product rating. My only concern here is what impact it may have on google ranking.
To give some context, the site has for a long time some panda/phantom related issues where there are no obvious reasons that we could point to.0 -
Is irrelevant backlinks real? And does anchor text affect all keywords?
I ask because I see many of my competitors with irrelevant backlinks still ranking at the top, despite what SEO say about it. I have a Web Design business, most of my competitors have their backlinks from sites they built ie "website by _____" and thats how they rank at the top. Should I keep getting backlinks from my clients (with permission)? Also does anchor text on backlinks affect all keywords in your website? Example is if you provide multi services ie plastering and decorating. You have 100 links pointing to your site with anchor text of "plastering". Technically your site still has 100 backlinks, will that also help boost the onsite optimisation for the keyword "decorating"?
Intermediate & Advanced SEO | | Marvellous0 -
4 questions about a paragraph of SEO friendly text in my e-com websites header.
Hi guys, I'm trying to understand the SEO behind our websites header. www.mountainjade.co.nz As you can see we have a paragraph of relevant introductory text that is also SEO friendly in our header. What I would like some help with is understanding how google views and assigns 'juice' to information like this in the header or footer of a website. Usually certain pages have content specific to a given topic, and google ranks these pages accordingly. But with a websites header / footer its content appears on every page as the header is always at the top and footer at the bottom. 1. In what way does my website benefit from the paragraph of text in the header? e.g at the domain level? Just the home page? etc etc 2. How does google assign 'juice' to the paragraph of text? (similiar to Q1). 3. How would my website be effected if I moved the text to the footer? (Aesthetic change) 4. When I 'inspect element' on the paragraph, it is labelled 'div id=site description.' Can someone please explain the relevance of a sites description to SEO for me. This paragraph of text was in the websites header before I came onboard, and I've been too concerned to change / move it as I don't know enough about it. Any help would be appreciated! Thanks team, Jake
Intermediate & Advanced SEO | | Jacobsheehan0 -
Links from music/celebrity based fansites - sitewide images with no alt text
We're currently in the middle of a link audit on our website OneDirection.net and a large part of our incoming links come from fansites such as the following: ladygaganow.net nickjonline.com justinbieberhood.com joejonashq.com harrystylesfan.org brunodaily.org onedirectiondaily.com onedirectionfans.net Now, our previous way of thinking was that these are very relevant websites in the same niche as us, and therefore should be passing some value? However all of the links on these sites come from sitewide images with no alt-text. Some of the sites are passing 1000+ links to us. We've been wary to disavow or request removal of these links as we've usually gone with the thinking that Google applies "common-sense" based logic in its algorithms, and therefore these backlinks should be ok - in our opinion. However we think we are suffering from some kind of algorithmic penalty with our current rankings, and are now thinking these could be the cause. What are people's opinions on these links? Should we stay clear of sitewide links altogether? Should we contact the site owners and try to get them to mix up the alt-text? Or should we get rid of them altogether? Thanks, Chris.
Intermediate & Advanced SEO | | PixelKicks0 -
How fast to change non-branded anchor text
A client has most of his links from sitewide partnership links with non-branded anchor text. One we changed to nofollow and one partnership we're ending next month, but the rest he wants to change to branded anchor text There's 7 sitewides and a bunch more single links. Should we change the anchor text all at once or space it out to one major change per month?
Intermediate & Advanced SEO | | BobGW0 -
Is "Car Discount" a problematic anchor text for CarDiscount.com (google penguin)?
I have a couple of partial match domains in the format KEYOWRDdiscount.com and also the website name resembles domain name. "Car Discount" is not my website but just an example to illustrate:
Intermediate & Advanced SEO | | lcourse
Is "Car Discount" a problematic anchor text for CarDiscount.com?
Should I try to modify existing external anchor texts to "CarDiscount" or "CarDiscount.com" instead of "Car Discount" Do you know of any cases where such anchor texts coinciding with partial match domain were likely reason for penguin penalization? Thanks.0 -
Anchor text
What will I need to make amormensagens.com.br is in position 1 in Google to the word "mensagens"? Only anchor text will?
Intermediate & Advanced SEO | | tibtos0 -
Impact of slight character variations in anchor text
Does anyone have experience of how Google deals with slight character variations, e.g. Facade v Façade? From an SEO perspective, are these treated as two completely separate words or is Google clever enough to determine the intent of the searcher & the site?
Intermediate & Advanced SEO | | bjalc20110