Anyways to pull anchor text?
-
Hi guys,
So basically i have a list of URLs/Domains and there backlinks (example: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg) but i'm missing anchor text. Can anyone recommend any tools which can scan a backlink, locate the URL/Domain on the page and then pull the anchor text?
Cheers, Chris
<colgroup><col width="548"><col width="884"></colgroup>
| | | -
Hi Matt!
No i have not yet found a tool which can do this.
The _ScrapeBox Anchor Text plugin _CleverPhD mentioned can only do this for one domain at a time. I need it for multiple domains.
Any other suggestions?
-
Hi Jay! Did you get this worked out?
-
Thanks Jay. If I look on the backlinks side, they all seem to have the same subdomain in some form or another. You would just need to setup the regex in Screaming Frog to look for just that keyword in the subdomain so it should match all the variants of it.
That said, ignore everything I just posted. I was thinking earlier, "Surely there is scraper software out there that does this already." I did not take the time to look. Your mention of Scrapebox reminded me of that.
Scrapebox has a separate addon that does this
http://www.scrapebox.com/anchor-text-checker
The ScrapeBox Anchor Text Checker allows you to enter your domain and then load a list of URL’s that contain your backlink. It will scan all the URL’s containing your link and extract the anchor text used by the websites that link to you.
-
Basically want the anchor text, so I can easily identify the location of the link on the page without needing to view source and search for the URL.
This export is directly from: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg
Scrapebox backlink checker which doesn't give you anchor text.
-
Ok. Can you be more specific on what you are trying to accomplish with this data? I think that may help my understanding of what you are trying to do.
-
Thanks CleverPhD, sorry should had mentioned i'm looking to do this for multiple domain names not just one. So the method you describe works great for a single domain.
-
Screaming Frog can do this with custom extraction and list mode. If I am reading your question correctly, you have a list of URLs and what pages on your site that they link to.
You would upload the list of URLs into Screaming Frog so it knows what pages to scan and run it in list mode
http://www.screamingfrog.co.uk/seo-spider/user-guide/configuration/#15
You would then use the custom extraction tool to grep for the ahref code that has a link to your domain
http://www.screamingfrog.co.uk/web-scraper/
You would need to plug in a regular expression to look for your domain (or versions of it) and then include the rest of the HTML tag that include the anchor text all the way through the ending .
You should then be able to import that data into a spreadsheet and use text to columns to split the anchor text into it's own column.
It is a little tricky as the regular expression may have to be tweaked depending on how other sites link to your site. Run the Frog on a test group of 10 or so to make sure it works. If you have a bunch of errors, take the error examples and tweak the regular expression based on those.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Different text position on mobile and desktop page
Hi there, We have product category descriptions in our webshop, with one smal difference: desktop: the first paragraf is at the top above the products with a scroll down to the other paragrafs below the products mobile: the whole text is below the products I am wondering whether it is allright in regards of the mobile first index, or should we have exactly the same paragraf split on the desktop and mobile version? Eg. dadum.pl/zabawki Thanks in advance for your opinions. Isabelle
Intermediate & Advanced SEO | | isabelledylag0 -
Too much duplicate text?
Last December we started losing traffic to our website https://www.spec.lt (This is in Lithuanian).
Intermediate & Advanced SEO | | anonimas
The thing we did was to every single company page we added QR Code. For example: https://www.spec.lt/imone/onninen-uab (at the bottom of this page). We added some text that goes with it. As you can see here http://imgur.com/a/beaYm
The only difference between those texts is the company name. Can this be the reason why google reduced our positions ? (We didn't lose any of traffic in categories/search/articles - only in company pages). A lot of companies that are new or bancrupt have little to no text at all. Except for this text about QR code, like here for example - https://www.spec.lt/imone/mazoji-bendrija-transportas-2017 Can this be the reason? Or any other on page errors that you see.
Thank you0 -
Does Disavowing Links Negate Anchor Text, or Just Negates Link Juice
I'm not so sure that disavowing links also discounts the anchor texts from those links. Because nofollow links absolutely still pass anchor text values. And disavowing links is supposed to be akin to nofollowing the links. I wonder because there's a potential client I'm working on an RFP for and they have tons of spammy directory links all using keyword rich anchor texts and they lost 98% of their traffic in Pengiun 1.0 and haven't recovered. I want to know what I'm getting into. And if I just disavow those links, I'm thinking that it won't help the anchor text ratio issues. Can anyone confirm?
Intermediate & Advanced SEO | | MiguelSalcido0 -
Anchor text is a url, but not the one I want?
I am currently getting links to my site where the anchor text is, say, domainx.com. However, my website's url is domainy.com - the anchor text is the url of another website. Is this hurting my website in the SERPS, or making my link building look unnatural? The link is on the blog roll of another website, so it has produced hundreds, maybe thousands of copies of this anchor text link. Does that have any impact on my ranking as well?
Intermediate & Advanced SEO | | redfishking0 -
Dynamically change anchor text and URLs remotely
Hey i'm looking to create a widget in javascript which i dynamically change the urls and anchor text which link the widget back to my site remotely (via php) once it spreads. I have heard peopled doing this before, but i can't seem to find a example. Does anyone know of any examples/widgets or anything which can do this?
Intermediate & Advanced SEO | | monster990 -
Company name often shows in anchor text (important keyword phrase within), can this impact ranking?
Hi everyone, My company is called "Hawaii Job Engine" - www.hawaiijobengine.com - and many sites that link to my site use my company name as anchor text "Hawaii Job Engine". I have heard Google may devalue a certain keyword phrase if used too often in anchor text. Does this mean I may, over time, get a poor ranking for the term "Hawaii Job" since that phrase is part of my company's name. Or, will search engines easily notice it is my company name and therefore it will not have a negative impact on rankings? Example: if the anchor text leading to my company's homepage is company's name 95% of the time (on authoritative sites) could this be an issue? I don't know the %, but just to establish if there may be in % levels to keep in mind. thank you, Kristian
Intermediate & Advanced SEO | | knielsen1 -
Cannot Identify Self Cannabilizing Keyword Anchor
I am struggling to locate on our website www.towelsrus.co.uk a self cannibalizing keyword. Can anyone help? http://www.towelsrus.co.uk/Bath-Linen-Bath-Pillows/Aztex/Luxury-Bath-Pillows-With-Suction-Cups-4-Embroidered-Designs_ct463bd182pd2818.htm void Keyword Self-Cannibalization Easy fix <dl> <dt>Cannibalizing link</dt> <dd>"Luxury Bath Pillows With Suction Cups, 4 Embroidered Designs"</dd> <dt>Explanation</dt> <dd>It's a best practice in SEO to target each keyword with a single page on your site (sometimes two if you've already achieved high rankings and are seeking a second, indented listing). To prevent engines from potentially seeing a signal that this page is not the intended ranking target and creating additional competition for your page, we suggest staying away from linking internally to another page with the target keyword(s) as the exact anchor text. Note that using modified versions is sometimes fine (for example, if this page targeted the word 'elephants', using 'baby elephants' in anchor text would be just fine).</dd> <dt>Recommendation</dt> <dd>Unless there is intent to rank multiple pages for the target keyword, it may be wise to modify the anchor text of this link so it is not an exact match.</dd> </dl>
Intermediate & Advanced SEO | | Towelsrus0 -
Fading Text Links Look Like Spammy Hidden Links to a g-bot?
Ah, Hello Mozzers, it's been a while since I was here. Wanted to run something by you... I'm looking to incorporate some fading text using Javascript onto a site homepage using the method described here; http://blog.thomascsherman.com/2009/08/text-slideshow-or-any-content-with-fades/ so, my question is; does anyone think that Google might see this text as a possible dark hat SEO anchor text manipulation (similar to hidden links)? The text will contain various links (4 or 5) that will cycle through one another, fading in and out, but to a bot the text may appear initially invisible, like so; style="display: none;"><a href="">Link Here</a> All links will be internal. My gut instinct is that I'm just being stupid here, but I wanted to stay on the side of caution with this one! Thanks for your time 🙂 http://blog.thomascsherman.com/2009/08/text-slideshow-or-any-content-with-fades
Intermediate & Advanced SEO | | PeterAlexLeigh0