Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Difference keyword tool and related topics
Hello, Can someone explain me the difference between those 2 ? Because to me they are very similar. Thank you,
Moz Pro | | seoanalytics0 -
Why would only 1 pg of an 18 pg site be crawled?
I signed up yesterday. Added 4 sites. The one I really need data on now has only had 1 page crawled. The other sites have had almost all pages crawled (over 40 each) in one day. What is wrong? The main domain has a 301 redirect to another domain name is that the problem? Is there something wrong at Google analytics? I dont manage this site and I'm picking up behind another firm..where should I start my discovery? Thanks so much!
Moz Pro | | moreidea0 -
How reliable are the external link metrics found in the Open Site Explorer research tool?
I have noticed huge discrepancies with the external link metrics of one of the websites that I have been tracking since the launch of the website. Shortly after launch I captured a screenshot of the external link metrics to the website to track the progress. As I look at the same Open Site Explorer metrics now (2 months later) it has increased by 900% which is very inconsistent with the amount of link building done for the client. Thanks!
Moz Pro | | michaeleagar0 -
Keyword Difficulty Tool not working!
Hello everyone, I quite new to the pro service and I am still on my free trial and I have to say that it is perfect! However, I haven't been able to use the keyword difficulty tool for a couple of days as it keeps saying that there is a problem and that I should try again in 20 minutes. I tried several times and 9 out of 10 times I get this message. I have only managed to get a couple of keywords analyzed. Keep in mind that most of the times I am not using English terms but since it worked in the past I would think that this is not the problem. Any thoughts on how to fix this issue? Thanks in advance!
Moz Pro | | Somondo1 -
What web page and domain analysis / error checking / testing tools do you use for competitor analysis? sites like webpagetest
Just wondering what everyone is using, I am looking to get as much insight and detail as I can on websites that are not currently being monitored by me... i.e. potential clients. I use tools like pagespeed, webpagetest, loadimpact and open site explorer, google adwords, ispionage, alexa, semrush and well, looking for more. I really just want to rip a website to the tiniest pieces possible in an organized and coherent manner... is there anything out there? I have tried several other's which i no longer use (compete, 4q, woopra, nuestar, to name a few), I am not sure if I know exactly what i want, i just want more.... damn the human condition. lol
Moz Pro | | atb9900 -
Comparing with Open Site Explorer
Hi, I am trying to compare a website that has a url of e.g. https://mysite.com on Open Site Explorer. Any idea how to do this? It will only compare it when I use www and it also doesn't accept https. So I am comparing www.mysite.com which has redirects on it https://mysite.com but I am worried it's not comparing the right stats? If this makes sense and you can help it would be greatly appreciated. Cheers
Moz Pro | | Hughescov0 -
Looking for Advice on SEO for Minimilistic Site
Hi everyone, I found this site and it looks like a great place for us to start, we have just opened the doors to our new website fenwaymedia thats a dot co dot uk. Now Ihave tried to maintain the most minimilistic design throughout but I believe with this comes certain drawbacks - - - - S E O which is why i am here. My thought were that I would draw my clients from the Web Design, web Marketing & Blog page which I will starting in 3 hours be constantly updating. but Im not sure Im doing the right thing. I am trying to target Edinburgh, Scotland although we would take work from anywhere, if anyone could spare us a moment, I would greatly appreciate any help, tips or comments that might help us attain some visibility. Thanks in advance and thanks for looking. Kind regards, Craig
Moz Pro | | fenwaymedia0 -
Garbled URL's in Private Messages.
Every time I try to put a url in a Private message it gets garbled up with extra chars and then won't go to the right place. http://www.facebook.com/pages/Mariah-Carle-Photography Becomes: http://www.facebook.com/pages/Mariah58973jhsdfui-Carle%8594743Photography Ok after that test I deliberately garbled a url and it STILL work in the open forums....
Moz Pro | | Mcarle0