Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is Anyone Else Having Problems With The Ranking On Pro Tools?
After checking them from the report I was emailed, some of them seem to be incorrect, or is it something my end? To be fair the majority of them are correct, I'm just querying it.
Moz Pro | | JonathanRolande0 -
In Open Site Explorer is it possible to use wildcards?
If I have a section on my website called lists with articles in there can I use wildcards in Open Site Explorer to find how many backlinks all articles in that section have - and ideally which pages are most linked to? Something like www.example.com/lists/* to give number of backlinks to all articles in that website section and which are the most highly linked to. Would be a great feature to have! Cheers Siimon
Moz Pro | | SimonCh0 -
Is there a tool that tracks and records your links to your site
What I mean by this is we have a linkbuilder working for us and I'm looking for to record there progress with link building I've seen somthing in Majestic but is there one in SEOMOZ All teh best Steve
Moz Pro | | ibexinternet0 -
On page links tool here at Seomoz
Hi Seomoz - first of all, thanks for the best SEO tools I have ever worked with (this is my first question in this forum, and also I just subscribed as a paying customer after the 30 days trial you guys offer). My question: After having worked for several weeks on getting the numbers of links in our forum on www.texaspoker.dk down, we are somewhat surprised to see that we didn't succeed in getting lower numbers. For instance, this page: http://www.texaspoker.dk/forum/aktuelle-konkurrencer/coaching-projekt-bliver-du-den-udvalgte has (that's what Seomoz seo tool tells us): 239 on page links. Can this really be true? We can't find these links, and we actuually did a lot to lower the numbers of links, for instance the forum members picture was a link before, and also there was a "go to top" link in each post in the forum. Thanks a lot.
Moz Pro | | MPO0 -
New site on page check
hello wonderful Mozzers, I am building a new site and was wondering if any of you knew any latest " thorough" ON Page Check Lists? I want to make sure I build the site right, and do every bit of on page and new site seo right. I have access to the SEOMOZ guides as well. Have a fab day guys, Best, Vijay
Moz Pro | | vijayvasu0 -
Internal links not showing in Open Site Explorer
So I'm working on a law firm site and looking at the links for pages in OSE. For practice areas, the links to each practice area are in the left hand menu on every page of the site. Can anyone help me with this question: Example: http://www.comitzlaw.com/personal-injury/car-accidents.html When I plug this URL into OSE, it only shows one linking page, www.comitzlaw.com/practice-areas.html, yet there is a link to this on every other page in the site. When I plug in a random competitors page, www.lesagelblaw.com/Personal-Injury-Overview/Car-Accidents.shtml, it does show all the internal pages linking to it. Since I'm not using a flash menu or javascript, any ideas as to why no internal links are showing up in OSE? Even when I plug in the main URL for the home page, it only shows 4 other internal pages linking to it, yet there is a link on every page. What am I doing wrong?
Moz Pro | | c2g0 -
How to set up SEOMOZ to track multilanguage sites?
Hi, I am managing a site in 5 different languages/regions. Our language structure is as follows: www.domain.com/en-us/ - For US www.domain.com/en-uk/ - For UK www.domain.com/fr/ - For France ... However, SEOMOZ does not allow to set up a domain with "/" as a campaign. How can I do it? Thanks
Moz Pro | | hockerty1 -
Domain and Submain : which choice ? (open explorer tool)
Hi, 1/ Please could you tell me why Moztrust and Mozrank give not similar figures for subdomain and root domain ? 2/ Which is the best way for Google webmaster tool for configuring : Sub or Root domain ? 3/ Finally, regarding anchor text, Sub or root domain ? Tks for links or knowledge base about it....
Moz Pro | | mozllo2