Link analysis task
-
Hi mozzers,
I am currently working on a phd, and one of the professors asked me for help.
He would like to know how many Danish school websites (n=1500) links to a certain section of a government website (the relevant section has around 1600 pages). The problem is, that the government website is coded very poorly from an seo perspective with lots of strange URL variables, entailing OSE can't give valid data.
So, what would be the best way to check how many of the school websites link? Throw all 1500 website through Xenu, or is there a smarter solution? Maybe the link out feature on Bing?
Any suggestions will be greatly appreciate.
Thanks!
-
Exactely.
Sometimes I am not aware of an answer or question due to to much traffic here in the Q&A section ... I kind of loosing the overview
-
Thank you all for the answers - Incredible helpful!
And yes, this open q&a is great - I am sure it will dominate serious SEO discussions by the end of this year
-
Hey Thomas: I just sat through the video for the tool and it looks amazing! They say it uses 12 different sources for their link data(SEOmoz Linkscape, Majestic SEO, Google data, Yahoo, Technorati, etc.) THen it sounds like they give you a live verification. So to answer your question, it looks like it's doing both.
-
Sweet Petra! This site was completely off my radar. I really like the collective intelligence this Q&A is bringing to SEOmoz.
-
Thank you, Dejan.
Correct me if I am wrong - bur Majestic has a pretty deep crawl of the web right? We are examining websites with potential very few ingoing links.
-
I would suggest Majestic SEO to get solid backlinks and Excel to filter out URLs that do not belong to a desired section.
-
One more question: Is Cemper's tools based on an a priori crrawling like SEOmoz or a live crawling like Xenu?
-
Cool didn't knoe the tool could do it. ALways avoided the website as I do with most other keyword rich domain names
-
E.g. buying a 72 hours day pass - http://www.linkresearchtools.com/products-overview/ for Euro 20,- - worth the money - even Ann Smarty recommended the tool.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do back-links to non indexed sub-domains / sub-directories considered by Google as website backlinks and pass Pagerank to website?
Hi, If some noindexed links on our website or sub-domain got some backlinks, will that backlinks pass Pagerank / linkjuice to website? Will they be considered as backlinks to website by Google? Here is a statement from Matt cutts for the question. My question is same as below with answer? Eric Enge: Can a NoIndex page accumulate PageRank? Matt Cutts: A NoIndex page can accumulate PageRank, because the links are still followed outwards from a NoIndex page. Thanks
Algorithm Updates | | vtmoz0 -
Keyword cannibalization or linking structure?
Hi all, Recently I got an answer from this community about "why our login page is ranking but not my homepage for primary keyword"? Possibilities are keyword cannibalization or linking structure. In our case, our homepage is not ranking for "primary keyword" but ranking for other keywords. If it is linking structure, what might be wrong? Like do we need to unlink login page from many internal links? Thanks
Algorithm Updates | | vtmoz0 -
Drastic Drop in Link Juice
Hi Back in December we shifted my web domain from a gourmetdirect.com to gourmetdirect.co.nz as part of a site-wide revamp. Everything was going along fine until recently when my Linking domains plummeted and external links fell from 6000 approx to 600. We still have the .com live for loads of disfunctional reasons. Can anyone help? I have gone from a top ranker to a no show and my contractors are all shaking their heads.
Algorithm Updates | | GourmetDirect0 -
Clean up of Links, What to get rid of?
We have been cleaning up our back office and preparing our .com domain to take all our future traffic and have got into a debate about how far to clean up the old past links. We have not ever had a penalty on the site as far as we know, but did once get the site taken offline by Google as they thought it was a malware site back in March this year. They put it straight back up and running in 5 hours, but was very strange as it is an amazon-webstore retail site. We are not sure why Google thought (edit: typo) this, so just in-case we have been combing through the historical links and now started to disavow any links we cannot get removed manually. So far just a couple of sites that have no relevance to our retail business. However, the debate we have been having is around Directory listings: Should we get rid of these too? Gut reaction is Yes, based on the need for quality relevant links for the end user, but then some are passing proper links to relevant sections of our site albeit in a directory format. Dmoz comes to mind Any thoughts? Bruce.
Algorithm Updates | | BruceA0 -
Content Caching Memory & Removal of 301 Redirect for Relieving Links Penalty
Hi, A client site has had very poor link legacy, stretching for over 5 years. I started the campaign a year ago, providing valuable good quality links. Link removals and creating a disavow to Google have been done, however after months and months of waiting nothing has happened. If anything, after the recent penguin update, results have been further affected. A 301 redirect was undertaken last year, consequently associating those bad links with the new site structure. I have since removed the 301 redirect in an attempt to detach this legacy, however with little success. I have read up on this and not many people appear to agree whether this will work. Therefore, my new decision is to start a fresh using a new domain, switching from the .com to .co.uk version, helping remove all legacy and all association with the spam ridden .com. However, my main concern with this is whether Google will forever cach content from the spammy .com and remember it, because the content on the new .co.uk site will be exactly the same (content of great quality, receiving hundreds of visitors each month from the blog section along) The problem is definitely link related and NOT content as I imagine people may first query. This could then cause duplicate content, knowing that this content pre-existed on another domain - I will implement a robots.txt file removing all of the .com site , as well as a no index no follow - and I understand you can present a site removal to Google within webmaster tools to help fast track the deindexation of the spammy .com - then once it has been deindexed, the new .co.uk site will go live with the exact same content. So my question is whether Google will then completely forget that this content has ever existed, allowing me to use exactly the same content on the new .co.uk domain without the threat of a duplicate content issue? Also, any insights or experience in the removal of a 301 redirect, detaching legacy and its success would also be very helpful! Thank you, Denver
Algorithm Updates | | ProdoDigital0 -
Do scraped or borrowed articles with my links still pass page rank?
I wrote some articles for Ezine Articles a few years back and i still see links in the ose to my site that are from these articles that were borrowed from the Ezine Articles bank. Do the links in these articles still count toward my site including link juice and anchor text or does google discount them as duplicate content? I was told that Google counts these links for about 3 weeks and then discounts them as duplicate content so it's like they don't exist. Any truth to this or should i make the articles on my site available for people to copy and paste into their blogs as long as they keep my links intact? Thanks, Ron
Algorithm Updates | | Ron100 -
Awarding badges to bloggers as a link-building strategy?
Hello - In the past, doing "Blog awards" and subsequently offering winners' badges with links and anchor text embedded has been something that worked very well for a client. However, we are now noticing that the badging strategy does not seem to be producing the same results for us. We are only reaching out to quality blogs in particular niches - e.g., Top Fashion Blogs, Top Health Blogs, Top Design Blogs, etc. Most blogs that post the badges are in the PageRank 3-5 range. Has anyone else engaged in badging strategy and noticed that the success rates are declining? Could it be that in-text, contextual links are given much more weight than sidebar links? (The badges are most typically posted in bloggers' sidebars). Any insight would be appreciated, thanks!
Algorithm Updates | | brmckenna0 -
FLASH vs HTML links in SEO
In terms of a small flash slideshow and having text and links on various slides within, is such text and links as easily index-able (or even at all) compared to static html text on a webpage?
Algorithm Updates | | heritageseo0