How is my competition causing bad crawl errors and links on my site
-
We have a compeditor who we are in a legal dispute at the moment, and they are using under hand tactics to cause us to have bad links and crawl errors and i do not know how they are doing it or how to stop it.
The crawl errors we are getting is the site having two urls together, for example www.testsite.com/www.testsite.com and other errors are pages that we do not even have or pages that are spelt wrong or have a dot after the page name.
We have been told off a number of people in our field that this has also happened to them and i would like to know how they are doing it so we can have this stopped
Since they have been doing this our traffic has gone down by half
-
Hi no there is only me who deals with the site. I have put copyright notice on the site but i will read about authorship accorss the site.
-
Hi Diane,
I like the way Ryan thinks!
...but I am hoping we won't have to go to the length of having to resort to a content bomb.
The lesson in a situation like this is to realize that being a good white hat SEO unfortunately means needing an understanding of some of the tactics used by black hats
or maybe just a little help from some friends
Since there is obviously an issue with your content being copied, the first thing I would do is to implement Authorship markup across your entire site. By doing this you ensure that any content that is "borrowed" is immediately "outed" to the search engines because it doesn't have an external link from your Google profile page, which acts as verification that you are in fact the author of the content. Matt Cutts and Othar Hansson have given a really easy rundown on implementation in this Google Webmaster Help video
For the moment though, it would be better if you can avoid making any changes to the site until we can identify all of the issues in play.
BTW ... I think I have an inkling of what might be going on here ... is there a programmer or designer who is or has been involved in the development of the site besides yourself?
In the meantime, I'm continuing with a diagnostic based on the information we have and will let you know as soon as I have confirmed my suspicions or otherwise.
Hang in there,
Sha
-
May I suggest planting a "bomb" in your content?
Most thieves are lazy. Rather then create content themselves they steal from others. Their laziness is dependable.
Take a look at their copies and determine what content is and is not being stolen. If they are copying everything including the HTML code and meta tags, you can add canonical tags to your site and other helpful code.
If they are not copying the meta tags, you can add fake content and use the noindex, nofollow tag to protect your site, but provide content which otherwise would cause a site to be removed from Google's index. When they steal the content, it wont have the noindex tag and the site would get nailed.
There are many other possibilities but I am confident you can outsmart them if you try. In addition, be sure to report the site(s) to Google: http://www.google.com/support/bin/static.py?page=ts.cs&ts=1114905
Another idea is to copyright your work, thereby protecting it and giving you legal proof the content is yours.
-
thanks for this, will send private message. we have had to redo the site so many times with new content because the content keeps on being stolen by a franchise group who then pass it on to their franchisees.
-
Hi Diane,
Ryan's response is spot on and his suggestions are excellent.
If you can provide the URL(s), then we can take a look and see exactly what is going on with the referring page(s).
If you don't want to share the information publicly in the Q&A, you can private message each of us through your SEOmoz profile page.
If you ever think that someone has access to edit pages on your site without your permission, the first thing to do is to check with your service provider whether there are any active ftp accounts that you are unaware of. I have seen situations before where people have managed to get a "back door" set up and then it is as simple as logging in and changing pages without your knowledge.
Given that this involves a legal dispute, if we can do a proper diagnosis and trace the source of the errors (or if there happens to be a back door in place), then you would be able to:
- issue a cease and desist
- better secure the server against unauthorized access
- Ban any ip addresses identified as malicious
Hope that helps,
Sha
-
You mentioned these are crawl errors. Are you using the SEOmoz crawl report? If so, please look at the "referrer" field. It will offer the page on your site which is providing the bad URL.
If you are willing to share the referring page, we can take a look and possibly provide more detail.
Anyone can create bad links to your site which can appear in Google or Bing WMT. Only someone with the ability to add content on your site can create crawl errors. Either your site is open to user generated content and someone created a bad link, or someone with access to your web server created the content.
-
will do thanks
-
If iit is happniong to many, then maybe its a reason not to suspect them, but like i say go to Bing WMT and have a look at where the links ae comming from.
-
the reason why i know they are behind it, is because other companies in the field of the website have had the same problems and after doing research for our legal team over this matter, we spent time speaking to over 40 people in the field of the website and found it happened to them aswell.
These people are cowboys but hopefully it will all be sorted out soon.
-
I would look in Bing WMT to see where the links are comming from for a start.
i must say also, If you dont know how they are doing it, then maybe you dont know if they are doing it., it may be somthing quite inocent.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Link juice and max number of links clarification
I understand roughly that "Link Juice" is passed by dividing PR by the number of links on a page. I also understand the juice available is reduced by some portion on each iteration. 50 PR page 10 links on page 5 * .9 = 4.5 PR goes to each link. Correct? If so and knowing Google stops counting links somewhere around 100, how would it impact the flow to have over 100 links? IE 50 PR page 150 links on the page .33 *.9 = .29PR to each link BUT only for 100 of them. After that, the juice is just lost? Also, I assume Google, to the best of its ability, organizes the links in order of importance such that content links are counted before footer links etc.
Technical SEO | | sprynewmedia0 -
Cross links between sites
hi, We have several ecommerce sites and we cross linked 3 of them by mistake. We realize that the sites were linked through WMT, We have shut down 2 of the sites about 2 months ago, but WMT still shows the links coming from those 2 sites. how do we make sure that google will see the sites are shut down. Is there a better of way resolving this issue. We are no longer using those sites, so do not need them to be active. whats the best solution to show google that the links are no longer there. Crawler shows that it was able to crawl the site 45 days after it is shut down. thanks nick
Technical SEO | | orion680 -
How does Google Crawl Multi-Regional Sites?
I've been reading up on this on Webmaster Tools but just wanted to see if anyone could explain it a bit better. I have a website which is going live soon which is going to be set up to redirect to a localised URL based on the IP address i.e. NZ IP ranges will go to .co.nz, Aus IP addresses would go to .com.au and then USA or other non-specified IP addresses will go to the .com address. There is a single CMS installation for the website. Does this impact the way in which Google is able to search the site? Will all domains be crawled or just one? Any help would be great - thanks!
Technical SEO | | lemonz0 -
Any way to modify SERP site extension links?
I am not sure if this is doable but one of my clients have their website's SERP with sitelinks extension. In example. www.example.com -example1 -example2 -example3 -example 4 Anyway to modify the sitelinks on the SERP?
Technical SEO | | William.Lau0 -
Nofollow links if you have more than one link on a page to the same destination.
Hi, I am wondering if someone can confirm that its best practice to have nofollow on secondary links on a page. For instance the contact page may have a link in the navigation and in the the blurb down the page have another link to the contact page saying contact us here etc.. So in this instance i would put a nofollow on the secondary link in the blurb would this be the best way to impliment this. Many thanks Chris
Technical SEO | | InteractiveRed670 -
How to remove the 4XX Client error,Too many links in a single page Warning and Cannonical Notices.
Firstly,I am getting around 12 Errors in the category 4xx Client error. The description says that this is either bad or a broken link.How can I repair this ? Secondly, I am getting lots of warnings related to too many page links of a single page.I want to know how to tackle this ? Finally, I don't understand the basics of Cannonical notices.I have around 12 notices of this kind which I want to remove too. Please help me out in this regard. Thank you beforehand. Amit Ganguly http://aamthoughts.blogspot.com - Sustainable Sphere
Technical SEO | | amit.ganguly0 -
Lots of overdynamic URL and crawl errors..
Just wanted some advice. SEOmoz crawl found out about 18,000 errors. The error URLs are all mainly URLs like the one below, which seem to be the registration URL with a re-direct on, going back the product after registration: http://www.DOMAIN.com/index.php?_g=co&_a=reg&redir=/index.php?_a=viewProd%26productId=3465 We have the following line in the robots file to stop the login page from being crawled: Disallow: /index.php?act=login If I add the following, will it stop the error? Disallow: /index.php?act=reg Thanks in advance**.**
Technical SEO | | filarinskis0 -
E-Commerce Site Crawling Problem
Our website displays all of the products in our website If you attempt to visit a category or page that doesn't exist but conforms to our site url structure. Somehow google crawled these pages and indexed them, and they have TONS of duplicate content that hurt us. How do I deal with this problem?
Technical SEO | | 13375auc30