GSC is reporting a lot of chopped URLs
-
Recently, in the last two weeks, I started seeing a lot of odd 404 errors in GSC for my site. Upon investigation, the URLs are for fairly new articles, and the URLs are chopped in various places. From missing a character at the end to missing about 10 characters at the end of the URL. (an old similar issue is that GSC reports duplicate contents on weird subdomains that we've never used like 'smtp' 'ww1' or even random ones like 'bobo'.)
GSC doesn't report any 'linked from' for those odd URLs and I know for sure these links aren't on the site itself. They're definitely not errors in the CMS.
The site is a long established site (started 1997-1998) and we've been subject to a lot of negative SEO. I recently had to disavow about 1000 .ru domain linking to us, with some domains containing over a million link each.
Could these chopped links be a new tactic of negative SEO? How do I find these seemingly intentionally broken links to us?
-
Thanks for the question. It isn't uncommon for there to be strange 404 errors in Search Console with little information/bad information. They are working hard to improve this, but I wouldn't take everything you see there as set-in-stone.
This doesn't sound like a negative SEO tactic. I would just mark them all as fixed, and see if they appear again in about a week. If they do, I'd make sure they are actually served as 4xx status and not worry too much about it. If you want to do more digging...
Some ideas of where you could look further
- Logs logs logs. This will be the ultimate truth - you will be able to see whether or not GoogleBot is actually hitting those URLs.
- It could be something weird happening with a plugin of yours that generates those URLs (particularly on Wordpress).
- Perhaps you have a filtering system setup that generates these URLs?
- If you have a search function on the site, sometimes weird URLs can be generated through that.
- Do the URLs come-up when you crawl the site at all?
Just a few ideas!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URL structure for new product launch
Hello, I work for a company (let's call it companyX) that is about to launch a new product, lets call it ProductY. www.CompanyX.com is an old domain with a good domain authority. The market in which ProductY is being launched is extremely competitive. The marketing department want's to launch ProductY on a new website at www.ProductY.com.
Intermediate & Advanced SEO | | Lvet
My opinion is that we should instead create a subfolder with product information at www.CompanyX.com/ProductY. By doing this we could leverage on the existing domain authority of CompanyX.com Additionally for campaigns, and in order to have a more memorable URL we could use ProductY.com with a 301 redirect to www.CompanyX.com/ProductY What do you think is the best strategy from an SEO point of view? Cheers
Luca0 -
Should you shorten very long URLs?
Hi Moz Community! If the nav architecture URL is long, like this: https://www.savewildlife.org/wildlife-conservtion/endangered-species-act-protections/endangered-species-list/birds/mexican-spotted-owl can I and should I shorten that new destination URL to make it easy for Google to see that the page topic is really the owl, like this: https://savewildlife.org/endangered-species-list/mexican-spotted-owl Thank you! Jane
Intermediate & Advanced SEO | | CalamityJane771 -
Will have /index in my url hurt?
I am trying to setup permalinks on a wordpress blog that is installed on iis. I can't update the web.config file so I have to make every page /index/pagetitle. as shown here-http://codex.wordpress.org/Using_Permalinks#PATHINFO:_.22Almost_Pretty.22 How much of a difference is there between no /index and having the /index in there?
Intermediate & Advanced SEO | | EcommerceSite0 -
Which URL is better for SEO?
We have a URL structure question: Because we have websites in multiple countries and in multiple languages, we need to add additional elements to our URL structure. Of the two following options, what would be better for SEO? Option 1: www.abccompany.com/abc-ca-en/home.htm Option 2: www.abccompany.com/home.abc.ca.en.htm
Intermediate & Advanced SEO | | northwoods-2603420 -
Company Blog at a different URL
Ok, I have been doing a lot of work over the past 6 months, disavowing low quality links from spammy directories to our company website, etc. However, my efforts seem to have had a negative, not positive effect. This has brought me back to reconsidering what we are doing as we have lost a good amount of traction on the nationwide Google rankings specifically. Considering our company blog - platinumcctv(dot)net - we have used this blog for a long time to inform customers of new products, software developments and then to provide them links to purchase those components. Last week, I revamped the nearly default wordpress theme to another on a piece of advice. However, someone told me that all of our links should be nofollow, even though it is a company blog because we have many links coming from this domain, and it could be found as spammy. Potato/Potato - But before I start the tedious task of changing every link to no follow on a whim, i searched a lot, but have found no CLEAR substantiation of this. Any ideas? Other recommendations appreciated as well! Platinum-CCTV(dot)com
Intermediate & Advanced SEO | | PTCCTV0 -
301 redirect with /? in URL
For a Wordpress site that has the ending / in the URL with a ? after it... how can you do a 301 redirect to strip off anything after the / For example how to take this URL domain.com/article-name/?utm_source=feedburner and 301 to this URL domain.com/article-name/ Thank you for the help
Intermediate & Advanced SEO | | COEDMediaGroup0 -
What will the effect of normalising the case of my URLs be?
Hi all, I have a web site with a selection of pages with excellent rankings, mostly in the top 3 for the keywords we want to rank for. Currently, the URLs are mostly presented mixed case, like this: www.mydomain.com/Type/ITEM-IDENTIFIER/ However we have problems of different cases being used in different parts of our application, and also it's obviously not that attractive the way it is. What we are proposing to do is deploy a change to our web site that lowercases all URLs in internal links, as well as present the URLs in lowercase in our sitemap.xml, and provide any links to partners from this point on in lowercase format. We are also proposing to 301 redirect any non-lowercase URLs to the lowercase version. These pages already have a canonical link tag due to us hosting different versions of these pages on multiple domains, for skinning purposes. The link in the canonical link tag will also be changed to be lowercase. What I am concerned about is, URLs of the case above have been in the rankings for a few years now, and if all of a sudden our links are all lowercase, will they drop off the rankings? Or will the above measures mean that the pagerank is transferred to the lowercase version of the URL? Thanks in advance, James
Intermediate & Advanced SEO | | SeeTickets0 -
Googlebot crawling partial URLs
Hi guys, I've checked my email this morning and I've got a number of 404 errors over the weekend where Google has tried to crawl some of my existing pages but not found the full URL. Instead of hitting 'domain.com/folder/complete-pagename.php' it's hit 'domain.com/folder/comp'. This is definitely Googlebot/2.1; http://www.google.com/bot.html (66.249.72.53) but I can't find where it would have found only the partial URL. It certainly wasn't on the domain it's crawling and I can't find any links from external sites pointing to us with the incorrect URL. GoogleBot is doing the same thing across a single domain but in different sub-folders. Having checked Webmaster Tools there aren't any hard 404s and the soft ones aren't related and haven't occured since August. I'm really confused as to how this is happening.. Thanks!
Intermediate & Advanced SEO | | panini0