Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
-
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain:
/JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8=
/MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I=
/YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34=
I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them.
We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really.
Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
-
Glad to hear you located the root cause of the issue.
-
Update: I think I found the source of the errors. We have a form submission box on every page of our site and there are random alphanumeric strings that are unique to every page that look like these urls that Google is reporting as errors. Maybe Google is trying to use our form submission tool.
It's still really odd that Google would think they are urls and report them as errors since they appear in an tag not an <a> tag</a>
<a>```
<input id="authenticity_token" name="authenticity_token" type="hidden" value="ElCkBCszxMddaqVyVIx+flkzRlM9xb7mHb3cFISlJvY=" /> -
The next step would be to check Google WMT > Diagnostics > Crawl errors > Not Found tab
Click the "Linked from Detected" text. That will show the source of the link.
-
That's not the case. We have an in house content system and no shopping cart. These urls return a 404 error and always have because they've never existed.
No crawlers have returned urls that look like this either. I can't get these urls to show up in Google's index either.
-
Hi Jimmy,
My bet is you are using a CMS or Shopping Cart on your site. The software is generating these seemingly random URLs, and the Google crawler is seeing them.
Crawlers don't move through your site the way people do. If you take your home page and right-click, then choose View Page Source, you will find many URLs. The crawler will then follow each and every URL throughout your site. The URLs you are seeing could be generated on any given page.
To track the issue, use the SEOmoz crawler on your site, then analyze the results. When you find these types of URLs, take note of the page they are located on.
Alternatively, you can contact your CMS or shopping cart's software developer and inquire about the URLs. If these URLs show a folder path before the random characters, that information can help track down which pages or software is generating these pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Search Console not loading some resources
When I check an URL with Search Console it cannot load some page resources, even from other domains (like: ssl.google-analytics.com, www.facebook.com and www.google-analytics.com).
Reporting & Analytics | | TottiataHUN
Have any of you experienced this issue?
Steps to reproduce: open Google Search Console check an URL click "View crawled page" link select "More info" tab click "Page resources" {?}/{?} couldn't be loaded When I check the listed resources, all of them can be loaded from a web browser.
So I do not understand why Google cannot load them.
And there is no additional info why the resources could not be loaded.
Any ideas? google-search-console-other-error-1.png google-search-console-other-error-2.png0 -
Tasks for Google Analytics training
Hi Mozzers, I'm delivering some Google Analytics (Fundamentals level) training, and trying to make it was fun and as interesting as possible... which is quite a challenge when it comes to GA. I was just wondering if you're aware of training tasks, or interactions, I could bring into this kind of training session? The group are particularly interested in user journeys and the effectiveness of content. Thanks!
Reporting & Analytics | | A_Q0 -
Why google stubbornly keeps indexing my http urls instead of the https ones?
I moved everything to https in November, but there are plenty of pages which are still indexed by google as http instead of https, and I am wondering why. Example: http://www.gomme-auto.it/pneumatici/barum correctly redirect permanently to https://www.gomme-auto.it/pneumatici/barum Nevertheless if you search for pneumatici barum: https://www.google.it/search?q=pneumatici+barum&oq=pneumatici+barum The third organic result listed is still http. Since we moved to https google crawler visited that page tens of time, last one two days ago. But doesn't seems to care to update the protocol in google index. Anyone knows why? My concern is when I use API like semrush and ahrefs I have to do it twice to try both http and https, for a total of around 65k urls I waste a lot of my quota.
Reporting & Analytics | | max.favilli0 -
Link Research Tools
Is anyone else here a user of Link Research Tools? I recently completed a Link Detox for my sites. However, it is saying that links from high quality press release sites are deadly and should be removed. They are also saying the same about the links from the Yellow Pages. Obviously I know these tools are automated, but does anyone know why they are showing these links as 'deadly' and should be removed? I have tried contacting LRT about this issue but am yet to receive a reply.
Reporting & Analytics | | AAttias0 -
How do I manually add transactions to Google Analytics
We are seeing Google Analytic's drop transaction on our site so therefore all the figures are skewed. Is there a way I can manually add transactions to GA to cover the missing one?
Reporting & Analytics | | Towelsrus0 -
What is s.ytimg.com in google analytics?
My clients GA reports 273 visits from s.ytimg.com. I go to the site, it doesn't exist. I googled it, there were some code with s.ytimg.com in it, but nothing I could understand. Anybody have an idea where this comes from?
Reporting & Analytics | | endlessrange0 -
Weird info from google analytics?
Hi Could anyone explain what these visits are in Google Analytics? Under traffic sources and organic I am seeing lots of entries with data like below. Any ideas what kind of traffic this is? Is it a bot and if so what is their purpose of it and is it recommended that you block it? Pages/Visit 1.00 Avg. Time on Site 00:00:00 % New Visits : 100% Bounce Rate: 100.00% Many Thanks
Reporting & Analytics | | ocelot0