Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
-
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain:
/JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8=
/MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I=
/YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34=
I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them.
We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really.
Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
-
Glad to hear you located the root cause of the issue.
-
Update: I think I found the source of the errors. We have a form submission box on every page of our site and there are random alphanumeric strings that are unique to every page that look like these urls that Google is reporting as errors. Maybe Google is trying to use our form submission tool.
It's still really odd that Google would think they are urls and report them as errors since they appear in an tag not an <a> tag</a>
<a>```
<input id="authenticity_token" name="authenticity_token" type="hidden" value="ElCkBCszxMddaqVyVIx+flkzRlM9xb7mHb3cFISlJvY=" /> -
The next step would be to check Google WMT > Diagnostics > Crawl errors > Not Found tab
Click the "Linked from Detected" text. That will show the source of the link.
-
That's not the case. We have an in house content system and no shopping cart. These urls return a 404 error and always have because they've never existed.
No crawlers have returned urls that look like this either. I can't get these urls to show up in Google's index either.
-
Hi Jimmy,
My bet is you are using a CMS or Shopping Cart on your site. The software is generating these seemingly random URLs, and the Google crawler is seeing them.
Crawlers don't move through your site the way people do. If you take your home page and right-click, then choose View Page Source, you will find many URLs. The crawler will then follow each and every URL throughout your site. The URLs you are seeing could be generated on any given page.
To track the issue, use the SEOmoz crawler on your site, then analyze the results. When you find these types of URLs, take note of the page they are located on.
Alternatively, you can contact your CMS or shopping cart's software developer and inquire about the URLs. If these URLs show a folder path before the random characters, that information can help track down which pages or software is generating these pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The best checking tool Keyword Cannibalization
hi guys i have a Keyword Cannibalization isuue, please Introduce best free tools for checking Keyword Cannibalization.
Reporting & Analytics | Sep 1, 2019, 9:55 AM | 1001mp30 -
"index.htm" for all url's in google analytics
I don't have this issue with other wordpress websites, only this one website, and I don't know what's causing the issue: Google Analytics is adding an "index.htm" to every single page on the website. So it is tracking the pages, I see no errors - is it tracking the right page? When I click on the page link in a report, I naturally go to a "404 page not found" since the website address isn't "www.example.com/rewards/index.htm" - but instead the actual address would be:
Reporting & Analytics | Sep 14, 2016, 8:32 PM | cceebar
"www.example.com/rewards/". I have navigated to View Settings in GA to insure "default page" is empty. Although adding anything else to this field does not effect the page url in analytics reports either. Could it be htaccess file - or a plugin effecting the htaccess file?_Cindy0 -
Google Analytics - Average Position
Hi Just trying to get some clarity on Google Analytics Average Positions in "Aquistions/Search Engine Optimisation". For a very competitive keyword Google Analytics is saying i am on average position of 6. Is this Page 6? I am assuming position six would be 1.6?
Reporting & Analytics | Aug 11, 2015, 2:32 AM | Cocoonfxmedia0 -
Why google stubbornly keeps indexing my http urls instead of the https ones?
I moved everything to https in November, but there are plenty of pages which are still indexed by google as http instead of https, and I am wondering why. Example: http://www.gomme-auto.it/pneumatici/barum correctly redirect permanently to https://www.gomme-auto.it/pneumatici/barum Nevertheless if you search for pneumatici barum: https://www.google.it/search?q=pneumatici+barum&oq=pneumatici+barum The third organic result listed is still http. Since we moved to https google crawler visited that page tens of time, last one two days ago. But doesn't seems to care to update the protocol in google index. Anyone knows why? My concern is when I use API like semrush and ahrefs I have to do it twice to try both http and https, for a total of around 65k urls I waste a lot of my quota.
Reporting & Analytics | Jan 29, 2015, 9:49 PM | max.favilli0 -
Getting google impressions for a site not in the index...
Hi all Wondering if i could pick the brains of those wise than myself... my client has an https website with tons of pages indexed and all ranking well, however somehow they managed to also set their server up so that non https versions of the pages were getting indexed and thus we had the same page indexed twice in the engine but on slightly different urls (it uses a cms so all the internal links are relative too). The non https is mainly used as a dev testing environment. Upon seeing this we did a google remove request in WMT, and added noindex in the robots and that saw the index pages drop over night. See image 1. However, the site still appears to getting return for a couple of 100 searches a day! The main site gets about 25,000 impressions so it's way down but i'm puzzled as to how a site which has been blocked can appear for that many searches and if we are still liable for duplicate content issues. Any thoughts are most welcome. Sorry, I am unable to share the site name i'm afraid. Client is very strict on this. Thanks, Carl image1.png
Reporting & Analytics | Aug 15, 2014, 10:19 AM | carl_daedricdigital0 -
Link Research Tools
Is anyone else here a user of Link Research Tools? I recently completed a Link Detox for my sites. However, it is saying that links from high quality press release sites are deadly and should be removed. They are also saying the same about the links from the Yellow Pages. Obviously I know these tools are automated, but does anyone know why they are showing these links as 'deadly' and should be removed? I have tried contacting LRT about this issue but am yet to receive a reply.
Reporting & Analytics | Feb 13, 2014, 9:30 AM | AAttias0 -
Google Analytics - Keywords (not set) or ( not provided) WHY???
In Analytics, most of my visitors are landing on my home page, but when trying to see wich keywords they use, most of them are: (not set) or ( not provided) See screencast: http://screencast.com/t/AKwPW76qLVsN Can you tell why? What is going on? Is there a way to solve this? Thank you, BigBlaze
Reporting & Analytics | May 30, 2013, 4:33 AM | BigBlaze2050 -
How do you set www vs non-www in Bing Webmaster Tools?
A site of mine is showing as example.com instead of www.example.com in Bing and Yahoo. I know where to set the preferred domain to www in Google Webmaster Tools, but I can't seem to find that function in Bing Webmaster Tools. Can someone please point me in the right direction?
Reporting & Analytics | Jan 17, 2012, 6:03 AM | DentalPlanSEO0