Why HTML entities gets crawled as content keywords in Google search console?
-
My Google search console shows HTML parameters such as div, class, img, src, gif, align as content keywords, but why google crawls HTML parameters as keywords? because of this, I would be losing traffic for my on-page content keywords.
Please let me know how to solve this.
Thanks,
Jenifer
-
Thank u so much for your response chris.
-
Hi Jenifer,
This is an interesting one and I honestly can't figure out what's going on here. Could there be some section(s) of the site that have been obscured that maybe I can't see but Google is picking up?
There are quite a few problems I found in a quick skim of the HTML which might be the problem but honestly this is grasping at straws. It's something I would definitely recommend fixing for separate reasons. See if it makes any difference to this problem but I doubt it will.
An example of what I've noticed:
_widht="100%">
This report from W3C highlights a bunch of others, though not all of them. This report is mostly a red flag that the HTML could do with some attention to tidy up and get more functional._
-
Hi Jenifer,
Very hard to say without being able to take a look at the site. If you're comfortable dropping a link here I can take a closer look for you?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I authenticate a script with Search Console API to pull data
In regards to this article, https://moz.com/blog/how-to-get-search-console-data-api-python I've gotten all the way to the part where I need to authenticate the script to run. I give access to GSC and the local host code comes up. In the article, it says to grab the portion between = and #, but that doesnt seem to be the case anymore. This is what comes up in the browser http://localhost/?code=4/igAqIfNQFWkpKyK6c0im0Eop9soZiztnftEcorzcr3vOnad6iyhdo3DnDT1-3YFtvoG3BgHko4n1adndpLqjXEE&scope=https://www.googleapis.com/auth/webmasters.readonly When I put portions of it in, it always comes back with an error. Help!
Technical SEO | | Cnvrt0 -
Having trouble getting listed on Google News
My friend has a site: http://www.behindthehedges.com/ and is having trouble getting accepted into Google News. I read over the technical guidelines and best practices listed on Google's site. It appears they have everything covered. However, they keep getting rejected. Any ideas?
Technical SEO | | ninel_P0 -
Crawl Issues / Partial Fetch Via Google
We recently launched a new site that doesn't have any ads, but in Webmaster Tools under "Fetch as Google" under the rendering of the page I see: Googlebot couldn't get all resources for this page. Here's a list: URL Type Reason Severity https://static.doubleclick.net/instream/ad_status.js Script Blocked Low robots.txt https://googleads.g.doubleclick.net/pagead/id AJAX Blocked Low robots.txt Not sure where that would be coming from as we don't have any ads running on our site? Also, it's stating the the fetch is a "partial" fetch. Any insight is appreciated.
Technical SEO | | vikasnwu0 -
Can Google Crawl This Page?
I'm going to have to post the page in question which i'd rather not do but I have permission from the client to do so. Question: A recruitment client of mine had their website build on a proprietary platform by a so-called recruitment specialist agency. Unfortunately the site is not performing well in the organic listings. I believe the culprit is this page and others like it: http://www.prospect-health.com/Jobs/?st=0&o3=973&s=1&o4=1215&sortdir=desc&displayinstance=Advanced Search_Site1&pagesize=50000&page=1&o1=255&sortby=CreationDate&o2=260&ij=0 Basically as soon as you deviate from the top level pages you land on pages that have database-query URLs like this one. My take on it is that Google cannot crawl these pages and is therefore having trouble picking up all of the job listings. I have taken some measures to combat this and obviously we have an xml sitemap in place but it seems the pages that Google finds via the XML feed are not performing because there is no obvious flow of 'link juice' to them. There are a number of latest jobs listed on top level pages like this one: http://www.prospect-health.com/optometry-jobs and when they are picked up they perform Ok in the SERPs, which is the biggest clue to the problem outlined above. The agency in question have an SEO department who dispute the problem and their proposed solution is to create more content and build more links (genius!). Just looking for some clarification from you guys if you don't mind?
Technical SEO | | shr1090 -
CDN Being Crawled and Indexed by Google
I'm doing a SEO site audit, and I've discovered that the site uses a Content Delivery Network (CDN) that's being crawled and indexed by Google. There are two sub-domains from the CDN that are being crawled and indexed. A small number of organic search visitors have come through these two sub domains. So the CDN based content is out-ranking the root domain, in a small number of cases. It's a huge duplicate content issue (tens of thousands of URLs being crawled) - what's the best way to prevent the crawling and indexing of a CDN like this? Exclude via robots.txt? Additionally, the use of relative canonical tags (instead of absolute) appear to be contributing to this problem as well. As I understand it, these canonical tags are telling the SEs that each sub domain is the "home" of the content/URL. Thanks! Scott
Technical SEO | | Scott-Thomas0 -
We're no longer turning up in Google SERP for our brand search when we used to be #1 after our site update. Any ideas why?
We recently updated our website and during the push, someone mistakenly 301 redirected "www.brandx.com" to "brandx.com" instead of the otherway. Since then, our website no longer turns up for the search "brandx" on Google. We have reversed the mistake a few days ago, but we're still not turning up, and we used to rank #1 in Google SERP. Could it just be due to timing between the crawls and that our www. site didn't make it in Google's index due to this mistake? We have submitted our new sitemap to google a couple of days ago as well, as a side we're still showing up #1 in Bing's results however. And it should still show up based on SEOMoz's SERP report. Any help would help as I'm growing increasingly concerned.
Technical SEO | | JoeLin0 -
"To keyword or not to keyword" in the URL string?
We are debating on whether to use primary keywords in the URL for every page for a new client for the sake of SEO. What is the feeling in the Community on which version is smarter? Version 1: www.abccompany.com/miami-moving-company/about-us www.abccompany.com/miami-moving-company/contact-us etc. etc. Version 2: www.abccompany.com/about-us Thank you for your thoughts!
Technical SEO | | theideapeople0 -
Google Web Master Tools - Keyword Variants & misspelling
We have millions of urls and the technical expertise to write code to fix the spelling of keyword variants Google has discovered and shows us in Web Master tools. Since Google has recognized these as variants, is it worth our time to write code that will fix the spelling of obvious misses?
Technical SEO | | snoopcat0