News Errors In Google Search Console
-
Years ago a site I'm working on was publishing news as one form of content on the site.
Since then, has stopped publishing news, but still has a q&a forum, blogs, articles... all kinds of stuff.
Now, it triggers "News Errors" in GWT under crawl errors. These errors are
"Article disproportionately short"
"Article fragmented" on some q&a forum pages
"Article too long" on some longer q&a forum pages
"No sentences found"
Since there are thousands of these forum pages and it's problem seems to be a news critique, I'm wondering what I should do about it. It seems to be holding these non-news pages to a news standard:
https://support.google.com/news/publisher/answer/40787?hl=en
For instance, is there a way and would it be a good idea to get the hell out of Google News, since we don't publish news anymore? Would there be possible negatives worth considering?
What's baffling is, these are not designated news urls. The ones we used to have were /news/title-of-the-story per...
https://support.google.com/news/publisher/answer/2481373?hl=en&ref_topic=2481296
Or, does this really not matter and I should just blow it off as a problem.
The weird thing is that we recently went from http to https and The Google News interface still has us as http and gives the option to add https, which I am reluctant to do sine we aren't really in the news business anymore.
What do you think I should do?
Thanks!
-
Update: 5 months later, the problem has long since gone away.
-
Thanks for the answers Matthew & Martijn. So, I'm going to go with what I have in place... 301s to the main forums page to pick up/recycle the incoming links to that section and removing the /news category from where Google News looks for news, but not leave the Google News program altogether right now.
Thanks, again!
-
I'd opt for 301s if there is any link equity on those pages worth pushing to the forum pages. If not, I'd robots.txt them out to save on wasted crawl.
-
HI Matthew,
Thanks for the insight. On your, "I think the important part is that Google doesn't suddenly crawl a bunch of 404s that they think are news pages of some importance" would you robots.txt out the old category they looked at (/news) or leave it and figure out via the mass of 301s and the removal from the url structure we told them to crawl, so that Google figures it out on it's own?
-
Hi there,
It wouldn't be a problem to get out of Google News, but by the same logic, it wouldn't hurt just to leave it alone and let them figure out you're not publishing news anymore. It won't affect your web search rankings, since you're not targeting traffic from the news onebox or news.google.com.
These old pages redirect to forum pages? I think the important part is that Google doesn't suddenly crawl a bunch of 404s that they think are news pages of some importance.
-
Possible, hard to guess what your current set-up looks like. What you could do alternatively is set up a robots.txt with Disallow statement that are only targeting the Google News bot instead of just the general Google Bot.
-
Hi Martijn,
We still have the old /news url still in Google News. I don't think we've ever submitted a news site map.
Is there any downside to deleting the url structure they're currently looking at, which does forward to our forum? Or, would it be better to just get out of Google News altogether, since we don't really have or publish news anymore? Is there a downside to that?
To summarize... delete the url they look to as news or get out of Google News altogether... what do you think?
Thanks!
-
Are you still submitting a Google News Sitemap to Google Search Console? Because usually that's the biggest reason where these errors are coming from because Google is picking up these kind of 'new' pages/articles as news content and then seeing it doesn't match with their guidelines.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Href Lang Errors
Hi Im getting href lang errors in my site audit for having no self referntial tags. I can't see anything wrong? Can anyone enlighten me 🙂 From this page ** http://www.key.co.uk/en/key/0-4-tonne-wll-6mm-single-leg-wire-rope-sling-580a078n**
Intermediate & Advanced SEO | | BeckyKey0 -
Should I worry about rendering problems of my pages in google search console fetch as google?
Some elements are not properly shown when I preview our pages in search console (fetch as google), e.g.
Intermediate & Advanced SEO | | lcourse
google maps, css tables etc. and some parts are not showing up since we load them asynchroneously for best page speed. Is this something should pay attention to and try to fix?0 -
Website using search term as URL brand name to cheat Google
Google has come a long way over the past 5 years, the quality updates have really helped bring top quality content to the top that is relevant for users search terms, although there is one really ANNOYING thing that still has not been fixed. Websites using brand name as service search term to manipulate Google I have got a real example but I wouldn't like to use it in case the brand mentions flags up in their tools and they spot this post, but take this search for example "Service+Location" You will get 'service+location.com' rank #1 Why? Heaven knows. They have less than 100 backlinks which are of a very low, spammy quality from directories. The content is poor compared to the competition and the competitors have amazing link profiles, great social engagement, much better website user experience and the data does not prove anything. All the competitors are targeting the same search term but yet the worst site is ranking the highest. Why on earth is Google not fixing this issue. This page we are seeing rank #1 do not even deserve to be ranking on the first 5 pages.
Intermediate & Advanced SEO | | Jseddon920 -
Hreflang Tags with Errors in Google Webmaster Tools
Hello, Google Webmaster tools is giving me errors with Hreflang tags that I can't seem to figure out... I've double checked everything: all the alternate and canonical tags, everything seems to match yet Google finds errors. Can anyone help? International Targeting | Language > 'fr' - no return tags
Intermediate & Advanced SEO | | GlobeCar
URLs for your site and alternate URLs in 'fr' that do not have return tags.
Status: 7/10/15
24 Hreflang Tags with Errors Please see attached pictures for more info... Thanks, Karim KQgb3Pn0 -
Number of images on Google?
Hello here, In the past I was able to find out pretty easily how many images from my website are indexed by Google and inside the Google image search index. But as today looks like Google is not giving you any numbers, it just lists the indexed images. I use the advanced image search, by defining my domain name for the "site or domain" field: http://www.google.com/advanced_image_search and then Google returns all the images coming from my website. Is there any way to know the actual number of images indexed? Any ideas are very welcome! Thank you in advance.
Intermediate & Advanced SEO | | fablau1 -
Robots.txt error message in Google Webmaster from a later date than the page was cached, how is that?
I have error messages in Google Webmaster that state that Googlebot encountered errors while attempting to access the robots.txt. The last date that this was reported was on December 25, 2012 (Merry Christmas), but the last cache date was November 16, 2012 (http://webcache.googleusercontent.com/search?q=cache%3Awww.etundra.com/robots.txt&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:en-US:official&client=firefox-a). How could I get this error if the page hasn't been cached since November 16, 2012?
Intermediate & Advanced SEO | | eTundra0 -
Somthing weird in my Google Webmaster Tools Crawl Errors...
Hey, I recently (this past may) redesigned my e-commerce site from .asp to .php. I am trying to fix all the old pages with 301 redirects that didn't make it in the switch, but I keep getting weird pages coming up in GWT. I have about 400 pages under crawl errors that look like this "emailus.php?id=MD908070" I delete them and they come back. my site is http://www.moondoggieinc.com the id #'s are product #'s for products that are no longer on the site, but the site is .php now. They also do not show a sitemap they are linked in or any other page that they are linked from. Are these hurting me? and how do I get rid of them? Thanks! KristyO
Intermediate & Advanced SEO | | KristyO0 -
What is a good content for google?
When we start to study SEO and how google see our webpage, one important point is to have good content. But, for beginners like me, we get lost on this. Is not so black and white: what for you is a good content? the text amount matters? there is any trick that all good content websites need to have?
Intermediate & Advanced SEO | | Naghirniac0