Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Google News URL Format
-
Hi,
We are currently redesigning our gaming website (www.totallygn.com) and one of our main goals is to get listed by Google News in future.
Looking at the Google News URL requirements "The URL for each article must contain a unique number consisting of at least three digits."
How does the above affect SEO structure? I was planning on using a format such as
www.totallygn.com/xbox-360/360-reviews/fifa-12-review
how would this compare to something like?
www.totallygn.com/xbox-360/360-reviews/fifa-12-review234
Thanks in advance for your help
-
Hi all,
Is it still the case that you can submit EITHER with 3 digits in the URL OR via a news sitemap? I can't see anything in the official instructions about the sitemap route... they seem pretty insistent on the 3 digit rule though.
-
Can we do it just by submitting a news sitemap via GWT?
-
Do you still have to go through the inclusion process here: http://support.google.com/news/publisher/bin/bin/static.py?hl=en&ts=2394225&page=ts.cs&from=191208
Thanks guys... MB.
-
-
My site was just accepted in to Google News yesturday and when I went to check the sitemap for the news, Google Webmaster showed errors for the news sitemap.
So I have tried every wordpress plugin I could find, and submitted the news sitempa.
Each one had errors, the only one that worked for me and my site is now showing in Google News is this plugin BWP Google XML Sitemaps
Hope that helps
-
Hi WalesDragon,
Did these answers solve your question, or are you looking for some more advice still?
-
No worries!
I am pretty sure that plugin is the one which allows the WP admin to select JUST posts, and leave out pages... but I am not 100%.
The reason I recommended that particular plugin though, is that from experience, many of the other Google news sitemap plugins seem to cause some sort of XML error when submitting the sitemap to Google news, but this one doesn't, so using it should save a few headaches, and having to 'shop around', so to speak!
Another thing to bear in mind, is that if you have 1 section of your site (say, domain.com/news) and you have an RSS feed on there, showing a feed of a different section of your website (say, domain.com/self-promotional-company-blog), and the second blog for any reason ends up with 3 unique digital in the URL of a post, then Google news can find the link in the RSS feed of your news section, and index the page on the (self promotional blog) in error -
Sounds harmless, but if the news team then decided that you were actually TRYING to get self promotional stuff (even company news) into Google news, you could loose your news approved status... short solution is just to be careful when putting any RSS feeds (of other parts of your site/domain) on your news section!!! (Hope that makes sense?!) - I learned this the hard way (didn't get dropped or anything, as I acted swiftly to sort the issue!).
Hope that helps!
Mike.
-
Mike,
Thanks for this, I personally found it helpful. I like the idea of the Google News Plugin and will test it out on a small site.
Good info,Robert
-
In addition to the excellent response by Robert Fisher, below, you do not actually NEED to do this, but you CAN do it automatically if you choose to.
Google News needs...
EITHER a unique 3 digit code in the URL...
OR
A Google news specific sitemap.
So, your options are to either change your WP (I checked, your site is Wordpress based, yes?) Permalinks settings, to include post id, OR use a google news sitemap plugin.
You can always put a number in front of the post id, so use something like:
/%postname%/1%post_id%
So, adding a numerical '1' befor %post_id% in your permalinks.
If you are worried about lots of 404 errors due to changing your URL structure, then how about using deans permalinks migration (install it BEFORE changing your permalink settings!) - http://wordpress.org/extend/plugins/permalinks-migration-plugin-for-wordpress/
As for a Google News sitemap... For wordpress, I recommend this one: http://wordpress.org/extend/plugins/gn-xml-sitemap/
If you go down the sitemap route, do be sure that ONLY news posts are included... E.G. NOT your static, non-news content pages!
IN TERMS OF SEO -
I don't feel it will effect things too much, so long as everything else is good as regards your on-page SEO etc.
Hope that helps!
-
If you understand that the requirement for the three or more digits is around insuring that there is a unique page for each individual article. So if you look at: www.totallygn.com/xbox-360/360-reviews/fifa-12-review, It appears to me that the second 360 is still associated with reviews of games associated with XBox 360. The fifa-12-review appears to be a soccer game (I have never played on one of those things I am an intelligent worker and not involved in any type of warfare even modern).
So, the second where you have review 234 does work because the three digit number appears to give a unique numeric identifier to that article. (Note if a 4 digit number it cannot start with 199 or 200).
In the event there is something that would prevent you from using this convention, you can always create a news Sitemap. Google Support News Sitemap.
Hope this helps, best,
Edit: missed seo question: It has a positive effect on SEO as it is following Google's convention. (One question is whether or not having a news sitemap would give more credence/weight as a news site versus the unique identifier???) My guess is it would.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google is indexing bad URLS
Hi All, The site I am working on is built on Wordpress. The plugin Revolution Slider was downloaded. While no longer utilized, it still remained on the site for some time. This plugin began creating hundreds of URLs containing nothing but code on the page. I noticed these URLs were being indexed by Google. The URLs follow the structure: www.mysite.com/wp-content/uploads/revslider/templates/this-part-changes/ I have done the following to prevent these URLs from being created & indexed: 1. Added a directive in my Htaccess to 404 all of these URLs 2. Blocked /wp-content/uploads/revslider/ in my robots.txt 3. Manually de-inedex each URL using the GSC tool 4. Deleted the plugin However, new URLs still appear in Google's index, despite being blocked by robots.txt and resolving to a 404. Can anyone suggest any next steps? I Thanks!
Technical SEO | | Tom3_150 -
Do URLs with canonical tags get indexed by Google?
Hi, we re-branded and launched a new website in February 2016. In June we saw a steep drop in the number of URLs indexed, and there have continued to be smaller dips since. We started an account with Moz and found several thousand high priority crawl errors for duplicate pages and have since fixed those with canonical tags. However, we are still seeing the number of URLs indexed drop. Do URLs with canonical tags get indexed by Google? I can't seem to find a definitive answer on this. A good portion of our URLs have canonical tags because they are just events with different dates, but otherwise the content of the page is the same.
Technical SEO | | zasite0 -
How google crawls images and which url shows as source?
Hi, I noticed that some websites host their images to a different url than the one their actually website is hosted but in the end google link to the one that the site is hosted. Here is an example: This is a page of a hotel in booking.com: http://www.booking.com/hotel/us/harrah-s-caesars-palace.en-gb.html When I try a search for this hotel in google images it shows up one of the images of the slideshow. When I click on the image on Google search, if I choose the Visit Page button it links to the url above but the actual image is located in a totally different url: http://r-ec.bstatic.com/images/hotel/840x460/135/13526198.jpg My question is can you host your images to one site but show it to another site and in the end google will lead to the second one?
Technical SEO | | Tz_Seo0 -
Vanity URLs are being indexed in Google
We are currently using vanity URLs to track offline marketing, the vanity URL is structured as www.clientdomain.com/publication, this URL then is 302 redirected to the actual URL on the website not a custom landing page. The resulting redirected URL looks like: www.clientdomain.com/xyzpage?utm_source=print&utm_medium=print&utm_campaign=printcampaign. We have started to notice that some of the vanity URLs are being indexed in Google search. To prevent this from happening should we be using a 301 redirect instead of a 302 and will the Google index ignore the utm parameters in the URL that is being 301 redirect to? If not, any suggestions on how to handle? Thanks,
Technical SEO | | seogirl221 -
Will Google Recrawl an Indexed URL Which is No Longer Internally Linked?
We accidentally introduced Google to our incomplete site. The end result: thousands of pages indexed which return nothing but a "Sorry, no results" page. I know there are many ways to go about this, but the sheer number of pages makes it frustrating. Ideally, in the interim, I'd love to 404 the offending pages and allow Google to recrawl them, realize they're dead, and begin removing them from the index. Unfortunately, we've removed the initial internal links that lead to this premature indexation from our site. So my question is, will Google revisit these pages based on their own records (as in, this page is indexed, let's go check it out again!), or will they only revisit them by following along a current site structure? We are signed up with WMT if that helps.
Technical SEO | | kirmeliux0 -
How to Remove /feed URLs from Google's Index
Hey everyone, I have an issue with RSS /feed URLs being indexed by Google for some of our Wordpress sites. Have a look at this Google query, and click to show omitted search results. You'll see we have 500+ /feed URLs indexed by Google, for our many category pages/etc. Here is one of the example URLs: http://www.howdesign.com/design-creativity/fonts-typography/letterforms/attachment/gilhelveticatrade/feed/. Based on this content/code of the XML page, it looks like Wordpress is generating these: <generator>http://wordpress.org/?v=3.5.2</generator> Any idea how to get them out of Google's index without 301 redirecting them? We need the Wordpress-generated RSS feeds to work for various uses. My first two thoughts are trying to work with our Development team to see if we can get a "noindex" meta robots tag on the pages, by they are dynamically-generated pages...so I'm not sure if that will be possible. Or, perhaps we can add a "feed" paramater to GWT "URL Parameters" section...but I don't want to limit Google from crawling these again...I figure I need Google to crawl them and see some code that says to get the pages out of their index...and THEN not crawl the pages anymore. I don't think the "Remove URL" feature in GWT will work, since that tool only removes URLs from the search results, not the actual Google index. FWIW, this site is using the Yoast plugin. We set every page type to "noindex" except for the homepage, Posts, Pages and Categories. We have other sites on Yoast that do not have any /feed URLs indexed by Google at all. Side note, the /robots.txt file was previously blocking crawling of the /feed URLs on this site, which is why you'll see that note in the Google SERPs when you click on the query link given in the first paragraph.
Technical SEO | | M_D_Golden_Peak0 -
Should the date be included in news URLs
My website is not a news or magazine site, but we do have a news section updated 2-3 times a week with industry related news. We are working on a new structure for the URLs.
Technical SEO | | theLotter
Should the date be included in the URL? From this article from Google I understand that as long as we submit a news sitemap it doesnt matter whether or not numbers are included in the URL, correct? https://support.google.com/news/publisher/answer/68323?topic=116650