%20 Rewrite in CMS doesn't get picked up by Search Engiens
-
Hi Mozzers I have a little issue on a rewrite that was implemented on a CMS. The CMS was built for my client without the option to put custom slugs in. So it takes the title of a post or page and uses it as a URL, the site was launched with a rewrite so that any space in the title is replaced with a - and that is the permanent URL for that post/page. This morning when I was busy doing my checkup on the site I found that the URLs are being indexed as %20 and not - however, if you navigate through the site the URLs are displaying correctly. How is it that search engines pick this up as a space in the slug if it has clearly been set as a -
anyone had this issue before? Its causing duplicate content issues on the site because both ways display the same post/page. Cheers, Chris Captivate.
-
Hi Irving
Fully aware of that, but if I am not mistaken a URL with a - in and one with %20 in is still seen as 2 different URLs and is duplicate content if they both have the same content on?
Here is an example of the SERP, you can check on it is the URL that is indexed with the - but once you click it, it goes off to the %20 version, however, if you navigate through the site to this blog post it shows as the - URL, now comes the interesting part, if you use the site search and search for "Southern Right Whale" this blog post comes up first and it is indexed in the site search as the %20 version.
Here is the serps
let me know your thoughts.
Chris. -
Google treats - as a space, but that is very strange, the CMS might be rewriting the dash as a space?
Can you send a URL example, I'd like to see this with my own eyes.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do internal search results get indexed by Google?
Hi all, Most of the URLs that are created by using the internal search function of a website/web shop shouldn't be indexed since they create duplicate content or waste crawl budget. The standard way to go is to 'noindex, follow' these pages or sometimes to use robots.txt to disallow crawling of these pages. The first question I have is how these pages actually would get indexed in the first place if you wouldn't use one of the options above. Crawlers follow links to index a website's pages. If a random visitor comes to your site and uses the search function, this creates a URL. There are no links leading to this URL, it is not in a sitemap, it can't be found through navigating on the website,... so how can search engines index these URLs that were generated by using an internal search function? Second question: let's say somebody embeds a link on his website pointing to a URL from your website that was created by an internal search. Now let's assume you used robots.txt to make sure these URLs weren't indexed. This means Google won't even crawl those pages. Is it possible then that the link that was used on another website will show an empty page after a while, since Google doesn't even crawl this page? Thanks for your thoughts guys.
Intermediate & Advanced SEO | | Mat_C0 -
What Happens If a Hreflang Sitemap Doesn't Include Every Language for Missing Translated Pages?
As we are building a hreflang sitemap for a client, we are correctly implementing the tag across 5 different languages including English. However, the News and Events section was never translated into any of the other four languages. There are also a few pages that were translated into some but not all of the 4 languages. Is it good practice to still list out the individual non-translated pages like on a regular sitemap without a hreflang tag? Should the hreflang sitemap include the hreflang tag with pages that are missing a few language translations (when one or two language translations may be missing)? We are uncertain if this inconsistency would create a problem and we would like some feedback before pushing the hreflang sitemap live.
Intermediate & Advanced SEO | | kchandler0 -
Getting Your Website Listed
Do you have any suggestiongs? I do not know local websites where I can get some easy backlinks. I guess a record in Google Places.would be great as well. Any sound suggestion will be appreciated. Thanks!
Intermediate & Advanced SEO | | stradiji0 -
Mystery 404's
I have a large number of 404's that all have a similar structure: www.kempruge.com/example/kemprugelaw. kemprugelaw keeps getting stuck on the end of url's. While I created www.kempruge.com/example/ I never created the www.kempruge.com/example/kemprugelaw page or edited permalinks to have kemprugelaw at the end of the url. Any idea how this happens? And what I can do to make it stop? Thanks, Ruben
Intermediate & Advanced SEO | | KempRugeLawGroup0 -
I have two sitemaps which partly duplicate - one is blocked by robots.txt but can't figure out why!
Hi, I've just found two sitemaps - one of them is .php and represents part of the site structure on the website. The second is a .txt file which lists every page on the website. The .txt file is blocked via robots exclusion protocol (which doesn't appear to be very logical as it's the only full sitemap). Any ideas why a developer might have done that?
Intermediate & Advanced SEO | | McTaggart0 -
Why isn't google indexing our site?
Hi, We have majorly redesigned our site. Is is not a big site it is a SaaS site so has the typical structure, Landing, Features, Pricing, Sign Up, Contact Us etc... The main part of the site is after login so out of google's reach. Since the new release a month ago, google has indexed some pages, mainly the blog, which is brand new, it has reindexed a few of the original pages I am guessing this as if I click cached on a site: search it shows the new site. All new pages (of which there are 2) are totally missed. One is HTTP and one HTTPS, does HTTPS make a difference. I have submitted the site via webmaster tools and it says "URL and linked pages submitted to index" but a site: search doesn't bring all the pages? What is going on here please? What are we missing? We just want google to recognise the old site has gone and ALL the new site is here ready and waiting for it. Thanks Andrew
Intermediate & Advanced SEO | | Studio330 -
How to get rid from unwanted backlinks
Hello, Before joining seomoz I was desperate to hire seo expert for my website and each posts on my website. I did some research and found some cheap services on fiverr.com as I didn't had huge marketing budget and I purchased some services (gig) on fiverr.com to add many backlinks to my latest post. Now before ordering this services I was on 3rd page of Google for my relevant keywords but after adding backlinks I am not on first 50 pages for my specific keywords. I admit that when I was on 3rd page the competition for my keywords was about 50,000 (with comma ofcourse) and now it is 186,000 so I do believe that competition is increased but I didn't expect such a drop in rankings. I suspect that may be Google put my site far behind because of so much backlinks and that too generated within one week. I didn't know that it will cause such a drop in rankings. I even suspect that may be there were spam backlinkgs or linkings which google doesn't like. So my question is first of all how do I know that is there any backlinks on my website or posts which can harm my rankings and if yes how do I get rid of them. Please guide me as I want to do proper and genuine seo which google likes and finally my rankings get better. Thanks Bhadresh
Intermediate & Advanced SEO | | intmktcom0