Canonical and Sitemap issue
-
Hi all,
I was told that I could change my homepage Canonical tag to match that of my XML sitemap, this sitemap is being generated for me automatically and shows the homepage as e.g. https://www.mysite.com/index.html, yet my Canonical tag has been set to https://www.mysite.com.
Google currently shows as https://www.mysite.com/ being indexed, but https://www.mysite.com/index.html is not currently displayed in search results.
Can someone please tell me if I should change the Canonical to the index.html version, or if I should do nothing, or remove the Canonical tag altogether?
Thank you for looking.
-
I agree with the others. Given "https://www.mysite.com/index.html is not currently displayed in search results", in all likelihood it is being redirected to https://www.mysite.com (and should be). So you don't want to change the canonical to the index.html version of the page only to have it redirected back to https://www.mysite.com. It'll unnecessarily slow the site and might even create a loop.
-
Thank you both, I'll leave it as it is, I'm not able to edit the XML my side sadly.
-
Yes, that's a good point. Canonicals are suggestions for Google, not commands.
-
I see your point, and don't worry about it. Sitemaps help Google find all of your pages and can provide certain other information, but they are not required so no need to overthink them. In general Google is pretty good at finding what it needs to find. And it will certainly find your homepage.
-
I agree with Linda here, I would leave the canonical tag as is. It is a cleaner, better looking URL for the SERPs. If anything, manually update the XML file to reflect the canonical version of the homepage. The main purpose of the XML sitemap is to help search engines crawl and index a website. The homepage is going to be the most frequently crawled page so Google will not have a problem finding it.
Also, do not worry about Google disliking the canonical pointing to .com instead of /index.html. If Google determines that is not the ideal URL for it's index it will ignore the canonical tag.
-
Hi,
Thanks, basically I was concerned that Google may not like that https://www.mysite.com/ was not in the sitemap, yet index.html was and the canonical was pointing to https://www.mysite.com.
If that makes any sense....
-
What are you trying to achieve? Do you particularly want the index.html version to be the canonical? The https://www.mysite.com/ version is more straightforward and what most people would expect your homepage URL to be.
Unless there is some pressing reason to do otherwise, I'd leave it the way it is.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do we need both an .XML Sitemap and a .aspx sitemap?
Hi Mozers, We recently switched servers and it came to my attention that we have two sitemaps a XML version of the sitemap and a .aspx version of the sitemap. This came to light as the .aspx version of the sitemap is causing the site to come to a screeching halt as it has some complex code and lists over 80,000 products. My question is do we need both versions of the sitemap? My understanding is that the XML version is for Search Engine bots and the .aspx version is for customers. I can't imagine that anyone is using our .aspx version as it is basically a page with 80,000 links and it's buried away on the site, so we were hoping to kill off the .aspx version of the sitemap and keep the .xml version for Search Engine Bots. I wanted to check here first to make sure we did not any negative search engine implications. Any help would be most appreciated. Thanks so much! Patrick
Web Design | | gatorpool0 -
How can I fix New 4XX Issue on Site Crawl?
Hi all, My recent site crawl shows 27 4xx issues on this website http://www.rrbusinessconsultants.com/ All of them are for 'posts' on this wordpress website. Here is an example of the issue: http://www.rrbusinessconsultants.com/rr-business-consultants-on-the-rise-of-glassdoor-and-how-companies-are-coping/void(null) The blog page seems to be creating links ending in void(null) which are defaulting to 404 pages. I cannot see the links on the site so cannot see how to remove them. Can anyone provide any insight into how to correct his issue? Many thanks in advance.
Web Design | | skehoe0 -
Redirects Not Working / Issue with Duplicate Page Titles
Hi all We are being penalised on Webmaster Tools and Crawl Diagnostics for duplicate page titles and I'm not sure how to fix it.We recently switched from HTTP to HTTPS, but when we first switched over, we accidentally set a permanent redirect from HTTPS to HTTP for a week or so(!).We now have a permanent redirect going the other way, HTTP to HTTPS, and we also have canonical tags in place to redirect to HTTPS.Unfortunately, it seems that because of this short time with the permanent redirect the wrong way round, Google is confused as sees our http and https sites as duplicate content.Is there any way to get Google to recognise this new (correct) permanent redirect and completely forget the old (incorrect) one?Any ideas welcome!
Web Design | | HireSpace0 -
W3C My site has 157 Errors, 146 warning(s) Is it an issue?
Is having this number of W3C errors & warnings an issue and will be impacting my site's performance? When the site was built 6 months ago my developers told me that it "was nothing to worry about", but I have read that any errors aren't good, let alone the huge number my site has? Your advice please Thanks Ash
Web Design | | AshShep10 -
Sitemap Update Frequency?
Hello, My question today is regarding sitemaps. I'm often confused by this and because I am a bit obsessive I believe I may be giving myself more work than needed.. Basically my question is, do I need to update and/or re-generate my sitemap every time I make a change to the site? I mean, I must have to if I add a page, correct? And so in Google's Webmaster Tools, do I just delete the current sitemap and re-upload a new one for Google to crawl? Is it possible to overdo this? Any sitemap suggestions would be fantastic. I feel like there's been a few weeks where I've updated the sitemap daily and re-submitted it and I worry that might be hurting my site. Thanks!
Web Design | | jesse-landry0 -
Best Practice issue: Modx vs Wordpress
Lately I've been working a lot with Modx to create a new site for our own firm as well for other projects. But so far I haven't seen the advantages for SEO purposes other then the fact that with ModX you can manage almost everything yourself including snippets etc without to much effort. Wordpress is a known factor for blogging and since the last 2 years or so for websites. My question is: Which platform is better suited for SEO purposes? Which should I invest my time in? ModX or Wordpress? Hope to hear your thought on the matter
Web Design | | JarnoNijzing0 -
How to fix and issue with robot.txt ?
I am receiving the following error message through webmaster tools http://www.sourcemarketingdirect.com/: Googlebot can't access your site Oct 26, 2012
Web Design | | skehoe
Over the last 24 hours, Googlebot encountered 35 errors while attempting to access your robots.txt. To ensure that we didn't crawl any pages listed in that file, we postponed our crawl. Your site's overall robots.txt error rate is 100.0%. The site has dropped out of Google search.0 -
Landing Page/Home Page issues
Hi. I was speaking with my designer last night (we are setting up a new website) and we were discussing the design of our homepage, now the designer said he wanted the first page of the website to be a sort of landing page page were the visitor has to click and enter, im sure everyone has all come across these before. However, I am concerned as to the SEO implications of this? Any help guys?
Web Design | | CompleteOffice0