Canonical and Sitemap issue
-
Hi all,
I was told that I could change my homepage Canonical tag to match that of my XML sitemap, this sitemap is being generated for me automatically and shows the homepage as e.g. https://www.mysite.com/index.html, yet my Canonical tag has been set to https://www.mysite.com.
Google currently shows as https://www.mysite.com/ being indexed, but https://www.mysite.com/index.html is not currently displayed in search results.
Can someone please tell me if I should change the Canonical to the index.html version, or if I should do nothing, or remove the Canonical tag altogether?
Thank you for looking.
-
I agree with the others. Given "https://www.mysite.com/index.html is not currently displayed in search results", in all likelihood it is being redirected to https://www.mysite.com (and should be). So you don't want to change the canonical to the index.html version of the page only to have it redirected back to https://www.mysite.com. It'll unnecessarily slow the site and might even create a loop.
-
Thank you both, I'll leave it as it is, I'm not able to edit the XML my side sadly.
-
Yes, that's a good point. Canonicals are suggestions for Google, not commands.
-
I see your point, and don't worry about it. Sitemaps help Google find all of your pages and can provide certain other information, but they are not required so no need to overthink them. In general Google is pretty good at finding what it needs to find. And it will certainly find your homepage.
-
I agree with Linda here, I would leave the canonical tag as is. It is a cleaner, better looking URL for the SERPs. If anything, manually update the XML file to reflect the canonical version of the homepage. The main purpose of the XML sitemap is to help search engines crawl and index a website. The homepage is going to be the most frequently crawled page so Google will not have a problem finding it.
Also, do not worry about Google disliking the canonical pointing to .com instead of /index.html. If Google determines that is not the ideal URL for it's index it will ignore the canonical tag.
-
Hi,
Thanks, basically I was concerned that Google may not like that https://www.mysite.com/ was not in the sitemap, yet index.html was and the canonical was pointing to https://www.mysite.com.
If that makes any sense....
-
What are you trying to achieve? Do you particularly want the index.html version to be the canonical? The https://www.mysite.com/ version is more straightforward and what most people would expect your homepage URL to be.
Unless there is some pressing reason to do otherwise, I'd leave it the way it is.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How important is anchor text in your sitemap?
I've read in a few recent articles that using keyword anchor text in your HTML sitemap is a good idea i.e. important. How important do you think it is? I'd love to hear your thoughts. Example 1: Widgets: View All Colors: Red | Blue | Green | Yellow | Orange | Purple Types: Oversized | Large | Small | Miniature Example 2: Widgets: View All Widgets Colors: Red Widgets | Blue Widgets | Green Widgets | Yellow Widgets | Orange Widgets | Purple Widgets Types: Oversized Widgets | Large Widgets | Small Widgets | Miniature Widgets
Web Design | | Choice0 -
Pagenation - Crawl Issue
Hi,
Web Design | | semvibe
We have a site with large number of products (6000 +) under each categories and so we have made a page under each category to list out all products (View all page), which lists out product in pagenation setup built on Ajax. The problem is only our 1st page is crawlable and all the other pages beyond 1st page remains hidden,
We need make all our pagenation URL’s crawlable, our requirements are we never want a change in URL as user goes to next page, want to show the user the same URL for all the pagenation numbers. Is there a perfect solution?0 -
How to fix and issue with robot.txt ?
I am receiving the following error message through webmaster tools http://www.sourcemarketingdirect.com/: Googlebot can't access your site Oct 26, 2012
Web Design | | skehoe
Over the last 24 hours, Googlebot encountered 35 errors while attempting to access your robots.txt. To ensure that we didn't crawl any pages listed in that file, we postponed our crawl. Your site's overall robots.txt error rate is 100.0%. The site has dropped out of Google search.0 -
Panda and Penquin Fall - Could HTML Design an Issue?
Hi, We were hit hard by Panda 3.4 on March 23rd 2012. Then Penguin came along and slapped us down a little farther on April 24th. White hat SEO for 13 years on the site. I have been trying to discover the reason we got hit so hard, to date 90% down. We ae wiped. I have a couple of keywords still #2 and #3 and we see up and down changes in Google webmaster tools, i.e. a keyword is supposedly up 50 points then another down 50. All other 150 keywords that we used to rank on the first page for are not even showing up. I have a person that is about to do a full link analysis but since we never went after links I just never had the feeling that is where our problem is at, but definitely going to explore it. The reason for my post is that last night I spoke with an SEO person that has some pretty good credentials (9 years experience and works currently at large online marketing company with seo with clients like Honda) and he was nice enough to just take a quick look at the site. He said he saw nothing really wrong and did not think that we were hit for any of the normal issues people are listing, i.e. duplicate content, backlinks. His first impression was that we were knocked down because the site is "hard to index". He said the site still uses tables and a lot of our Doc Statements were for HTML 4.01 from 1999. As we all know, there are 'many' experts in this industry. So I wanted a little feedback from the community. Our main site was built in Dreamweaver using tables. We do have a Wordpress blog that is very small and just now posting to add fresh content. (posts seem to rank pretty good, this is why I thought, you know he may be right) Would an older site be penalized like this for using tables? What would you do at this stage if you had a site that is not recovering? I have now reached panic mode and have to do something, just not sure of the next step. I will be happy to post the URL if anyone wants to help with advice. Thanks,
Web Design | | Force7
Force70 -
Mobile Sitemap for Site with Media Queries
I'm doing SEO for a site. It uses Media Queries and the CSS to automatically resize the site for the screen size in use. I.e. the site detects the screen size of say an iPhone and the CSS knows which elements to hide for that screen size and still make it look good. This is great because it will automatically cut down the content to display nicely on small screens - obviating the need for a separate mobile site. What kind of sitemap should be generated since the urls are for desktop and mobile use? Yoast (sweet SEO) said it should have both regular and mobile style sitemap to get both the regular and mobile bots to visit, but didn't elaborate on how that sitemap should look. Do you have a recommendation for how exactly the sitemap should look? Should the sitemap have the urls all twice, i.e. once regular and once with the mobile indicator?
Web Design | | GregoryHaze1 -
Canonical Tag
I've been helping someone out with their website, and I noticed the person who built the site made the canonical tags like this:
Web Design | | StandUpCubicles
href="http://www.example.com/" rel="canonical" /> I'm use to seeing it how seomoz does it: Does this matter? Is it ok to have it inverted? They also have another canonical tag in there like this:
var hs_canonical_url = "http\x3A\x2F\x2Fwww.example.com\x2Fhome" Any idea what that is? Could it be hurting the site?0 -
Custom URL's with Bigcommerce Issue (Is it worth it?)
We're building out a store in Bigcommerce, who for all intensive purposes is perfect for SEO besides the fact that you can not change the URL's to be custom. My question is, does this kill the SEO value of bigcommerce, despite everything else being great? So for example the URL's for a category page would be something like this www.mysite.com/categories/keyword and the product URL's are pulled in by product name, so product URL's could be something like www.mysite.com/products/Product-Description-Long-223.html (notice the words will be capitalized and their is no way to remove the trailing .html) I could go with Interspire (the liscenced version of Bigcommerce) or Magento so I can custom edit this stuff. But then its a lot more work for my employee's on the buildout.
Web Design | | iAnalyst.com0 -
Facing design issue in IE6 and IE7
Hi, I implemented custom search code on my website which is on a test url http://61.246.241.232/. It is working perfectly fine on all the browsers except on IE6 and IE7. Can anyone please let me know the solution to this problem. Thanks & Regards
Web Design | | IM_Learner0