Sitemap include all site links or just ones we want indexed?
-
Got a quick sitemap question. We have a clients site built in opencart and are getting ready to submit the sitmap. The default sitemap setting generates urls right off of the root. For example site.com/product. These urls are also accessible through the site itself. We prefer to give the site some depth and have structured the products so the urls are site.com/category/product. All of the product pages have canonicals including the category so we should not have to worry about duplicate content on the /product page vs the /category/product page. My question is both types of product pages are included in the sitemap at the moment. Since we don't want google to index the /product urls should we leave them off of the sitemap even though they are readily accessible from the frontend(though not linked)? Or just leave them and let the canonical tag be used in directing google as to which urls to index. Thanks in advance.
-
Hi again JS,
I think it's great that you continue to evaluate your platform from all perspectives and evaluate its strengths/weaknesses. Many times, a platform can do a lot of the basics well, but fall short on the details that differentiate us from our competition. For example, opencart may do the basic SEO requirements well, but not include ecommerce microdata (schema.org) which have a high impact on our search listings.
You can do a lot of harm/good with the robots.txt file - like deindex entire website (probably not a good thing) or block certain directories (your /product issue). I would gain some deeper knowledge about what you can do with the robots.txt file and how you need it to perform for your business.
-
Hey Raymond,
Thanks for the response, feel like I'm over thinking this a bit, as usually we just leave our opencart setups as is, other then a few minor tweaks. Lately I've really been scrutinizing opencart's SEO setup and how to improve it, since it seems there are a lot of gaps in he way it handles this.
I thought the robots.txt would have been a good way to block the pages, but the issue is I would need to block every single product page as opencart automatically creates a page for every product that is site.com/product and since we are adding lots of products there should be a better way to handle this. After I posted I came across this tidbit from a 6 year old google webmaster central blog post. Basically it states that 'While we can't guarantee that our algorithms will display that particular URL in search results, it's still helpful for you to indicate your preference by including that URL in your Sitemap. '. I think going this route along with the canonical should do the trick.
-
Hi JStrong,
Great question to be asking and an important topic to be doing your due diligence on, especially when dealing with an eCommerce related website.
Google uses a sitemap as a guideline for crawling your site. So, just because you put a URL in your sitemap, doesn't mean that they URL will actually be indexed. You can see those stats in your Google Webmaster Tools account, under the Sitemap area. It will display how many URLs are in the sitemap and how many out of those URLs are indexed.
If you do not want certain pages to be indexed by Google, then you would need to adjust your robots.txt file to give Google those instructions.
As long as you have the correct Canonical configurations, you should avoid any duplicate content issues from the URLs you've described above.
Good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The particular page cannot be indexed by Google
Hello, Smart People!
On-Page Optimization | | Viktoriia1805
We need help solving the problem with Google indexing.
All pages of our website are crawled and indexed. All pages, including those mentioned, meet Google requirements and can be indexed. However, only this page is still not indexed.
Robots.txt is not blocking it.
We do not have a tag "nofollow"
We have it in the sitemap file.
We have internal links for this page from indexed pages.
We requested indexing many times, and it is still grey.
The page was established one year ago.
We are open to any suggestions or guidance you may have. What else can we do to expedite the indexing process?1 -
Do Google Know What People Are Doing On My Site?
I run an ecommerce site and am keep to increase the time on site and reduce bounce rate. I have had some good results from writing good articles and advertising them on Outbrain and Twitter; both of these metrics have improved. However, I am wondering if Google will see this. Clearly if people land on the pages from the SERPS and return there they will know what is happening but will they know if people get there from an advert?
On-Page Optimization | | T0BY0 -
Navigation Links Causing Too Many Links Help?
Hello, I have read some SEOMOZ search results for this, but am still concerned that Google may see 4,500 Too Many Link warnings as a problem. This is caused primarily due to our header navigation, which is not intended to be keyword stuffing, but to provide all avenues for our breadth of content. site: crazymikesapps.com. Most answers seem to advise if there is no keyword stuffing at hand don't worry about it. Any help appreciated. thank you Mike
On-Page Optimization | | crazymikesapps0 -
SEO and multilanguage site
Hi all! I have used a wordpress plugin called WPML which translates a webpage into another language so that I have a webpage in two different languages (spanish (main market) and english). I'm just doing the seo for the spanish market and I'm gonna start with the seo for the english one. Should I do it just the same as I had a one-single-language page? just with english keywords, etc. I guessit would only differ in the way I do the linkbuilding strategy as the markets are different Thanks
On-Page Optimization | | juanmiguelcr0 -
Are there to many internal links here?
This is our main Motorcycle Parts page. In my opinion there are to many links on this page. They have put links to all the top brands after products. Do you think having all these links could be hurting our rankings/ ability for google to crawl the site efficiently? The links are good for navigation, I just think there are way to many there like over 200. Here is the page you can see what i am talking about http://www.rockymountainatvmc.com/s/49/61/Motorcycle-Parts
On-Page Optimization | | DoRM0 -
Drop in Internal Links to Root
This morning I noticed in Google Webmaster Tools that my internal links to my root domain (Home Page), dropped from 428,000 to 58,000. It appears that this could be my header or footer links back to the home page that Google is not showing any more. My programmers claim they have not made any changes over the last 30 days. My rankings and traffic are normal. Any cause for concern?
On-Page Optimization | | tdawson090 -
Does anyone know of a Domain auction site that includes MOZ rankings?
Trying to locate a site that has this as part of the search criteria
On-Page Optimization | | hooopdream0 -
Prefetching and prerendering, should I include both?
Chrome supports prerendering whereas Firefox supports prefetching. So, can i include both of them to speed up the performance of the website in Chrome and Firefox? Is the below code right?
On-Page Optimization | | SoftzSolutions0