Google Ignoring Canonical Tag for Hundreds of Sites
-
Bazaar Voice provides a pretty easy-to-use product review solution for websites (especially sites on Magento): https://www.magentocommerce.com/magento-connect/bazaarvoice-conversations-1.html
If your product has over a certain number of reviews/questions, the plugin cuts off the number of reviews/questions that appear on the page.
To see the reviews/questions that are cut off, you have to click the plugin's next or back function. The next/back buttons' URLs have a parameter of "bvstate....."
I have noticed Google is indexing this "bvstate..." URL for hundreds of sites, even with the proper rel canonical tag in place.
Here is an example with Microsoft:
My website is seeing hundreds of these "bvstate" urls being indexed even though we have a proper rel canonical tag in place. It seems that Google is ignoring the canonical tag. In Webmaster Console, the main source of my duplicate titles/metas in the HTML improvements section is the "bvstate" URLs.
I don't necessarily want to block "bvstate" in the robots.txt as it will prohibit Google from seeing the reviews that were cutoff. Same response for prohibiting Google from crawling "bvstate" in Paramters section of Webmaster Console.
Should I just keep my fingers crossed that Google honors the rel canonical tag?
Home Depot is another site that has this same issue:
-
I have had something similar, this is response I received:
You don’t have canonical tags on the URL and that’s expected.
On pages where BVSEO is implemented, canonical tags must be updated or removed when the product contains more than one page (more than eight) of reviews. BVSEO paginates the product page so all reviews are in the search engines’ index. Canonical tags that point away from a pagination URL will cause search engines to ignore the paginated content.
When any of the BVSEO pagination parameters are present (bvstate, bvrrp, bvqap, bvsyp, bvpage), do one of the following:
•Remove the canonical tag. This is the most common, recommended solution.
•Append the "name=value" pair to the canonical URL.
-
I think I found out what is going on.
I have found that the source code does contain the proper rel canonical tag.
However, the "bazaar voice" plugin generates a code snippet that appears in the page's body where it features a [base_url]. The [base_url] should match up with the canonical tag. For some reason, it isn't. The [base_url] that is generated contains the "bvstate" parameter.
Tools like the Mozbar, and I believe even Googlebot, are extracting out overriding the rel canonical tag with the [base_url] that appears in the code.
Complex!
-
Yeah, it's very strange... if you view-source on the BVSTATE url that is cached, the proper canonical tag is in there. Don't know why toolbar apps like Mozbar show otherwise. I think you're right, must be a deeper issue.
-
I just ran this query for bvstate URLs indexed for the H&R Block site. Mozbar shows canonical tags with bvstate in them, and Screaming Frog finds no canonical tags at all. There is a deeper issue that is not simply Google ignoring them.
-
Hey Logan -
The Microsoft canonical is not being obeyed. The canonical tag points to the one representative URL for the product whereas the "bvstate" URL is shown as being cached.
If you do a search in Google for inurl:"bvstate" , you will see hundreds of sites like H&R Block, Kohls, etc.
-
Do you have different examples? The Home Depot link doesn't work when trying to view the actual page on the site. With the Microsoft link, the canonical is working, as the version with the parameter is not indexed in Google, but the canonical version is indexed, which is what I would expect for a canonical that is being obeyed.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why isn't the rel=canonical tag working?
My client and I have a problem: An ecommerce store with around 20 000 products has nearly 1 000 000 pages indexed (according to Search Console). I frequently get notified by messages saying “High number of URLs found” in search console. It lists a lot of sample urls with filter and parameters that are indexed by google, for example: https://www.gsport.no/barn-junior/tilbehor/hansker-votter/junior?stoerrelse-324=10-11-aar+10-aar+6-aar+12-aar+4-5-aar+8-9-aar&egenskaper-368=vindtett+vanntett&type-365=hansker&bruksomraade-367=fritid+alpint&dir=asc&order=name If you check the source code, there’s a canonical tag telling the crawler to ignore (..or technically commanding it to regard this exact page as another version of the page without all the parameters) everything after the “?” Does this url showing up in the Search Console message mean that this canonical isn’t working properly? If so: what’s wrong with it? Regards,
Intermediate & Advanced SEO | | Inevo
Sigurd0 -
Consolidate Local sites to one larger site
I am a partner in a real estate company that operates in 10 different markets across the country. Each of these markets has it's own individual domain. My question is should we consolidate each of these markets into one domain that services all markets? What would we possibly gain or lose from an organic traffic standpoint? In some of our more established markets (Indianapolis, Las Vegas, Tampa, Orlando and Charlotte) our organic traffic accounts for 50-60% of our total traffic. In some of our newer markets (Denver, Phoenix, San Diego) it accounts for less than 15%. We do operate under two different brand names. EasyStreet Realty and Highgarden Real Estate. EasyStreet has been around since 2000 with most of our Highgarden sites only up for 6-24 months. Another question is we are considering converting all EasyStreet divisions to Highgarden. I am a little reluctant to do so, since most of our organic traffic is coming from our EasyStreet sites. Thoughts? You can find links to all our sites at www.easystreetrealty.com or www.highgarden.com Thank you in advance for your insight.
Intermediate & Advanced SEO | | EasyStreet0 -
Malicious site pointed A-Record to my IP, Google Indexed
Hello All, I launched my site on May 1 and as it turns out, another domain was pointing it's A-Record to my IP. This site is coming up as malicious, but worst of all, it's ranking on keywords for my business objectives with my content and metadata, therefore I'm losing traffic. I've had the domain host remove the incorrect A-Record and I've submitted numerous malware reports to Google, and attempted to request removal of this site from the index. I've resubmitted my sitemap, but it seems as though this offending domain is still being indexed more thoroughly than my legitimate domain. Can anyone offer any advice? Anything would be greatly appreciated! Best regards, Doug
Intermediate & Advanced SEO | | FranGen0 -
Why isn't google indexing our site?
Hi, We have majorly redesigned our site. Is is not a big site it is a SaaS site so has the typical structure, Landing, Features, Pricing, Sign Up, Contact Us etc... The main part of the site is after login so out of google's reach. Since the new release a month ago, google has indexed some pages, mainly the blog, which is brand new, it has reindexed a few of the original pages I am guessing this as if I click cached on a site: search it shows the new site. All new pages (of which there are 2) are totally missed. One is HTTP and one HTTPS, does HTTPS make a difference. I have submitted the site via webmaster tools and it says "URL and linked pages submitted to index" but a site: search doesn't bring all the pages? What is going on here please? What are we missing? We just want google to recognise the old site has gone and ALL the new site is here ready and waiting for it. Thanks Andrew
Intermediate & Advanced SEO | | Studio330 -
I have a general site for my insurance agency. Should I create niche sites too?
I work with several insurance agencies and I get this questions several times each month. Most agencies offer personal and business insurance and in a certain geographic location. I recommend creating a quality general agency site but would they have more success creating other nice sites as well? For example, a niche site about home insurance and one about auto insurance. What would your recommendation be?
Intermediate & Advanced SEO | | lagunaitech1 -
Optimal site structure for travel site
Hi there, I am seo-managing a travel website where we are going to make a new site structure next year. We have about 4000 pages on the site at the moment. The structure is only 2-levels at the moment: Level 1: Homepage Level 2: All other pages (4000 individual pages - (all with different urls)) We are adding another 2-3 levels, but we have a challenge: We have potentially 2 roads to the same product (e.g. "phuket diving product") domain.com/thailand/activities/diving/phuket-diving-product.asp domain.com/activities/diving/thailand/phuket-diving-product.asp I would very much appreciate your view on the problem: How do I solve this dilemma/challenge from a SEO standpoint? I want to avoid DC if possible, I also only want one landing page - for many reasons. And usability is of course also very important. Best regards, Chris
Intermediate & Advanced SEO | | sembseo0 -
Canonical tag for similar page with different theme.
Our commerce system allows products to be shared across multiple categories/sections of our site. E.G. /boxes/blue-box.html /circles/blue-box.html This enables the product to show up in different areas of the site, but does not link to an evergreen URL. We are considering using the canonical tag to resolve this issue, but our question relates to the similarity of the pages. Each section folder (e.g. /boxes/ and /circles/) has a different header, left navigation and footer. They are similar in layout and some content is the same, but a good portion is different in the header and nav. Each category nav basically deals with deeper links in it's own category. The product title, image, description, etc. is all the same and makes up the bulk of the page. Is this a good candidate for the canonical tag or should we attempt to accommodate an evergreen URL?
Intermediate & Advanced SEO | | josh-att0 -
How do Google Site Search pages rank
We have started using Google Site Search (via an XML feed from Google) to power our search engines. So we have a whole load of pages we could link to of the format /search?q=keyword, and we are considering doing away with our more traditional category listing pages (e.g. /biology - not powered by GSS) which account for much of our current natural search landing pages. My question is would the GoogleBot treat these search pages any differently? My fear is it would somehow see them as duplicate search results and downgrade their links. However, since we are coding the XML from GSS into our own HTML format, it may not even be able to tell.
Intermediate & Advanced SEO | | EdwardUpton610