Is it a problem that Google's index shows paginated page urls, even with canonical tags in place?
-
Since Google shows more pages indexed than makes sense, I used Google's API and some other means to get everything Google has in its index for a site I'm working on.
The results bring up a couple of oddities.
It shows a lot of urls to the same page, but with different tracking code.The url with tracking code always follows a question mark and could look like:
http://www.MozExampleURL.com?tracking-example
http://www.MozExampleURL.com?another-tracking-examle
http://www.MozExampleURL.com?tracking-example-3
etc
So, the only thing that distinguishes one url from the next is a tracking url. On these pages, canonical tags are in place as:
<link rel="canonical<a class="attribute-value">l</a>" href="http://www.MozExampleURL.com" />
So, why does the index have urls that are only different in terms of tracking urls? I would think it would ignore everything, starting with the question mark. The index also shows paginated pages. I would think it should show the one canonical url and leave it at that. Is this a problem about which something should be done? Best... Darcy
-
Hi Samuel,
Thank you for the detailed answer. A couple of things;
My two "L" typo is just as written here... not on the site. Sorry about that.
On the use of the url parameters indexed, those are used internally, but they're set in GWT as having no effect and to only look at the representative url,.. everything before the question mark.
On your point about rel canonicals, one way we use them is in a category pages which are long lists of other pages. In that case it looks at page one of the long list as the canonical.
With that in mind, along with all the duplicate stuff in the index (paginated page #s, ignored url parameters), what would you suggest I change?
Thanks... Darcy
-
A couple of things. First, a rel=canonical tag -- like many other things -- is only a suggestion to search engines. Google and others can choose to ignore it, though they rarely do. In your post above, you have "canonicall" spelled with two "l"s -- so it might be as simple as changing that!
Second, just to clarify your teminology: What you are showing is not "tracking code" but "URL paramaters." I'm curious as to why the pages with tracking paramaters are being indexed -- normally, this should not happen at all. How are you using the paramaters? Usually, it should only be used to track traffic from external websites. For example: If I run a Facebook ad campaign, I can add a parameter to the ad's destination URL to track the results of the campaign. Google, however, would not index that special URL as a separate page. I'd review Google's information and recommendations on URL paramaters and perhaps change any settings in Google Webmaster Tools.
Third, the recommended practice for paginated pages is to have a "single page" version of the article and make that canonical for search engines (have all paginated pages point to that single-page one with a rel=canonical tag). This can be done whether you want to show a single-page version for users -- though I'd recommend it because most pagination is a cheap attempt just to get more pageviews for advertising revenue, and it's annoying.
Good luck -- I hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Startpage and shop page shows the same thing, shall i set canonical url?
Our startpage http://siga-sverige.se/ and http://siga-sverige.se/butik/ shows the same woocommerce loop of all our products. Shall i set canonical url for http://siga-sverige.se/butik/ to http://siga-sverige.se/? Thanks! / Jonas
Intermediate & Advanced SEO | | knubbz0 -
How to switch from URL based navigation to Ajax, 1000's of URLs gone
Hi everyone, We have thousands of urls generated by numerous products filters on our ecommerce site, eg./category1/category11/brand/color-red/size-xl+xxl/price-cheap/in-stock/. We are thinking of moving these filters to ajax in order to offer a better user experience and get rid of these useless urls. In your opinion, what is the best way to deal with this huge move ? leave the existing URLs respond as before : as they will disappear from our sitemap (they won't be linked anymore), I imagine robots will someday consider them as obsolete ? redirect permanent (301) to the closest existing url mark them as gone (4xx) I'd vote for option 2. Bots will suddenly see thousands of 301, but this is reflecting what is really happening, right ? Do you think this could result in some penalty ? Thank you very much for your help. Jeremy
Intermediate & Advanced SEO | | JeremyICC0 -
Partial Match or RegEx in Search Console's URL Parameters Tool?
So I currently have approximately 1000 of these URLs indexed, when I only want roughly 100 of them. Let's say the URL is www.example.com/page.php?par1=ABC123=&par2=DEF456=&par3=GHI789= All the indexed URLs follow that same kinda format, but I only want to index the URLs that have a par1 of ABC (but that could be ABC123 or ABC456 or whatever). Using URL Parameters tool in Search Console, I can ask Googlebot to only crawl URLs with a specific value. But is there any way to get a partial match, using regex maybe? Am I wasting my time with Search Console, and should I just disallow any page.php without par1=ABC in robots.txt?
Intermediate & Advanced SEO | | Ria_0 -
Could this be seen as duplicate content in Google's eyes?
Hi I'm an in-house SEO and we've recently seen Panda related traffic loss along with some of our main keywords slipping down the SERPs. Looking for possible Panda related issues I was wondering if the following could be seen as duplicate content. We've got some very similar holidays (travel company) on our website. While they are different I'm concerned it may be seen as creating content that is too similar: http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays/the-wildlife-and-beaches-of-kenya.aspx http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays/ultimate-kenya-wildlife-and-beaches.aspx http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays/wildlife-and-beach-family-safari.aspx They do all have unique text but as you can see from the titles, they are very similar (note from an SEO point of view the tabbed content is all within the same page at source level). At the top level of the holiday pages we have a filtered search:
Intermediate & Advanced SEO | | KateWaite
http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays.aspx These pages have a unique introduction but the content snippets being pulled into the boxes is drawn from each of the individual holiday pages. I'm just concerned that these could be introducing some duplicating issues. Any thoughts?0 -
What's the best way to redirect categories & paginated pages on a blog?
I'm currently re-doing my blog and have a few categories that I'm getting rid of for housecleaning purposes and crawl efficiency. Each of these categories has many pages (some have hundreds). The new blog will also not have new relevant categories to redirect them to (1 or 2 may work). So what is the best place to properly redirect these pages to? And how do I handle the paginated URLs? The only logical place I can think of would be to redirect them to the homepage of the blog, but since there are so many pages, I don't know if that's the best idea. Does anybody have any thoughts?
Intermediate & Advanced SEO | | kking41200 -
2 page titles, 1 url in Google SERPS: WTF!?!?
Hey guys, Hope everybody is having a good day. Today i came across something i have never seen in the serps before that i would like to share and getting feedback on. When i search for 'woonverzekering' on google.nl #1 is: **Url: ** www.independer.nl/woonverzekering/intro.aspx
Intermediate & Advanced SEO | | PrizeWize
**page titel: **Woonverzekering - Independer.nl When i search for 'woonhuisverzekering' on google.nl #1 is: **Url: ** www.independer.nl/woonverzekering/intro.aspx
page titel: Woonhuisverzekering? Vergelijk alle soorten woonverzekeringen - Independer.nl So basically 2 different queries show the same url with 2 different page titles in the serps. The only 'weird' thing i could find was a nobreakspace in the page title code: Woonhuisverzekering? Vergelijk alle soorten woonverzekeringen - Independer.nl I'm i missing something completely obvious here? Is this a commonly used technique. Is the page title getting chopped up because of ? What are they doing to get 2 page title results on 1 url?0 -
Should I use both Google and Bing's Webmaster Tools at the same time?
Hi All, Up till now I've been registered only to Google WMT. Do you recommend using at the same time Bing's WMT? Thanks
Intermediate & Advanced SEO | | BeytzNet0 -
Google swapped our website's long standing ranking home page for a less authoritative product page?
Our website has ranked for two variations of a keyword, one singular & the other plural in Google at #1 & #2 (for over a year). Keep in mind both links in serps were pointed to our home page. This year we targeted both variations of the keyword in PPC to a products landing page(still relevant to the keywords) within our website. After about 6 weeks, Google swapped out the long standing ranked home page links (p.a. 55) rank #1,2 with the ppc directed product page links (p.a. 01) and dropped us to #2 & #8 respectively in search results for the singular and plural version of the keyword. Would you consider this swapping of pages temporary, if the volume of traffic slowed on our product page?
Intermediate & Advanced SEO | | JingShack0