Tracking links and duplicate content
-
Hi all,
I have a bit of a conundrum for you all pertaining to a tracking link issue I have run into on a clients site. They currently have over duplicate content. Currently, they have over 15,000 pages being crawled (using Screaming Frog) but only 7,000+ are legitimate pages in the sense of they are not duplicates of themselves.
The client is using Omniture instead of Google Analytics and using an advanced tracking system on their site for internal and external links (ictids and ectids) in the URL parameters. This is creating thousands of duplicated pages being crawled by Google (as seen on their Search Console and on Screaming Frog).
They also are in the middle of moving over from http to https and have thousands of pages currently set up for both, again, creating a duplicate content issue.
What I have suggested for the tracking links is setting up a URL parameter in Search Console for these tracking links. I've also suggested they canonical all tracking links to point to the clean page so the pages that have already been indexed point to the correct clean url. Does this seam like the appropriate strategy?
Additionally, I've told them before they submit a new sitemap to Google, they need to switch their website over to https to avoid worsening their duplicate content issue. They have not submitted a sitemap to Google Search Console since March 2015.
Thank you for any help you can offer!
-
Personally, I would submit a clean sitemap ASAP. It's helpful whenever you upload it, and SEO fixes are best made as soon as possible, otherwise you're just leaving traffic on the table.
Plus, I'm skeptical that a move to https will be fast.
That said, there's no reason why you can't move your site to https without already having a clean XML sitemap for the http version. So it's really up to you.
Sorry, that's a little ambiguous! Such is SEO.
Good luck!
Kristina
-
Hey Kristina,
Thanks for the reply! Yes, I have already gone ahead and changed their URL parameters accordingly. I've asked their developer to go through and canonical all tracking link URLs to the clean URL.
The sitemap is a more complicated issue as they haven't had one created in a little more than a year, so it is very out of date. We are working with them to get a clean version of their sitemap in place after we restructure some of their navigation and content.
Do you think there is value in submitting a clean sitemap (without the tracking links) before switching over to https or just wait until after that change is made?
Thanks again for the reply. This is one of the most complicated sites I've ever tackled. Glad to hear I am on the right track!
-
Hey there,
I think you've given your client some good advice. Just to make sure we're all on the same page about how to handle duplicate content created by tracking parameters:
- Make sure to keep the XML sitemap up to date, and only include the canonical versions of URLs
- Canonical all parameter-ed URLs back to a single source, without parameters
- Mark those tracking parameters as "Doesn't affect page content (tracks usage)" in Google Search Console
The parameter issue is something I would fix ASAP, then tackle https when that comes around. At that point, you'll need to make sure ALL http pages 301 redirect to https versions of the page. I haven't worked with Omniture much, but make sure that doesn't break tracking.
Good luck!
Kristina
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Change Phone Number Based on Traffic Source + Ping URL for Call Tracking Number
Hi Everyone, Is there a tool that can change the phone number on a web page based on the visitor source (i.e., direct, organic, paid, etc.)? I'd like to implement a solution like this with different call tracking numbers based on the visitor source. We use the Google suite for our analytics (GA, GTM, Google Data Studio, Google Optimize is also an option as well). - Also, is there a good call tracking service that will ping a URL each time the phone number is called so that we can track these calls as events in GA? The majority of our visitors use a desktop PC and dial in the number on the screen rather than clicking (tapping) on it from a mobile device. Thanks, Andy
Reporting & Analytics | | AndyRCWRCM0 -
ECommerce Tracking in Analytics - Should I use a plugin or do it manually?
Hi, I have a WordPress site (woocommerce integrated). I would like to know should I use a plugin to enable e-commerce tracking in Google Analytics or I should do it manually with GTM. Which is a better way? Will using a plugin to enable e-commerce tracking slow down my site? Any thoughts? Thanks in advance.
Reporting & Analytics | | Mr.Suren0 -
Google Analytics Goals - Button Tracking
Does anyone know if there is a really easy way to track a button in Google Analytics yourself? It seems that most button click goal setups involve some use of tricky code and I'm wondering if there is a much easier way to do this that will allow us to simply setup and track certain button clicks as goal conversions in Analytics. Your help here is much appreciated!
Reporting & Analytics | | Gavo0 -
Www.google-analytics.com/analytics.js what is this link doing on my website?
Hello Expert, I am using google tag manager and google analytic is already configured in that now i just want to confirm when i do inspect element of my home page in that i can see this link - http://www.google-analytics.com/analytics.js where as if i do view source of my page then it is not visible. so what is this link - www.google-analytics.com/analytics.js and what role it play? Do we really need this link to be present on website? Regards, Raghuvinder
Reporting & Analytics | | raghuvinder0 -
Does analytics track an order two times by refresh on the confirmation-page?
Hi there,
Reporting & Analytics | | Webdannmark
I have a quick question. Does Google analytics track an order two times, if the user buys a product, see the confirmation page and then click refresh/click or back and forward again?
The order/tracking data must be the same, but i guess the tracking code runs for every refresh and therefore tracks the order two times in Analytics or does analytics know that it is the same order? Someone that can clearify this?Thanks! Regards
Kasper0 -
Google not reading my canonical links
Hi All, New to SEOmoz but so far love it. My reports list tons of duplicate links and webmaster tools does as well. In fact it just updated last night and added several hundred more. I have the canonical tag on my products. Here is a product example page: http://www.stonehousecollection.com/card/funny-christmas-cards/KX296a.html Thank you for the help. Matt
Reporting & Analytics | | mker0 -
Duplicate Content
I am looking to check the duplicate content of two websites against each other, www.housesalesbulgaria.com and www.housesalesturkey.com. What is the best way to check this?
Reporting & Analytics | | Feily0 -
Spider 404 errors linked to purchased domain
Hi, My client purchased a domain which based on the seller "promising lots of traffic". Subsequent investigation showed it was a scam and that the seller had been creative in Photoshop with some GA reports. Nevertheless, my client had redirected the acquired domain to their primary domain (via the domain registrar). From the period on which the acquired domain was redirected to the point when we removed the redirect, the web log files had a high volume of spider/bot 404 errors relating to an online pharmaacy - viagra, pills etc. The account does not seem to have been hacked. No additional files are present and the rest of the logs seem normal. As soon as the redirect was removed the spider 404 errors stopped. Aside from the advice about acquiring domains promising traffic which I've already discussed with my client, does anybody have any ideas about how a redirect could cause the 404 errors? Thanks
Reporting & Analytics | | bjalc20110