Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
CGI Parameters: should we worry about duplicate content?
-
Hi,
My question is directed to CGI Parameters. I was able to dig up a bit of content on this but I want to make sure I understand the concept of CGI parameters and how they can affect indexing pages.
Here are two pages:
No CGI parameter appended to end of the URL:
http://www.nytimes.com/2011/04/13/world/asia/13japan.html
CGI parameter appended to the end of the URL:
http://www.nytimes.com/2011/04/13/world/asia/13japan.html?pagewanted=2&ref=homepage&src=mv
Questions:
Can we safely say that CGI parameters = URL parameters that append to the end of a URL? Or are they different? And given that you have rel canonical implemented correctly on your pages, search engines will move ahead and index only the URL that is specified in that tag?
Thanks in advance for giving your insights. Look forward to your response.
Best regards,
Jackson
-
Since it is a duplicate and meant for mobile devices, then yes, I would use a canonical tag or even noindex if you don't want it in the index anyway. Either method would eliminate the duplicate content problem.
-
The page content is the exact same, the the layout is built for a mobile device. So in essence we don't know why it would be indexed, unless that happens for mobile browsing pages...
So the solution is to put a rel-canonical tag on that trailing parameter page to prevent duplicate content.
-
Is the page with device=iphone&c=y different than example.html? If not, you should make sure to add the canonical tag to it. If it is different, then you shouldn't add it because it's not a duplicate.
-
Hi Steve,
Another thing I came across... a page with trailing parameters like ?device=iphone&c=y is rendering a different set of code. So we have the original page with the content, and then we have www.example.html?device=iphone&c=y. The one with the trailing parameter doesn't have a canonical tag attached to it, but it's indexed in Google (when we search the www.example.html URL) it shows up as number two.
Do you have any insights into this? Will this be a duplicate content issue?
Thanks!
Jackson
-
Thank you Steve for your response. I had come across Dr. Pete's post in the past but forgot about it. Nonetheless, the CGI parameter explanation and the use of canonical tags answers my question.
Jackson
-
Yes, you can say CGI parameters = URL parameters. I don't think many people refer to them as CGI parameters anymore though.
To answer your question, yes, as long as you have rel canonical set up correctly, then the URL parameters won't hurt your indexing.
For example, if you have your rel canonical set to http://mysite.com/japan.html
Then, only that page will be indexed, even if there are various parameters such as
http://mysite.com/japan.html?source=something&whateva=somethingelse
Just MAKE SURE to setup rel canonical correctly because it can be bad if you don't. Check out Dr. Pete's post about this: http://www.seomoz.org/blog/catastrophic-canonicalization
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content and Subdirectories
Hi there and thank you in advance for your help! I'm seeking guidance on how to structure a resources directory (white papers, webinars, etc.) while avoiding duplicate content penalties. If you go to /resources on our site, there is filter function. If you filter for webinars, the URL becomes /resources/?type=webinar We didn't want that dynamic URL to be the primary URL for webinars, so we created a new page with the URL /resources/webinar that lists all of our webinars and includes a featured webinar up top. However, the same webinar titles now appear on the /resources page and the /resources/webinar page. Will that cause duplicate content issues? P.S. Not sure if it matters, but we also changed the URLs for the individual resource pages to include the resource type. For example, one of our webinar URLs is /resources/webinar/forecasting-your-revenue Thank you!
Technical SEO | Apr 1, 2024, 6:55 PM | SAIM_Marketing0 -
Duplicate content, although page has "noindex"
Hello, I had an issue with some pages being listed as duplicate content in my weekly Moz report. I've since discussed it with my web dev team and we decided to stop the pages from being crawled. The web dev team added this coding to the pages <meta name='robots' content='max-image-preview:large, noindex dofollow' />, but the Moz report is still reporting the pages as duplicate content. Note from the developer "So as far as I can see we've added robots to prevent the issue but maybe there is some subtle change that's needed here. You could check in Google Search Console to see how its seeing this content or you could ask Moz why they are still reporting this and see if we've missed something?" Any help much appreciated!
Technical SEO | Jun 9, 2022, 2:29 PM | rj_dale0 -
404 Error Pages being picked up as duplicate content
Hi, I recently noticed an increase in duplicate content, but all of the pages are 404 error pages. For instance, Moz site crawl says this page: https://www.allconnect.com/sc-internet/internet.html has 43 duplicates and all the duplicates are also 404 pages (https://www.allconnect.com/Coxstatic.html for instance is a duplicate of this page). Looking for insight on how to fix this issue, do I add an rel=canonical tag to these 60 error pages that points to the original error page? Thanks!
Technical SEO | May 9, 2016, 12:27 PM | kfallconnect0 -
Handling of Duplicate Content
I just recently signed and joined the moz.com system. During the initial report for our web site it shows we have lots of duplicate content. The web site is real estate based and we are loading IDX listings from other brokerages into our site. If though these listings look alike, they are not. Each has their own photos, description and addresses. So why are they appear as duplicates – I would assume that they are all too closely related. Lots for Sale primarily – and it looks like lazy agents have 4 or 5 lots and input the description the same. Unfortunately for us, part of the IDX agreement is that you cannot pick and choose which listings to load and you cannot change the content. You are either all in or you cannot use the system. How should one manage duplicate content like this? Or should we ignore it? Out of 1500+ listings on our web site it shows 40 of them are duplicates.
Technical SEO | Apr 26, 2015, 10:21 PM | TIM_DOTCOM0 -
Duplicate Page Content and Titles from Weebly Blog
Anyone familiar with Weebly that can offer some suggestions? I ran a crawl diagnostics on my site and have some high priority issues that appear to stem from Weebly Blog posts. There are several of them and it appears that the post is being counted as "page content" on the main blog feed and then again when it is tagged to a category. I hope this makes sense, I am new to SEO and this is really confusing. Thanks!
Technical SEO | Mar 11, 2017, 9:08 AM | CRMI0 -
How to deal with duplicated content on product pages?
Hi, I have a webshop with products with different sizes and colours. For each item I have a different URL, with almost the same content (title tag, product descriptions, etc). In order to prevent duplicated content I'am wondering what is the best way to solve this problem, keeping in mind: -Impossible to create one page/URL for each product with filters on colour and size -Impossible to rewrite the product descriptions in order to be unique I'm considering the option to canonicolize the rest of de colours/size variations, but the disadvantage is that in case the product is not in stock it disappears from the website. Looking forward to your opinions and solutions. Jeroen
Technical SEO | Mar 16, 2015, 11:43 AM | Digital-DMG0 -
Duplicate Content Issue WWW and Non WWW
One of my sites got hit with duplicate content a while ago because Google seemed to be considering hhtp, https, www, and non ww versions of the site all different sites. We thought we fixed it, but for some reason https://www and just https:// are giving us duplicate content again. I can't seem to figure out why it keeps doing this. The url is https://bandsonabudget.com if any of you want to see if you can figure out why I am still having this issue.
Technical SEO | Feb 5, 2015, 7:50 PM | Michael4g1 -
How much to change to avoid duplicate content?
Working on a site for a dentist. They have a long list of services that they want us to flesh out with text. They provided a bullet list of services, we're trying to get 1 to 2 paragraphs of text for each. Obviously, we're not going to write this off the top of our heads. We're pulling text from other sources and trying to rework. The question is, how much rephrasing do we have to do to avoid a duplicate content penalty? Do we make sure there are changes per paragraph, sentence, or phrase? Thanks! Eric
Technical SEO | Mar 20, 2012, 4:58 PM | ericmccarty0