Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
CGI Parameters: should we worry about duplicate content?
-
Hi,
My question is directed to CGI Parameters. I was able to dig up a bit of content on this but I want to make sure I understand the concept of CGI parameters and how they can affect indexing pages.
Here are two pages:
No CGI parameter appended to end of the URL:
http://www.nytimes.com/2011/04/13/world/asia/13japan.html
CGI parameter appended to the end of the URL:
http://www.nytimes.com/2011/04/13/world/asia/13japan.html?pagewanted=2&ref=homepage&src=mv
Questions:
Can we safely say that CGI parameters = URL parameters that append to the end of a URL? Or are they different? And given that you have rel canonical implemented correctly on your pages, search engines will move ahead and index only the URL that is specified in that tag?
Thanks in advance for giving your insights. Look forward to your response.
Best regards,
Jackson
-
Since it is a duplicate and meant for mobile devices, then yes, I would use a canonical tag or even noindex if you don't want it in the index anyway. Either method would eliminate the duplicate content problem.
-
The page content is the exact same, the the layout is built for a mobile device. So in essence we don't know why it would be indexed, unless that happens for mobile browsing pages...
So the solution is to put a rel-canonical tag on that trailing parameter page to prevent duplicate content.
-
Is the page with device=iphone&c=y different than example.html? If not, you should make sure to add the canonical tag to it. If it is different, then you shouldn't add it because it's not a duplicate.
-
Hi Steve,
Another thing I came across... a page with trailing parameters like ?device=iphone&c=y is rendering a different set of code. So we have the original page with the content, and then we have www.example.html?device=iphone&c=y. The one with the trailing parameter doesn't have a canonical tag attached to it, but it's indexed in Google (when we search the www.example.html URL) it shows up as number two.
Do you have any insights into this? Will this be a duplicate content issue?
Thanks!
Jackson
-
Thank you Steve for your response. I had come across Dr. Pete's post in the past but forgot about it. Nonetheless, the CGI parameter explanation and the use of canonical tags answers my question.
Jackson
-
Yes, you can say CGI parameters = URL parameters. I don't think many people refer to them as CGI parameters anymore though.
To answer your question, yes, as long as you have rel canonical set up correctly, then the URL parameters won't hurt your indexing.
For example, if you have your rel canonical set to http://mysite.com/japan.html
Then, only that page will be indexed, even if there are various parameters such as
http://mysite.com/japan.html?source=something&whateva=somethingelse
Just MAKE SURE to setup rel canonical correctly because it can be bad if you don't. Check out Dr. Pete's post about this: http://www.seomoz.org/blog/catastrophic-canonicalization
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content, although page has "noindex"
Hello, I had an issue with some pages being listed as duplicate content in my weekly Moz report. I've since discussed it with my web dev team and we decided to stop the pages from being crawled. The web dev team added this coding to the pages <meta name='robots' content='max-image-preview:large, noindex dofollow' />, but the Moz report is still reporting the pages as duplicate content. Note from the developer "So as far as I can see we've added robots to prevent the issue but maybe there is some subtle change that's needed here. You could check in Google Search Console to see how its seeing this content or you could ask Moz why they are still reporting this and see if we've missed something?" Any help much appreciated!
Technical SEO | | rj_dale0 -
Recurring events and duplicate content
Does anyone have tips on how to work in an event system to avoid duplicate content in regards to recurring events? How do I best utilize on-page optimization?
Technical SEO | | megan.helmer0 -
Duplicate content on job sites
Hi, I have a question regarding job boards. Many job advertisers will upload the same job description to multiple websites e.g. monster, gumtree, etc. This would therefore be viewed as duplicate content. What is the best way to handle this if we want to ensure our particular site ranks well? Thanks in advance for the help. H
Technical SEO | | HiteshP0 -
Duplicate Page Content and Titles from Weebly Blog
Anyone familiar with Weebly that can offer some suggestions? I ran a crawl diagnostics on my site and have some high priority issues that appear to stem from Weebly Blog posts. There are several of them and it appears that the post is being counted as "page content" on the main blog feed and then again when it is tagged to a category. I hope this makes sense, I am new to SEO and this is really confusing. Thanks!
Technical SEO | | CRMI0 -
.com and .co.uk duplicate content
hi mozzers I have a client that has just released a .com version of their .co.uk website. They have basically re-skinned the .co.uk version with some US amends so all the content and title tags are the same. What you do recommend? Canonical tag to the .co.uk version? rewrite titles?
Technical SEO | | KarlBantleman0 -
Squarespace Duplicate Content Issues
My site is built through squarespace and when I ran the campaign in SEOmoz...its come up with all these errors saying duplicate content and duplicate page title for my blog portion. I've heard that canonical tags help with this but with squarespace its hard to add code to page level...only site wide is possible. Was curious if there's someone experienced in squarespace and SEO out there that can give some suggestions on how to resolve this problem? thanks
Technical SEO | | cmjolley0 -
Mod Rewrite / .htaccess avoid duplicate content
I have been searching and testing for hours but cannot find a solution. I am able to get a URL to display with out the file exntension. i.e domain.com/file instead of domain.com/file.php The problem is both versions of the URL above work, therefore a duplicate content issue. How can I force the URL with the file extension not to resolve and give a 404 error? Or just redirect to the non extension URL? IF it helps here is my code. Options +FollowSymLinks
Technical SEO | | MiamiWebCompany
RewriteEngine On RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteCond %{REQUEST_FILENAME}.php -f
RewriteRule ^(.+)$ $1.php [L,QSA]0 -
Duplicate content and http and https
Within my Moz crawl report, I have a ton of duplicate content caused by identical pages due to identical pages of http and https URL's. For example: http://www.bigcompany.com/accomodations https://www.bigcompany.com/accomodations The strange thing is that 99% of these URL's are not sensitive in nature and do not require any security features. No credit card information, booking, or carts. The web developer cannot explain where these extra URL's came from or provide any further information. Advice or suggestions are welcome! How do I solve this issue? THANKS MOZZERS
Technical SEO | | hawkvt10