Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Duplicate Content with ?Page ID's in WordPress
-
Hi there,
I'm trying to figure out the best way to solve a duplicate content problem that I have due to Page ID's that WordPress automatically assigns to pages. I know that in order for me to resolve this I have to use canonical urls but the problem for me is I can't figure out the URL structure.
Moz is showing me thousands of duplicate content errors that are mostly related to Page IDs
For example, this is how a page's url should look like on my site
Moz is telling me there are 50 duplicate content errors for this page. The page ID for this page is 82 so the duplicate content errors appear as follows
and so on. For 47 more pages. The problem repeats itself with other pages as well.
My permalinks are set to "Post Name" so I know that's not an issue.
What can I do to resolve this? How can I use canonical URLs to solve this problem. Any help will be greatly appreciated.
-
this might do it as well
A flexible pattern URL mapping is a way of redirecting all URLs that match a particular pattern, to equivalent destination URLs, using a single mapping. It does this by allowing you to parse out and name portions of the requested URL to substitute into the destination URL. These types of URL mappings are useful when you are changing the structure or format of your URLs, but want to make sure you can redirect requests for pages under their old URL structure to their new URLs.
An example of a flexible pattern URL is the following:
/myblog/:post-year/:post-month/*rest-of-url?id=:post-id
Each portion of the URL above that starts with a colon (:) creates a named component that is matched until the next delimiter character (./=&?), and any portion that starts with an asterisk (*) creates a named component that is matched until the end of the URL (up to the query string).
The named components can then be used in the URL mapping's destination, with each name included inside of curly braces. For example, the named components defined in the flexible pattern URL above could be used to create the following destination:
/newblog/{post-year}/{post-month}/post-{post-id}/{rest-of-url}
To demonstrate how this flexible pattern URL mapping would work, let's consider the following example requested URL and where it would be redirected. The named components in the requested and destination URLs are highlighted.
Requested URL: http://www.mydomain.com/myblog/2013/12/marketing/inbound-marketing-rocks?id=98765
Redirected to: http://www.mydomain.com/newblog/2013/12/post-98765/marketing/inbound-marketing-rocks
With this pattern-based URL mapping we were able to retain all of the important, identifying parts of the original URL and insert them into the new URL structure. In addition, with this particular mapping, we were able to:
- capture the variable-length {rest-of-url} component (i.e. marketing/inbound-marketing-rocks) to be used in the destination url, by using an asterisk (*) at the beginning of that component's definition
- move the {post-id} component from the query string in the original URL into the middle of the URL in the destination
-
you may want to remove the flash file from the website. It is not compatible with iPhones and flash is really something I would stay away from on websites.
This is flash
http://www.spamedica.com/wp-content/themes/spamedia/flash/spamedica_flash.swf
See why that's not good
http://www.browserstack.com/screenshots/46bca267a16b8726a26c7438e76317ef51b877be
Try
http://www.browserstack.com/responsive
Here's an example of the exact same thing using the gif
http://cl.ly/image/2t2d470b3A2F/Screen Recording 2014-09-23 at 12.58 PM.gif
iPhone
The http://www.siteground.com/ your host is great for shared hosting
your IP
181.224.137.194 - 126 other sites hosted on this server
I honestly would give them a call and have them correct your htaccess file
However if you want something that's going to be faster and focused 100% on WordPress use manage WordPress hosting provider. Like GetFlywheel you get your own VPS only one site per an IP no shared resources and this type of stuff is never a problem for them to fix.
You can migrate 100% free in addition.
Overall I would say that your site needs some TLC
http://tools.pingdom.com/fpt/#!/exa05N/http://www.spamedica.com/
you have over 100 requests and your server does a good job of catching after it's been loaded but not that much prior to so the 1st visit will be slow.
Most important part is it needs to be responsive and work properly on mobile devices.
that includes using CSS 3 and HTML 5 to substitute for flash so render properly on all devices. In addition to allowing people to browse it on all devices.
http://www.spamedica.com/
Tested from New York City, New York, USA on September 23 at 14:00:29
<dl class="first">
<dt>Page size</dt>
<dd>1.9MB</dd>
</dl>
<dl>
<dt>Load time</dt>
<dd>2.31s</dd>
</dl>
<dl>
<dt>Requests</dt>
<dd>112</dd>
</dl>
<dl class="last">
<dt>Perf. grade</dt>
<dd>69/100</dd>
</dl>
Sincerely,
Thomas
-
You can check it with
https://yoast.com/wp-content/permalink-helper.php
https://yoast.com/change-wordpress-permalink-structure/
If you’re on Apache and you decide to do the redirect, having been on a
/yyyy/mm/dd/%postname%/
structure before, you might benefit from this simple redirect which you could throw into your .htaccess file:|
1
|RedirectMatch 301 /\d{4}/\d{2}/\d{2}/(.*) http://example.com/$1
| -
PS try to save your link structure to the way you want it and make sure you click the save button.
if it does not work there a problem that would require access to WordPress to fix
http://wordpress.stackexchange.com/questions/39665/custom-htaccess-rewrite-rule-for-page
-
Rewrite the URLs your hosting company for a plug-in like
https://yoast.com/wordpress/plugins/seo-premium/
Yoast makes it better be best redirect plug-in for WordPress I know of.
You can use The redirection plug-in
https://wordpress.org/plugins/redirection/
Hosting
http://getflywheel.com/help/do-you-support-htaccess-files/
You will want to clean up what WordPress did not.
Via 301 redirects it is most likely an error in your HTaccess file
Your WordPress install is not up to date as well you may want to ensure that is updated to 4.0.
Have http://getflywheel.com/tour/ do it
This is something I would recommend a company like Getflywheel.com at $15 a month you get manage WordPress your own all SSD VPS and all the issues with WordPress to take care of by them. http://getflywheel.com is a bargain.
Hope that helps,
Thomas
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google ranking content for phrases that don't exist on-page
I am experiencing an issue with negative keywords, but the “negative” keyword in question isn’t truly negative and is required within the content – the problem is that Google is ranking pages for inaccurate phrases that don’t exist on the page. To explain, this product page (as one of many examples) - https://www.scamblermusic.com/albums/royalty-free-rock-music/ - is optimised for “Royalty free rock music” and it gets a Moz grade of 100. “Royalty free” is the most accurate description of the music (I optimised for “royalty free” instead of “royalty-free” (including a hyphen) because of improved search volume), and there is just one reference to the term “copyrighted” towards the foot of the page – this term is relevant because I need to make the point that the music is licensed, not sold, and the licensee pays for the right to use the music but does not own it (as it remains copyrighted). It turns out however that I appear to need to treat “copyrighted” almost as a negative term because Google isn’t accurately ranking the content. Despite excellent optimisation for “Royalty free rock music” and only one single reference of “copyrighted” within the copy, I am seeing this page (and other album genres) wrongly rank for the following search terms: “free rock music”
On-Page Optimization | | JCN-SBWD
“Copyright free rock music"
“Uncopyrighted rock music”
“Non copyrighted rock music” I understand that pages might rank for “free rock music” because it is part of the “Royalty free rock music” optimisation, what I can’t get my head around is why the page (and similar product pages) are ranking for “Copyright free”, “Uncopyrighted music” and “Non copyrighted music”. “Uncopyrighted” and “Non copyrighted” don’t exist anywhere within the copy or source code – why would Google consider it helpful to rank a page for a search term that doesn’t exist as a complete phrase within the content? By the same logic the page should also wrongly rank for “Skylark rock music” or “Pretzel rock music” as the words “Skylark” and “Pretzel” also feature just once within the content and therefore should generate completely inaccurate results too. To me this demonstrates just how poor Google is when it comes to understanding relevant content and optimization - it's taking part of an optimized term and combining it with just one other single-use word and then inappropriately ranking the page for that completely made up phrase. It’s one thing to misinterpret one reference of the term “copyrighted” and something else entirely to rank a page for completely made up terms such as “Uncopyrighted” and “Non copyrighted”. It almost makes me think that I’ve got a better chance of accurately ranking content if I buy a goat, shove a cigar up its backside, and sacrifice it in the name of the great god Google! Any advice (about wrongly attributed negative keywords, not goat sacrifice ) would be most welcome.0 -
Will shortening down the amount of text on my pages affect it's SEO performance?
My website has several pages with a lot of text that becomes pretty boring. I'm looking at shortening down the amount of copy on each page but then within the updated, shortened copy, integrating more target keywords naturally. Will shortening down the current copy have a negative effect on my SEO performance?
On-Page Optimization | | Liquid20150 -
Why do I have 2 different URL's for the same page - is this good practice?
Hi GuysMy father is currently using a programmer to build his new site. Knowing a little about SEO etc, I was a little suspicious of the work carried out. **Anyone with good programming and SEO knowledge, please offer your advice!**This page http://www.thewoodgalleries.co.uk/gallery-range-wood-flooring/ which is soon to be http://www.thewoodgalleries.co.uk/engineered-wood/ you'll see has a number of different products. The products on this particular page have been built into colour categories like thishttp://www.thewoodgalleries.co.uk/engineered-wood/lights-greys http://www.thewoodgalleries.co.uk/engineered-wood/beiges http://www.thewoodgalleries.co.uk/engineered-wood/browns http://www.thewoodgalleries.co.uk/engineered-wood/darks-blacks This is fine. Eventually when we add to our selection of woods, we'll easily segment each product into "colour categories" for users to easily navigate to. My question is - Why do I have 2 different URL's for the same page - is this good practice? Please see below... Visible URL - http://www.thewoodgalleries.co.uk/engineered-wood/browns/cipressa/Below is the permalink seen in Word Press for this page also.Permalink: http://www.thewoodgalleries.co.uk/engineered-wood/browns-engineered-wood/cipressa/and in the Word Press snippet shows the same permalink urlCipressa | Engineered Brown Wood | The Wood Gallerieswww.thewoodgalleries.co.uk/engineered-wood/browns-engineered-wood/cipressa/ Buy Cipressa Engineered Brown Wood, available at The Wood Galleries, London. Provides an Exceptional Foundation for Elegant Décor, Extravagant .. If this is completely ok and has no negative search impact - then I'm happy. If not what should I advise to my programmer to do? Your help would be very much appreciated. Regards Faye
On-Page Optimization | | Faye2340 -
Two URL's for the same page
Hi, on our site we have two separate URL's for a page that has the same content. So, for example - 'www.domain.co.uk/stuff' and 'www.domain.co.uk/things/stuff' both have the same content on the page. We currently rank high in search for 'www.domain.co.uk/things/stuff' for our targeted keyword, but there are numerous links on the site to www.domain.co.uk/stuff and also potentially inbound links to this page. Ideally we want just the www.domain.co.uk/things/stuff URL to be present on the site, what would be the best course of action to take? Would a simple Canonical tag from the '/stuff' URL which points to the '/things/stuff' page be wise? If we were to scrap the '/stuff' URL totally and redirect it to the 'things/stuff' URL and change all our on site links, would this be beneficial and not harm our current ranking for '/things/stuff'? We only want 1 URL for this page for numerous reasons (i.e, easier to track in Analytics), but I'm a bit cautious that changing the page that doesn't rank may have an affect on the page that does rank! Thanks.
On-Page Optimization | | Jaybeamer2 -
Duplicate Content - Blog Rewriting
I have a client who has requested a rewrite of 250 blog articles for his IT company. The blogs are dispersed on a variety of platforms: his own website's blog, a business innovation website, and an IT website. He wants to have each article optimised with keyword phrases and then posted onto his new website thrice weekly. All of this is in an effort to attract some potential customers to his new site and also to establish his company as a leader in its field. To what extent would I need to rewrite each article so as to avoid duplicating the content? Would there even be an issue if I did not rewrite the articles and merely optimised them with keywords? Would the articles need to be completely taken by all current publishers? Any advice would be greatly appreciated.
On-Page Optimization | | StoryScout0 -
Duplicate Content for Men's and Women's Version of Site
So, we're a service where you can book different hairdressing services from a number of different salons (site being worked on). We're doing both a male and female version of the site on the same domain which users are can select between on the homepage. The differences are largely cosmetic (allowing the designers to be more creative and have a bit of fun and to also have dedicated male grooming landing pages), but I was wondering about duplicate pages. While most of the pages on each version of the site will be unique (i.e. [male service] in [location] vs [female service] in [location] with the female taking precedent when there are duplicates), what should we do about the likes of the "About" page? Pages like this would both be unique in wording but essentially offer the same information and does it make sense to to index two different "About" pages, even if the titles vary? My question is whether, for these duplicate pages, you would set the more popular one as the preferred version canonically, leave them both to be indexed or noindex the lesser version entirely? Hope this makes sense, thanks!
On-Page Optimization | | LeahHutcheon0 -
Does a page's url have any weight in Google rankings?
I'm sure this question must have been asked before but I can't find it. I'm assuming that the title tag is far more important than the page's url. Is that correct? Does the url have any relevance to Google?
On-Page Optimization | | rdreich490 -
Percentage of duplicate content allowable
Can you have ANY duplicate content on a page or will the page get penalized by Google? For example if you used a paragraph of Wikipedia content for a definition/description of a medical term, but wrapped it in unique content is that OK or will that land you in the Google / Panda doghouse? If some level of duplicate content is allowable, is there a general rule of thumb ratio unique-to-duplicate content? thanks!
On-Page Optimization | | sportstvjobs0