How to prevent duplicate content at a calendar page
-
Hi,
I've a calender page which changes every day.
The main url is
/calendarFor every day, there is another url:
/calendar/2012/09/12
/calendar/2012/09/13
/calendar/2012/09/14So, if the 13th september arrives, the content of the page
/calendar/2012/09/13
will be shown at
/calendarSo, it's duplicate content.
What to do in this situation?
a) Redirect from /calendar to /calendar/2012/09/13 with 301? (but the redirect changes the day after to /calendar/2012/09/14)
b) Redirect from /calendar to /calendar/2012/09/13 with 302 (but I will loose the link juice of /calendar?)
c) Add a canonical tag at /calendar (which leads to /calendar/2012/09/13) - but I will loose the power of /calendar (?) - and it will change every day...
Any ideas or other suggestions?
Best wishes,
Georg.
-
Ah... yeah, that's tricky. There's no magic solution, I'm afraid. You've really got three options:
(1) Leave it alone
(2) Re-organize your site architecture to push individual date pages down a level or two, so that they get less internal link-juice.
(3) Re-organize such that you focus search engines on chunks of time or maybe date/aspect combinations, but then de-index the individual date combos. This would take a much better understanding of your site structure than I currently have. The goal would be to focus your index on some smaller combination of pages that still covers 80% of your search traffic.
The big problem is just that this is a lot potential dilution, and I suspect that many of these pages look very similar to Google. I'm also certain that not all pages have the same value, either for SEO or users, so there's some hybrid approach where you could prune back but not lose everything. Long-term, I think that's worth the time and trouble to sort out, but it's not an emergency or something I'd rush into.
-
Hi Peter,
thanks for your answer!
Well, it's even more complicated!
It's an astrology calendar with planet aspect data for each day starting from 1900-01-01 to 2099-12-31, so there are around 73,000 pages, it's a big database.
People are searching for a date and the planet aspects. So I need the "old pages" and the future pages in the index.
People are also searching their birthday and want to know their zodiac. My calendar is providing this info.
This is an example:
http://www.schicksal.com/horoskop/tageshoroskop/1951/09/10The best thing is to do nothing at the moment I think. The alternativ is to cut the content of the current day from the main page and let the user click a button which redirects to the current day page. But this is not user friendly and I will do nothing at them moment.
Any other idea would be great
Best wishes,
Georg.
-
Sadly, the short answer is that you can't have it all. Either you index the separate calendar pages, get more pages/content out there and risk some "thinning" of your index, or you focus on one page, maximize the SEO value, but then lose the individual pages.
I would not 301 or 302 to the individual calendar URLs - that kind of daily URL shifting is going to look suspicious, Google will not re-cache consistently, and you're going to end up with a long-term mess, I strongly suspect.
I actually tend to agree with Muhammed and Paragon that a viable option would be to let the individual days have their own content, but then canonical to the main calendar page to focus the search results. That way, users can still cycle through each individual day, but Google will focus on the core content. In a way, that's how a blog home-page works - the content changes daily, but you're still keeping the bots focused on one URL.
Think of it in terms of usability, too. How valuable is old/outdated content to search users? They might find something relevant on an old page, but they still probably want to see the main calendar and view recent content.
Where are the links to the individual days, if "/calendar" always has today's content? I'm wondering if there's a hybrid approach, like letting the most recent 30 days all have their own URLs, but then redirecting or using rel-canonical to point to the main page after 30 days.
-
What about adding to all of the other pages i.e not to /calendar/ the links will be followed but not indexed by Google.
-
Hi Georg,
Setting up a redirect or canonicalization for the the calendar page in the ways you describe might make it harder to build up any kind of authority for your calendar.
You could consider adding canonicalization for all the individual day pages that points to the main calendar page. ie. Each page /calendar/YYYY/MM/DD would have rel canonlical=/calendar/. Not sure this is the best idea though.
I don't know how your calendar is setup but you could also look at differentiating the pages by doing just a listing of events on the main page and including summaries or detail on the current day page. Or maybe including some additional information about your calendar on the main page like what type of events are included and how to submit events and not including that information on the individual day pages.
I've always taken the approach of minimizing duplicate content as much as possible but not getting excessive with it. I think in a case like this you could do more harm than good. The calendar page is an ever changing page, it's not like you have the exact same static content on two pages.
Hope this helps!
Zach
-
Hi Muhammed,
because the content is different. This would devaluate all calendar pages.
Best wishes,
Georg. -
Hi Georg What about adding canonical tag(s) from each days (/calendar/2012/09/13) calender pages to the main page (/calendar)
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Handling of Duplicate Content
I just recently signed and joined the moz.com system. During the initial report for our web site it shows we have lots of duplicate content. The web site is real estate based and we are loading IDX listings from other brokerages into our site. If though these listings look alike, they are not. Each has their own photos, description and addresses. So why are they appear as duplicates – I would assume that they are all too closely related. Lots for Sale primarily – and it looks like lazy agents have 4 or 5 lots and input the description the same. Unfortunately for us, part of the IDX agreement is that you cannot pick and choose which listings to load and you cannot change the content. You are either all in or you cannot use the system. How should one manage duplicate content like this? Or should we ignore it? Out of 1500+ listings on our web site it shows 40 of them are duplicates.
Technical SEO | | TIM_DOTCOM0 -
Duplicate Page Content but where?
Hi All Moz is telling me I have duplicate page content and sure enough the PA MR mT are all 0 but it doesnt give me a link to this content! This is the page: http://www.orsgroup.com/index.php?page=Scanning-services But I cant find where the duplicate content is other than on our own youtube page which I will get removed here: http://www.youtube.com/watch?v=Pnjh9jkAWuA Can anyone help please? Andy
Technical SEO | | ORS-Group0 -
Duplicate Page Content
Hi, I just had my site crawled by the seomoz robot and it came back with some errors. Basically it seems the categories and dates are not crawling directly. I'm a SEO newbie here Below is a capture of the video of what I am talking about. Any ideas on how to fix this? Hkpekchp
Technical SEO | | mcardenal0 -
Content and url duplication?
One of the campaign tools flags one of my clients sites as having lots of duplicates. This is true in the sense the content is sort of boiler plate but with the different countries wording changed. The is same with the urls but they are different in the sense a couple of words have changed in the url`s. So its not the case of a cms or server issue as this seomoz advises. It doesnt need 301`s! Thing is in the niche, freight, transport operators, shipping, I can see many other sites doing the same thing and those sites have lots of similar pages ranking very well. In fact one site has over 300 keywords ranked on page 1-2, but it is a large site with an 12yo domain, which clearly helps. Of course having every page content unique is important, however, i suppose it is better than copy n paste from other sites. So its unique in that sense. Im hoping to convince the site owner to change the content over time for every country. A long process. My biggest problem for understanding duplication issues is that every tabloid or broadsheet media website would be canned from google as quite often they scrape Reuters or re-publish standard press releases on their sites as newsworthy content. So i have great doubt that there is a penalty for it. You only have to look and you can see media sites duplication everywhere, everyday, but they get ranked. I just think that google dont rank the worst cases of spammy duplication. They still index though I notice. So considering the business niche has very much the same content layout replicated content, which rank well, is this duplicate flag such a great worry? Many businesses sell the same service to many locations and its virtually impossible to re write the services in a dozen or so different ways.
Technical SEO | | xtopher660 -
Duplicate pages problem
The Moz report shows that I have 600 Duplicate pages, How can I locate the problem and how can I fix it?
Technical SEO | | Joseph-Green-SEO0 -
How do i deal with duplicate content on the same domain?
I'm trying to find out if there's a way we can combat similar content on different pages on the same site, without having to re write the whole lot? Any ideas?
Technical SEO | | indurain0 -
Duplicate Content Question
Just signed up for pro and did my first diagnostic check - I came back with something like 300 duplicate content errors which suprised me because every page is unique. Turns out my pages are listed as www.sportstvjobs.com and just sportstvjobs.com does that really count as duplicate? and if so does anyone know what I should be doing differently? I thought it was just a canonical issue, but best I can tell I have the canonical in there but this still came up as a duplicate error....maybe I did canonical wrong, or its some other issue? Thanks Brian Clapp
Technical SEO | | sportstvjobs0