Content with changing URL and duplicate content
-
Hi everyone,
I have a question regarding content (user reviews), that are changing URL all the time.
We get a lot of reviews from users that have been dining at our partner restaurants, which get posted on our site under (new) “reviews”.
My worry however is that the URL for these reviews is changing all the time. The reason for this is that they start on page 1, and then get pushed down to page 2, and so on when new reviews come in.
http://www.r2n.dk/restaurant-anmeldelser
I’m guessing that this could cause for serious indexing problems? I can see in google that some reviews are indexed multiple times with different URLs, and some are not indexed at all.
We further more have the specific reviews under each restaurant profile. I’m not sure if this could be considered duplicate content?
Maybe we should tell google not to index the “new reviews section” by using robots.txt. We don’t get much traffic on these URLs anyways, and all reviews are still under each restaurant-profile.
Or maybe the canonical tag can be used?
I look forward to your input.
Cheers, Christian
-
Hi Everett,
Thanks for the reply, and choices.
We just implemented choice #2, so Im pleased that you think it's a good solution.
Thanks a bunch and have a great weekend .
Cheers, Christian
-
Hello Christian,
You have three choices here, listed in order of preference in my opinion:
#1 View All Canonical - This may not be a good choice for you, however, because there are hundreds of pages of reviews, which may take too long to load on a single page.
#2 Rel Canonical - You are actually already doing this:
#3 Rel Next Prev - This "would" be the ideal fix, but Google doesn't seem to treat it the way they're supposed to.
In summary: I think you're fine the way things are. No need to be concerned here.
-
Hi Guys,
I would still love an answer to this!?
Have a great day.
Cheers, Christian
-
Hi Federico, thanks for the reply.
I will defiantly look into this schema.org that you mention.
Regarding the "Changing URL". The URL stays the same, but the content (reviews) keeps changing URL, as more reviews comes in.
Example:
A new review comes in, let's call it “MOZ”. This gets automatically published as newest review, on the first review page:
http://www.r2n.dk/restaurant-anmeldelserAs more new reviews come in, "MOZ" will automatically be "pushed down" to page 2.
http://www.r2n.dk/restaurant-anmeldelser?restaurant=-1&sort=0&page=2As more come in “MOZ” will be pushed to page 3, and so on, and so on.
So the URL for the “MOZ” review will keep changing, and I’m worried that this solution causes indexing problems!?
Regards, Christian
-
Christian,
I just checked your site and I wasn't able to find those URLs that are constantly changing as you mention.
If you are referring to the restaurant order presented on the page changes, that's nothing to worry about. You are just sorting the restaurants by the last review, right?
I wouldn't recommend noindexing the review section, in fact, I would recommend you further improve it using schema.org markup to tell Google what those "comments" are. You can even use GWT Data Highlighter to markup your data automatically (although only Google will know the markup if you use that method).
About the duplicate content, there's also nothing to worry about, as in the homepage you present only 1 review while in the restaurant page you show all of them, so the content is different.
You should also use schema.org markup to let crawlers know business locations, details, etc and what you are actually listing.
Hope that helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
PDF Instructions come up in Crawl report as Duplicate Content
Hello, My ecommerce site has many PDF instruction pages that are being marked as duplicate content in the site crawl. Each page has a different title, and then a PDF displayed in an iframe with a link back to the previous page & to the category that the product is placed in. Should I add text to the pages to help differentiate them? I included a screenshot of the code that is on all the pages. Thanks! Justin 9tD9HMr
On-Page Optimization | | JustinBSLW0 -
Magento Duplicate Content Question - HELP!
In Magento, when entering product information, does the short description have to be different than the meta description? If they are both the same is this considered duplicate content? Thanks for the help!!!
On-Page Optimization | | LeapOfBelief0 -
Similar URLs
I'm making a site of LSAT explanations. The content is very meaningful for LSAT students. I'm less sure the urls and headings are meaningful for Google. I'll give you an example. Here are two URLs and heading for two separate pages: http://lsathacks.com/explanations/lsat-69/logical-reasoning-1/q-10/ - LSAT 69, Logical Reasoning I, Q 10 http://lsathacks.com/explanations/lsat-69/logical-reasoning-2/q10/ - LSAT 69, Logical Reasoning II, Q10 There are two logical reasoning sections on LSAT 69. For the first url is for question 10 from section 1, the second URL is for question 10 from the second LR section. I noticed that google.com only displays 23 urls when I search "site:http://lsathacks.com". A couple of days ago it displayed over 120 (i.e. the entire site). 1. Am I hurting myself with this structure, even if it makes sense for users? 2. What could I do to avoid it? I'll eventually have thousands of pages of explanations. They'll all be very similar in terms of how I would categorize them to a human, e.g. "LSAT 52, logic games question 12" I should note that the content of each page is very different. But url, title and h1 is similar. Edit: I could, for example, add a random keyword to differentiate titles and urls (but not H1). For example: http://lsathacks.com/explanations/lsat-69/logical-reasoning-2/q10-car-efficiency/ LSAT 69, Logical Reasoning I, Q 10, Car efficiency But the url is already fairly long as is. Would that be a good idea?
On-Page Optimization | | graemeblake0 -
Duplicate content "/"
Hi all, Ran my website through the SEOMOZ campaigns and the crawl diagnostics give me a duplicate error for these urls http://www.mysite.com/cat1/article http://www.mysite.com/cat1/article/ so the url with the "/" is a duplicate of the one without the "/" Can someone point me out to a solution to solve this ? regards, Frederik
On-Page Optimization | | frdrik1230 -
Recommendation: Add a canonical URL tag referencing this URL to the header of the page.
Please clarify: In the page optimization tool, seomoz recommends using the canonical url tag on the unique page itself. Is it the same canonical url tag used when want juice to go to the original page? Although the canonical URL tag is generally thought of as a way to solve duplicate content problems, it can be extremely wise to use it on every (unique) page of a site to help prevent any query strings, session IDs, scraped versions, licensing deals or future developments to potentially create a secondary version and pull link juice or other metrics away from the original. We believe the canonical URL tag is a best practice to help prevent future problems, even if nothing is specifically duplicate/problematic today. Please give example.
On-Page Optimization | | AllIsWell0 -
Summarize your question.Images being seen as duplicate content/pages
My images suddenly are appearing in my crawl reports as duplicate content, without meta tags, this happened over night and cant figure out why.
On-Page Optimization | | RBYoung0 -
Duplicate content
crawler shows following links as duplicate http://www.mysite.com http://mysite.com http://www.mysite.com/ http://mysite.com. http://mysite.com/index.html How can i solve this issue?
On-Page Optimization | | bhanu22170