Duplicate Forum Content
-
HI everyone,
great to be here, absolutely loving everything, please go easy on me I'm quite a noob when it comes to seo, but hopefully my question isn't too basic.
After running the initial checks on my websites, I found there are 7,646 duplicate pages? Some are easy fixes but the majority are not, being forum pages, the edit, quote and new post links are coming up as duplicates of the main post?
Does anyone know how to fix this?
Best Lee
-
How delightful that you refer to Invisionpower as I recently took the software for a test drive and plan to purchase it in January
A question though that has been troubling me. I use a paid for hosting service, which forwards to one of my sub-domains. The problem is I have nearly 7k pages of duplicate content (all from the forum), I really want to add a no follow and then remove the forum pages from the search index and thus remove all of the duplicate content. Though I'm worried that adding no follow to the sub domain may impact on the main site.
I have a lot of page one search positions, many long tails which combined result in a lot of visits, but I also have a couple of short phrases that get 50% of the visits alone, so I can't risk making any changes, actually I'm pretty terrified to make any changes, just in case all of my website pages are removed from the index.
-
invisionpower forum is good choice, as i knew. Found duplicated content pages here, it happens with most of forums, but there is full provision that this can resolved by canonical or stop crawling one you don't want in robots.txt.
-
Hey
just found some valuable info which has solved a lot of my problems, modified .htaccess which has removed all of the non www links, so thats a part solve.
I think what I'm going to do is purchase the new forum software, setup a robot.txt to not crawl the new forums and then once I have all of the posts transfered to the new forum, remove the old forums from the index.
That should solve it, the new forum is very well established and if need be I can pay one of their developers to incorporate some seo feature that will stop the duplicate content issue. Once thats solved I can remove the robot.txt
Many thanks, Lee
-
Hi, still one more thing, because you sound loss, and that's not the meaning of this Q&A. Contact the forumservice and ask for the canonical, this is a very simple and common thing to do.
-
Ok thanks Leonie,
still no nearer to knowing what to do, I know theres a problem and generally what I need to do, but I'm still at a complete loss as to how to do it
But thank you for trying to help,
Best, Lee
-
Seo stuff is'nt that difficult, though it can be comlicated
Anyway I wish you luck with the forum and hope you'll manage to get it the way you wanted.
Grtz, Leonie
-
Mmmm this is becoming more problematic by the minute, will need to implement robot.txt before removing the urls.
And there was me thinking this seo stuff would be easy.
-
Lol, tha'ts also an option
under url's index, though they have to be removed first, otherwise google crawl them again. but there you can remove complete directories at once.
-
or go somewhere else lol, have been considering buying invisionpower forum for a while, this might be the final push I needed.
Is there a way of mass deleting pages from the index in webmaster tools, i.e. all links that contain edit, quote and new ect?
-
Ah okay, that will be difficult than.
Maybe you can ask them to implement the canonical url. It will be helpful for you to avoid duplicates
-
is a paid forum service, but they give you the ability to add code snippets in the head.
-
Hi Lee,
What kind of CMS are you using?
In wmt under parameters you can configure parameters. If the ur's contain the same parameter it is a posibility.
-
Hi Leonie,
thanks for taking the time to answer
Not sure if that will work, there is only one head (hope that makes sense). Forums work pretty much the same as content management systems. I have the ability to change what's contained in the head, but adding rel="canonical" will have the same effect on all pages, even the duplicate one (I think).
Is there a way of removing all pages from the index (in webmaster tools) that contain the word edit, or quote or new? Kind of like using a wild card?
-
Hi Lee,
I'm not very familiar with forums. but i think you can solve it by putting a canonical in the head.
If you have a main page, which is the post, put a canonical url in it:
the duplicate pages need to have the same canonical.
I hope this work for you.
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Content - Similar but not exactly the same content - Duplicate or Spammy?
Hey, so I have been wondering for some time now as some pages will get indexed and others won't appear at all. That makes me think that I am either creating to similar content or it is becoming too spammy. Take these two pages I created for example. The body content is very similar but h tags, meta tags and title are different. So my questions is; would pages not be displaying due possibly being too similar and spammy or duplicate? I have linked two pages that are very similar below and would love to hear any thoughts about it. https://www.dlmremovals.com.au/queensland/interstate/removalists/gold-coast-to-ballarat-removalist-backloads-and-moving-service.html https://www.dlmremovals.com.au/queensland/interstate/removalists/gold-coast-to-bendigo-removalist-backloads-and-moving-service.html Any feedback would be greatly appreciated. Thanks in advance.
Content Development | | Niclasfa0 -
Reposting content.
I have some good articles I wrote for article directories a couple of years ago. I took them down 6 months ago. I am hoping to repost them somewhere better if the content isn't listed on google and passes Copyscape. Would this be safe?
Content Development | | T0BY1 -
Blog for content
Noob here so be gentle... We have a blog on our homepage that we're using for content/SEO purposes. Does this content rank as highly as it would if it were a simple HTML page as opposed to a blogspot or wordpress from our page? In other words, are we getting as much content cred in this way as we would if blogspot wasn't affiliated? http://greatmats.blogspot.com/
Content Development | | Greatmats0 -
Similar Content
I'm in the process of launching two new websites (redesign / rebrand) - one website represents the manufacturer while a second website represents the retail side of the manufacturer (same company essentially but two different brands). The sites have co-existed historically without worry of canibalizing the other's traffic, but I want to make sure in this redesign that we're all set. I'm curious if anyone has any recommendations on how to handle two sites with different branding but VERY similar content (product descriptions are essentially the same). I was thinking it could be smart to just no follow the content on the manufacturers site since we're just trying to drive traffic to the consumer-facing retail side mostly anyway but would love to hear from the experts! Thank you.
Content Development | | TheBatesMillStore0 -
How to make new content Indexed faster by google
I would like to know what can I do. Normally it takes google around 3 days to index my content. I got a site map, swiched the crawling rate to the fastest in my webmaster tools. I also tried crawling my homepage as google bot and sending it to the index with all linked pages but even if I do so my content takes around 3 days if not more to get indexed. I publish around 20 posts a week. My SEOmoz page authority is 48. Some sites of my competition seem to be getting their content indexed in the same day. What else can be done?
Content Development | | sebastiankoch0 -
Do comments count as page content, as it relates to the length of content on a page?
I understand Google likes long content, and I make all my pages at least 500 words of unique and good content. But there is something I am curious about. Do they also count comments as content? The reason I'm asking is that I'm considering creating a Q&A site, where I'd control the questions, making sure they would be good ones and not duplicates, and then have people add answers. In reality, I'd be populating most the questions as first, and most definitely supplying a very good and long answer to questions. The answers would likely be in the form of comments, with highest ranked answers at top. So, I'm wondering what Google would think of a 100 word question, with a several hundred word answer in a comment, often followed by some other comments after that. Would it be a 100 word page or a 500+ word page?
Content Development | | bizzer0 -
Possible to recover from Thin/Duplicate content penalties?
Hi all, first post here so sorry if in wrong section. After a little advice, if I may, from more experienced SEOers than myself with regards to writing off domains or keeping them. To cut a long story short I do a lot of affiliate marketing, back in the day (until the past 6 months or so) you could just take a merchant's datafeed and with some SEO outrank them for individual products. However, since the Google Panda update this hasn't worked as well and now it's much hard to do - which is better for the end user. The issue I have is that I got lazy and tried to see if I could still get some datafeeds to rank with only duplicate content. The sites ranked very well at first but after a couple of weeks died massively. They went from 0 to 300 hits a day in a matter of 24 hours and back to 2 hits a day. The sites now not rank for anything which is obviously because they are duplicate content. The question I have is are these domains dead, can they be saved? Not talking about duplicate content but as a domain itself. I used about 10 domains to test things, they ranged from DA 35 to DA 45 - one of the tests being can a domain with reasonable DA rank for duplicate content. Seeing as the test didn't work I want to use the domains for proper sites with proper unique content, however so far although the new unique content is getting indexed it is suffering from the same ranking penalties the duplicate (and now deleted content) pages had. Is it worth trying to use these domains, will Google finally remove the penalty when they notice that the bad content is no longer on the site or are the domains very much dead? Many thanks
Content Development | | ccgale0 -
Duplicating Articles/Blogs
Two years ago, some articles and blogs were written and submitted to certain tech and news sites relevant to our field. Will it be fine to submit those to other sites now. It is our content only and not some one else, but would it be penalty to submit the copy to other article sites whereever those articles/content are relevant today also?
Content Development | | sayeed0