Duplicate Forum Content
-
HI everyone,
great to be here, absolutely loving everything, please go easy on me I'm quite a noob when it comes to seo, but hopefully my question isn't too basic.
After running the initial checks on my websites, I found there are 7,646 duplicate pages? Some are easy fixes but the majority are not, being forum pages, the edit, quote and new post links are coming up as duplicates of the main post?
Does anyone know how to fix this?
Best Lee
-
How delightful that you refer to Invisionpower as I recently took the software for a test drive and plan to purchase it in January
A question though that has been troubling me. I use a paid for hosting service, which forwards to one of my sub-domains. The problem is I have nearly 7k pages of duplicate content (all from the forum), I really want to add a no follow and then remove the forum pages from the search index and thus remove all of the duplicate content. Though I'm worried that adding no follow to the sub domain may impact on the main site.
I have a lot of page one search positions, many long tails which combined result in a lot of visits, but I also have a couple of short phrases that get 50% of the visits alone, so I can't risk making any changes, actually I'm pretty terrified to make any changes, just in case all of my website pages are removed from the index.
-
invisionpower forum is good choice, as i knew. Found duplicated content pages here, it happens with most of forums, but there is full provision that this can resolved by canonical or stop crawling one you don't want in robots.txt.
-
Hey
just found some valuable info which has solved a lot of my problems, modified .htaccess which has removed all of the non www links, so thats a part solve.
I think what I'm going to do is purchase the new forum software, setup a robot.txt to not crawl the new forums and then once I have all of the posts transfered to the new forum, remove the old forums from the index.
That should solve it, the new forum is very well established and if need be I can pay one of their developers to incorporate some seo feature that will stop the duplicate content issue. Once thats solved I can remove the robot.txt
Many thanks, Lee
-
Hi, still one more thing, because you sound loss, and that's not the meaning of this Q&A. Contact the forumservice and ask for the canonical, this is a very simple and common thing to do.
-
Ok thanks Leonie,
still no nearer to knowing what to do, I know theres a problem and generally what I need to do, but I'm still at a complete loss as to how to do it
But thank you for trying to help,
Best, Lee
-
Seo stuff is'nt that difficult, though it can be comlicated
Anyway I wish you luck with the forum and hope you'll manage to get it the way you wanted.
Grtz, Leonie
-
Mmmm this is becoming more problematic by the minute, will need to implement robot.txt before removing the urls.
And there was me thinking this seo stuff would be easy.
-
Lol, tha'ts also an option
under url's index, though they have to be removed first, otherwise google crawl them again. but there you can remove complete directories at once.
-
or go somewhere else lol, have been considering buying invisionpower forum for a while, this might be the final push I needed.
Is there a way of mass deleting pages from the index in webmaster tools, i.e. all links that contain edit, quote and new ect?
-
Ah okay, that will be difficult than.
Maybe you can ask them to implement the canonical url. It will be helpful for you to avoid duplicates
-
is a paid forum service, but they give you the ability to add code snippets in the head.
-
Hi Lee,
What kind of CMS are you using?
In wmt under parameters you can configure parameters. If the ur's contain the same parameter it is a posibility.
-
Hi Leonie,
thanks for taking the time to answer
Not sure if that will work, there is only one head (hope that makes sense). Forums work pretty much the same as content management systems. I have the ability to change what's contained in the head, but adding rel="canonical" will have the same effect on all pages, even the duplicate one (I think).
Is there a way of removing all pages from the index (in webmaster tools) that contain the word edit, or quote or new? Kind of like using a wild card?
-
Hi Lee,
I'm not very familiar with forums. but i think you can solve it by putting a canonical in the head.
If you have a main page, which is the post, put a canonical url in it:
the duplicate pages need to have the same canonical.
I hope this work for you.
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Internal blog with history and some SEO value versus new external blogs with specialized content?
We operate a blog inside a folder on our site and considering the launch of 4 highly focused blogs with specialized content which are now categories on the internal blog. Wondering if there is more value in using the external new blogs or just keep growing the internal blog content. Does fact that the internal blog is buried amongst millions of pages have any impact if we want the content indexed and value given to the links from the blog content to our main site pages.
Content Development | | CondoRich0 -
Removing old content
Ahoy! Variously I have heard the opinion that content which does not generate regular search traffic (let's ballpark it at >10 views in any given month) should be noindexed or even removed. Allegedly this would improve the overall quality of the site, rankings and traffic. I remain doubtful. What would you do if the interest in a given matter goes down over time for any (most) given topics of your content and is replaced by "newer" specific interest? Concrete example: I have a website about (book) reviews. Naturally, there will always be new books; old books are not in the media as much and "forgotten". Nevertheless, the reviews (all unique, based on really having read the books, no trace of the standard back cover copy) are obviously still there. Personally I feel that they do not really lose any value - they are still reviews of that one book, even though it is not the most recent one. So, what would you do: Deindex "older" book reviews after a certain time? Even remove them completely? Just let them run? I am looking forward to your opinions - and even your experience if you have done something like this! Nico
Content Development | | netzkern_AG0 -
Reviving a (very) old blog - is it worth shifting the content onto a new blog?
I look after a few ecommerce sites, one of them doesn't currently have a blog, we are setting up a wordpress blog now for the site. Going way back in time the site did have a blog which was on a separate Typepad domain. What I'm wondering is whether it is worth redirecting this whole blog to the new blog section of the site and copying some of the content over to the new blog as historical posts? I don't think it will be possible to redirect each individual post to a new one so it will just be a straight redirect of the old blog domain to the new one with the same (most of anyway) content. Do you think it is worth doing this for the value of this content which is relevant but dated (many of the links are now expired)? Doing this will take some time to do so it's not 'free' content we'd be getting We have a lot of new content planned out so we won't be short of content, just would be nice to have some historical content on there too Thanks
Content Development | | PeterLeatherland0 -
One Page Website Blog Content Question
Hi guys, I'm new to the art of SEO and am learning every day from all the fantastic content here, I have a question that I can't find an answer to, hope it doesn't stump you like it has me... I have a one page website (www.neilwilliamsvoiceover.com) that I need to put more content on for SEO purposes but needs to be kept as one page. I've set-up a blog via blogger, and have that on the website but it's in iframe, which I've now discovered is ignored by search engines. So, my question is, is there a way to pull my blog feed into the website and have it recognised by search engines as content for the website? Would I use an RSS feed or feed burner or something else completely?! Thanks for your time and help in advance.
Content Development | | BamMK0 -
What is your strategy in looking for content to write relative to your niche?
looking to keep adding to our blog in a big way. Things I use are questions we get a lot, we add them to blog and answer them - works quite nicely. Look at other blogs although in our industry its not really there, etc. What are some of your strategies for looking for content to write about?
Content Development | | PaulDylan1 -
Same content on site blog as a separate blog. Will unpublishing on one blog evade duplicate content issues?
I just discovered my client was posting the same content as the site I'm working on for him on a separate blog. I don't want to run into duplicate content issues. Both are Wordpress sites. Will it suffice to simply unpublish duplicate entries on the other blog and leave the posts as drafts?
Content Development | | locallyrank0 -
Handling duplicate content in Blogs
Many wordpress themes like mine have a homepage where the last 3 to 4 posts are displayed on the frontpage. Each post also has its own url where the post are shown seperately. How do I avoid beeing seen as duplicate content by Google?
Content Development | | wellnesswooz0 -
What is the best way to get around duplicate content when you are advertising exactly the same content on two different sites?
I am currently trying to improve exposure for an online degrees website but the content for the degree program pages is exactly the same as the company's main website. What would you suggest for getting around the duplicate content issue as a lot of the curriculum content will obviously be the same for each module, etc? Thanks
Content Development | | BeattieGroup0