Is it Panda?, how to deal with AP etc newswire articles
-
A site I have lost 30% of its traffic in June then another 10% in July, is it Panda?
The site has 10's of thousands of AP or other syndicated articles on it, they are not there for SE benefits, they are categorized and relevant to the people who read them, the site gets half of its traffic from type ins/bookmarks.
Should I nofollow the articles or rel="canonical" them? what can help......
Cheers
-
Thanks again, I guess I will have to look through keywords and see what traffic these news pages are still getting from google, then weigh up whether to tag them.
-
Panda isn't a penalty per se. It is a algorithmic change to how Google ranks sites and pages. If your site has duplicated content on it, you will need to fix all of it. Once your site has been cleaned up, it will can take a month or more for Google to fully re-index your entire site and see all of the duplicated content gone or properly handled (i.e. noindex or canonicalized).
It's not as if you have 1% of duplicated content that your site is affected but no one knows for sure what exact percentage triggers this effect, so your best course of action is to clean it all up.
By using the canonical tag, these pages will be removed from the index for your site. The "harm" would be that if someone searches for the pages your site wont be listed unless you have relevant comments for the search query.
-
Thanks, is there any way that I could trial this on the site by just adding the tag to a few pages or sections? Is it domain level metrics google is using, they have decided that the site is junk now as it has so much duplicated content?
The articles are slightly changed and there are comments on them.
What harm could I cause by trialing the canonical tag? if I took it of there later could there be some recovery time?
thanks a lot
-
If this content is merely duplications of articles which exist elsewhere then yes, you can add the canonical tag pointing to the source.
You would definitely not want to "nofollow" these pages. By adding a nofollow tag you are telling search engines not to flow page rank to the other links they find on the page. That is not the result you desire.
You could noindex the pages as well. Prior to doing such I would ask if you are offering comments or other user generated comments. If you are not, then the noindex tag is fine. If you do offer UGC, then I would recommend the canonical tag.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should summary pages have the rel canonical set to the full article?
My site has tons of summary pages, Whether for a PDF download, a landing page or for an article. There is a summary page, that explains the asset and contains a link to the actual asset. My question is that if the summary page is just summary of an article with a "click here to read full article" button, Should I set the rel canonical on the summary page to go to the full article? Thanks,
Technical SEO | | Autoboof0 -
Blog Page Titles - Page 1, Page 2 etc.
Hi All, I have a couple of crawl errors coming up in MOZ that I am trying to fix. They are duplicate page title issues with my blog area. For example we have a URL of www.ourwebsite.com/blog/page/1 and as we have quite a few blog posts they get put onto another page, example www.ourwebsite.com/blog/page/2 both of these urls have the same heading, title, meta description etc. I was just wondering if this was an actual SEO problem or not and if there is a way to fix it. I am using Wordpress for reference but I can't see anywhere to access the settings of these pages. Thanks
Technical SEO | | O2C0 -
Our Panda Content Audit Process
We've put together this process over the past year that has shown success when it comes to sites that appear to be hit by Panda. The idea was to put together a process that would allow us to give our clients an understanding of the problem at hand and metrics we can use to explain how recovery is going. Would love to hear your opinion or if you have a different/similar strategy.
Technical SEO | | eyeflow1 -
Dealing with closely related pages
I have a book with 8 pages which I offer free on my site: http://www.pottytrainingchart4kids.com/free-potty-training-book/ For technical reasons each of the 8 pages are on a seperate page. This might cause thin content/duplicate content since most of the code is the same besides the images and there isn't much on each page. How would you suggest I deal with this? I remember once reading about rel prev or something like that but I am not sure if it is applicable. I would like all page rank to go to the main page. Should I add no index to the other pages? I am not really sure what I should do to prevent a Panda penalty. Thanks in advance!
Technical SEO | | JillB20130 -
Syndicated content outranks my original article
I have a small site and write original blog content for my small audience. There is a much larger, highly relevant site that is willing to accept guest blogs and they don't require original content. It is one of the largest sites within my niche and many potential customers of mine are there. When I create a new article I first post to my blog, and then share it with G+, twitter, FB, linkedin. I wait a day. By this time G has seen the links that point to my article and has indexed it. Then I post a copy of the article on the much larger site. I have a rel=author tag within the article but the larger site adds "nofollow" to that tag. I have tried putting a link rel=canonical tag in the article but the larger site strips that tag out. So G sees a copy of my content on this larger site. I'm hoping they realize it was posted a day later than the original version on my blog. But if not will my blog get labeled as a scraper? Second: when I Google the exact blog title I see my article on the larger site shows up as the #1 search result but (1) there is no rich snippet with my author creds (maybe because the author tag was marked nofollow?), and (2) the original version of the article from my blog is not in the results (I'm guessing it was stripped out as duplicate). There are benefits for my article being on the larger site, since many of my potential customers are there and the article does include a link back to my site (the link is nofollow). But I'm wondering if (1) I can fix things so my original article shows up in the search results, or (2) am I hurting myself with this strategy (having G possibly label me a scraper)? I do rank for other phrases in G, so I know my site hasn't had a wholesale penalty of some kind.
Technical SEO | | scanlin0 -
A possible hit by panda
Hi there, on the panda update a few weeks ago i noticed i dropped 8 places for a very competitive keyword. All my content in the site is unique and quality. Its also not published elsewhere. I think the update did knock this one keyword down for me - no others got hti though. Perhaps the landing page got it? too many mentions of the keyword? however i didn't think i spammed it on the page, perhaps jsut mentioned once or twice - any ideas? Any help aprpeciated 🙂
Technical SEO | | pauledwards0 -
Large Scale Ecommerce. How To Deal With Duplicate Content
Hi, One of our clients has a store with over 30,000 indexed pages but less then 10,000 individual products and make a few hundred static pages. Ive crawled the site in Xenu (it took 12 hours!) and found it to by a complex mess caused by years of hack add ons which has caused duplicate pages, and weird dynamic parameters being indexed The inbound link structure is diversified over duplicate pages, PDFS, images so I need to be careful in treating everything correctly. I can likely identify & segment blocks of 'thousands' of URLs and parameters which need to be blocked, Im just not entirely sure the best method. Dynamic Parameters I can see the option in GWT to block these - is it that simple? (do I need to ensure they are deinxeded and 301d? Duplicate Pages Would the best approach be to mass 301 these pages and then apply a no-index tag and wait for it to be crawled? Thanks for your help.
Technical SEO | | LukeyJamo0 -
Dealing with indexable Ajax
Hello there, My site is basically an Ajax application. We assume lots of people link into deep pages on the site, but bots won't be able to read past the hashmarks, meaning all links appear to go to our home page. So, we have decided to form our Ajax for indexing. And so many questions remain. First, only Google handles indexable Ajax, so we need to keep our static "SEO" pages up for Bing and Yahoo. Bummer, dude, more to manage. 1. How do others deal with the differences here? 2. If we have indexable Ajax and static pages, can these be perceived as duplicate content? Maybe the answer is to disallow google bot from indexing the static pages we made. 3. What does your canonical URL become? Can you tell different search engines to read different canonical URLs? So many more questions, but I'll stop there. Curious if anyone here has thoughts (or experience) on the matter. Erin
Technical SEO | | ErinTM2