How to make google not index quotes from other sites?
-
Hey guys,
I have a site where we post quite a lot of info from other sites. We don't want google to de-index our pages because parts of it are quotes from other sites. What would you use to make it so Google sees it's a quote from another site? Or to just make Google not index the quote?
Thanks!
-
We're using Vbulletin and we do really want the rest of page to get indexed. I'll just link back to them then. Thanks to both of you!
-
Hi Gianluca,
Yes I did - Thanks for pointing it out
-
If the quotes are just part of the content and not a big part of the content, I would not worry too much about eventual problems, always if you cite the source of the quote.
If the quote substantially means the highest % of the page content, but you want the page to be indexed, then the use of the canonical tag with the original source in it, it's not the solution, because the SE will filter out your page and show the source only. In that case I think that a link back to the source could be enough.
There could be an alternative, but it is just an idea as i don't know if it could work in your case: to use a schema (microdata) in order to better specify the source to the SE: http://schema.org/Article and http://schema.org/BlogPosting
Then, if you don't want to have the quoting pages be seen at all, then simply don't make them indexable with noindex,follow or use the canonical tag with the source url in.
-
1. Add a nofollow tag to the head of the page so that it doesn't get indexed
<meta name="robots" content="noindex, follow" />
I think you meant: Add a noindex tag..., right?
-
Hi,
I would say that you have a couple of options:-
1. Add a nofollow tag to the head of the page so that it doesn't get indexed
<meta name="robots" content="noindex, follow" />
2. Add a canonical link to the head of the page pointing back to the original content
<link href="http://www.original-site.com"rel="canonical">
I would try to go for the 2nd option if possible but it can be difficult to implement if you are using a CMS system. Also make sure that you don't have more quoted content on your site than you have original content.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to handle "app" pages.
Hey guys, We've got an app - a drag & drop email builder - and we are looking to improve our seo efforts. That being said - we're not sure how to treat pages of the app that wouldn't tell google nothing at all basically (loads of duplicate content, lorem ipsum, etc). They're pages that are used by the clients to build their own templates ex: builder pages they are extremely useful for our clients, but GGL wouldn't prolly make too much sense out of them. That being said - rather randomly, before we nofollow noindexed them, some of them started ranking (probably given to the really great analytics data we have on them. Loads of clients, loads of time spent on page, etc). Can we harness them in a better way, or just nofollownoindex them? I don't really see how they can be "canonicalised" since they don't really provide any quality content for Google. Much like MOZ's keyword explorer tool for ex. Mucho quality for us - but not a google fan favorite content-wise. Thanks for your help 🙂
On-Page Optimization | | andy.bigbangthemes0 -
When making content pages to a specific page; should you index it straight away in GSC or let Google crawl it naturally?
When making content pages to a specific page; should you index it straight away in GSC or let Google crawl it naturally?
On-Page Optimization | | Jacksons_Fencing0 -
Index or No Index (Panda Issue)
Hi, I believe our website has been penalized by the panda update. We have over 9000 pages and we are currently indexing around 4,000 of those pages. I believe that more than half of the pages indexes have either thin content. Should we stop indexing those pages until we have quality page content? That will leave us with very few pages being indexed by Google (Roughly 1,000 of our 9,000 pages have quality content). I am worried that we would hurt our organic traffic more by not indexing the pages than by indexing the pages for google to read. Any help would be greatly appreciated. Thanks, Jim Rodriguez
On-Page Optimization | | dustyabe0 -
My Site's Name Not Ranking in Google
Hey all, I've seen a few posts like this. But I wanted to start a new thread in hopes I may find the underlying issue. I've had my site: http://www.ctrl-alt-success.com for about 2 years. Recently I've started really adding a lot of content to it. (about 2-3 posts a week). I get zero organic views which is fine as I know it's still in the beginning. But here's my main question. If I type "ctrl-alt-success" into google. I get some site that shows up. "ctrlaltsuccess.com" I've been looking at this issue forever. That site has been "coming soon" for nearly 2 years. lol My site doesn't even show up on the first 10 pages of google. However in Bing and Yahoo it ranks on the first page. What could my site be doing wrong that it's not even ranking for the exact domain name? Keep in mind, if I google "ctrl-alt-success.com" my site comes up fine. Any help would be appreciated, thanks!
On-Page Optimization | | Ctrl-Alt-Success0 -
Why isn't our site being shown on the first page of Google for a query using the exact domain, when its pages are indeed indexed by Google
When I type our domain.com as a query into Google, I only see one of our pages on the homepage, and it's in 4th position. It seems though, that all pages of the site are indexed by google when I type in the query "site:domain.com". There was an issue at the site launch, where the robots.txt file was left active for around two weeks. Would this have been responsible for the fact that another domain ranks #1 when we type in our own domain? It has been around a couple of months now since the site was launched. Thanks in advance.
On-Page Optimization | | featherseo0 -
How do I address "Critical Factors: Accessible to Engines"?
Hello,I am going thru the on-page report card produced by SEOMOZ and am stumped as to how to address the first critical factor. It looks like the correct meta tag to get search engines to index the site is at the bottom of the header. And as far as I know, which isn't much, the site returns the HTTP code 200 when I refresh.I am new at this, so please let me know if you have some specific solutions. I am using IWeb and the IWeb SEO Tool to make meta code improvements. I have pasted the head code for my website (www.grass2greens.com) below. Thanks in advance!<html lang="en" xml:lang="en" xmlns="http://www.w3.org/1999/xhtml"><head><meta content="text/html; charset=UTF-8" http-equiv="Content-Type"><meta content="iWeb 3.0.4" name="Generator"><meta content="local-build-20120619" name="iWeb-Build"><meta content="IE=EmulateIE7" http-equiv="X-UA-Compatible"><meta content="width=880" name="viewport"><title>Grass to Greens: Asheville Edible Landscapingtitle><link href="Grass_to_Greens__Asheville_Edible_Landscaping_files/Grass_to_Greens__Asheville_Edible_Landscaping.css" media="screen,print" type="text/css" rel="stylesheet"><style type="text/css"><script type="text/javascript" async="" src="http://www.google-analytics.com/ga.js"><script type="text/javascript" async="" src="http://www.google-analytics.com/ga.js"><script src="Scripts/iWebSite.js" type="text/javascript"><script src="Scripts/iWebImage.js" type="text/javascript"><script src="Scripts/iWebMediaGrid.js" type="text/javascript"><script src="Scripts/Widgets/SharedResources/WidgetCommon.js" type="text/javascript"><script src="Scripts/Widgets/HTMLRegion/Paste.js" type="text/javascript"><script src="Grass_to_Greens__Asheville_Edible_Landscaping_files/Grass_to_Greens__Asheville_Edible_Landscaping.js" type="text/javascript"><script type="text/javascript"><meta content="Grass to Greens offers a range of edible landscape design, consultation, installation, and maintenance services. Free Consultations! We specialize in beautiful and useful vegetable gardens, season extension, tree work, orchards and food forests, stone work, fencing, and rain water catchment. Grass to Greens is an edible landscaping company committed to creating food security and fostering social justice through urban agriculture in the Asheville area. " name="description"><meta content="Landscaping Asheville Edible Gardens" name="keywords"><meta content="follow,index" name="robots"><link rel="stylesheet" type="text/css" href="Grass_to_Greens__Asheville_Edible_Landscaping_files/Grass_to_Greens__Asheville_Edible_LandscapingMoz.css">head> Grass to Greens: Asheville Edible Landscaping
On-Page Optimization | | dcaudio0 -
Does google treat all urls equal?
Sorry for the lame title, i couldn't think of a better one. I want to know if google treats this: http://www.domain.com/products/some-product-name the same as it would treat: http://www.domain.com/?products=some-product-name if not, could you tell me the differences?
On-Page Optimization | | adriandg0 -
How to "rich-content" optimized!
Hi mozzers! How to optimize really a rich index.php of a page,with a keyword example: " mobile " what kind of things to include,video,comments,images,how many words,manually meta-descriptons or to leave it empty to take automatically the googlebot a snippet! Tell us more on this, because we forget sometimes the rich-content-optimized and only concentrated on the link-building. Thanks,
On-Page Optimization | | leadsprofi0