Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Help with Alt Image
Hi. Moz is alerting me that my blog has no alt image. I actually don't have an image inside the blog but it's in the header, as part of my wordpress theme. Now when I check this image in the header, the alt image keyword is there. I'm not sure how to fix it. Please help.
Content Development | | VAPartners0 -
High qualitty images and mobile
We know that google likes high quality unique images. On the other side google likes fast load times (in particular on mobile). What about showing two different dimensions images on desktop and mobile? Is it good? If yes than any wordpress plugins for that?
Content Development | | kitoks2 -
Can We Publish Duplicate Content on Multi Regional Website / Blogs?
Today, I was reading Google's official article on Multi Regional website and use of duplicate content. Right now, We are working on 4 different blogs for following regions. And, We're writing unique content for each blog. But, I am thinking to use one content / subject for all 4 region blogs. USA: http://www.bannerbuzz.com/blog/ UK: http://www.bannerbuzz.co.uk/blog/ AUS: http://www.bannerbuzz.com.au/blog/ CA: http://www.bannerbuzz.ca/blog/ Let me give you very clear ideas on it. Recently, We have published one article on USA website. http://www.bannerbuzz.com/blog/choosing-the-right-banner-for-your-advertisement/ And, We want to publish this article / blog on UK, AUS & CA blog without making any changes. I have read following paragraph on Google's official guidelines and It's inspire me to make it happen. Which is best solution for it? Websites that provide content for different regions and in different languages sometimes create content that is the same or similar but available on different URLs. This is generally not a problem as long as the content is for different users in different countries. While we strongly recommend that you provide unique content for each different group of users, we understand that this may not always be possible. There is generally no need to "hide" the duplicates by disallowing crawling in a robots.txt file or by using a "noindex" robots meta tag. However, if you're providing the same content to the same users on different URLs (for instance, if both example.de/ and example.com/de/ show German language content for users in Germany), you should pick a preferred version and redirect (or use the rel=canonical link element) appropriately. In addition, you should follow the guidelines on rel-alternate-hreflang to make sure that the correct language or regional URL is served to searchers.
Content Development | | CommercePundit0 -
Does Google penalize for duplicate blog posts?
Occasionally, I get asked by another blogger if they can repost (in full) one of our blog posts on their blog as a guest post. I've always been under the impression that Google penalizes this type of behavior, but I haven't seen any evidence. Is this true?
Content Development | | Event360300 -
301 Redirect & Duplicate Content
We currently have 16465 audiobook products presented at our Web store. 5411 of them are out-of-publication (OOP). Here's an example: Harry Potter Audiobook 2 : Harry Potter and the Chamber of Secrets - J.K. Rowling - cassette audiobook Many of the 5411 OOP products are duplicates and triplicates of one title but were offered on a different medium (cassette, CD or MP3 CD) or were a different type (abridged, unabridged, dramatized). The description (story-line) is the same for all. Because we know once a page gets on the Internet, it can live there for years, we decided to keep OOP product pages at our Web store to: Let those who may have searched for the product and clicked on a link to an OOP product's page that it was no longer available. Invite them to explore our Web store. Let them know that although the product may not be available on cassette, CD or MP3 CD, that it might be available as a digital download. We know that Google does NOT like duplicate content from one site to another and even within the same site. If we redirect all the 5411 pages to one OOP page, will this eliminate this duplicate content issue? The OOP page would explain that the title they were looking for is no longer available but that it might be available as a digital download.
Content Development | | lbohen0 -
Blog Anchor Text SEO Query
Apologies if the following is a basic question but I am just starting out with SEO and I have seen this quite a lot recently on sites I have been on. My question is if you have a self hosted blog e.g. blog.site.com or site.com/blog and you use keyword anchor text in your blog post does that bring SEO benefits in itself or does the blog post have to be shared/commented by readers to have an effect? I have seen/heard of many sites spending a lot of money getting copy done or spending their own time and resources starting a blog but the blogs remain unshared or comment-less. I am starting my own blog to go with my social media and website so I wanted to establish the basics. Thanks in advance guys.
Content Development | | jannkuzel0 -
Blogger - Multiple partial duplicate content and canonical
In Blogger, have at least three pages produced for each post - main post, archive and tag - each has their own canonical tag - are these considered duplicate content by Google? Not sure the best way to handle this.
Content Development | | holdtheonion0 -
Is it considered as duplicate content ?
Hello, I see a lot of errors on my webmaster tools because of this ajax code on my questions pages of the site (screen) : www.dismoicomment.fr The code : | / ADD ANSWER FORM |
Content Development | | elitepronostic
| | $("#answer-add-button").click(function () { |
| | $.ajax({ |
| | type: 'POST', |
| | url: '/answers/quelle-assurance-choisir-pour-un-scooter/', |
| | data: $("form#answer-add").serialize(), |
| | dataType: 'html', |
| | success: function(data) { |
| | |
| | if(data=="answer") { |
| | $('.answer-add-message').show().empty(); |
| | $(document).ready(function() { |
| | $(' Vous avez déjà répondu à cette question. ').appendTo('.answer-add-message'); |
| | }); | I have add a line on my robots.txt : http://www.dismoicomment.fr/robots.txt for remove all urls with /answers/. These urls with /answers/ aren't indexed in google. Do you think that it is dangerous and that can be considered as duplicate content ? 1129546035.png0