Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content
Dear community, We have 15 product specific landing pages. They all share a block called "Why invest in VanEck ETFs?", see e.g., https://www.vaneck.com/de/en/mining-etf https://www.vaneck.com/de/en/space-etf/ https://www.vaneck.com/de/en/esports-etf/
Content Development | | marketing-europe
Can this lead to SEO penalization because of duplicate content?0 -
Content Curation & Duplicate Content
Hi, I have a client that wants to do content curation but it has been my understanding that adding external content that is already live on another website to your website, you get penalized for duplicate content. I have read that you can create an excerpt and then Google won't penalizes you for duplicate content. Can anyone shed more light on this topic. Thanks
Content Development | | M_80 -
Duplicate Legal Content
Oftentimes lawyer websites will publish laws (codes, statutes, regulations, case law, etc). They add no value to the text, it's just copy pasted. Therefore, the same text/content may be on potentially hundreds of websites. Does google interpret this as duplicate content, or does it recognize government content as special? I want to have the laws on my website as well, however I am debating whether to add no follow tags or not. Or I'm thinking about adding value to the content by breaking down the specific law. However, even then at least 50% of the content on the page will still be the law, and I'm not sure if that is enough to be considered duplicate content.
Content Development | | irnikij0 -
Will our two retail sites get hit with duplicate content?
Our retail site just rolled out a second online store. The URL is new and it is showing some of the same products from the same vendors (probably about 40% of the fist store is in the second store). Down the road, we will remove the products from the first site, however, we are keeping it for now. The products show up on both sites, with the same images, and the same descriptions and almost the same URL query string. Are we going to get hit with any penalties due to duplicate content?
Content Development | | klmarketing0 -
Does Google penalize for duplicate blog posts?
Occasionally, I get asked by another blogger if they can repost (in full) one of our blog posts on their blog as a guest post. I've always been under the impression that Google penalizes this type of behavior, but I haven't seen any evidence. Is this true?
Content Development | | Event360300 -
301 Redirect & Duplicate Content
We currently have 16465 audiobook products presented at our Web store. 5411 of them are out-of-publication (OOP). Here's an example: Harry Potter Audiobook 2 : Harry Potter and the Chamber of Secrets - J.K. Rowling - cassette audiobook Many of the 5411 OOP products are duplicates and triplicates of one title but were offered on a different medium (cassette, CD or MP3 CD) or were a different type (abridged, unabridged, dramatized). The description (story-line) is the same for all. Because we know once a page gets on the Internet, it can live there for years, we decided to keep OOP product pages at our Web store to: Let those who may have searched for the product and clicked on a link to an OOP product's page that it was no longer available. Invite them to explore our Web store. Let them know that although the product may not be available on cassette, CD or MP3 CD, that it might be available as a digital download. We know that Google does NOT like duplicate content from one site to another and even within the same site. If we redirect all the 5411 pages to one OOP page, will this eliminate this duplicate content issue? The OOP page would explain that the title they were looking for is no longer available but that it might be available as a digital download.
Content Development | | lbohen0 -
How to titling images in WP blog
What is the best way to title an image in a blog post. The wording will relate to the post discussion so I am not discussing the word stuffing, rather how to enter the words. Here are my options for the title: 1. dayton engagement photos 2. dayton_engagement_photos 3. daytonengagementphotos 4. Is there another preferred method? Should I increment the image title for each image such as: daytonengagementphotos1, daytonengagementphotos2, etc. What about the alternative text area? Does the same concept apply there? Thanks hR3ua.jpg
Content Development | | maximphotostudio0 -
Duplicate content?
I am not understanding this - I see a duplicate content warning. When I look into it I see these two urls: http;//search-engine-upgrade.com http;//search-engine-upgrade.com/default.asp (NOT a blog)
Content Development | | dcmike0