Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Index Issue
2 months ago, I registered a domain named www.nextheadphone.com I had a plan to learn SEO and create a affiliate blog site. In my website I had 3 types of content. Informative Articles Headphone Review articles Product Comparision Review articles Problem is, Google does not index my informative articles. I dont know the reasons. https://www.nextheadphone.com/benefits-of-noise-cancelling-headphones/
Content Development | | NextHeadphone
https://www.nextheadphone.com/noise-cancelling-headphones-protect-hearing/ Is there anyone who can take a look and find the issues why google is not indexing my articles? I will be waiting for your reply0 -
Would a lot of images on one post be categorized as thin content?
As an example, if i write an article on 12 best print ads by BMW, it will have 12 images and possible 12 single liners and a paragraph. The images will have the necessary alt tags. But overall, will this post be counted as low content and is there changes of being penalized by google for it?
Content Development | | marketing910 -
Duplicated Terms and Conditions?
Does Google penalise duplicated terms and conditions. I know solicitors use the same legal terms and phrases all the time... Thanks
Content Development | | GaryVictory0 -
How much duplicate content counts as duplicate content to Google?
Hi everyone! I've had a look through some duplicate content posts and I can't see the answer to this query, so I thought I'd ask in case someone could help. I've been looking at a website that competes with the site that I work on. They have profile pages containing content that has been copied and pasted straight from the suppliers' websites. Their pages have all their own code framing the content, which is diluting the concentration of duplicate copy. How much duplicate content can a page have before it gets penalised or ignored by Google? Any suggestions very gratefully received 🐵
Content Development | | ceecee0 -
Google adsense image - text and alternatives
Hi, there has been a lot of talk on the internet about google adsense and which is better, image or text as well as what are the best alternatives to google adsense, so i thought i would throw out the topic on here and see what real website experts think about the topic. I have experienced google adsense image as better than text and would like to know what other people think and beside affliate ads what alternatives to google adsense do people like
Content Development | | ClaireH-1848860 -
301 Redirect & Duplicate Content
We currently have 16465 audiobook products presented at our Web store. 5411 of them are out-of-publication (OOP). Here's an example: Harry Potter Audiobook 2 : Harry Potter and the Chamber of Secrets - J.K. Rowling - cassette audiobook Many of the 5411 OOP products are duplicates and triplicates of one title but were offered on a different medium (cassette, CD or MP3 CD) or were a different type (abridged, unabridged, dramatized). The description (story-line) is the same for all. Because we know once a page gets on the Internet, it can live there for years, we decided to keep OOP product pages at our Web store to: Let those who may have searched for the product and clicked on a link to an OOP product's page that it was no longer available. Invite them to explore our Web store. Let them know that although the product may not be available on cassette, CD or MP3 CD, that it might be available as a digital download. We know that Google does NOT like duplicate content from one site to another and even within the same site. If we redirect all the 5411 pages to one OOP page, will this eliminate this duplicate content issue? The OOP page would explain that the title they were looking for is no longer available but that it might be available as a digital download.
Content Development | | lbohen0 -
Duplicate content?
I am not understanding this - I see a duplicate content warning. When I look into it I see these two urls: http;//search-engine-upgrade.com http;//search-engine-upgrade.com/default.asp (NOT a blog)
Content Development | | dcmike0 -
Best way to avoid duplicate content issues here.
I am planning to write an article that refutes some claims made in another article. The original article is a 20 page pdf. What I plan to do is to take quotes from this PDF and then under each quote write my arguments for or against the quote. If I take direct quotes from the article, is Google likely to see this as duplicate content?
Content Development | | MarieHaynes0