Black Hat or Bulletproof?
-
I run a blog and a e-commerce website. Their not connected but their about the same thing. I want to put my blog articles onto my website (just a couple not every last one) but I'm afraid of the duplicate content issues.
Can I take an image of a blog post, make it a PDF, and put it under a category of my e-commerce site which is helping users with useful content.
This sounds like a great idea that Google wouldn't be able to tell the difference, in fact Google would like it and see it as a useful document.
To me this seems to good to be true, perhaps a form of black hat.
So my question is, is it black hat? Could I ever get penalized for doing this?
-
Just a quick note -- I've seen Google index PDFs that were scanned images of a cut-and-paste newsletter from the 1980s with a variety of different fonts. This is not a guaranteed way to keep Google out, and images will also make your files much bigger than just text.
-
I don't think so, but it will help keep you from dropping. You are doing it for your users and that is great I just worry if that would not be obvious to Google - that's all.
-
That is what I thought, I was hoping it wouldn't be considered a bad thing to do though. Oh well it is still useful for customers. So making these canonical will not boost my overall website ranking in the least bit?
-
It Takes about 2 minutes per post. Print Screen, Crop, Use Acrobat to make the PDF, upload to site, & write a quick paragraph.
-
Wouldn't you still need some supplemental text to go along with the pdf to explain why a visitor should download it? Seems like a lot of extra work converting blogs into pdfs, uploading them, and extra writing work. A link back makes more sense to me.
-
I wouldn't do that. It would work, but in case your site was ever manually looked at for any reason and that was noticed, that could look like an attempt to manipulate search results and you could get hit. I would just put it on as text and either noindex the page in your robots.txt file or do as Raymond and Nakul suggest and set up a canonical tag. In my very humble opinion I think the safest thing would just be to block bots from the page but the canonical isn't a bad suggestion at all.
-
Seeing how it would be an image and google's crawlers cant crawl the text in that image does it still need to be no follow or canonical?
-
Is your blog blog.yourdomain.com or yourdomain.com/blog/ or yourblogdomain.com ? As Raymond recommended, I would suggest doing a cross domain canonical and you should be good. I hope this helps.
-
Or you can link back to the original article with a rel="cannonical" or if you want to be 100% sure just make it rel="nofollow".
-
You want to add the content to help your users, right? You aren't trying to get it indexed, correct? Just noindex those pages...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Not Picking Up Posts
I am trying to work out why from March 4th Google is not seeing my posts. Our google impressions have dropped from 8,000 to 40. If you put in the full article name with speach marks it does not find it, and instead shows the home page in google. We have not had any warnings. We did have work done on our site but nothing else i could think of to cause this. Can anyone let me know what may have caused this. All articles are original
Technical SEO | | headlinesplus0 -
Surge in spammy links
Hi, Our website www.foodjet.com has recently seen a huge amount of spammy incoming links to non-exisiting URLS:
Technical SEO | | FoodJEtThey all target pages that lead to a 404 and which clearly do not exist on our website. Since they have started to appear our DA has plummeted. I have already disavowed some domains, but more re-appear just as fast. I have also checked if our site has been hacked, which does not seem to be the case. What am I missing? And/or what can I do?
0 -
FAQ Schema Markup
I was wondering which blog posts would qualify for an FAQ Schema markup. For instance, we have a blog post which is more like a Q&A interview with our customer where our product gets mentioned several times. Would we get dinged for including our product name in the markup? First of all does that kind of blog posts even qualify for the markup? Example of the blog post: https://www.revulytics.com/blog/qa-techsmith-snagit-strategy-lead-daniel-foster
Technical SEO | | revulytics0 -
Internal link structure for my loan website
Hi folks. I own a Norwegian consumer loan/financing website, which has been monetized with links. I've created various silos for my content, according to what I believe is most relevant to the user.
Technical SEO | | llevy
However, as a result each article now has a sidebar list, which in turn links to all other articles within the same category (silo). As you can see here, it has about 30 links in the sidebar: forbrukslån.no/beste-lån. With 30 articles in a silo, that corresponds to over 900 internal links, in just one silo alone. I wonder if this could be hurting me SEO wise? I know G cares a lot about relevance and user experience. So I have a feeling it could be interpreted as spammy. Reason I did this in the first place, is that the header links are also being repeated on all pages, without any issue. T4FHxHw0 -
Google will index us, but Bing won't. Why?
Bing is crawling our site, but not indexing it, and we cannot figure out why -- plus it's being indexed fine in Google. Any ideas on what the issue with Bing might be? Here's are some details to let you know what we've already checked/established: We have 4 301’s and the rest of our site checks out We’ve already established our Robots is ok, and that we are fixing our site map/it's in fine shape We do not see anything blocking bingbot access to the site There is no varnish or any load balancers, so nothing on that end that would be blocking the access We also don't see any rules in the apache or the .htaccess config that would be blocking the access
Technical SEO | | Alex_RevelInteractive0 -
Black listed or not, struggling on this one.
I have a client who said they are black listed and they do not come up for any search query other than their name. I have done what I would expect to find the issues, like hurtful backlinks, poor coding etc however the code is fine, yes backlinks are a little slim. They have also said Penguin hit them hard last year. I am confused with this one as I have worked with clients who got hit by penguin and they improved but this particular client has not. http://www.specialistpaintsonline.co.uk is the website, and if anyone can shed some light as I may be missing something head on. regards
Technical SEO | | Shuffled0 -
DISQUS COMMENTS backlinks-good for seo? YES/NO?
DISQUS COMMENTS backlinks-good for seo? YES/NO? I have just started commenting on "powered by disquus" websites in the Disqus comments box and left a link to my website in the name field! Having googled whether Disqus comments backlinks are any good for seo purposes i have discovered that there is a 50/50 view on the subject with some people saying they are a "goldmine" for getting high PR backlinks and others saying they are a waste of time because googlebot cannot read Java. My own experience of commenting on Disqus powered websites is that wordpress blogs powered by disqus comments ARE INDEXED by GOOGLE and the "BACKLINK IS IN THE SOURCE OF THE PAGE" When i comment on normal websites using the Disqus comment system i have found that my Disqus comments ARE NOT indexed by Google and there IS NO BACKLINK in the page source! Has anybody got any views on whether Disqus comments backlinks are any good?
Technical SEO | | Freebetsuk2 -
What tool do you use to check for URLs not indexed?
What is your favorite tool for getting a report of URLs that are not cached/indexed in Google & Bing for an entire site? Basically I want a list of URLs not cached in Google and a seperate list for Bing. Thanks, Mark
Technical SEO | | elephantseo3