PDFs and indexing
-
Hello and good morning.
I work for a paint manufacturing company in the UK on their seo campaigns across a couple of websites, this is my question. as paint and chemicals require data and tech sheets by law, available to be downloadable for said product, should these be included in the sitemap, we auto generate our sitemaps which they include these files, with low priorities and never change in terms of name etc.
they basically have a name of say 092847.pdf for example which cannot be changed, but from an seo view this doesn't mean a thing? so theres my question should they be included and would they carry any value?
-
thank you, I'm not saying they couldn't be changed it would just cause a lot of stress for our labs and tech guys who create these and work by the number. were as having a naming structure things would become a mess and everything up in the air.
I will look into the back end keywords, authors, company name which may give them some sort of impact from what I read on the link above.
-
-
Hi
Sitemaps - yes, include anything in Sitemaps that you want users to be able to find, so the more ways you can lead a Search Engine to it, the better.
Filename - it would help if you could change the filenames to include keywords, but if that's not an option then there are other things you can do to optimise each PDF.
There's a good overview of optimising PDFs here - How To Optimize PDF Documents For Search
As that post mentions, include links back to your site for maximum value, especially if these documents are shared on other websites. Also, a bit of branding within each PDF (just add a logo) could help you out in some way.
Hope that's helpful
-
Case A:
If the content of the PDF's is valuable, if it contains also some text about the product, I would make them indexable. It will make niche searchers find you.
You might want to make a separate sitemap for these PDF's, just to keep things clean.Case B:
If it's only numbers and very technical jibber jabber, I wouldn't let it index, since Google won't understand it either.Update with an interesting story:
A client of mine also had technical PDF sheets online. He has put a lot of effort in that. There were a few (4-5) competitors using direct links to the PDF's. After a while, we referred all that competitor traffic to a special landing page trying to convince why my client is a better deal. It's still online on some of the sites, since some competitors never really checked the PDF's.
Made my client very happy. -
Hey there
I can't imagine them having any SEO value, but I can't see the PDFs doing any harm either.
PDFs are crawlable and indexable by the search engines, so I would want to keep it in your sitemap for the user. I'm quite familiar with your industry (my dad worked with providing paint and chemical coatings) and I can imagine your target audience being quite specific in their searches, looking for products by code and specifications. A PDF would probably be the ideal solution for this and so having it indexed and sitting on your domain could bring in some organic traffic.
I'd make sure that the PDFs are branded if possible containing clear links back to your site, in order to funnel any long-tail traffic back to your homepage and sales pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Whats the best way to move 30% of our content behind a paywall and still get indexed without penalties and without letting people see our content before they subscribe.
Hi all - We want to create a membership program so that they can get more great stuff from us and offers, deals, etc. but only if they qualify to be a member via a purchase for example. The question is we want to move only some of our content (c.30%) behind the membership curtain - will be a mix of SEO value content. There are few questions/ concerns I am hoping you the SEO community can help me with: How can i ensure Google continues to index it without getting penalized. If i tell google bot to index but not allow Google and other sites to see the membership content will that create a penalty? Is that considered a form of cloaking? How can i prevent having to reveal 3 pages a day under Google's First Click Free set-up. I suppose i want my cake and eat it and i suspect the answer is well i cant. Any help or insights that can help me make this decision better is gratefully accepted.
Reporting & Analytics | | Adrian-phipps0 -
How often does the indexing take place?
Hi there, I am currently looking at one of my campaigns and have noticed that since i last checked on the 31st Jan there have been no change in analytics despite me having done work on the campaign itself. I saw in the external links section that it said "38 down since last index" and I was just wondering how often the index takes place. Any help with this would be greatly appreciated! Many thanks.
Reporting & Analytics | | VoodooCreativeLtd0 -
"index.htm" for all url's in google analytics
I don't have this issue with other wordpress websites, only this one website, and I don't know what's causing the issue: Google Analytics is adding an "index.htm" to every single page on the website. So it is tracking the pages, I see no errors - is it tracking the right page? When I click on the page link in a report, I naturally go to a "404 page not found" since the website address isn't "www.example.com/rewards/index.htm" - but instead the actual address would be:
Reporting & Analytics | | cceebar
"www.example.com/rewards/". I have navigated to View Settings in GA to insure "default page" is empty. Although adding anything else to this field does not effect the page url in analytics reports either. Could it be htaccess file - or a plugin effecting the htaccess file?_Cindy0 -
How to detect where Google gets indexed URL's
Google index some kind of way some links that create duplicate content. We doesn't understand how these are created so we would like detect where Google robots find these links. We tried: Moz Crawl Diagnostics but it shows 0 as Internal Link Count for these kind of links. Find some information from Google Analytics, that maybe there is trace (site content - all content) from visitors side. There wan't. We tried to find some information in Webmaster Tools under Internal link and HTML Improvements but didn't find any trace. Tried some search commands. Is there maybe some good one to search. TO search URL's form code with https://search.nerdydata.com.
Reporting & Analytics | | raido0 -
Google Webmaster says "0" pages indexed
Built my first Wordpress site. It launched a few months ago. Google has crawled 76 pages so far. But why are 0 indexed?
Reporting & Analytics | | cschwartzel0 -
Why do I have few different index URL addresses?
Yes I know, sorry guys but I also have a problem with duplicate pages. It shows that almost every page of my site has a duplicate content issue and looking at my folders in the server, I don't see all these pages... This is a static Website with no shopping cart or anything fancy. The first on the list is my [index] page and this is giving me a hint about some sort of bad settings on my end with the SEOMOZ crawler??? Please advice and thank you! index-variations.jpg
Reporting & Analytics | | cssyes0 -
Will Bing/Google's engine index a page that has only been on social media?
Will Bing's engine index and rank a page that has only been seen on social media and has no inbound links? Will Google's? Are inbound links absolutely required to get a page indexed and ranking and getting traffic? If unknown, how would you go about testing this?
Reporting & Analytics | | SarahGoliger0