Very strange HTML docs - what should I do with them through site migration?
-
I've just been looking at a website and it includes numerous web pages with addresses like this. I click on the URL and it takes me to a fully functional web page (not an image) and when I run it through Screaming Frog this comes up as an HTML page. The site has around 150 unique pages and over 450 pages like this one - how should I deal with these pages during an SEO migration (only a few are backlinked to)? I look forward to reading your thoughts.
http://www.[companyname].co.uk/property/caravan-sleeps-4/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/images/cottageTypes/blank.png
-
Or should I fix the issue first via htaccess rule before attempting the migration
I quite honestly think that the problem is WITH htaccess, not that you have to fix something else with htaccess.
And as an answer to your question - you always can migrate with issues and hope that nothing breaks during the process, or try to patch it up so it seems to be working fine and, again, hope that it doesn't break on you, OR you can get it fixed at the root of the problem and don't worry about it in the future.
-
Thanks Dmitrii - I will take a look - as the client is migrating on Tuesday, would you think I could get away with ignoring these during migration, or perhaps I should redirect the pages (very few are backlinked - I can redirect those that are backlinked and leave the rest).
Or should I fix the issue first via htaccess rule before attempting the migration? The web developer has disappeared (before my arrival on the scenes), and nobody has any access, so not sure if I can make much progress - no access to hosting at the mo, or anything.
-
Hi there.
It looks like some problems with htaccess rewrite rule or redirects. Most likely rewrite rule, since you say that it takes you to html page, not image.
Check that out.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I canonicalize an old HTML static site?
Hey All, I have an old static HTML site, and the crawl errors are showing "http://www.website.com" and http://website.com" as the two separate pages because there is no canonicalization. Can I fix that with a rel="canonical" tag? There is just a folder of HTML files to add the tag to, so if the www. version is the true version, can I just add to all the pages? Or is there a better way to do this??
Intermediate & Advanced SEO | | mbodine0 -
Sitewide links and owned site
Hi everyone, I need the community opinion on something. I am webmarketer and SEO for a pure player who runs a couple of e-commerce sites. On one side we have bigsite.com. It makes all our revenue. I have been in charge for years. Results are good. We have smallsite.com. It is starting. But small revenues for the moment. We have a new SEO working on this. My question is : We always had a banner on bigsite.com's homepage, sending valuable traffic to smallsite.com.T he new SEO, has footer sitewide links from smallsite.com to bigsite.com homepage. Considering both sites share same ssl, server and company name, I am quite sure this is out of google's guide lines and would hurt bigsite.com. Do you agree that this is wrong from the new SEO, and that it could hurt my work and the search results for bigsite.com and smallsite.com, as well as team work ? Thanks
Intermediate & Advanced SEO | | Kepass0 -
6 .htaccess Rewrites: Remove index.html, Remove .html, Force non-www, Force Trailing Slash
i've to give some information about my website Environment 1. i have static webpage in the root. 2. Wordpress installed in sub-dictionary www.domain.com/blog/ 3. I have two .htaccess , one in the root and one in the wordpress
Intermediate & Advanced SEO | | NeatIT
folder. i want to www to non on all URLs Remove index.html from url Remove all .html extension / Re-direct 301 to url
without .html extension Add trailing slash to the static webpages / Re-direct 301 from non-trailing slash Force trailing slash to the Wordpress Webpages / Re-direct 301 from non-trailing slash Some examples domain.tld/index.html >> domain.tld/ domain.tld/file.html >> domain.tld/file/ domain.tld/file.html/ >> domain.tld/file/ domain.tld/wordpress/post-name >> domain.tld/wordpress/post-name/ My code in ROOT htaccess is <ifmodule mod_rewrite.c="">Options +FollowSymLinks -MultiViews RewriteEngine On
RewriteBase / #removing trailing slash
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule ^(.*)/$ $1 [R=301,L] #www to non
RewriteCond %{HTTP_HOST} ^www.(([a-z0-9_]+.)?domain.com)$ [NC]
RewriteRule .? http://%1%{REQUEST_URI} [R=301,L] #html
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule ^([^.]+)$ $1.html [NC,L] #index redirect
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://domain.com/ [R=301,L]
RewriteCond %{THE_REQUEST} .html
RewriteRule ^(.*).html$ /$1 [R=301,L]</ifmodule> The above code do 1. redirect www to non-www
2. Remove trailing slash at the end (if exists)
3. Remove index.html
4. Remove all .html
5. Redirect 301 to filename but doesn't add trailing slash at the end0 -
Why is this site not ranking?
http://www.petstoreunlimited.com They get good grades from the on-page tool. The links are not amazing, but are not super spammy. Yet it ranks for nothing they target Any reason why?
Intermediate & Advanced SEO | | Atomicx0 -
What this site is doing? Does it look like cloaking to you?
Hi here, I was studying our competitors SEO strategies, and I have noticed that one of our major competitors has setup something pretty weird from a SEO stand point for which I would like to know your thoughts about because I can't find a clear explanation for it. Here is the deal: the site is musicnotes.com, and their product pages are located inside the /sheetmusic/ directory, so if you want to see all their product pages indexed on Google, you can just type in Google: site:musicnotes.com inurl:/sheetmusic/ Then you will get about 290,000 indexed pages. No, here is the tricky part: try to click on one of those links, then you will get a 302 redirect to a page that includes a meta "noindex, nofollow" directive. Isn't that pretty weird? Why would they want to "nonidex, nofollow" a page from a 302 redirect? And how in the heck the redirecting page is still in the index?!! And how Google can allow that?! All this sounds weird to me and remind me spammy techniques of the 90s called "cloaking"... what do you think?
Intermediate & Advanced SEO | | fablau0 -
It appears that Googlebot Mobile will look for mobile redirects from the desktop site, but still use the SEO from the desktop site.
Is the above statement correct? I've read that its better to have different SEO titles & descriptions for mobile sites as users search differently on mobile devices. I've also read it's good to link build, keep text content on mobile sites etc to get the mobile site to rank. If I choose to not have titles & descriptions on my mobile site will Google just rank our desktop version & then redirect a user on a mobile device to our mobile site or should I be adding in titles & descriptions into the mobile site? Thanks so much for any help!
Intermediate & Advanced SEO | | DCochrane0 -
Do I have to tell WBT site moved to a subdirectory on another internal site?
I am moving content from one site to another and redirecting the DNS from www.oldsite.com to www.newsite.com/old-site. I have put the 301 in place but I wanted to make sure I have to also tell Webmaster Tools to change the old site to the new domain? We still want the old domain name to answer and redirect to www.newsite.com/old-site. Thanks
Intermediate & Advanced SEO | | GeorgeLaRochelle0 -
Merging three sites to one
Hi guys, I just wanted confirmation if this is the right way to go about doing this. I need to merge three websites and I've never done three websites in to a brand new site before. Ok so we have Sitex.com
Intermediate & Advanced SEO | | Profero
Sitey.com
Sitez.com We've created a SiteB.com SiteB.com has SiteB.com/SiteXCat
SiteB.com/SiteYCat
SiteB.com/SiteZCat Each X,Y and Z have over 1,000 pages. They only have about 10 pages each with Page Authority above 10 and the domains arn't that strong. What i plan to do is: 301 redirect each site domain (X,Y,,Z) to it's corresponding category. e.g. Sitex.com > SiteB.com/SiteXCat 301 redirect each page off X,Y,Z that has a Page Authority above 10 to their new pages on SiteB.com Then, I'm unsure if i should 410 every other URL... I don't think its worht 301 every single URL if they arn't in search results much - but maybe it is if they have a lot of inbound links even with low page authority? Any ideas and does the above seem the best practise? Thanks.0