How do I diagnose a site that has had a corrupted database restored?
-
Here's the situation:
Downloaded a backup of a full database from CPanel, because we needed to perform some queries on it.
Found out that after restoring it, CPanel had not been able to zip up the full database because the server memory was set so low (some posts weren't showing up after the restore).
SO, how would I go about determining exactly what content is missing from the site? What search engine queries would you perform? Is there a plugin I could use to establish the inconsistencies between the database content and the search results?
Your help is appreciated in advance!
-
Thanks a lot, Ryan. This response was really helpful.
-
Prior to SEO, my time was spent as a Microsoft Database Administrator. Your site almost certainly is using a MySQL database which is a different format then T-SQL, but the comparison likely holds.
Whenever I set up a MS SQL database, a task would be scheduled to automatically shrink the database. There are temp tables which are added and removed, files within the database which can have unused pages removed, etc. A database size reduction of 5% does not indicate to me any data loss.
At a high level, you need to compare the "corrupted" database with the one you restored on a file or table level to determine any differences. That is the only reasonable way to achieve your goal. The work should ideally be performed by a professional who is highly experienced in MySQL.
If you desire further assistance, I recommend pursuing the topic on a MySQL forum as this does not really pertain to SEO nor even WordPress.
-
Hi Ryan,
Apologies for not mentioning the software. The website is built on Wordpress.
Here's a bit extra information for you regarding the issue: upon inspection after seeing that the database was corrupted, comparing file sizes of the recently backed-up database, with a database that was backed-up three days beforehand, we found that the most recently backed-up database was around 5% smaller in file size than the earlier one—if anything it should be larger.
Regarding timestamps, the latest posts are there, and the comments seem to be there, so determining what exactly IS missing is something I'm going to need help with.
I can't restore the earlier version of the database either, because important data has been added since then.
The site works fine, for now. I'm just worried somewhere down the road we're going to find that there are 100 posts missing that are now turning up as 404 pages and lost links.
Does that make sense? Thanks for your help.
-
Based on your inquiry, it seems likely you are using specific software to run your site. It is unclear what type of software is being used, which is a critical factor. It could be a CMS such as WordPress. a shopping cart such as ZenCart, a forum such as vBulletin, etc.
You would likely receive the fastest and most accurate response by using the support site of the specific software in use.
Based on your questions, you are in far over your head and should ideally step aside and find a programmer who can resolve the issue. With that said, I'll try to answer your questions.
"how would I go about determining exactly what content is missing from the site?"
In order to determine what is missing, you need a baseline. You need to understand the site's function and activity. For example, if you are running an ecommerce site, what is the timestamp of the last order placed on the site?
"What search engine queries would you perform?"
None. You have an onsite issue. That is where your attention needs to be focused.
** "Is there a plugin I could use to establish the inconsistencies between the database content and the search results?"**
No. The search results should not even be a consideration. Search engines may choose to index or not index your content based on numerous factors including the robots.txt file, the meta tags on each page, the content on each page and so forth. Asking this question indicates you are grasping at straws. If your site is important to you, hire a professional developer to fix the problem. If the site is not of great importance (i.e. it does not generate revenue) then you can visit the site of the software in use and spend a day or two reading various articles, forums and such, then making various setting changes in an attempt to restore the site.
Another option....contact the web host and request them to restore a full backup of the entire site. This option would likely be best, but you would lose all data from after the time the backup was taken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Changing site URL structure
Hey everybody, I'm looking for a bit of advice. A few weeks ago Google sent me an email saying all pages with any text input on them need to switch to https for those pages. This is no problem, I was slowly switching the site to https anyway using a 301 redirect. However, my site also has a language subfolder in the url, mysite.com/en/ mysite.com/ru/ etc. Due to poor work on my part the translations of the site haven't been updated in a long time and lots of the pages are in english even on the russian version etc. So I'm thinking of just removing this url structure and just having mysite.com My plan is to 301 all requests to https and remove the language subfolder in the url at the same time. So far the https switching hasn't changed my rankings. Am I more at risk of losing my rankings by doing this? Thanks!
Technical SEO | | Ruhol0 -
Is there value in optimizing stock images on site?
We have a whole bunch of websites that all use stock imagery. Some of the images have alt tags on them but some of them do not. To add alt tags to all of them would be a pretty lengthy process. So, would there actually be any value in adding these tags to stock images across the board?
Technical SEO | | chrisvogel0 -
Site Migration between CMS's
Hi There, I have a technical question about migrating CMS's but not servers. My client has site A on Joomla install, He want's ot migrate to Wordpress and we will call this site B. As he has a lot of old content on site A he doesn't want to lose, he has put site B (wordpress install) on a subdirectory site.com/siteb (for example). and will use a htaccess to forward the root domain to this wordpress site. Therefore anyone going to www.site.com will see the new wordpress site and the old content and joomla install will sit on the root of the server. Will Google have an issue with this? Will it even find the old content? what are the issues for the new site and new content? Look forward getting your guys input
Technical SEO | | nezona1 -
URL Changes And Site Map Redirects
We are working on a site redesign which will change/shorten our url structure. The primary domain will remain the same however most of the other urls on the site are getting much simpler. My question is how should this be best handled when it comes to sitemaps because there are massive amounts of URLS that will be redirected to the new shorter URL how should we best handle our sitemaps? Should a new sitemap be submitted right at launch? and the old sitemap removed later. I know that Google does not like having redirects in sitemaps. Has anyone done this on a large scale, 60k URLs or more and have any advice?
Technical SEO | | RMATVMC0 -
Migrating to New site keywords question
We are converting an old static html ecommerce site to a new platform. The old site has excellent ranking for some of the products. In order to maintain our ranking we will implement 301 redirects from old to new pages (as the urls will change to SEF). I am using Googles Keyword tool (in adwords) and entering each page url of the old site (there are hundreds, I'm doing the top 50 in traffic) and generating a set of keywords, then sorting each list by global searches. For each page, Google's Keyword Tool is giving me hundreds of keywords, but in meta tags there should be no more than 15, so I need a method to choose the keywords on the new page. Question: in the new meta tags should we emphasize the most common keywords (as defined by most global searches) or the least common keywords? I would hate to lose the good ranking for the least common (long tail) keywords.
Technical SEO | | ssaltman0 -
Google ranking my site abroad, how to stop?
Hi Mozzers, I have a UK based ecommerce site, that sells only to the UK. Over the last month Google has started ranking my site on foreign flavours of Google, so I keep getting traffic coming to my site from Europe, America and the far east that we could never sell to, and as a result bounce is going up and engagement is going down. They are definitely coming to the site from google searches that relate to my product type, but in regions I do not service. Is there a way to stop google doing this? I have the target set to UK in WMT, but is there anything else I can do? I worried about my UK ranking being damaged by an increasing overall bounce rate. Thanks
Technical SEO | | FDFPres0 -
Site command / Footprint Question
Hi All, I am looking for websites with keywords in the domain and I am using: inurl:keyword/s The results that come back include sub-pages and not only domains with the keywords in the root domain. example of what i mean: www.website.com/keyword/ What I want displayed only: www.keyword/s.com Does anyone know of a site command i can use to display URL's with keywords in the root domain only? Thanks in Advance Greg
Technical SEO | | AndreVanKets0 -
Should I create mini-sites with keyword rich domain names pointing to my main site?
Hi, I'm new to seomoz (and seo in general) and loving it so far. My main domain name is more of a brandname than a search engine friendly list of keywords. I rank well for some keywords I optimized for, and less so for the more competitive keywords. I was wondering if making one page minisites hosted on keyword rich domain names could help in this respect? What I want to do is just have a single page with a few paragraphs of content and links to the main site. I am not looking for links to boost the main site, just for the minisites to do better for several keywords. Will this help? Is this ok, or against some Google policy? Can this hurt the main site rankings? Thank you! **Edit: **I noticed that sites ranking above me on the first page for some keywords have much less on-page elements than my page, have about the same domain trust and also very little inbound links. The only factor I can see is the exact match of keywords in the domain name.
Technical SEO | | Eladla1