How do I diagnose a site that has had a corrupted database restored?
-
Here's the situation:
Downloaded a backup of a full database from CPanel, because we needed to perform some queries on it.
Found out that after restoring it, CPanel had not been able to zip up the full database because the server memory was set so low (some posts weren't showing up after the restore).
SO, how would I go about determining exactly what content is missing from the site? What search engine queries would you perform? Is there a plugin I could use to establish the inconsistencies between the database content and the search results?
Your help is appreciated in advance!
-
Thanks a lot, Ryan. This response was really helpful.
-
Prior to SEO, my time was spent as a Microsoft Database Administrator. Your site almost certainly is using a MySQL database which is a different format then T-SQL, but the comparison likely holds.
Whenever I set up a MS SQL database, a task would be scheduled to automatically shrink the database. There are temp tables which are added and removed, files within the database which can have unused pages removed, etc. A database size reduction of 5% does not indicate to me any data loss.
At a high level, you need to compare the "corrupted" database with the one you restored on a file or table level to determine any differences. That is the only reasonable way to achieve your goal. The work should ideally be performed by a professional who is highly experienced in MySQL.
If you desire further assistance, I recommend pursuing the topic on a MySQL forum as this does not really pertain to SEO nor even WordPress.
-
Hi Ryan,
Apologies for not mentioning the software. The website is built on Wordpress.
Here's a bit extra information for you regarding the issue: upon inspection after seeing that the database was corrupted, comparing file sizes of the recently backed-up database, with a database that was backed-up three days beforehand, we found that the most recently backed-up database was around 5% smaller in file size than the earlier one—if anything it should be larger.
Regarding timestamps, the latest posts are there, and the comments seem to be there, so determining what exactly IS missing is something I'm going to need help with.
I can't restore the earlier version of the database either, because important data has been added since then.
The site works fine, for now. I'm just worried somewhere down the road we're going to find that there are 100 posts missing that are now turning up as 404 pages and lost links.
Does that make sense? Thanks for your help.
-
Based on your inquiry, it seems likely you are using specific software to run your site. It is unclear what type of software is being used, which is a critical factor. It could be a CMS such as WordPress. a shopping cart such as ZenCart, a forum such as vBulletin, etc.
You would likely receive the fastest and most accurate response by using the support site of the specific software in use.
Based on your questions, you are in far over your head and should ideally step aside and find a programmer who can resolve the issue. With that said, I'll try to answer your questions.
"how would I go about determining exactly what content is missing from the site?"
In order to determine what is missing, you need a baseline. You need to understand the site's function and activity. For example, if you are running an ecommerce site, what is the timestamp of the last order placed on the site?
"What search engine queries would you perform?"
None. You have an onsite issue. That is where your attention needs to be focused.
** "Is there a plugin I could use to establish the inconsistencies between the database content and the search results?"**
No. The search results should not even be a consideration. Search engines may choose to index or not index your content based on numerous factors including the robots.txt file, the meta tags on each page, the content on each page and so forth. Asking this question indicates you are grasping at straws. If your site is important to you, hire a professional developer to fix the problem. If the site is not of great importance (i.e. it does not generate revenue) then you can visit the site of the software in use and spend a day or two reading various articles, forums and such, then making various setting changes in an attempt to restore the site.
Another option....contact the web host and request them to restore a full backup of the entire site. This option would likely be best, but you would lose all data from after the time the backup was taken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Site scraped over 400,000 urls
Our business is heavily dependent on SEO traffic from long tail search. We have over 400,000 pieces of content, all of which we found scraped and published by another site based out of Hong Kong (we're in the US). Google has a process for DMCA takedown, but doing so would be beyond tedious for such a large set of urls. The scraped content is outranking us in many searches and we've noticed a drastic decrease in organic traffic, likely from a duplicate content penalty. Has anyone dealt with an issue like this? I can't seem to find much help online.
Technical SEO | | Kibin0 -
Staging site and "live" site have both been indexed by Google
While creating a site we forgot to password protect the staging site while it was being built. Now that the site has been moved to the new domain, it has come to my attention that both the staging site (site.staging.com) and the "live" site (site.com) are both being indexed. What is the best way to solve this problem? I was thinking about adding a 301 redirect from the staging site to the live site via HTACCESS. Any recommendations?
Technical SEO | | melen0 -
302 error removing site from results
I have a client who had a screwy url structure based off of parameters and all. They hired a developer that added the keyword to the end of the url and set up 302 redirects to the new keyword included url. Since then the entire site has virtually gone missing in the results but it is not penalized. I put in a request with webmaster tools for reconsideration and they said there was no penalty. I only just found the 302 problem today and think this is probably the problem. Could this remove a site from the search results?
Technical SEO | | webfeatseo0 -
Diagnosing a Penguin Demotion
Whilst ive noyt hit the panic button just yet ive learnt that a specific target term I was optimising a site flor has dropped off the Google radar from forth to zero 😞 The site is www.davidclick.com the target term was "York wedding photographer" and the demotion happened at the end of March this year, below illustrates the carnage.. http://i216.photobucket.com/albums/cc53/zymurgy_bucket/target-term-demotioncopy.jpg When i first learnt about this I began to wonder if I was a victim of the Penguin update. I suspected i might so I ran a test to see how many inbound links used the target term York wedding photographer and here is what i found... Only one site http://www.singingwaitersx2.com/links/ and at a push this site also http://york-wedding-photographer.blogspot.co.uk/ So my question is have i fallen foul to the penguin update? If yes then is the only remedy to block the aforementioned site links via this technique: http://www.htaccess-guide.com/deny-visitors-by-referrer/ Thanks in advance, David
Technical SEO | | Nightwing0 -
What to do next with my site gamblingsites.co
So I have this site gamblingsites.co, which I launched about a year ago (I think.) This used to be internetgamblingsites.net (a domain I bought, but never managed to get in the index, and it appeared to violate the T/Cs after asking in GWMT) and before that the site used to be casinowarehouse.eu. After moving to gamblingsites.co, the pages were indexed almost instantly. I kept a 301 in place until today as I had some links pointing to internetgamblingsites.net. Now, until a few weeks ago, everything was fine. The site was ranking top 10 for gambling sites (8-10) and I had some traffic everyday. This site wasn't my top priority, so besides adding new unique content, I didn't do much with it. In each case no shady link building or what-so-ever. On February first of this year, however, it lost all of its rankings, and I have no idea why. Much worse site appear in the top 50, where a sub page of my site appears somewhere on the 9th SERP for keyword 'gambling sites.' Last week I started contacting some people and asked them to update my links. I also used my own sites (all on unique hosting accounts) to build some branded links, i.e. 'GamblingSites.co' and similar terms to down tune the exact match. I also decreased the instances of the exact match on the homepage, to avoid over optimization. Finally, I removed the 301 from internetgamblingsites.net, since the better links have been changed (or are about to get changed soon.) Now, couple of days later... no changes, but it's probably to early to judge. My question to you: "What would you do next, to try to save the site and at least get some traffic to it?" Thank you for your help, Giorgio PS: Feel free to ask for more information.
Technical SEO | | VisualSense0 -
Adding .html To Wordpress Site
I am working on a team (my part is the SEO) where the developer added the .html extension to the permalinks. I don't understand why, on Wordpress, you would do this. Is there any benefit, or penalty for it as an SEO standpoint? I usually just set mine up %postname% as the permalink structure, but I am not the web design lead on this project. I asked the designer why, but he seems to be reluctant to answer any of my emails about his work, (like he is above that or something). Not wanting to make things worse, I dropped it and thought I would ask here since I saw a few posts in reference to it today in the forum. Is there an advantage or disadvantage (either way) to using the .html extension on a Wordpress site?
Technical SEO | | kbloemendaal0 -
Optimizing a website which uses information from a database
Hi, Sorry if this question is very general. I am in the process of building a website for local business, and it it will be heavily dependent on a database. The database will contain loads of information which can be optimized such as business directory listings, articles, forums, questions and answers etc. The businesses will also be able to link to and from the site. Which is the best way to display this information so that it can be optimized the best? I was going to use drop down boxes on a single page, ie main category, sub catagory, then display the business listings based on this. However, as the information on the page changes constantly based on the drop down the robot / user uses, I am assuming this is very hard to get optimized well. Does anyone know a better way? Thanks.
Technical SEO | | PhatJP0 -
Will rankings for my micro site rank better if I 301 redirect it to my main site?
This is my first time asking so I will try to be as clear as possible. Ok, I have a micro site that is an (exact match domain) and the domain is a couple 3-4 years old and ranks very well for several search terms. The main two terms it ranks for are like this. houses for rent in XXXXX XXXXX homes for rent (XXXXX equals a city name) The issue is this site has no backlinks, zero advanced SEO, I only did basic optimization to it when i set the site up. Even site structure, url structure all are not good.
Technical SEO | | Robbie8299
The only page I have ever even seen rank is the main root url. But with all that the site does really good in the top 1-2 results for key search terms. Now, I have a main site that is a very big site that has steadily been climbing in search terms every month with great backlinks, optimized for the city and all.
It currently ranks on second page for the listed search terms listed above. What I want to do is 301 redirect this microsite to my city page on my main site that is much better optimized for the key city terms.
The 301 redirect would point this "root domain" (mymicrosite.com) to my city page that looks like this. www.mymaindomain.com/city/XXXXXXX If I do this will Google rank my main URL city page as well as it ranks this microsite with zero links, seo, etc, etc. What happens if it does not? Will I be able to turn off the 301 redirect and keep the microsite rankings? My main reason for wanting this is I want this city page to rank well and I only want to optimize one site instead of both. Any help would be great!0