How do I diagnose a site that has had a corrupted database restored?
-
Here's the situation:
Downloaded a backup of a full database from CPanel, because we needed to perform some queries on it.
Found out that after restoring it, CPanel had not been able to zip up the full database because the server memory was set so low (some posts weren't showing up after the restore).
SO, how would I go about determining exactly what content is missing from the site? What search engine queries would you perform? Is there a plugin I could use to establish the inconsistencies between the database content and the search results?
Your help is appreciated in advance!
-
Thanks a lot, Ryan. This response was really helpful.
-
Prior to SEO, my time was spent as a Microsoft Database Administrator. Your site almost certainly is using a MySQL database which is a different format then T-SQL, but the comparison likely holds.
Whenever I set up a MS SQL database, a task would be scheduled to automatically shrink the database. There are temp tables which are added and removed, files within the database which can have unused pages removed, etc. A database size reduction of 5% does not indicate to me any data loss.
At a high level, you need to compare the "corrupted" database with the one you restored on a file or table level to determine any differences. That is the only reasonable way to achieve your goal. The work should ideally be performed by a professional who is highly experienced in MySQL.
If you desire further assistance, I recommend pursuing the topic on a MySQL forum as this does not really pertain to SEO nor even WordPress.
-
Hi Ryan,
Apologies for not mentioning the software. The website is built on Wordpress.
Here's a bit extra information for you regarding the issue: upon inspection after seeing that the database was corrupted, comparing file sizes of the recently backed-up database, with a database that was backed-up three days beforehand, we found that the most recently backed-up database was around 5% smaller in file size than the earlier one—if anything it should be larger.
Regarding timestamps, the latest posts are there, and the comments seem to be there, so determining what exactly IS missing is something I'm going to need help with.
I can't restore the earlier version of the database either, because important data has been added since then.
The site works fine, for now. I'm just worried somewhere down the road we're going to find that there are 100 posts missing that are now turning up as 404 pages and lost links.
Does that make sense? Thanks for your help.
-
Based on your inquiry, it seems likely you are using specific software to run your site. It is unclear what type of software is being used, which is a critical factor. It could be a CMS such as WordPress. a shopping cart such as ZenCart, a forum such as vBulletin, etc.
You would likely receive the fastest and most accurate response by using the support site of the specific software in use.
Based on your questions, you are in far over your head and should ideally step aside and find a programmer who can resolve the issue. With that said, I'll try to answer your questions.
"how would I go about determining exactly what content is missing from the site?"
In order to determine what is missing, you need a baseline. You need to understand the site's function and activity. For example, if you are running an ecommerce site, what is the timestamp of the last order placed on the site?
"What search engine queries would you perform?"
None. You have an onsite issue. That is where your attention needs to be focused.
** "Is there a plugin I could use to establish the inconsistencies between the database content and the search results?"**
No. The search results should not even be a consideration. Search engines may choose to index or not index your content based on numerous factors including the robots.txt file, the meta tags on each page, the content on each page and so forth. Asking this question indicates you are grasping at straws. If your site is important to you, hire a professional developer to fix the problem. If the site is not of great importance (i.e. it does not generate revenue) then you can visit the site of the software in use and spend a day or two reading various articles, forums and such, then making various setting changes in an attempt to restore the site.
Another option....contact the web host and request them to restore a full backup of the entire site. This option would likely be best, but you would lose all data from after the time the backup was taken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
We are migrating a site and are seeing alot of 301s and 302s already in the old site is it ok to leave those as is?
For the 3xx’s I’m not sure if it’s okay for us to redirect to these so please advise on that
Technical SEO | | lina_digital0 -
Direct To Site Traffic Decline
Hi Everyone, My company's site went from Http to Https and responsive design last year. After 6 months passed, we noticed a 40% decline from direct visits but revenue saw an increase. Could the 301 redirects from Http to Https cause the drop in direct traffic? Thanks!
Technical SEO | | JMSCC1 -
Sudden traffic down after mobile site
Hi to all. I have a huge problem about my web site traffic. We published our mobile site version under mobile domain in m.etilercerrental.com. And we published mobile site sitemap under main site such as sitemap-mobile.xml and notify google from webmaster tools. After a while whole site traffic gone down so badly. Almost 60% gone down which doesnt make sense. Mobile sitemap is hosted under desktop site. And we didnt add any link to switch sites between mobile and desktop version. We automaticly redirect between desktop and mobile versiyons by detecting useragent. One another this is we have announcement section in dekstop version but we dont have it in mobile. But we are not making any redirection to mobile site when user visits the announcement section even by mobile phone. Because we are not displaying it in mobile version.
Technical SEO | | gkhnrtk
One last this is we are making redirections automatically and not asking user preference such as showing a link in the bottom of page to switch sites betwenn mobile and pc version. While developing mobile site we followed instructions in this https://developers.google.com/webmasters/smartphone-sites/?hl=tr For find the details I would like to give you some site urls. Site Url : http://www.etilercarrental.com/
Sitemap Xml : http://www.etilercarrental.com/sitemap.xml
Mobile Sitemap Xml : http://www.etilercarrental.com/sitemap-mobile.xml0 -
Google having trouble accessing my site
Hi google is having problem accessing my site. each day it is bringing up access denied errors and when i have checked what this means i have the following Access denied errors In general, Google discovers content by following links from one page to another. To crawl a page, Googlebot must be able to access it. If you’re seeing unexpected Access Denied errors, it may be for the following reasons: Googlebot couldn’t access a URL on your site because your site requires users to log in to view all or some of your content. (Tip: You can get around this by removing this requirement for user-agent Googlebot.) Your robots.txt file is blocking Google from accessing your whole site or individual URLs or directories. Test that your robots.txt is working as expected. The Test robots.txt tool lets you see exactly how Googlebot will interpret the contents of your robots.txt file. The Google user-agent is Googlebot. (How to verify that a user-agent really is Googlebot.) The Fetch as Google tool helps you understand exactly how your site appears to Googlebot. This can be very useful when troubleshooting problems with your site's content or discoverability in search results. Your server requires users to authenticate using a proxy, or your hosting provider may be blocking Google from accessing your site. Now i have contacted my hosting company who said there is not a problem but said to read the following page http://www.tmdhosting.com/kb/technical-questions/other/robots-txt-file-to-improve-the-way-search-bots-crawl/ i have read it and as far as i can see i have my file set up right which is listed below. they said if i still have problems then i need to contact google. can anyone please give me advice on what to do. the errors are responce code 403 User-agent: *
Technical SEO | | ClaireH-184886
Disallow: /administrator/
Disallow: /cache/
Disallow: /components/
Disallow: /includes/
Disallow: /installation/
Disallow: /language/
Disallow: /libraries/
Disallow: /media/
Disallow: /modules/
Disallow: /plugins/
Disallow: /templates/
Disallow: /tmp/
Disallow: /xmlrpc/0 -
What happens if my site was down for 10 hours?
Hello, We have a fairly large site. Due to an attack it was taken down for about 10 hours. We have finally been able to resolve the security issues, restore the code and put everything back online. Now, we are a little bit worried about what GoogleBot will think of us. We have been working hard during the past months to get a better SEO. Should we be worried?
Technical SEO | | jgenesto0 -
Site Disappeared off of Search
A friend of mine has a site (http://bit.ly/q4iWkM ) that was ranking number one for their key word (Drimnagh() and has now completely disappeared off of the ranking. I did some checking and can't see a problem. She does have duplicate meta and titles throughout but this shouldn't be a punishable offence that I know of and is something that I am going to correct with a quick plugin install. I couldn't see any redirects or code stopping search either. When you do site:URL it shows up OK as well. She is client of mine (for website not for SEO) and she is really upset about it so any help from the forum would be appreciated. This isn't even a site I did but you couldn't get a better person to work with so I am eager to help where and if possible. Guinness all round if someone solves it next time you are in Ireland
Technical SEO | | kdaly1000 -
Partial Site Move -- Tell Google Entire Site Moved?
OK this one's a little confusing, please try to follow along. We recently went through a rebranding where we brought a new domain online for one of our brands (we'll call this domain 'B' -- it's also not the site linked to in my profile, not to confuse things). This brand accounted for 90% of the pages and 90% of the e-comm on the existing domain (we'll call the existing domain 'A') . 'A' was also redesigned and it's URL structure has changed. We have 301s in place on A that redirect to B for those 90% of pages and we also have internal 301s on A for the remaining 10% of pages whose URL has changed as a result of the A redesign What I'm wondering is if I should tell Google through webmaster tools that 'A' is now 'B' through the 'Change of Address' form. If I do this, will the existing products that remain on A suffer? I suppose I could just 301 the 10% of URLs on B back to A but I'm wondering if Google would see that as a loop since I just got done telling it that A is now B. I realize there probably isn't a perfect answer here but I'm looking for the "least worst" solution. I also realize that it's not optimal that we moved 90% of the pages from A to B, but it's the situation we're in.
Technical SEO | | badgerdigital0 -
301 an old site to a newer site...
Hi First, to be upfront - these are not my websites, I'm asking because they are trying to compete in my niche. Here's the details, then the questions... There is a website that is a few months old with about 200 indexed pages and about 20 links, call this newsite.com There is a website that is a few years old with over 10,000 indexed pages and over 20,000 links, call this oldsite.com newsite.com acquired oldsite.com and set a 301 redirect so every page of oldsite.com is re-directed to the front page of newsite.com newsite.com & oldsite.com are on the same topic, the 301 occurred in the past week. Now, oldsite.com is out of the SERPs and newsite.com is pretty much ranking in the same spot (top 10) for the main term. Here are my questions; 1. The 10,000 pages on oldsite.com had plenty of internal links - they no longer exists, so I imagine when the dust settles - it will be like oldsite.com is a one page site that re-diretcts to newsite.com ... How long will a ranking boost last for? 2. With the re-direct setup to completely forget about the structure and content of oldsite.com, it's clear to me that it was setup to pass the 'Link Juice' from oldsite.com to newsite.com ... Do the major SE's see this as a form of SPAM (manipulating the rankings), or do they see it as a good way to combine two or more websites? 3. Does this work? Is everybody doing it? Should I be doing it? ... or are there better ways for me to combat this type of competition (eg we could make a lot of great content for the money spent buying oldsite.com - but we certainly wouldn't get such an immediate increase to traffic)?
Technical SEO | | RR5000