How to Build the Most Effective Backup System — A Conversation with the Expert

Why Should Databases Be Backed Up?

I think the answer to this question is already obvious. So, rather than answering this question, I would like to answer another question: what risks can be prevented through database backup? In fact, since its generation, data has always been accompanied by the risks of data loss caused by natural disasters, power failures, network faults, hardware faults, software faults, and human faults.

Which Challenges Are Presented by Database Backups?

The first challenge is taking stock of database assets. For an individual user, all these database assets may just be one instance, and the user clearly knows the assets even without stocktaking. However, for an enterprise user, especially a user from a large-sized enterprise, the database can have multiple instances and various database types due to business diversity. In this case, the O&M personnel need to clearly know the numbers, distribution, types (production or core databases), and functions of different databases.

What Is an Effective Backup System?

Different databases can be used for different purposes, and the effectiveness of a backup system varies accordingly. According to their functions, databases can be classified as test databases, production databases, and core databases.

  1. Not verifying the validity of the backup data is even worse than not backing up the data. Imagine that all of your business data has been completely destroyed in a disaster. However, when you want to recover the data, you may find that the backup data is corrupted, the files that you backed up are incorrect, or some other terrible thing has happened. In this case, what can you do? A data backup solution without validation can be an even bigger disaster.
    You must validate the backup content to ensure that the data has been properly backed up and can be used for recovery. Don’t wait until it is too late.
  2. Don’t insist on large and comprehensive solutions. Diversified requirements must be met by a variety of solutions. In particular, for the core database, the entire instance must be backed up regularly to prevent hardware failures and damage to instances. In addition, each table must be backed up in real time, which often reduces the data recovery time at crunch time by up to 90%.
  3. Either manual or automatic data validation aims to verify the validity of the backup data used for recovery (also referred to as the recovery data). Verification of the integrity of the recovery data is pretty challenging. In most cases, the recovery data and production data are sampled and compared with each other based on the business characteristics. Alternatively, the recovery database serves as the secondary database and is synchronized with the primary database to verify data integrity.

Which Solutions Are Applicable?

Again, be prepared before the data is lost. Act now to protect your database. Here are some of the solutions that are deployed based on Alibaba Cloud products:

  1. If your database is located on an Alibaba Cloud ECS instance, use Database Backup Service (DBS) to back up the data to OSS. It takes as little as five minutes to purchase, configure, and start the backup service.
  2. If your database is located on a local IDC, and remote access to the Internet is enabled for the database, use DBS to directly back the data up. If you have activated Express Connect, use DBS to back up data to OSS. Depending on the DBS region of your choice, you can also implement remote backup.
  3. If your database is hosted by a cloud vendor other than Alibaba Cloud and remote access to the Internet is enabled for the database, use DDS to directly back up the database. If you have activated the deployment agent service or Express Connect, use DBS to back up data to OSS and implement cross-cloud backup on Alibaba Cloud.

Could You Give Us a Brief Introduction to Your Work?

I am currently in charge of an Alibaba Cloud product called DBS. Have you ever heard of it? As a database backup channel, DBS has been put into commercial use, and is used together with OSS to develop a cloud database backup solution. It takes only five minutes for such a solution to implement real-time backup with a second-level Recovery Point Objective (RPO). The RPO indicates the maximum duration allowed for data loss when the database fails. And of course, a smaller RPO is always desired.

About the Author

Heng Tiegang (nickname Pei’en) joined Alibaba in 2011, and was once the MySQL DBA of Alibaba Group. He is currently a database product manager responsible for designing database backup products.



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Alibaba Cloud

Alibaba Cloud

Follow me to keep abreast with the latest technology news, industry insights, and developer trends. Alibaba Cloud website: