Skip to main content
Engineering, Backend

Improving MySQL® Cluster Uptime: Designing Advanced Detection, Mitigation, and Consensus with Group Replication

December 2 / Global
Featured image for Improving MySQL® Cluster Uptime: Designing Advanced Detection, Mitigation, and Consensus with Group Replication
Image
Figure 1: MySQL cluster at Uber.
Image
Figure 2: Old mitigation system architecture.
Image
Figure 3: Architecture of a consensus-based MySQL cluster.
Image
Figure 4: Insert latency comparison.
Image
Figure 5: Update latency comparison.
Image
Figure 6: Cluster uptime comparison of async versus MGR cluster.
Siddharth Singh

Siddharth Singh

Siddharth Singh is a Staff Software Engineer on the Core Storage team at Uber focusing on the reliability, availability, quality, and resiliency charters for MySQL/Vitess. He played a pivotal role as the lead for high availability via consensus in the MySQL project.

Raja Sriram Ganesan

Raja Sriram Ganesan

Raja Sriram Ganesan is a Sr Staff Software Engineer on the Core Storage team at Uber. He’s the tech lead for MySQL initiatives and has led critical reliability and modernization projects for MySQL at Uber.

Amit Jain

Amit Jain

Amit Jain is an Engineering Manager in the Storage organization and leads the charter of MySQL, sharded MySQL, CDC (Change Data Capture), and snapshot systems. His work centers on building reliable and scalable database infrastructure that delivers high availability and performance at Uber’s global scale.

Debadarsini Nayak

Debadarsini Nayak

Debadarsini Nayak is a Senior Engineering Manager, providing leadership in the development and management of various online data technologies.

Posted by Siddharth Singh, Raja Sriram Ganesan, Amit Jain, Debadarsini Nayak