Ping a Specific Port

Question

OverTheRainbow

Asked: 2009-08-26 23:38:00 +0800 CST2009-08-26 23:38:00 +0800 CST 2009-08-26 23:38:00 +0800 CST

How to keep multiple read/write DB servers in sync?

772

I'm curious to know how big sites spread the load between the different DB server in the case where users write as much as they read, ie. when the standard solution of having one master to accept write, and several slaves that only let users read data doesn't work because it simply turns the master server into the bottleneck.

For those of you who manage a big site with a load balancer -> multiple web servers -> multiple DB servers, how do you spread the load evenly between the DB servers so that users (at best) don't have to wait for the master to update the slaves, or (at worst) users end up reading dirty data from slaves that haven't been updated yet?

Thank you.

6 Answers

Voted

JamesRyan · Answer 1 · 2009-08-27T01:16:59+08:00

JamesRyan

2009-08-27T01:16:59+08:002009-08-27T01:16:59+08:00

Check out http://highscalability.com/

You can use more complicated methods of storing the data basically to denormalise and segment it into chunks that you can load balance across servers. Look for shards.

The general answer seems to be to make the single writing DB machine more and more powerful for as long as possible before you move to those other methods though.

In most cases the best way to solve the problem is to rethink how your site works to cut down the number of writes/make them batchable.

3

wolfgangsz · Answer 2 · 2009-08-27T06:03:42+08:00

wolfgangsz

2009-08-27T06:03:42+08:002009-08-27T06:03:42+08:00

What you need is a proper multi-master database. And as far as I know the only DB engine that has so far implemented this in a reliable way is Oracle. Which goes some way to explain why all the big boys use Oracle.

Having said that, MySql does support multi-master replication, although (AFAIK) not in a full production release. See http://dev.mysql.com/doc/refman/5.1/en/mysql-cluster-replication-multi-master.html for more detail.

1

Tom · Answer 3 · 2017-11-23T07:49:54+08:00

Tom

2017-11-23T07:49:54+08:002017-11-23T07:49:54+08:00

This answer does not answer the title of the question because it makes no attempt to keep the DBs in sync but it does answer the body of the question to do with distributing requests for high scale websites.

You can use Sharding to divide your data so for example you have 26 database servers one for each letter of the alphabet. All the users with name beginning with A go to one server. You can use various algorithms to divide up your requests evenly. It's a complex solution that shouldn't really be used until other options have been exhausted.

https://en.wikipedia.org/wiki/Shard_(database_architecture)

1

Istvan · Answer 4 · 2009-08-27T03:33:49+08:00

Istvan

2009-08-27T03:33:49+08:002009-08-27T03:33:49+08:00

I presume you are talking about MySQL, based on your terms. Unfortunately this DBMS has lack of support for the distributed writes, only the NDB supports that.

http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-overview.html

http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-nodes-groups.html

Another solution can be: use DNS level partition based on your client GEO location resolv different IP addresses where to connect to and basically separate the data by this info. There is a problem with this sort of solution, if you have a query for example you want to know how many items do you have globally then this won't work very well.

0

mrdenny · Answer 5 · 2009-08-27T05:43:11+08:00

mrdenny

2009-08-27T05:43:11+08:002009-08-27T05:43:11+08:00

It depends on the site and the part of the site.

Some pieces will have a single write server, which will then replicate to a bunch of read servers.

Other pieces of the site will have lots of servers each holding a small part of the data in them. For example a couple of million customer accounts per database server with logic in the application so that it knows which server you are on based on your UserId.

0

rolaf · Answer 6 · 2009-08-27T06:05:15+08:00

rolaf

2009-08-27T06:05:15+08:002009-08-27T06:05:15+08:00

A solution is to rethink your application so that you can split data between multiple database servers. Sometimes it's easy... sometimes not.

0

How to keep multiple read/write DB servers in sync?

Ping a Specific Port

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What's the command-line utility in Windows to do a reverse DNS look-up?

How to check if a port is blocked on a Windows machine?

What port should I open to allow remote desktop?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?