Deployment in two data centers

Uladzislau · February 7, 2024, 11:22am

Hi, please help me figure this out.

I need to deploy a multi data center cluster. Exactly 2 data centers. Two clusters with bidirectional replication are not suitable, since we have many tables that need to be replicated. And in the setup_universe_replication command you need to specify the IDs of all tables. It seems there is no way to specify that certain databases need to be replicated.

Would it be ok to do the following setup instead? 3 nodes in one data center and 3 nodes in another data center in one cluster. On each node there is a master + tserver. Set the replication factor to 6. We need that if the connection between the data centers is lost, both clusters continue to work. Also, if one data center is lost, another cluster could service client requests.
Will this option be fault-tolerant and is it normal to use this approach?

dorian_yugabyte · February 7, 2024, 12:09pm

Hi @Uladzislau

Can you be more clear why you mean that this is not suitable?

RF must be an odd number, see Deployment checklist for YugabyteDB clusters | YugabyteDB Docs.

This option will require you to have 2 separate clusters and be connected with xCluster xCluster replication (2+ regions) in YugabyteDB | YugabyteDB Docs.

It won’t work with a single cluster with synchronous replication (even if you had 3 regions).

Uladzislau · February 7, 2024, 4:16pm

It is not convenient that you need to manually or using a script get the IDs of all tables and list them in the setup_universe_replication command. Also, when changing the database schema, you will have to constantly add new tables to replication.

I would like to be able to replicate all databases with all schemas and tables without having to list all the IDs of all tables.

Maybe I missed something?

dorian_yugabyte · February 8, 2024, 7:31am

This is in the roadmap, to replicate all tables. DDL conflicts is a separate issue though.

You’ll need 3 DCS with normal synchronous replication.

Uladzislau · February 8, 2024, 11:44am

But we only have 2 data centers. A third is not planned.

Uladzislau · February 8, 2024, 11:45am

If I get you right. There is no way to replicate 2 data centers with asynchronous replication and not list the IDs of all tables.

dorian_yugabyte · February 8, 2024, 11:57am

With the constraints you mentioned (2 dcs being able to function if either one of them goes down) you can only use xCluster.

Not yet.

Uladzislau · February 8, 2024, 5:49pm

Thanks for the info.

Hari_yb · June 18, 2024, 5:03pm

We will support the database level APIs from the next release
[xCluster] Support Database/Keyspace level replication · Issue #10984 · yugabyte/yugabyte-db · GitHub .

But this is going to be for YSQL only. And it will be transactional uni-direction replication. We do not support Bi-Directional Transactional xCluster since with async replication there is always a lag, and we cannot guarantee consistency when writing to both sides at the same time. You can have 2 DBs, with unidirectional transactional xCluster on different directions and write to the appropriate side.
You will have to run the YSQL DDL change on both sides, but you dont have to worry about getting the ID and adding it to replication. The automatic replication of DDLs itself is planned in [DocDB] xcluster: Automatically propagate DDL changes across clusters · Issue #11537 · yugabyte/yugabyte-db · GitHub

Topic		Replies	Views
Multi cluster setup active-active on bare machine , and both cluster share data asynchronously General	4	1451	June 3, 2021
2 server set up using manual installation of YugaByte with replication factor of 2 General	4	161	June 20, 2024
Migrate von single DC setup to multi DC geo replication General	1	588	August 12, 2022
2-region YugabyteDB cluster General	3	679	June 15, 2021
What is the behaviour of connection failure between masters in multi-master setup? General	5	962	July 26, 2020

Deployment in two data centers

Related topics