Microservices approach: a centralized YugaByte DB for all services or one per service?

Alex_Arica · May 22, 2020, 12:23pm

Hello,

I am coming from a microservice world where each microservice has a dedicated instance of PostgreSql server. For example a “product” microservice has a dedicated instance of PostgreSql server, and an “account” microservice as a dedicated instance of PostgreSql. Etc…

We have about 40 different microservices with such design.

This design was put in place to isolate each microservice data. And also to avoid a centralized but potentially slow PostgreSql server in case of one or many microservice running resource intensive queries impacting others microservices.

My question is about your recommendation if I decide to migrate from PostgreSql to YugaByte DB.

Should I have one centralized installation of YugaByte DB server with one database for each microservice?
OR is the architecture that I followed with PostgreSql possible/recommendable with YugaByte DB?

Thank you for your help.

dorian_yugabyte · May 22, 2020, 12:51pm

Hi @Alex_Arica,

How much data do you have in all the postgresql instances ?
And what size are they in terms of ram/cpu/ssd-disks ?

Alex_Arica · May 22, 2020, 2:10pm

Hi @dorian_yugabyte,

Thank you for your message.

We have 40 instances of PostgreSql with about 30 GB of data in total. Some instances have less than 50MB of data when some have about 1GB. They are stored on SSD disks.

Some instances run with 256MB RAM when some others run with 2GB. In total I would say 20GB of RAM are used.

The CPU usage is usually low. But we have some increases to 20% CPU for some instances for short times based on specific events.

Also note that some microservices are caching heavily and consequently the load on PostgreSql is lower than the demand. If we move to YugaByte DB, we are planning to remove the caching logics from our microservices.

The services run on Kubernetes on 2 data-centers per region. Each data-center has 12 nodes. On each region, the primary data-center hosts the master PostgreSql instances when the secondary data-center hosts the standby/slave PostgreSql instances.

Please let me know if you have more questions.

dorian_yugabyte · May 22, 2020, 2:51pm

Usually you try to architect based on current needs + estimated (real estimation, not google scale) growth. Even 100x growth, 1 cluster will be able to support your needs.

Check hardware requirements regarding server size.

We’ll help with schema/queries design so each table/db will be able to scalable.

This will also depend what you’re exactly caching. If it’s single rows, then yes it will work great. While if it’s complex computations (cpu intensive), then you may still need to cache (and even cache in yugabytedb).

It would be better to have 3 regions when replication factor is 3. Loosing any region the cluster will still be live. While having 2 regions it will not because it can’t form a majority consensus. (see the link above)

Alex_Arica · May 22, 2020, 3:29pm

Hi @dorian_yugabyte,

Thank you for your help.

Let me resume your recommendations:

If a cached query is CPU intensive, the caching can be moved from the services to Yugabyte DB as long as it is cached using the YEDIS caching API,
If a cached query is not CPU intensive and returns single rows, then YugaByte DB can handle it via its YSQL API,
to benefit from the multi-clusters capability of YugaByte DB, I must have at least 3 clusters as required by the RAFT consensus algo
I should use one unique installation of YugaByte DB for all microservices and I can create one database for each microservice

In regards to the last point, I would say that for our company it is very convenient for each microservice to have their specific instances of PostgreSql. Developers owning a microservice are able to upgrade a version of PostgreSql without impacting other services. It seems like with YugaByte DB it won’t be possible since we will be using a centralised installation of it.

Saying that, the rolling upgrade capability of YugaByte DB with Kubernetes seems to be a reliable. But I am not sure if we are comfortable (yet) with having to upgrade a centralised system for 40 microservices.

Please let me know if I interpreted you wrongly.

sid.choudhury · May 22, 2020, 3:48pm

YEDIS is not a caching API – it is fully-distributed, strongly-consistent, persistent key-value database that simply happens to speak the Redis wire protocol (and has limited support for the Redis command library). It is not under active development and hence we do not recommend for new use cases YEDIS | YugabyteDB Docs

sid.choudhury · May 22, 2020, 3:50pm

And nothing stops you from going to a fully-decentralized database architecture where each microservice has its own YugabyteDB cluster. You simply have to account for the additional administration overhead given that you are no longer running a single-node PostgreSQL but rather running a minimum 3-node distributed database.

Alex_Arica · May 22, 2020, 3:59pm

Hi @sid.choudhury,

Thank you for your messages. Very helpful.

dorian_yugabyte · May 22, 2020, 4:05pm

You can use the YCQL api (because it supports TTL) for caching or even YSQL if you have invalidating logic.

Yes. For region failover, you must also have 3 regions.

This will be the most efficient. You can create a database for each microservice on a single cluster.

It is possible. You can create a cluster for each microservice.

We support rolling upgrades. But you can use as many clusters as you want.

Alex_Arica · May 22, 2020, 4:11pm

HI @dorian_yugabyte,

Thank you very much for your help. Much appreciated. Very helpful.

Next week I will start testing YugaByte DB with few of our services.

Topic		Replies	Views
Introducing YugaByte DB General	1	1339	September 15, 2018
How to create a multi-cluster deployment on Kubernetes? General	10	2864	June 5, 2019
YugaByte DB for instant messaging app General	9	921	July 25, 2022
Yugabyte and Zabbix or Nagios Design Discussions	5	728	January 4, 2023
Yugabyte vs MySQL General	13	3780	October 2, 2019

Microservices approach: a centralized YugaByte DB for all services or one per service?

Related topics