From the above points, I would love to have a Java library published that can be brought into projects/applications that would like to digest changes in yugabyte in real time. And I’m happy to help in getting that fully fleshed out. I’ve looked through the current Java code and have a pretty good idea on how that can currently be used to satisfy my goals, but it’s currently unpublished in Maven, which makes me think that it’s likely to change substantially.
Having access to a dependency like a
yugabytedb-cdc-subscriber could be really nice.
I’ve viewed a recent presentation from @neha about CDC and noticed that a large percentage of it was devoted to CDC particularly in relation to cross DC replication (I’m also eager to use it for this feature).
I have two initial questions to help me in exploring the codebase and better understanding the the implementation around CDC so far. Is it possible for me to see the code that was written for the Yugabyte to Kafka connector jar that’s in this codebase? And, which area/files of the C++ implementation of Yugabyte should I look in if I want to better understand how the CDC producer process works? (I’ve been poking around the tablet server code so far, but don’t believe I’ve found the meat of it).