Debezium Blog

Debezium 3.2.0.Final Released Debezium 3.2.0.Final Released

July 9, 2025 by Chris Cranford

releases mongodb mysql mariadb postgres sqlserver cassandra oracle db2 vitess outbox spanner jdbc informix ibmi

The Debezium team has been extremely busy this past quarter as we prepared for this summer release, and we’re excited to announce the immediate availability of Debezium 3.2.0.Final. This release includes a slew of features including integration with OpenLineage, a new Quarkus DevService/GraalVM extension, Qdrant vector database sink support, improvements to Debezium Platform and AI, and much more!

Improving Debezium performance Improving Debezium performance

July 7, 2025 by Vojtěch Juránek

performance core postgres flamegraphs

It’s useful from time to time to evaluate the performance of an entire project - or at least selected parts of it. This is especially important when adding new features or performing major code refactoring. However, performance checks can also be done ad hoc, or ideally, on a regular basis.

In this blog post, I’d like to demonstrate a quick way to identify and analyze a particular type of performance issue in Debezium. The post walks through the full cycle: setting up a lightweight performance test, analyzing the results, proposing an improvement, and evaluating its impact.

Native data lineage in Debezium with OpenLineage Native data lineage in Debezium with OpenLineage

June 13, 2025 by Fiore Mario Vitale

releases cdc integration openlineage data lineage

The modern data landscape bears little resemblance to the centralized databases and simple ETL processes of the past. Today’s organizations operate in environments characterized by diverse data sources, real-time streaming, microservices architectures, and multi-cloud deployments. What began as straightforward data flows from operational systems to reporting databases has evolved into complex networks of interconnected pipelines, transformations, and dependencies. The shift from ETL to ELT patterns, the adoption of data lakes, and the proliferation of streaming platforms like Apache Kafka have created unprecedented flexibility in data processing. However, this flexibility comes at a cost: understanding how data moves, transforms, and evolves through these systems has become increasingly challenging.

Understanding data lineage

Data lineage is the process of tracking the flow and transformations of data from its origin to its final destination. It essentially maps the "life cycle" of data, showing where it comes from, how it’s changed, and where it ends up within a data pipeline. This includes documenting all transformations, joins, splits, and other manipulations the data undergoes during its journey.

At its core, data lineage answers critical questions: Where did this data originate? What transformations has it undergone? Which downstream systems depend on it? When issues arise, where should teams focus their investigation?

Debezium as part of your AI solution Debezium as part of your AI solution

May 19, 2025 by Jiri Pechanec

demo ai rag

One question that we encountered recently is how to effectively integrate change data capture (CDC) with AI workloads — particularly for scenarios in which critical organizational knowledge is not publicly available. To help you to take advantage of your internal data, Debezium 3.1 introduces AI-focused features such as the Embeddings SMT and the Milvus sink, which you can combine to supply inputs to an LLM. You can read more about these enhancements in the Debezium 3.1 release notes.

Debezium 3.2.0.Alpha1 Released Debezium 3.2.0.Alpha1 Released

April 29, 2025 by Chris Cranford

releases mongodb mysql mariadb postgres sqlserver cassandra oracle db2 vitess outbox spanner jdbc informix ibmi

Another release cadence done, and we’re pleased to announce the next preview release of Debezium is available, 3.2.0.Alpha1. This release is built on Kafka 4.0 with several breaking changes with many improvements and bugfixes.

Let’s take a moment and dive into all these changes.