Post Processors
Post processors perform lightweight, per-message mutations, similar to the modifications that are performed by single message transformations (SMTs). However, Debezium calls post processors earlier in the event chain than transformations, enabling post processors to act on messages before they are handed off to the messaging runtime. Because post processors can act on messages from within the Debezium context, they are more efficient at modifying event payloads than transformations.
For a transformation to modify a message, it must recreate the message’s immutable ConnectRecord
, or more aptly, its SourceRecord
.
By contrast, because a post processor acts within the Debezium scope, it can operate on mutable Struct
types in the event payload of a message, modifying payloads before the construction of the SourceRecord
.
Close integration with Debezium provides post processors with access to Debezium internals, such as Debezium metadata about database connections, relational schema model, and so forth.
In turn, this access enhances efficiency when performing tasks that rely on such internal information.
Debezium provides the following post processor implementations:
Post Processor | Description |
---|---|
Re-selects specific columns that may not have been provided by the change event, such as TOASTed columns or Oracle LOB columns that were not modified by the current event’s change. |
Post processors are designed to modify change event records emitted by Debezium source connectors only. |