_CORE
AI & Agentic Systems Core Information Systems Cloud & Platform Engineering Data Platform & Integration Security & Compliance QA, Testing & Observability IoT, Automation & Robotics Mobile & Digital Banking & Finance Insurance Public Administration Defense & Security Healthcare Energy & Utilities Telco & Media Manufacturing Logistics & E-commerce Retail & Loyalty
References Technologies Blog Know-how Tools
About Collaboration Careers
CS EN
Let's talk

Debezium — Change Data Capture for Real-time Replication

20. 09. 2023 1 min read intermediate

Debezium captures database changes in real time. Every INSERT, UPDATE and DELETE is transferred to Kafka without loading source DB.

Change Data Capture

CDC reads transaction log (WAL, binlog) — doesn’t load source DB.

{
  "name": "postgres-cdc",
  "config": {
    "connector.class": "io.debezium.connector.postgresql.PostgresConnector",
    "database.hostname": "postgres",
    "database.dbname": "app",
    "topic.prefix": "cdc",
    "table.include.list": "public.orders",
    "plugin.name": "pgoutput",
    "transforms": "unwrap",
    "transforms.unwrap.type": "io.debezium.transforms.ExtractNewRecordState"
  }
}

CDC → Data Lake

  • Debezium → Kafka
  • Flink/Spark → processing
  • Delta/Iceberg/Hudi → upsert

Summary

Debezium is standard for CDC in Kafka ecosystem. Near real-time replication without source DB load.

debeziumcdcreplicationkafka connect
Share:

CORE SYSTEMS tým

Stavíme core systémy a AI agenty, které drží provoz. 15 let zkušeností s enterprise IT.