What is Change Data Capture
Tracking database changes
CDC (Change Data Capture) — Capturing Data Changes
CDC is a pattern for tracking and capturing database changes for real-time replication to other systems.
CDC Implementation Methods
| Method | Description | Pros/Cons | |--------|-------------|-----------| | Log-based | Reading WAL/binlog | Low overhead, reliable | | Trigger-based | DB triggers | Flexible, DB overhead | | Timestamp | By updated_at field | Simple, misses deletes | | Query-based | Periodic polling | Simple, high latency |
Popular Tools
- Debezium — open-source, Kafka Connect
- AWS DMS — managed AWS service
- Striim — enterprise solution
- Airbyte — ETL with CDC support
CDC Applications
- Microservices synchronization
- Data Warehouse replication
- Cache invalidation
- Event Sourcing
- Change auditing
Architecture Example
PostgreSQL → Debezium → Kafka → Consumers