Data Platform Deployer (dpd
) is a CLI tool for automatically generating configurations and deploying a data platform based on a declarative description.
- 📜 Configuration generation for Kafka, ClickHouse, PostgreSQL, S3, and more.
- 🚀 Automated deployment of the data platform.
- 🛠 Flexible configuration via JSON.
- 🏗 Supports Docker Compose and (future) Kubernetes.
Try dpd in your browser without installation:
👉 DPD Playground (Streamlit UI)
Here's the high-level component diagram of what dpd can deploy:
Install from PyPI:
pip install data-platfrom-deployer
Once installed, you can run dpd from the command line:
dpd --help
Generate configuration files
dpd generate --config config.yaml
Example config:
project:
name: data-platform
version: 1.0.0
description: This is a test project
sources:
- type: postgres
name: postgres_1
- type: postgres
name: postgres_2
- type: s3
name: s3_1
streaming:
kafka:
num_brokers: 3
connect:
name: connect-1
storage:
clickhouse:
name: clickhouse-1
bi:
superset:
name: superset-1