Open Data Platform
An open, developer-first data platform that combines orchestration, lakehouse processing, governance, BI, and observability in one stack.
Core Capabilities
Medallion Architecture
Ingest and transform data through Bronze, Silver, and Gold layers with both Spark-compatible and Postgres/dbt transformation paths.
Full Observability
Integrated Prometheus metrics, Grafana dashboards, Loki logs, and Tempo traces with OpenTelemetry instrumentation.
Deploy Anywhere
Run locally with Docker Compose, scale with Kubernetes (kind), or deploy to Azure AKS and Scaleway with Terraform.
Architecture
Three planes — Operator, Control, and Data — connected through a React launchpad that links all platform surfaces.
Technology Stack
Built on battle-tested open source technologies.
Quick Start
Get the full platform running locally in three commands.
# Clone and configure
$ cp .env.template .env
# Bootstrap the full stack
$ ./scripts/platform/bootstrap_all.sh --auto-fill-env
# Open the platform launchpad
$ open http://localhost:3000 See the full Development Guide for prerequisites and advanced setup.
Documentation
Everything you need to understand, deploy, and extend the platform.
Architecture
Component-level architecture, data flows, and integration points.
Data Model
Medallion layer entities, warehouse schema, and data lineage.
Deployment
Docker Compose, Kubernetes, AKS, and Scaleway deployment guides.
Development
Local setup, workflows, Make targets, and troubleshooting.
Ingestion Guide
Step-by-step guide for adding new data sources to the platform.
Governance
Operating model, ownership, data quality, and lifecycle management.
CI/CD
Pipeline decisions, runbooks, required checks, and assumptions.
Testing
E2E testing strategy, dataset contracts, and QA framework.