DATABRICKS LAKEBASE ACCELERATOR

This project is designed to streamline the testing and deployment of customer OLTP workloads to Lakebase, Databricks' managed Postgres solution. It is particularly focused on supporting reverse ETL use cases. The accelerator provides an easy way for users to evaluate Lakebase and quickly get started with their POC and testing needs.

Prerequesites

Python3.11
Databricks Workspace Requirements:
- Unity Catalog enabled: CREATE CATALOG, USE CATALOG, CREATE SCHEMA permission
- Lakebase Service: CREATE DATABASE INSTANCE, USE DATABASE INSTANCE permission
- Delta Tables: For source data synchronization
- Databricks SQL Warehouse: For table size calculations (optional but recommended)
- Databricks SDK: For programmatic workspace access

Environments Setup

Setup Python virtual environment

# Install uv (if not already installed)
pip install uv
# or
curl -LsSf https://astral.sh/uv/install.sh | sh

Set up Python environment and install required packages

uv venv
source .venv/bin/activate
uv pip install -r requirements.txt

Install the Databricks CLI from https://docs.databricks.com/dev-tools/cli/databricks-cli.html

$ brew tap databricks/tap
$ brew install databricks
$ databricks --version

Databricks CLI v0.267+ is required, if you have older version, upgrade the CLI version

$ brew update && brew upgrade databricks && databricks --version | cat

Authenticate to your Databricks workspace using OAuth Authentication (recommended), if you have not done so already:

Configure OAuth:
```
databricks auth login --host https://your-workspace.cloud.databricks.com --profile DEFAULT
```
This will:
- Open your browser for authentication
- Create a profile in ~/.databrickscfg
- Store OAuth credentials securely
Verify Databricks Auth
```
$ databricks auth profiles
```

Application Features

The project includes a full-stack web application for interactive workload configuration, cost estimation, and deployment automation using the Databricks Python SDK.

🧮 Lakebase Calculator: Interactive cost estimation with real table size calculation
🚀 Automatic Deployment: Direct deployment using Databricks Python SDK
📁 Manual Deployment: Generate and download Databricks Asset Bundle files
🧪 Concurrency Testing: Upload and execute SQL queries for performance testing

Option 1: Starting the Web Application on Databricks Apps (RECOMMENDED for production)

Follow instruction on DEPLOY_WITH_DAB.md for more details on how deploy Databricks Apps with Databricks Asset Bundle, or follow Quick Deploy below

Quick Deploy (All Steps)

# 1. Build frontend
./npm-build.sh

# 2. Deploy
databricks bundle validate
databricks bundle deploy
databricks bundle run lakebase_accelerator_app

# 3. Get URL
databricks apps get <your-app-name>

Option 2: Starting the Web Application - self-hosted on local machine (For development)

Ensure you have completed the Environment Setup and authenticated with Databricks CLI.

Then on project root directory, run

# build frontend
./npm-build.sh

# Start the app
python app.py

The app will run on host: http://0.0.0.0:8000

Authentication

If self-host on local machine, authentication is handled via your Databricks CLI profiles, as set up in the Environment Setup section. The backend uses these CLI profiles to authenticate with the Databricks Python SDK (WorkspaceClient) using provided user credential.

When running the app on Databricks, the service principal assigned to the app will perform all the actions, hence it might need following permissions: Database Instance Management (see Database instance ACLs), Unity Catalog privileges including CREATE CATALOG, USE CATALOG, and CREATE SCHEMA on the target catalog, SELECT on any source Delta tables to be synced, USE SCHEMA and CREATE TABLE on the storage catalog and schema for Lakeflow-synced Delta pipelines, databricks-superuser permission to query tables, and "Allow unrestricted cluster creation" enabled.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
app		app
lakebase-samples		lakebase-samples
resources		resources
.gitignore		.gitignore
DEPLOY_WITH_DAB.md		DEPLOY_WITH_DAB.md
PGBENCH_README.md		PGBENCH_README.md
PSYCOPG_README.md		PSYCOPG_README.md
README.md		README.md
app.py		app.py
app.yml		app.yml
databricks.yml		databricks.yml
npm-build.sh		npm-build.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DATABRICKS LAKEBASE ACCELERATOR

Prerequesites

Environments Setup

Configure OAuth:

Verify Databricks Auth

Application Features

Option 1: Starting the Web Application on Databricks Apps (RECOMMENDED for production)

Quick Deploy (All Steps)

Option 2: Starting the Web Application - self-hosted on local machine (For development)

Authentication

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

databricks-solutions/lakebase-poc-accelerator

Folders and files

Latest commit

History

Repository files navigation

DATABRICKS LAKEBASE ACCELERATOR

Prerequesites

Environments Setup

Configure OAuth:

Verify Databricks Auth

Application Features

Option 1: Starting the Web Application on Databricks Apps (RECOMMENDED for production)

Quick Deploy (All Steps)

Option 2: Starting the Web Application - self-hosted on local machine (For development)

Authentication

About

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages