
- Campbell
Stars
Open, Multi-modal Catalog for Data & AI
Machine Learning Engineering Open Book
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
Spark + HDFS cluster using docker compose
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Apache Drill is a distributed MPP query layer for self describing data