Skip to content

Latest commit

 

History

History
25 lines (15 loc) · 728 Bytes

README.md

File metadata and controls

25 lines (15 loc) · 728 Bytes

PySpark with TDD

An example of how to implement test driven development with data science.

Motivation

Nowadays, data science projects have been increasing due to the demand in the commerce.

Setup

Environment

This project contains a setup for a virtual environment. To create and install libraries required on the virtual environment run:

make setup

Test

Configure pythonpath when running tests in Makefile with the path where tests are located.

All tests should be under folder test

For running tests:

make test

Useful resources

pytest: ModuleNotFoundError: No module named solution