This project serves as a proof of concept for statistical data analysis automatization. The project uses Python and GoodData.CN. To understand what this project is about please first read this short article .
If you don't already have an instance of GoodData.CN up and running on your system.
Follow these steps:
-
Run the GoodData.CN docker image using the following command:
docker run -d -i -t -e LICENSE_AND_PRIVACY_POLICY_ACCEPTED=YES -p 3000:3000 -p 5432:5432 -v gd-volume:/data gooddata/gooddata-cn-ce:2.1
After few minutes your GoodData.CN will be ready.
-
Access your GoodData.CN deployment using your internet browser. The URL is http://localhost:3000, where you should see the login page.
-
Log in using the following credentials:
- Email Address = [email protected]
- Password = demo123
- Token (for REST APIs or Python SDK) = YWRtaW46Ym9vdHN0cmFwOmFkbWluMTIz
For more information you may check out docker hub.
We recommend you prepare a virtual environment and install all requirements there. If you want to use your existing Python requirement, feel free to skip to Step 3.
Follow these steps:
-
Create Python virtual environment:
python -m venv .venv
-
Activate the Python virtual environment:
source .venv/bin/activate
-
Install Python requirements:
python -m pip install -r requirements.txt
If you're new to GoodData.CN, we recommend you take a look at some videos on GoodData Developers YouTube channel, have quick run through the GoodData University GoodData.CN course, or just explore it on your own. :)
For checking proof of concept run the main.py
script:
python main.py
Please note that receiving data from GoodData.CN and checking assumptions may take some time.
For example on MacBook Pro (14-inch, 2021) with Python 3.10.8
CPU | Apple M1 Pro |
---|---|
RAM | 32 GB |
It took us 1 minute 5 seconds. :)