The purpose of this project is to conduct the exploratory data analysis (EDA) in banking using Pandas framework.
During this project we will do the following
- Explore a banking dataset with Pandas framework.
- Build pivot tables.
- Visualize the dataset with various plot types.
In this project, we will try to give answers to a set of questions that may be relevant when analyzing banking data:
- What is the share of clients attracted in our source data?
- What are the mean values of numerical features among the attracted clients?
- What is the average call duration for the attracted clients?
- What is the average age among the attracted and unmarried clients?
- What is the average age and call duration for different types of client employment?
Python Version:3.9.12
Packages:pandas,numpy,matplotlib
Data Source:https://archive.ics.uci.edu
The datasets used in this project were downloaded from https://archive.ics.uci.edu. I then read the csv files using the pd.read_csv() command.
The data did not require any type of cleaning. There were no missing values and the data was of the correct type.
I looked at different distributions for both the numeric and categorical data. Below are highlights from the data visualization section