Predicting Kickstarter project success with Machine Learning

_{Image from http://clipart-library.com/}

Introduction

Kickstarter, founded in 2009, is a crowdfunding platform where project creators can raise money from the public, circumventing traditional avenues of investment. It has an all-or-nothing funding model, whereby a project is only funded if it meets its goal amount; otherwise no money is given by backers to a project.

A huge variety of factors contribute to the success or failure of a project on Kickstarter. Some of these factors are able to be quantified or categorized, which allows machine learning models to predict whether a project will succeed or not.

The goal of this project is to predict if a Kickstart project will succeed or fail through using Exploratory Data Analysis and supervised Machine Learning models.

More generally, the aim is to help potential project creators as well as potential investors assess what their chances of success on Kickstarter will be.

About the data

The dataset contains data on all projects hosted on Kickstarter between the company’s launch in April 2009 until the date of the webscrape on March 14, 2019. The dataset contains 209222 projects.

Requirements and Environment

Requirements:

pyenv with Python: 3.9.8

Environment:

For installing the virtual environment you can either use the Makefile and run make setup or install it manually with the following commands:

pyenv local 3.9.8
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

About this repo

In this repository you will find:

column_names.md : Descriptions of the columns of the dataset
1_Intro_and_Data_Cleaning.ipynb: Jupyter notebook for introduction to this project, importing the data, combining the csv files and cleaning the data
2_EDA_and_Data_Visualization.ipynb : Jupyter notebook for Exploratory Data Analysis (EDA) and data visualization
3_Modeling.ipynb : Jupyter notebook with machine learning model implementation and error analysis
Kickstarter_Project_Slides.pdf: Final presentation of the project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Kickstarter project success with Machine Learning

Introduction

About the data

Requirements and Environment

Requirements:

Environment:

About this repo

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
data		data
.gitignore		.gitignore
1_Intro_and_Data_Cleaning.ipynb		1_Intro_and_Data_Cleaning.ipynb
2_EDA_and_Data_Visualization.ipynb		2_EDA_and_Data_Visualization.ipynb
3_Modeling.ipynb		3_Modeling.ipynb
Kickstarter_Project_Slides.pdf		Kickstarter_Project_Slides.pdf
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
column_names.md		column_names.md
requirements.txt		requirements.txt

License

suleenwong/Predicting-crowdfunding-success

Folders and files

Latest commit

History

Repository files navigation

Predicting Kickstarter project success with Machine Learning

Introduction

About the data

Requirements and Environment

Requirements:

Environment:

About this repo

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages