Popularity_predictor

We have created a web app using Streamlit and deployed a custom machine learning model that can predict whether any GitHub repository is safe to consume or not just by passing the URL. Here, we generate a score based on the data we scrapped from some famous and random repositories on GitHub.

Folder Notebooks contains data and script to extract data, analysis of data or the model creation code.
We have used github api and Kaggle to collect the github data stored in the file github_api.csv and kaggle_data.csv respectively which has columns repo_name, star, fork, watch, issue, tags, most_used_lang, discription, contributors, license, and repo_url.
data_extraction.ipynb file contains script to extract the information from repositories, analysis.ipynb file contains cleaning and visualization operations on the dataset. model.ipynb building a machine learning model that can predict which repositories will gain how much stars in the future. 😃

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.streamlit		.streamlit
Notebooks		Notebooks
media		media
myEnviroment		myEnviroment
pretrained_model		pretrained_model
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Popularity_predictor

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

sarthak31122000/Popularity_predictor

Folders and files

Latest commit

History

Repository files navigation

Popularity_predictor

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages