Duke NLP Final Project

We collect data on seven different topics on twitter to build an LDA model as well as a discriminative Neural Network to benchmark their performance in topic classification.

Read our final report here.

Hashtags

crypto
tesla
championsleague
formula1
thanksgiving
holidays
covid19

Steps to Reproduce

1) Create & activate virtual environment

python3 -m venv venv
source venv/bin/activate

2) Install dependencies

make install

3) Collect data (takes long, please skip and use content in `data/`)

make data-collect

4) Train LDA model

cd src
python3 lda_modeling.py

5) Train Neural Network (Requires multiple hours on a GPU enabled device)

cd src
# tune hyperparameters
python3 tf_hyperparameter_tuning.py

Visually inspect results in visualize_study.ipynb

# run neural network with chosen params
cd src
python3 tf_train_model_with_best_params.py

Contributors

Name	Reference
Anna Dai	GitHub Profile
Satvik Kishore	GitHub Profile
Moritz Wilksch	GitHub Profile

Name		Name	Last commit message	Last commit date
Latest commit History 254 Commits
artefacts		artefacts
data		data
report		report
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Duke NLP Final Project

Hashtags

Steps to Reproduce

1) Create & activate virtual environment

2) Install dependencies

3) Collect data (takes long, please skip and use content in `data/`)

4) Train LDA model

5) Train Neural Network (Requires multiple hours on a GPU enabled device)

Contributors

About

Releases

Packages

Contributors 3

Languages

dai-anna/Duke-NLP-FinalProject

Folders and files

Latest commit

History

Repository files navigation

Duke NLP Final Project

Hashtags

Steps to Reproduce

1) Create & activate virtual environment

2) Install dependencies

3) Collect data (takes long, please skip and use content in data/)

4) Train LDA model

5) Train Neural Network (Requires multiple hours on a GPU enabled device)

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

3) Collect data (takes long, please skip and use content in `data/`)

Packages