Download Data from: http://www.di.unipi.it/~gulli/AG_corpus_of_news_articles.html top 4 classes title and description Download XML version use jupyter notebook to parse and create train test datasets