Skip to content

Configurable webpage crawler that returns the most common words used and the number of times they are used.

tomny-dev/Crawler-Challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Crawler Coding Challenge

This application will display the most common words used in the History section of https://en.wikipedia.org/wiki/Microsoft

Set up your virtual environment

pipenv install
pipenv shell

Run the application

python main.py

Configurable options

$ python main.py -h
usage: main.py [-h] [-n NUM_WORDS] [excluded_words ...]

Displays the most common words used in a portion of a webpage.

positional arguments:
  excluded_words        Words to exclude

options:
  -h, --help            show this help message and exit
  -n NUM_WORDS, --num_words NUM_WORDS
                        Number of words to return

Thanks for reading and have a nice day!

About

Configurable webpage crawler that returns the most common words used and the number of times they are used.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages