Marvel SNAP Scrapr

Scraper for https://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards. This is a fork of vlmaier/marvel-snap-scrapr.

How it works

The script uses the Beautiful Soup Python library, which pulls data out of HTML or XML files. It scrapes the website https://marvelsnapzone.com which is well-structured and provides all required metadata about Marvel SNAP cards. Selenium web driver is required because of the dynamic loading on the website. Otherwise, the card links are not available when going for a static approach. In the end, a dictionary of all available cards is created and saved to data.json alongside downloaded card thumbnails.

How to use it

Ensure Python is installed, then in the project's root directory, run pip install -r requirements.txt.

The script is run like python scrapr.py and, unless modified, will save the files in a subdirectory of the working directory. Therefore, it's best to run the script from the directory the output files should be in (e.g. src/assets) rather than running the script from its own root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github		.github
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
scrapr.py		scrapr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Marvel SNAP Scrapr

How it works

How to use it

About

Releases

Packages

Languages

License

bagley2014/marvel-snap-scrapr

Folders and files

Latest commit

History

Repository files navigation

Marvel SNAP Scrapr

How it works

How to use it

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages