Skip to content

Latest commit

 

History

History
22 lines (11 loc) · 640 Bytes

README.md

File metadata and controls

22 lines (11 loc) · 640 Bytes

annual_report_crawler

A script to crawl annual reports of various companies using public repositories/websites.

Example website used is http://www.annualreports.com

This program takes in a Excel File as an input with "Company Name" containing the company names whose reports will be generated.

Note:

Make sure you have python library installers like pip and install dependencies such as all the imports and 'lxml' - pip install lxml

Usage:

python annual_report_crawler.py -f companiesList.xlsx

python annual_report_crawler.py (-h | --help)

python annual_report_crawler.py --version