Simple Product Parser is a small Python app for learning how web scraping works. It opens catalog pages, reads product details, and prints the results on screen.
It can help you understand how to:
- load web pages
- find product data on a page
- read text from page elements
- print clean output for each product
This project is meant for beginners who want a simple example they can run on Windows.
Visit this page to download and run the app:
If the page contains a source code ZIP or a release file, download it to your computer first.
- Open the link above in your browser.
- Click the green Code button or the release file, if one is provided.
- Download the ZIP file to your Downloads folder.
- Right-click the ZIP file and choose Extract All.
- Open the extracted folder.
- Look for a file named
main.py,app.py, or a similar Python file. - If Python is installed, double-click the file or run it from Command Prompt.
- Read the text output in the window.
If Windows asks what app should open the file, choose Python.
This project is built with:
- Python 3
- requests
- BeautifulSoup
- lxml
For normal use on Windows, you need:
- Windows 10 or Windows 11
- Python 3 installed
- internet access
- a web browser
- the files from this repository
When the app runs, it reads multiple catalog pages and prints product data. The output may include:
- product title
- article number
- brand
- producer
- size
- old price
- new price
The data appears in the console window or terminal. This lets you see how a parser collects page content and turns it into readable text.
The script follows a simple flow:
- It opens a catalog page.
- It reads the HTML from the page.
- It searches for product blocks.
- It extracts text from key fields.
- It prints the result for each product.
The app uses requests to get page data and BeautifulSoup with lxml to read the page structure.
You can expect files like these:
- Python script files
- a
README.mdfile - package files
- project folders for source code
- dependency lists such as
requirements.txt
If you see a requirements file, install the listed packages before you run the script.
If you want to run the script from source, install the needed packages first.
Open Command Prompt in the project folder and run:
pip install requests beautifulsoup4 lxml
If the project includes a requirements.txt file, you can also use:
pip install -r requirements.txt
After you install Python and the packages, run the main file from the project folder.
Example:
python main.py
If the file has a different name, use that name instead.
If the app opens a console window and closes fast, run it from Command Prompt so you can read the output.
This app does not need a lot of setup.
- Open the project folder.
- Run the Python script.
- Wait while it reads the product pages.
- Check the printed list of products.
The script is made for learning, so the output is plain and easy to follow.
If Windows says Python is not found:
- install Python 3 from the Python website
- check the box that adds Python to PATH during setup
- close and reopen Command Prompt
- run the script again
If you see an error about requests, bs4, or lxml:
- open Command Prompt in the project folder
- run
pip install requests beautifulsoup4 lxml - try again
If the window opens and nothing appears:
- wait a few seconds
- check that your internet connection works
- make sure the target page still loads in your browser
- run the script from Command Prompt
This project is useful if you want to learn:
- how Python reads web pages
- how HTML parsing works
- how to extract product data
- how to print structured results
- how to use common scraping tools
It is a basic example, so it keeps the code simple and focused.
This repository is related to:
- beautifulsoup
- beautifulsoup4
- education
- educational project
- parser
- parsing
- python
- python script
- python3
- scraper python
Simple Product Parser is provided for educational use. It shows how a small Python script can read catalog pages and pull out product details in a way that beginners can follow.
- download the project from the link above
- extract the files
- install Python 3
- install
requests,beautifulsoup4, andlxml - run the main Python file
- read the product data in the terminal