Skip to content

hamawebdev/Nodejs-Web-Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

Web Crawler Project

A Node.js web crawler that analyzes internal links on websites.

Features

  • Crawls web pages recursively starting from a base URL
  • Extracts and normalizes URLs from HTML content
  • Tracks internal link counts between pages
  • Generates a report showing most linked pages

Key Components

  • crawl.js - Core crawling logic and URL handling
  • report.js - Report generation
  • main.js - CLI entry point

Technologies

  • Node.js
  • Jest for testing
  • JSDOM for HTML parsing
  • Native fetch API for HTTP requests

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published